37
<Insert Picture Here> Getting Coherence: Introduction to Data Grids South Florida User Group Cameron Purdy Cameron Purdy Vice President of Development

Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

<Insert Picture Here>

Getting Coherence: Introduction to Data GridsSouth Florida User GroupCameron PurdyCameron PurdyVice President of Development

Page 2: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Speaker

Cameron Purdy is Vice President of Development at Cameron Purdy is Vice President of Development at Oracle. Prior to joining Oracle, Mr. Purdy was the CEO of Tangosol, whose revolutionary Coherence Data Grid product provides reliable and scalable data Data Grid product provides reliable and scalable data management across the enterprise.Mr. Purdy has over ten years of experience with Java and Java-related technology; he regularly Java and Java-related technology; he regularly participates in industry standards development and is a specification lead for the Java Community Process.a specification lead for the Java Community Process.

Page 3: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

<Insert Picture Here>

Coherence History

Page 4: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Tangosol Coherence2000-20072000-2007

Pioneer of Data Grid technologyFirst to provide Coherent Clustered CachingFirst to provide Dynamic HA PartitioningFirst to localize Data ProcessingFirst to localize Data ProcessingFirst to achieve Parallel Aggregation

Consistent annual growth over 100%Innovation driven by a loyal, growing customer baseInnovation driven by a loyal, growing customer base

#1 in Data Grids#1 in Data Grids#1 in Transaction Volumes#1 in Install Base#1 in Install Base

Page 5: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Tangosol acquires Oracle20072007

Kick in the AfterburnerContinuing Innovation: Increased investment in developmentGrowing Adoption: Leveraging Oracle s scaleVibrant Ecosystem: Worldwide support, service, consulting, partnersVibrant Ecosystem: Worldwide support, service, consulting, partners

Increased annual growthInnovation continues to be driven by our customers

Like a kid in a toy-store Berkeley DBBerkeley DBTopLink Grid

Page 6: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Orasol acquires BEA20082008

Coming full circle: Immediate realized synergiesReal-Time Data Grid: jRockit RealTime and CoherenceWebLogic Suite and WebLogic Application GridWebLogic Liquid Operations ControlWebLogic Liquid Operations ControlWebLogic Server and Portal IntegrationCEP Engine Integration (OSGi)

Oracle now leads in every middleware category according to Gartner Magic Quadrants!to Gartner Magic Quadrants!Leading Developer Mindshare & Best of Breed Technology

Page 7: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

<Insert Picture Here>

Data Grid Introduction

The Oracle Coherence In-Memory Data Grid is a data management system for application

objects that are shared across multiple servers, objects that are shared across multiple servers, require low response time, very high throughput,

predictable scalability, continuous availability and information reliability.and information reliability.

Page 8: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Data Grid

Who: Oracle Coherence, IBM WebSphere XSWho: Oracle Coherence, IBM WebSphere XSAlso Gemstone Gemfire, Gigaspaces, ScaleOut Software

What: Using a grid infrastructure to host both reference and transactional data, eliminating data reference and transactional data, eliminating data bottlenecks and improving performance and efficiencyWhere: Inside the data centerWhere: Inside the data centerWhen: Usually 24x7x366, but sometimes only for How: Using clustering technology

Partitioning for scale (and reliability)Replicating for availability (and performance)

Page 9: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Oracle Coherence: A Unique Approach

Peer-to-Peer Clustering and Data Management Technology

No Single Points of FailureNo Single Points of Failure

No Single Points of Bottleneck

No Masters / Slaves / Registries etc

All members have responsibility to;

Manage Cluster Health & Data

Perform Processing and QueriesPerform Processing and Queries

Work as a team in parallel

Communication is point-to-point (not TCP/IP) and/or one-to-many(not TCP/IP) and/or one-to-many

Scale to limit of the back-plane

Use with commodity infrastructure

Linearly Scalable By Design

(c) Copyright 2008. Oracle Corporation

Page 10: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Oracle Coherence: Real World Results

From a presentation by Royal Bank of Scotland:

Further competitive evaluation conducted that ratified Coherence as best of breed

Easiest to configureEasiest to configureStable performance under loadMuch harder to break

Coherence Best of restCoherence Best of rest

Page 11: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Oracle Coherence: A Unique Approach

Data is automatically partitioned and load-balanced across the Server Cluster

Data is synchronously replicated for Data is synchronously replicated for continuous availability

Servers monitor the health of each other

When in doubt, servers work together to diagnose statusdiagnose status

Healthy servers assume responsibility for Healthy servers assume responsibility for failed server (in parallel)

Continuous Operation: No interruption to service or data loss due to a server failure

(c) Copyright 2008. Oracle Corporation

Page 12: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Data Grid: Capability Summary

A Data Grid combines data management with data processing in a scale-out environmentprocessing in a scale-out environment

Some or all of the servers in the grid are responsible for reliably managing live informationAny or all of the servers in the grid are able to simultaneously access and manipulate a shared, consistent view of that informationProcessing power far exceeds aggregate network bandwidth, so data access and manipulation must be parallelized and localized within the gridData locality is both dynamic and transparent

Page 13: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

<Insert Picture Here>

Oracle Coherence:

Technical IllustrationsTechnical Illustrations

Page 14: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Partitioned Topology : Data AccessData spread and backed Data spread and backed up across Members

Transparent to developer

Members have access to all Data

All Data locations are All Data locations are known no lookup & no registry!

Page 15: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Partitioned Topology : Data UpdateSynchronous Update

Avoids potential Data Loss & Corruption

Predictable Performance

Backup Partitions are partitioned away from partitioned away from Primaries for resilience

No engineering requirement to setup requirement to setup Primaries or Backups

Automatically and Dynamically ManagedDynamically Managed

Page 16: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Partitioned Topology : RecoveryMembership changes (new members added or members leaving)

Other members, using Other members, using consensus, recover and repartition automatically

No in-flight operations No in-flight operations lost, no availability gap!

Some latencies (due to higher priority of higher priority of asynchronous recovery)

Information Reliability & Continuous Availability are the prioritiesare the priorities

Page 17: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Partitioned Topology : Local StorageSome members are used to manage data

Other members are temporary in a cluster, temporary in a cluster, or do not have memory to spare for managing datadata

They should not cause repartitioning

Specialization of roles Specialization of roles within a Data Grid: Clients and Servers

Page 18: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Read-Through & Write-ThroughRead-Through & Write-ThroughAccess to the data sources go through the Data Grid.Data Grid.Read and write operations are always managed by the node that owns the data within the Data Grid.data within the Data Grid.Concurrent accesses are combined, greatly reducing database load.reducing database load.Write-Through keeps the in-memory data and the database in sync.database in sync.

Page 19: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Write-BehindWrite-BehindWrite-Behind accepts data modifications directly into the Data GridData Grid

The modifications are then asynchronously written back to the data source, to the data source, optionally after a specified delay

All write-behind data is synchronously and redundantly managed, redundantly managed, making it resilient to server failure

Page 20: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Topology Composition : Near Topology

Page 21: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Features : Observable Interface

Page 22: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Features : QueryMap Interface

Page 23: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

ConcurrencyConcurrency

Implicit: Queueing of operations

Virtual queue & thread per entry

Explicit: Pessimistic lockingGrid-Wide MutexGrid-Wide Mutex

Transactions: Unit of work management

Both optimistic and pessimistic transactions

Isolation levels from read-committed through serializablethrough serializable

Integrated with JTA

Page 24: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Features : InvocableMap Interface

Page 25: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

<Insert Picture Here>

Data Grid Use Cases

Page 26: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

General Use Cases for a Data Grid

CachingApplications request data from the Data Grid rather than Applications request data from the Data Grid rather than backend data sources

AnalyticsApplications ask the Data Grid questions from simple Applications ask the Data Grid questions from simple queries to advanced scenario modeling

TransactionsTransactionsData Grid acts as a transactional System of Record, hosting data and business logic

EventsEventsAutomated processing based on event

Page 27: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Use Case: Caching

Applications request data from the Data Grid rather than backend data sources

Enable faster access to frequently accessed dataEnable faster access to frequently accessed dataReduce load on shared data sourcesObviates the impact of data source outages

Use CasesUse CasesReference dataUser preferencesMarket dataMarket data

CoherenceManageable and scalable host for the cache Guarantees consistent data and data integrityGuarantees consistent data and data integrityBroad industry support as a plug-in cache

Page 28: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Use Case: AnalyticsApplications ask the Data Grid questions from simple queries to advanced scenario modeling:

Enables query rates beyond what a database can handleEnables query rates beyond what a database can handleLarger data sets available in memoryComplex analytics through massive parallel processing across gridIncreased availability

Use CasesRisk managementRisk managementPortfolio managementDerivatives pricing

CoherenceCoherenceBuilt-in query support User-defined parallel calculationsStable results even with server failureStable results even with server failure

Page 29: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Use Case: Transactions

The Data Grid acts as a transactional System of Record, hosting data and business logic:hosting data and business logic:

Higher transaction volumesMore predictable scalabilityMore predictable cost of scaleMore predictable cost of scaleFlexible capacity

Use CasesOrder book, execution, fill and reconciliationOrder book, execution, fill and reconciliationE-Commerce, real-time inventoryLarge-scale data collection: Buffered transactions

CoherenceCoherenceMultiple isolation levels, both optimistic and pessimistic modesReliability is key to transactional integrityReliability is key to transactional integrity

Page 30: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Use Case: EventsIn addition to transactions, the Data Grid reliably manages each resulting event and delivers it to the related business logic:

Real-time management for both data-driven and event-driven architecturesarchitecturesFlexible data, event and processing capacity

Use CasesMarket Data-driven processes such as Algorithmic TradingMarket Data-driven processes such as Algorithmic TradingFraud DetectionLarge-scale work-flow: SEDA / Staged Business Event TransitionsTransitions

CoherenceCo-located processing of data and events for low latency and high throughput high throughput Reliable Once-And-Only-Once processing for eventsScalable to millions of events per second without sacrificing reliabilityreliability

Page 31: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Financial Institution: Risk Analytics

Large Data SetsParallel AggregationCapable of Absorbing Capable of Absorbing Real Time FeedsCalc on Demand / Recalc on Changes100% Data Locality for

Node3

Node2Node1

Scenarios100% Data Locality for computationsTrue Linear ScaleReal World Results:

Aggregated Risk

Real World Results:50 Days -> 1 Hour

Page 32: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Financial Institution: Foreign Exchange.. executes any number of

Business Logic driven by State ChangesOnce-and-Only-Once

.. executes any number of

Once-and-Only-Once GuaranteesEncapsulated Transactions

No Global-TXState Change Event Business Logic100% Data Locality for

Business Logic executionReal World Results:

From Big Boxes to Blades1000x throughput increaseContinuous Availability

.. creates any number of

Page 33: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

<Insert Picture Here>

Coherence Update

The latest news about Oracle Coherence and related projects coming out of Oracle, as of

Q2/FY09.Q2/FY09.

Page 34: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Oracle Coherence 3.4Launched at Oracle OpenWorldLaunched at Oracle OpenWorld

http://www.oracle.com/technology/products/coherence/index.html

POF: End-to-end support for Portable Object FormatPluggable Serializers: Control wire & memory formatsFeature parity across Java, C# & .NET, C++ (New!)Triggers: Validate, modify and/or reject operationsTriggers: Validate, modify and/or reject operationsEvent Transformers: Alter events at their originBackup-Count-After-Write-Behind: Purges backup copies Backup-Count-After-Write-Behind: Purges backup copies once data has been committed to a databaseJMX Reporter: Cluster-wide black-box recorderJMX Reporter: Cluster-wide black-box recorderJava Platform Mbeans: Cluster-wide JVM statistics

Page 35: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Oracle TopLink GridLaunched at Oracle OpenWorldLaunched at Oracle OpenWorld

http://www.oracle.com/technology/products/ias/toplink/tl_grid.html

JPA for Oracle CoherenceOracle TopLink is the JPA Reference Implementation

TopLink Grid stores all or part of an application s domain model in a Coherence Data Gridmodel in a Coherence Data GridFeatures:

Simple configuration using annotations that align with standard JPAAbility to choose which entities are stored in the grid versus those stored directly in the backing data storeSupport for queries being executed against either the Grid or directly Support for queries being executed against either the Grid or directly against the data storeSupport for sequence generation and locking using either the Grid or the backing data storeor the backing data store

Page 36: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

Oracle IncubatorLaunched 15 October 2008!Launched 15 October 2008!

http://coherence.oracle.com/display/INCUBATOR/HomeCommand Pattern - This is a scale-out and Highly Available distributed implementation of the classic "Command Pattern".Store and Forward Messaging Pattern - This is a scale-Store and Forward Messaging Pattern - This is a scale-out and Highly Available framework for distributed Store and Forward Messaging.Push Replication Pattern - This is a powerful framework for replicating the transactional data in real-time systems from one data center to another; it supports bi-systems from one data center to another; it supports bi-directional WAN replication with application-customizable collision-reconciliation policies.

Page 37: Intro To Data Grids - Oracle User Groupsfoug.org/Downloads/data_grids_20081119.pdf · Orasol acquires BEA 2008 Coming full circle: Immediate realized synergies Real-Time Data Grid:

<Insert Picture Here>

Q&A

Learn more online: http://www.oracle.com/technology/products/coherence/