27
Distributed DBMS By :-Bharat P. Patil Bihag Mehta Ronak Thakkar Prashant Jaiswal

Distributed dbms

Embed Size (px)

Citation preview

Page 1: Distributed dbms

Distributed DBMS

By :-Bharat P. Patil Bihag Mehta Ronak Thakkar Prashant Jaiswal

Page 2: Distributed dbms

DISTRIBUTED DBMSDatabase:- Logical interrelated

collection of shared data, along with description of data, physically distributed over a computer network.

DBMS:- The software system that permits the management of the distributed database and makes the distribution transparent to users

Page 3: Distributed dbms

DISTRIBUTED DBMS

Page 4: Distributed dbms

TWO types of Applications

• Local application : Application that do not required data from other sites.

• Global application : Application that required data from other sites.

CONCEPTS

Page 5: Distributed dbms

Characteristics DDBMS

• A collection of logically related shared data.

• Data is split into a number of fragments.

• Fragments are replicated.

•Fragments/ replicas are allocated to sites.

CONCEPTS

Page 6: Distributed dbms

Characteristics Of DDBMS

• Sites are linked by a communication network.

• Data at each site is in the control of a DBMS.

•DBMS at each site can handle local applications

• Each DBMS participates in at least one global application

CONCEPTS

Page 7: Distributed dbms

Properties of DDBMS

• Distributed Data Independence : Users should not have to know where data is located (extends Physical and Logical Data Independence principles).

•Distributed Transaction Atomicity : Users should be able to write Xacts accessing multiple sites just like local Xacts.

Concepts

Page 8: Distributed dbms

T T T

COM 1

T T T

COM 2

Communication Network

T T T

COM 3

DB

DISTRIBUTED PROCESSING

Page 9: Distributed dbms

•Reflects organizational structure

•Improved shareability and local autonomy

•Improved availability

•Improved reliability

•Improved performance

•Economics

•Modular growth

ADVANTAGES

Page 10: Distributed dbms

•Complexity

•Cost

•Security

•Integrity control more difficult

•Lack of standards

•Lack of experience

•Database design more complex

DISADVANTAGES

Page 11: Distributed dbms

Reference Architecture for DDBMS

Due to diversity, no accepted architecture equivalent to ANSI/SPARC 3-level architecture.

A reference architecture consists of:◦ Set of global external schemas.◦ Global conceptual schema (GCS).◦ Fragmentation schema and allocation schema.◦ Set of schemas for each local DBMS conforming to 3-level

ANSI/SPARC .Some levels may be missing, depending on

levels of transparency supported.Can be homogeneous or heterogeneous

Page 12: Distributed dbms

Reference Architecture for DDBMS

Page 13: Distributed dbms

• Homogeneous DDBMS

• All sites use same DBMS product.

• Heterogeneous DDBMS

• All sites use different DBMS product.

TYPE OF DDBMS

Page 14: Distributed dbms

• Local DBMS

• Data Communication Component

• Global System Catalog

• Distributed DBMS component

COMPONENET ARCHITECTURE FOR DDBMS

Page 15: Distributed dbms

• Fragmentation: that are used to break up the database into logical units,called fragments.

• Replication: which permits certain data to be stored in more than one site.

• Allocation: process of allocating fragments or replicas of fragments for storage at the various site.

DISTRIBUTED DATABASE DESIGN AND TECHNIQUE

Page 16: Distributed dbms

• Horizontal fragment of a relation is a subset of the tuples in that relation.

• Vertical fragment of a relation keeps only certain attributes of a relation.

• Mixed – both horizontal and vertical.

• Derived – natural join first to get additional information required then fragment.

• Must be able to reconstruct original table.

• Can query and update through fragment.

TYPES OF FRAGMENTATION

Page 17: Distributed dbms

FRAGMENTATION• Strategize to achieve:• Locality of Reference• Improved Reliability and Availability• Improved Performance• Balanced Storage Capacities and Costs• Minimal Communication Costs.

• Quantitative and quantitative information

• Correctness of Fragmentation• Completeness• Reconstruction• Disjointness.

Page 18: Distributed dbms

REPLICATION

• Storing data at multiple sites• Example – Internet grocer with multiple

warehouses.• CUSTOMER (Cust#, Addr, Location)• Customer info at central location• Location is warehouse that makes deliveries

• Where do we store tables?• Fragment?• Replicate?

Page 19: Distributed dbms
Page 20: Distributed dbms

• Transparencies hide implementation details from the user

• Example in Centralized databases : Data independence

• Main types of transparencies in

• DDBMS:Distributed Transparency

• Transaction Transparency

TRANSPERENCIES IN DDBMS

Page 21: Distributed dbms

Allows the user to see the database as asingle, logical entity.

If this transparency is exhibited then theuser does not need to know that 1. The data are partitioned. 2. Data can be replicated at several sites. 3. Data location.

DISTRIBUTED TRANSPARENCY

Page 22: Distributed dbms

If it is provided then the user does not needto know the data is fragmented.

Example:

SELECT fName, lName FROM StaffWHERE position = ‘ Manager ’

FRAGMENTATION TRANSPERANCY

Page 23: Distributed dbms

• It maintains distributed database’s integrity and consistency.

TRANSACTION TRANSPARENCY

Page 24: Distributed dbms

A DBMS running across multiple processors and disks that has been designed to execute operations in parallel, whenever possible, in order to improve performance.

PARALLEL DBMS

Page 25: Distributed dbms

Date’s 12 Rules for a DDBMS

Fundamental PrincipleTo the user, a distributed system should look

exactly like a non distributed system.

1. Local Autonomy2. No Reliance on a Central Site3. Continuous Operation4. Location Independence5. Fragmentation Independence6. Replication Independence

Page 26: Distributed dbms

Date’s 12 Rules for a DDBMS

7. Distributed Query Processing8. Distributed Transaction Processing9. Hardware Independence10. Operating System Independence11. Network Independence12. Database Independence

Last four rules are ideals.

Page 27: Distributed dbms