Distributed DB & DBMS
Concepts
Distributed Database
A logically interrelated collection of shared data, physically distributed
over a computer network.
Distributed DBMS
Software system that permits the management of the distributed
database and makes the distribution transparent to users.
Main Characteristics
• Data at multiple sites
• DM at each site.
• Local requirements
• Global perspective
Distributed Processing
Concepts
• Distribution
– Refers to the distribution of data. i.e. physical distribution of data over
multiple sites that looks as one logical pool to the user.
– Client/Server distribution
– Peer-to-peer system: Each machine has full DBMS functionality
• Autonomy
– Autonomy is the distribution of control, not data.
– It is the level to which individual DBMSs can operate independently.
– The local operations of the individual DBMSs are not affected by their
participation in the multidatabase system.
– The queries process by individual DBMSs should not be affected by the
execution of global queries
Concepts
• Heterogeneity
– Heterogeneity may occur in various form in distributed system, ranging from
hardware heterogeneity and differences in networking protocols, data model,
query language and transaction management protocols
• Homogeneous DDBMS
– All sites use same DBMS product.
– Much easier to design and manage.
Distributed DBMS Architecture
showing Local Transaction Steps
3 5
Local transaction – all data stored locally
Local Transaction Steps
• Application makes request to distributed DBMS
• Distributed DBMS checks distributed data repository for
location of data. Finds that it is local
• Distributed DBMS sends request to local DBMS
• Local DBMS processes request
• Local DBMS sends results to application
Distributed DBMS architecture
showing global transaction steps
2
3
1
7 6
8
4
Global transaction – some data is at remote site(s)
Global Transaction Steps
• Application makes request to distributed DBMS
• Distributed DBMS checks distributed data repository for location of data. Finds
that it is remote
• Distributed DBMS routes request to remote site
• Distributed DBMS at remote site translates request for its local DBMS if
necessary, and sends request to local DBMS
• Local DBMS at remote site processes request
• Local DBMS at remote site sends results to distributed DBMS at remote site
• Remote distributed DBMS sends results back to originating site
• Distributed DBMS at originating site sends results to application
Advantages of DDBMS
• Organizational Structure : Organizations naturally distributed
• local autonomy :Data can be placed at the site close to the user
• Improved availability : Continue function despite failure
• Improved Reliability : Data replication
• Improved Performance: Data near the site (access speed factor)
• Economics :Data transmission cost (centralized vs Distributed)
• Modular Growth :Adding new sites is easy without effecting the operation of
other site
Disadvantages of DDBMS
• Complexity : Hides distributed nature from user
• Cost :Maintenance cost increase with complexity
• Security: Multiple locations
• Integrity: Communication and processing costs
• Lack of Standards: No tools to convert centralized DBMS to distributed DBMS
• Lack of Experience: Not much general purpose DDBMS
• Database Design more complex : Replication