100% found this document useful (1 vote)

201 views6 pages

Google Bigtable: Describe The Data Model of Bigtable

The Bigtable data model uses a sparse, distributed, multidimensional sorted map indexed by row, column, and timestamp keys storing string values. The Bigtable API allows creating/deleting tables and column families as well as reading, writing, and iterating over table data. Data is stored in SSTables using a block index for efficient lookups. The Chubby service provides a distributed lock used by Bigtable for master election and metadata storage. The Bigtable architecture has client libraries, a single master server, and multiple tablet servers where tablets are assigned, located via a hierarchy, and loads are balanced by the master.

Uploaded by

Đorđe Klisura

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

201 views6 pages

Google Bigtable: Describe The Data Model of Bigtable

Uploaded by

Đorđe Klisura

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Google Bigtable

1. Describe the data model of Bigtable.

A Bigtable is a sparse, distributed, persistent multidimensional sorted map. The map is indexed
by a row key, column key, and a timestamp; each value in the map is an uninterpreted array of
bytes.

(row:string, column:string, time:int64) -> string

2. Describe the design of the Bigtable API.

The Bigtable API provides functions for creating and deleting tables and column families. It also
provides functions for changing cluster, table, and column family metadata, such as access control
rights. Client applications can write or delete values in Bigtable, look up values from individual
rows, or iterate over a subset of the data in a table.
// Write a new anchor and delete an old anchor
RowMutation r1(T, "com.cnn.www");
r1.Set("anchor:www.c-span.org", "CNN");
r1.Delete("anchor:www.abc.com");
Operation op;
Apply(&op, &r1);

Bigtable supports several other features that allow the user to manipulate data in more
complex ways:
1. Bigtable supports single-row transactions, which can be used to perform atomic read-
modify-write sequences on data stored under a single row key.
2. Bigtable allows cells to be used as integer counters.
3. And finally, Bigtable supports the execution of client-supplied scripts in the address
spaces of the servers.
3. Present the Google SSTable file format.

The Google SSTable file format is used internally to store Bigtable data. An SSTable provides a
persistent, ordered immutable map from keys to values, where both keys and values are arbitrary
byte strings. Operations are provided to look up the value associated with a specified key, and to
iterate over all key/value pairs in a specified key range. Internally, each SSTable contains a
sequence of blocks (typically each block is 64KB in size, but this is configurable).
A block index (stored at the end of the SSTable) is used to locate blocks; the index is
loaded into memory when the SSTable is opened. A lookup can be performed with a single disk
seek: we first find the appropriate block by performing a binary search in the in-memory index,
and then reading the appropriate block from disk. Optionally, an SSTable can be completely
mapped into memory, which allows us to perform lookups and scans without touching disk.

4. Describe the role of the Google Chubby service.

Bigtable relies on a highly-available and persistent distributed lock service called Chubby.
A Chubby service consists of five active replicas, one of which is elected to be the master and
actively serve requests. The service is live when a majority of the replicas are running and can
communicate with each other. Chubby uses the Paxos algorithm to keep its replicas consistent in
the face of failure. Chubby provides a namespace that consists of directories and small files. Each
directory or file can be used as a lock, and reads and writes to a file are atomic. The Chubby client
library provides consistent caching of Chubby files. Each Chubby client maintains a session with
a Chubby service. A client’s session expires if it is unable to renew its session lease within the
lease expiration time. When a client’s session expires, it loses any locks and open handles. Chubby
clients can also register callbacks on Chubby files and directories for notification of changes or
session expiration.

Bigtable uses Chubby for a variety of tasks:

 to ensure that there is at most one active master at any time;
 to store the bootstrap location of Bigtable data to discover tablet servers and finalize
tablet server deaths;
 to store Bigtable schema information (the column family information for each
table);
 and to store access control lists.

If Chubby becomes unavailable for an extended period of time, Bigtable becomes unavailable.

5. Explain the architecture of the Google Bigtable server.

The Bigtable implementation has three major components:

1. a library that is linked into every client,
2. one master server, and
3. many tablet servers.
6. Describe the role of the Master server.

The master is responsible for assigning tablets to tablet servers, detecting the addition and
expiration of tablet servers, balancing tablet-server load, and garbage collection of files in GFS. In
addition, it handles schema changes such as table and column family creations. Client data does
not move through the master: clients communicate directly with tablet servers for reads and writes.
Because Bigtable clients do not rely on the master for tablet location information, most clients
never communicate with the master. As a result, the master is lightly loaded in practice.

7. Describe the role of the Tablet server in Bigtable.

Each tablet server manages a set of tablets (typically we have somewhere between ten to a
thousand tablets per tablet server). The tablet server handles read and write requests to the tablets
that it has loaded, and also splits tablets that have grown too large.
Tablet servers can be dynamically added (or removed) from a cluster to accommodate
changes in workloads. A Bigtable cluster stores a number of tables. Each table consists of a set of
tablets, and each tablet contains all data associated with a row range. Initially, each table consists
of just one tablet. As a table grows, it is automatically split into multiple tablets, each
approximately 100-200 MB in size by default.

8. Present the protocol for locating a tablet.

We use a three-level hierarchy analogous to that of a B +- tree to store tablet location information
(Figure 4).

The first level is a file stored in Chubby that contains the location of the root tablet. The root tablet
contains the location of all tablets in a special METADATA table. Each METADATA tablet
contains the location of a set of user tablets.
The root tablet is just the first tablet in the METADATA table, but is treated specially - it is never
split - to ensure that the tablet location hierarchy has no more than three levels. The METADATA
table stores the location of a tablet under a row key that is an encoding of the tablet’s table identifier
and its end row.
The client library caches tablet locations. If the client does not know the location of a tablet, or if
it discovers that cached location information is incorrect, then it recursively moves up the tablet
location hierarchy. If the client’s cache is empty, the location algorithm requires three network
round-trips, including one read from Chubby. If the client’s cache is stale, the location algorithm
could take up to six round-trips, because stale cache entries are only discovered upon misses
(assuming that METADATA tablets do not move very frequently). Although tablet locations are
stored in memory, so no GFS accesses are required, we further reduce this cost in the common
case by having the client library prefetch tablet locations: it reads the metadata for more than one
tablet whenever it reads the METADATA table. We also store secondary information in the
METADATA table, including a log of all events pertaining to each tablet (such as when a server
begins serving it). This information is helpful for debugging and performance analysis.

9. Describe how and when tablets are discovered, assigned, and unassigned.
Each tablet is assigned to one tablet server at a time. The master keeps track of the set of live tablet
servers, and the current assignment of tablets to tablet servers, including which tablets are
unassigned. When a tablet is unassigned, and a tablet server with sufficient room for the tablet is
available, the master assigns the tablet by sending a tablet load request to the tablet server. Bigtable
uses Chubby to keep track of tablet servers. When a tablet server starts, it creates, and acquires an
exclusive lock on, a uniquely-named file in a specific Chubby directory. The master monitors this
directory (the servers directory) to discover tablet servers. A tablet server stops serving its tablets
if it loses its exclusive lock. A tablet server will attempt to reacquire an exclusive lock on its file
as long as the file still exists. If the file no longer exists, then the tablet server will never be able to
serve again, so it kills itself.
The master is responsible for detecting when a tablet server is no longer serving its tablets, and for
reassigning those tablets as soon as possible. To detect when a tablet server is no longer serving
its tablets, the master periodically asks each tablet server for the status of its lock. If a tablet server
reports that it has lost its lock, or if the master was unable to reach a server during its last several
attempts, the master attempts to acquire an exclusive lock on the server’s file. If the master is able
to acquire the lock, then Chubby is live and the tablet server is either dead or having trouble
reaching Chubby, so the master ensures that the tablet server can never serve again by deleting its
server file. Once a server’s file has been deleted, the master can move all the tablets that were
previously assigned to that server into the set of unassigned tablets
When a master is started by the cluster management system, it needs to discover the current tablet
assignments before it can change them. The master executes the following steps at startup.
 The master grabs a unique master lock in Chubby, which prevents concurrent master
instantiations.
 The master scans the servers’ directory in Chubby to find the live servers.
 The master communicates with every live tablet server to discover what tablets are already
assigned to each server.
 The master scans the METADATA table to learn the set of tablets. Whenever this scan
encounters a tablet that is not already assigned, the master adds the tablet to the set of
unassigned tablets, which makes the tablet eligible for tablet assignment.
One complication is that the scan of the METADATA table cannot happen until the METADATA
tablets have been assigned. Therefore, before starting this scan (step 4), the master adds the root
tablet to the set of unassigned tablets if an assignment for the root tablet was not discovered during
step 3. This addition ensures that the root tablet will be assigned. Because the root tablet contains
the names of all METADATA tablets, the master knows about all of them after it has scanned the
root tablet.

10. Explain how and when tablets are merged and split.
The set of existing tablets only changes when a table is created or deleted, two existing tablets are
merged to form one larger tablet, or an existing tablet is split into two smaller tablets. The master
is able to keep track of these changes because it initiates all but the last. Tablet splits are treated
especially since they are initiated by a tablet server. The tablet server commits the split by
recording information for the new tablet in the METADATA table. When the split has committed,
it notifies the master. In case the split notification is lost (either because the tablet server or the
master died), the master detects the new tablet when it asks a tablet server to load the tablet that
has now split. The tablet server will notify the master of the split, because the tablet entry it finds
in the METADATA table will specify only a portion of the tablet that the master asked it to load.

11. Describe tablet read and write operations.

When a write operation arrives at a tablet server, the server checks that it is well-formed, and that
the sender is authorized to perform the mutation. Authorization is performed by reading the list of
permitted writers from a Chubby file (which is almost always a hit in the Chubby client cache). A
valid mutation is written to the commit log. Group commit is used to improve the throughput of
lots of small mutations. After the write has been committed, its contents are inserted into the
memtable.
When a read operation arrives at a tablet server, it is similarly checked for well-formedness and
proper authorization. A valid read operation is executed on a merged view of the sequence of
SSTables and the memtable. Since the SSTables and the memtable are lexicographically sorted
data structures, the merged view can be formed efficiently.
Incoming read and write operations can continue while tablets are split and merged.

12. Describe when and how tablets are compacted.

As write operations execute, the size of the memtable increases. When the memtable size reaches
a threshold, the memtable is frozen, a new memtable is created, and the frozen memtable is
converted to an SSTable and written to GFS.
This minor compaction process has two goals:
 it shrinks the memory usage of the tablet server, and
 it reduces the amount of data that has to be read from the commit log during recovery if
this server dies.
Incoming read and write operations can continue while compactions occur.
Every minor compaction creates a new SSTable. If this behavior continued unchecked, read
operations might need to merge updates from an arbitrary number of SSTables. Instead, we bound
the number of such files by periodically executing a merging compaction in the background. A
merging compaction reads the contents of a few SSTables and the memtable, and writes out a new
SSTable. The input SSTables and memtable can be discarded as soon as the compaction has
finished.
A merging compaction that rewrites all SSTables into exactly one SSTable is called a major
compaction. SSTables produced by non-major compactions can contain special deletion entries
that suppress deleted data in older SSTables that are still live. A major compaction, on the other
hand, produces an SSTable that contains no deletion information or deleted data. Bigtable cycles
through all of its tablets and regularly applies major compactions to them. These major
compactions allow Bigtable to reclaim resources used by deleted data, and also allow it to ensure
that deleted data disappears from the system in a timely fashion, which is important for services
that store sensitive data.

Big Data Analytics Notes
No ratings yet
Big Data Analytics Notes
33 pages
6 Documentdatabases
No ratings yet
6 Documentdatabases
27 pages
Google Bigtable
No ratings yet
Google Bigtable
21 pages
BDA Lab ManuaL
No ratings yet
BDA Lab ManuaL
83 pages
DDM 3
No ratings yet
DDM 3
43 pages
Basic Concepts in Big Data 1
No ratings yet
Basic Concepts in Big Data 1
43 pages
Unit 2-Data Storage and Cloud Computing
No ratings yet
Unit 2-Data Storage and Cloud Computing
87 pages
Nosqlmodule 1
100% (1)
Nosqlmodule 1
102 pages
NoSQL Module 2
No ratings yet
NoSQL Module 2
76 pages
010 Intro Natural Language Processing
No ratings yet
010 Intro Natural Language Processing
43 pages
Bda Unit 1
No ratings yet
Bda Unit 1
32 pages
BDA Unit - II
No ratings yet
BDA Unit - II
66 pages
CCS334 BIG DATA ANALYTICS Session 1 Intr
No ratings yet
CCS334 BIG DATA ANALYTICS Session 1 Intr
18 pages
G G 'S Bigtable: Name: Tunahan YILDIRIM Number:2195303 Paper: A Distributed Storage System For Structured Data
No ratings yet
G G 'S Bigtable: Name: Tunahan YILDIRIM Number:2195303 Paper: A Distributed Storage System For Structured Data
38 pages
CC Module 5
No ratings yet
CC Module 5
26 pages
Object Oriented Programming C++
No ratings yet
Object Oriented Programming C++
23 pages
Unit 4 BDA
No ratings yet
Unit 4 BDA
31 pages
Cloud Computing Notes-2
No ratings yet
Cloud Computing Notes-2
23 pages
Unit IV Testing Pune University SRES COE
No ratings yet
Unit IV Testing Pune University SRES COE
94 pages
DDM 5
No ratings yet
DDM 5
46 pages
Cloud Computing Answers
No ratings yet
Cloud Computing Answers
4 pages
Apache Hadoop YARN
No ratings yet
Apache Hadoop YARN
24 pages
Big Data Basics
No ratings yet
Big Data Basics
15 pages
Big Data, Map Reduce & Hadoop: By: Surbhi Vyas (7) Varsha
No ratings yet
Big Data, Map Reduce & Hadoop: By: Surbhi Vyas (7) Varsha
40 pages
Unit 4 Hadoop
No ratings yet
Unit 4 Hadoop
86 pages
Lab Experiments:05: OBJECTIVE: To Study Cloud Security Management
100% (1)
Lab Experiments:05: OBJECTIVE: To Study Cloud Security Management
15 pages
AAA Administration For Clouds Unit 4
No ratings yet
AAA Administration For Clouds Unit 4
35 pages
Numerical Analysis
No ratings yet
Numerical Analysis
117 pages
Predloga Venture Design
No ratings yet
Predloga Venture Design
9 pages
Ukraine
No ratings yet
Ukraine
13 pages
Distributed Filesystems Review
No ratings yet
Distributed Filesystems Review
30 pages
On The Exam We Can Have 1 Cheat Sheet: Blg/Edit?Usp Sharing
No ratings yet
On The Exam We Can Have 1 Cheat Sheet: Blg/Edit?Usp Sharing
40 pages
P.prabu (28x61c) CCS334 BDA - Unit 4
No ratings yet
P.prabu (28x61c) CCS334 BDA - Unit 4
28 pages
Big Query
No ratings yet
Big Query
5 pages
BCA 5 Google File System
No ratings yet
BCA 5 Google File System
17 pages
MSC IT Syllabus
93% (15)
MSC IT Syllabus
69 pages
Google File System
No ratings yet
Google File System
6 pages
Lecture 09
100% (1)
Lecture 09
35 pages
Semi Supervised Machine Learning Approach For DDOS Detection
No ratings yet
Semi Supervised Machine Learning Approach For DDOS Detection
6 pages
Unit-4-Unit-4-Bda EDIT
No ratings yet
Unit-4-Unit-4-Bda EDIT
16 pages
Handling of Categorical Data
No ratings yet
Handling of Categorical Data
18 pages
JAVA UNIT-3 Notes
No ratings yet
JAVA UNIT-3 Notes
24 pages
Ccs335 CC Unit IV Cloud Computing Unit 4 Notes
No ratings yet
Ccs335 CC Unit IV Cloud Computing Unit 4 Notes
42 pages
PPL I-GGoyal U2.1 Structured - Data - Objects 2022-11-18 20 - 07 Office Lens
100% (1)
PPL I-GGoyal U2.1 Structured - Data - Objects 2022-11-18 20 - 07 Office Lens
49 pages
UNIT 1 (Cyber Security)
100% (1)
UNIT 1 (Cyber Security)
16 pages
Colossus, Google's File System
No ratings yet
Colossus, Google's File System
8 pages
Chapter Five Hadoop Mapreduce & HDFS
No ratings yet
Chapter Five Hadoop Mapreduce & HDFS
44 pages
DSCC (Final)
No ratings yet
DSCC (Final)
18 pages
Hadoop Unit-4
No ratings yet
Hadoop Unit-4
44 pages
CCSAns PDF
No ratings yet
CCSAns PDF
32 pages
UCS15E08 - Cloud Computing - Unit 3 Notes
No ratings yet
UCS15E08 - Cloud Computing - Unit 3 Notes
13 pages
Unit 5 Print
No ratings yet
Unit 5 Print
32 pages
CC Unit-5
No ratings yet
CC Unit-5
9 pages
Unit 2 PDF
No ratings yet
Unit 2 PDF
22 pages
CS 525 Advanced Distributed Systems Spring 2010: Ravenshaw Management Centre, Cuttack
No ratings yet
CS 525 Advanced Distributed Systems Spring 2010: Ravenshaw Management Centre, Cuttack
27 pages
AnalyzingGFS HDFS
No ratings yet
AnalyzingGFS HDFS
11 pages
Google Distributed System
No ratings yet
Google Distributed System
40 pages
3
No ratings yet
3
11 pages
Bus Reservation System WebBased Apllication
0% (1)
Bus Reservation System WebBased Apllication
73 pages
Bigtable: A Distributed Storage System For Structured Data
No ratings yet
Bigtable: A Distributed Storage System For Structured Data
26 pages
10C - A Longitudinal Study of Herd Behavior in The Adoption and Continued Use of Technology - 2013
No ratings yet
10C - A Longitudinal Study of Herd Behavior in The Adoption and Continued Use of Technology - 2013
30 pages
Detection
No ratings yet
Detection
16 pages
Big Data Assighmwnt 2
No ratings yet
Big Data Assighmwnt 2
60 pages
Cloud Computing New One Project
No ratings yet
Cloud Computing New One Project
77 pages
Week6 Iot Big Data
No ratings yet
Week6 Iot Big Data
21 pages
Hbase
No ratings yet
Hbase
13 pages
Detection of Sparse Anomalies in High-Dimensional Network Telescope Signals
No ratings yet
Detection of Sparse Anomalies in High-Dimensional Network Telescope Signals
13 pages
Dynamo: Amazon's Highly Available Key-Value Store
No ratings yet
Dynamo: Amazon's Highly Available Key-Value Store
16 pages
Building Scalable Web Sites
No ratings yet
Building Scalable Web Sites
21 pages
DBMS Chapter 4
No ratings yet
DBMS Chapter 4
39 pages
2 - PPT Multi Keyword Search in Cloud Data
No ratings yet
2 - PPT Multi Keyword Search in Cloud Data
13 pages
07 - Ingesting New Datasets Into Google BigQuery
No ratings yet
07 - Ingesting New Datasets Into Google BigQuery
8 pages
BDA Answers-1
No ratings yet
BDA Answers-1
15 pages
Google File System (GFS)
No ratings yet
Google File System (GFS)
18 pages
A Big Data Analytics Study Challenges, Unresolved Research Issues, and Techniques
100% (1)
A Big Data Analytics Study Challenges, Unresolved Research Issues, and Techniques
8 pages
Big Data - RDBMS, NoSQL and DynamoDB
No ratings yet
Big Data - RDBMS, NoSQL and DynamoDB
6 pages
Gantt Chart
No ratings yet
Gantt Chart
1 page
Sample Paper Q0503
No ratings yet
Sample Paper Q0503
20 pages
Chapter - 1 Introduction
No ratings yet
Chapter - 1 Introduction
22 pages
#1 Hashing
No ratings yet
#1 Hashing
22 pages
Apache Hadoop Yarn Architecture PDF
No ratings yet
Apache Hadoop Yarn Architecture PDF
3 pages
DBMS Problem Statements 2018-19
0% (1)
DBMS Problem Statements 2018-19
12 pages
Oltp Olap Rtap
No ratings yet
Oltp Olap Rtap
53 pages
DBMS SQL Practice Questions Shivani
No ratings yet
DBMS SQL Practice Questions Shivani
10 pages
Hierarchy Code Node PR - 01 PR - 01 PR - 01 Products PR - 01 PR - 01 PR - 01 PR - 01 PR - 01 PR - 01 PR - 01
No ratings yet
Hierarchy Code Node PR - 01 PR - 01 PR - 01 Products PR - 01 PR - 01 PR - 01 PR - 01 PR - 01 PR - 01 PR - 01
4 pages
Hadoop and Mapreduce
No ratings yet
Hadoop and Mapreduce
21 pages
Programming Languages: A Survey
No ratings yet
Programming Languages: A Survey
7 pages
The Google File System
No ratings yet
The Google File System
21 pages
NLP Asgn2
No ratings yet
NLP Asgn2
7 pages
Privacy Protection Based Access Control Scheme in Cloud-Based
No ratings yet
Privacy Protection Based Access Control Scheme in Cloud-Based
14 pages
Evaluation Aspects For The Presentation
No ratings yet
Evaluation Aspects For The Presentation
2 pages
Data Analytics Unit-3 Notes
No ratings yet
Data Analytics Unit-3 Notes
21 pages
Google File System Basics: Google World Wide Web Computers
No ratings yet
Google File System Basics: Google World Wide Web Computers
5 pages
Unit 1 Introduction To BIG DATA ANALYSIS: Evolution of Technology
No ratings yet
Unit 1 Introduction To BIG DATA ANALYSIS: Evolution of Technology
9 pages
Mining Massive Data University of Primorska Fall 2020
No ratings yet
Mining Massive Data University of Primorska Fall 2020
2 pages
Questions On Google File System
100% (1)
Questions On Google File System
3 pages
Finalized Mind Map For Each CO: 16CST33-Java Programming
No ratings yet
Finalized Mind Map For Each CO: 16CST33-Java Programming
7 pages
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
Ultimate AWS Certified Solutions Architect Associate Exam Guide: Master Designing Resilient, Scalable Architectures with Core and Advanced AWS Services to Crack the SAA-C03 Certification (English Edition)
From Everand
Ultimate AWS Certified Solutions Architect Associate Exam Guide: Master Designing Resilient, Scalable Architectures with Core and Advanced AWS Services to Crack the SAA-C03 Certification (English Edition)
Venkata Sasi Kanumuri
No ratings yet
Professional Hadoop Solutions
From Everand
Professional Hadoop Solutions
Boris Lublinsky
4/5 (2)
HDInsight Essentials - Second Edition
From Everand
HDInsight Essentials - Second Edition
Rajesh Nadipalli
No ratings yet
Optimizing Hadoop for MapReduce
From Everand
Optimizing Hadoop for MapReduce
Khaled Tannir
No ratings yet

Google Bigtable: Describe The Data Model of Bigtable

Uploaded by

Google Bigtable: Describe The Data Model of Bigtable

Uploaded by

Google Bigtable

1. Describe the data model of Bigtable.

(row:string, column:string, time:int64) -> string

2. Describe the design of the Bigtable API.

4. Describe the role of the Google Chubby service.

Bigtable uses Chubby for a variety of tasks:

5. Explain the architecture of the Google Bigtable server.

The Bigtable implementation has three major components:

7. Describe the role of the Tablet server in Bigtable.

8. Present the protocol for locating a tablet.

11. Describe tablet read and write operations.

12. Describe when and how tablets are compacted.

You might also like