0% found this document useful (0 votes)

26 views5 pages

Google App Engine and Google File System

Google App Engine (GAE) is a PaaS that enables developers to build and run web applications without managing servers, offering features like automatic scaling, load balancing, and persistent data storage. It supports languages like Java and Python and provides built-in APIs for various functionalities. Google File System (GFS) is a distributed file system designed for large data storage, featuring a master-chunk server architecture that ensures fault tolerance and high throughput.

Uploaded by

Pavithra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views5 pages

Google App Engine and Google File System

Uploaded by

Pavithra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

1.

Explain the basics of the Google App Engine (GAE) infrastructure programming
model.

Introduction:

Google App Engine (GAE) is a Platform as a Service (PaaS) provided by Google that
allows developers to build, deploy, and run web applications on Google’s infrastructure
without worrying about managing servers or hardware.

GAE offers a complete platform including computing power, data storage, security, and load
balancing.

Key Features of GAE:

1. Supports Programming Languages:

o Java and Python are mainly supported.

o Developers can use web frameworks like Django (Python) and Google Web
Toolkit (Java).
2. Automatic Scaling:

o GAE automatically adjusts resources like CPU and memory depending on

traffic.
o No need for manual scaling or managing servers.

3. Load Balancing:

o Distributes incoming traffic efficiently across multiple servers for high

performance.
4. Sandboxed Environment:

o Each app runs in a secure, isolated environment which increases security and
stability.

5. Persistent Data Storage:

o GAE uses BigTable (a NoSQL database) to store structured data.

o Blobstore is available for large file storage (up to 2 GB).

6. APIs and Services:

o Provides built-in APIs for:

▪ Sending emails
▪ Authenticating users via Google accounts
▪ Accessing images, URLs, etc.

7. Free and Pay-as-you-go Model:

o Free usage up to a quota.

o Charges apply only when you exceed the quota.

GAE Architecture:

Component Function

DataStore Stores data using BigTable with support for transactions.

Provides an environment to run Java/Python apps

Application Runtime
securely.

Admin Console Used to deploy, monitor, and manage applications easily.

Google Secure Data Connector

Provides secure access to private data from the cloud.
(SDC)

Allows developers to test apps locally before deploying

Local SDK
to the cloud.
Real-World Applications Built on GAE:

• Gmail

• Google Docs
• Google Maps

• Google Earth

• These apps are scalable and support millions of users globally.

Summary:

Google App Engine allows developers to focus on writing application logic while Google
handles everything else like infrastructure, scaling, and performance. It’s a powerful tool for
building reliable and scalable web applications easily.

2. Outline the architecture of Google File System (GFS).

Introduction:

Google File System (GFS) is a distributed file system created by Google to store and
manage huge amounts of data across many servers. It is mainly used for internal Google
applications like search indexing, Gmail, etc.

Key Design Goals of GFS:

• Handle very large files (hundreds of MB or GB).

• Be fault-tolerant (hardware failures are common).

• Support high throughput rather than low latency.

• Optimized for write-once, read-many usage patterns.

GFS Architecture:

GFS uses a Master–Chunk Server model:

Component Description

Master Controls the file system. Maintains metadata such as file names, chunk
Server locations, and namespace.
Component Description

Chunk Store actual file data in chunks (default size: 64 MB). Each chunk is
Servers replicated on multiple servers (usually 3).

Request file data from the master, then communicate directly with chunk
Clients
servers to read/write chunks.

Data Flow in GFS (Write Operation):

1. Client → Master: Client asks the master which chunk server holds the data and
where the replicas are.
2. Master Response: Master tells the client which server is the primary and the list of
secondaries.

3. Client → Replicas: Client sends the data to all replicas (primary + secondaries).
4. Client → Primary: Once all servers receive the data, the client sends a write
command to the primary server.

5. Primary → Secondaries: Primary assigns a serial number and forwards the

command.

6. All Confirm: Once all secondaries finish writing, they confirm back.
7. Primary → Client: Finally, the primary server informs the client that the write was
successful.

Key Features:
• Fault Tolerance:

o Every chunk is replicated (usually 3 times) across different servers/racks.

o Ensures data availability even if some servers fail.

• Efficient Data Management:

o Large block size (64 MB) helps reduce metadata size and speeds up sequential
data access.

• Master Server Role:

o Handles metadata and gives instructions.

o Doesn’t participate in actual data transfer, improving performance.

• Shadow Master:

o A backup copy of the master to ensure continuity during failures.

Real-Time Example:

Let’s say Google Search needs to index web pages:

• The data is stored in GFS as large files.

• GFS breaks them into chunks, stores them across different servers.

• If one server fails, GFS can still fetch data from its replicas.

Summary:

GFS provides a scalable, fault-tolerant, and high-performance storage system to support

Google’s massive data needs. Its architecture is simple but powerful—based on a central
master, chunk servers, and intelligent client communication.

Unit 4
No ratings yet
Unit 4
41 pages
Unit - 4-Cloud
No ratings yet
Unit - 4-Cloud
122 pages
Programming Environment For GAE
No ratings yet
Programming Environment For GAE
35 pages
GFS - Architecture M5 GFS - Architecture M5
No ratings yet
GFS - Architecture M5 GFS - Architecture M5
25 pages
TLW Assignment 3 27-Sep-2024 10-32-28
No ratings yet
TLW Assignment 3 27-Sep-2024 10-32-28
28 pages
CC
No ratings yet
CC
17 pages
Google App Engine Programming Guide
No ratings yet
Google App Engine Programming Guide
8 pages
UNIT-IV Notes
No ratings yet
UNIT-IV Notes
15 pages
CHPT 4 Ques
No ratings yet
CHPT 4 Ques
5 pages
Ccs335 CC Unit IV Cloud Computing Unit 4 Notes
No ratings yet
Ccs335 CC Unit IV Cloud Computing Unit 4 Notes
42 pages
CC Unit-IV
No ratings yet
CC Unit-IV
41 pages
Rapid Application Development and Short-Time To The Market Low Latency Scalability High Availability Consistent View of The Data
No ratings yet
Rapid Application Development and Short-Time To The Market Low Latency Scalability High Availability Consistent View of The Data
21 pages
Unit 4
No ratings yet
Unit 4
14 pages
Storage Systems
No ratings yet
Storage Systems
23 pages
Google Architecture Insights
No ratings yet
Google Architecture Insights
7 pages
The Google File System: Alexandru Costan
No ratings yet
The Google File System: Alexandru Costan
38 pages
Google App Engine
No ratings yet
Google App Engine
9 pages
Storage Systems
No ratings yet
Storage Systems
23 pages
Google Distributed Systems Overview
No ratings yet
Google Distributed Systems Overview
23 pages
UNIT5
No ratings yet
UNIT5
34 pages
Chapter 2 Google File System 250525 070947
No ratings yet
Chapter 2 Google File System 250525 070947
42 pages
Cloud Platforms: GAE, AWS, Azure
No ratings yet
Cloud Platforms: GAE, AWS, Azure
19 pages
Google File System Review 2016
No ratings yet
Google File System Review 2016
4 pages
15 Gfs
No ratings yet
15 Gfs
40 pages
Google App Enginee
No ratings yet
Google App Enginee
4 pages
Ccomputing Madurya
No ratings yet
Ccomputing Madurya
20 pages
Distributed File System Study
No ratings yet
Distributed File System Study
4 pages
An Overview of Google File System (GFS) - Medium
No ratings yet
An Overview of Google File System (GFS) - Medium
10 pages
Unit-5 Final
No ratings yet
Unit-5 Final
19 pages
Unit V Case Studies
No ratings yet
Unit V Case Studies
37 pages
Chapter 2 1712934164766
No ratings yet
Chapter 2 1712934164766
21 pages
Google Architecture Case Study
No ratings yet
Google Architecture Case Study
44 pages
What Is Distributed Data Processing?
No ratings yet
What Is Distributed Data Processing?
2 pages
GCP (Google Cloud Platform)
No ratings yet
GCP (Google Cloud Platform)
16 pages
Lecture 4.1 - Hadoop - MapReduce - Hbase
No ratings yet
Lecture 4.1 - Hadoop - MapReduce - Hbase
94 pages
CC Ques Bank Cloud Computing QB UNIT 4
No ratings yet
CC Ques Bank Cloud Computing QB UNIT 4
11 pages
Google Distributed System
No ratings yet
Google Distributed System
40 pages
Refer Slide Time: 00:15
No ratings yet
Refer Slide Time: 00:15
31 pages
Cloud Unit3
No ratings yet
Cloud Unit3
26 pages
Google File System Basics: Google World Wide Web Computers
No ratings yet
Google File System Basics: Google World Wide Web Computers
5 pages
ICS 408 Exam A
No ratings yet
ICS 408 Exam A
5 pages
Google App Engine
No ratings yet
Google App Engine
10 pages
Cloud
No ratings yet
Cloud
4 pages
HDFS & MapReduce Explained
No ratings yet
HDFS & MapReduce Explained
16 pages
5.3.1 Google App Engine
No ratings yet
5.3.1 Google App Engine
5 pages
5.cloud Computing Lecture
No ratings yet
5.cloud Computing Lecture
7 pages
Case Study: Google File System
No ratings yet
Case Study: Google File System
7 pages
Google File System
No ratings yet
Google File System
22 pages
Sodapdf
No ratings yet
Sodapdf
6 pages
System Design Interviews
No ratings yet
System Design Interviews
151 pages
CCS335-Cloud Computing: Unit IV Cloud Deployment Environment Topic: GAE
100% (1)
CCS335-Cloud Computing: Unit IV Cloud Deployment Environment Topic: GAE
13 pages
Cloud Storage Systems: Unit-Iii
No ratings yet
Cloud Storage Systems: Unit-Iii
40 pages
20 GFS BigTable
No ratings yet
20 GFS BigTable
36 pages
Cloud Computing Question Bank Unit IV and Unit V Updated
No ratings yet
Cloud Computing Question Bank Unit IV and Unit V Updated
25 pages
BDA Unit-1
No ratings yet
BDA Unit-1
19 pages
Google Architecture
No ratings yet
Google Architecture
9 pages
Google Casestudy
No ratings yet
Google Casestudy
33 pages
The Google File System Final
No ratings yet
The Google File System Final
20 pages
L1.4 - The World of Computing
No ratings yet
L1.4 - The World of Computing
34 pages
Simple-Setting Mikrotik
No ratings yet
Simple-Setting Mikrotik
11 pages
AWS PPT Attached With Course
No ratings yet
AWS PPT Attached With Course
215 pages
Azure Kubernetes Setup Guide
No ratings yet
Azure Kubernetes Setup Guide
9 pages
Cloud Firestore Is Firebase
No ratings yet
Cloud Firestore Is Firebase
6 pages
Proxy Kampret
No ratings yet
Proxy Kampret
4 pages
IoT Internship Report: AWS Cloud
No ratings yet
IoT Internship Report: AWS Cloud
34 pages
Communication: Distributed Systems Principles and Paradigms
No ratings yet
Communication: Distributed Systems Principles and Paradigms
60 pages
DS - Unit Wise Question Bank
No ratings yet
DS - Unit Wise Question Bank
2 pages
AWS Partner-AWS Cloud Practitioner Essentials-Student Guide
No ratings yet
AWS Partner-AWS Cloud Practitioner Essentials-Student Guide
434 pages
Microsoft Azure for IT Professionals
No ratings yet
Microsoft Azure for IT Professionals
23 pages
Cloud Services for IT Professionals
No ratings yet
Cloud Services for IT Professionals
49 pages
T4 Worksheet 4
No ratings yet
T4 Worksheet 4
4 pages
MCA Cloud Computing Exam 2017-18
No ratings yet
MCA Cloud Computing Exam 2017-18
1 page
Pic Favorite
No ratings yet
Pic Favorite
128 pages
SAA-C03 Unique Study Guide Summary
No ratings yet
SAA-C03 Unique Study Guide Summary
3 pages
AWS
No ratings yet
AWS
2 pages
Cloud Digital Leader Insights
No ratings yet
Cloud Digital Leader Insights
18 pages
Peserta PRO DTS-CKO REDHAT ?
No ratings yet
Peserta PRO DTS-CKO REDHAT ?
17 pages
Tosca White Modern Professional Thesis Defense Presentation
No ratings yet
Tosca White Modern Professional Thesis Defense Presentation
12 pages
AWS & Azure Cloud Services Quiz
No ratings yet
AWS & Azure Cloud Services Quiz
15 pages
AWS IaC for Developers & Architects
No ratings yet
AWS IaC for Developers & Architects
24 pages
Docker & Kubernetes CLI Guide
No ratings yet
Docker & Kubernetes CLI Guide
4 pages
CC
No ratings yet
CC
2 pages
325E6D
No ratings yet
325E6D
2 pages
ACTE Module 3 Resource
No ratings yet
ACTE Module 3 Resource
25 pages
AWS Cloud Services Overview
No ratings yet
AWS Cloud Services Overview
16 pages
CCL Exp 5
No ratings yet
CCL Exp 5
6 pages
Cloud Security
No ratings yet
Cloud Security
25 pages
Building Web Services With Abap and Sap Web Application Server
No ratings yet
Building Web Services With Abap and Sap Web Application Server
0 pages

Google App Engine and Google File System

Uploaded by

Google App Engine and Google File System

Uploaded by

1.

Key Features of GAE:

1. Supports Programming Languages:

o Java and Python are mainly supported.

o GAE automatically adjusts resources like CPU and memory depending on

o Distributes incoming traffic efficiently across multiple servers for high

5. Persistent Data Storage:

o GAE uses BigTable (a NoSQL database) to store structured data.

o Blobstore is available for large file storage (up to 2 GB).

6. APIs and Services:

o Provides built-in APIs for:

7. Free and Pay-as-you-go Model:

o Free usage up to a quota.

o Charges apply only when you exceed the quota.

DataStore Stores data using BigTable with support for transactions.

Provides an environment to run Java/Python apps

Admin Console Used to deploy, monitor, and manage applications easily.

Google Secure Data Connector

Allows developers to test apps locally before deploying

• These apps are scalable and support millions of users globally.

2. Outline the architecture of Google File System (GFS).

Key Design Goals of GFS:

• Handle very large files (hundreds of MB or GB).

• Be fault-tolerant (hardware failures are common).

• Support high throughput rather than low latency.

GFS uses a Master–Chunk Server model:

Data Flow in GFS (Write Operation):

5. Primary → Secondaries: Primary assigns a serial number and forwards the

o Every chunk is replicated (usually 3 times) across different servers/racks.

o Ensures data availability even if some servers fail.

• Efficient Data Management:

• Master Server Role:

o Doesn’t participate in actual data transfer, improving performance.

o A backup copy of the master to ensure continuity during failures.

Let’s say Google Search needs to index web pages:

• The data is stored in GFS as large files.

GFS provides a scalable, fault-tolerant, and high-performance storage system to support

You might also like