0% found this document useful (0 votes)

387 views7 pages

Solution Guide - Collect, Process, and Store Data in BigQuery

The document is a solution guide for a lab focused on collecting, processing, and storing data in BigQuery as part of a capstone project. It outlines a series of tasks and challenges that involve data manipulation within a business context, including importing data, joining tables, and creating reports. The guide provides solutions and queries for each task, enabling users to assess their work and identify areas for improvement.

Uploaded by

Constant HOUEHA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

387 views7 pages

Solution Guide - Collect, Process, and Store Data in BigQuery

Uploaded by

Constant HOUEHA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Solution Guide: Collect, process, and store

data in BigQuery
The Collect, process, and store data in Big Query lab is a portion of the capstone project
that puts your data analysis skills to the test; the lab includes a set of tasks and challenges that
involve transforming data within a business scenario. Each task in the lab guides you to apply
the skills you learned throughout the course, focusing on data collection, processing, and
storage within the BigQuery environment. The lab also requires you to tackle two challenges to
assess your skills on your own: transform data and create a report.

This solution guide provides the results of each guided task in the lab for you to assess against
your own work. It also includes the solution query and results for the two challenges so that
you may evaluate your approach, as well as identify potential areas for improvement.

Task 1: Get started with BigQuery

To complete this task, open up the BigQuery environment. Select the project that matches the
Google Cloud project ID provided during login, and locate the fintech dataset in the Explorer
pane.

1
Task 2: Explore the Fintech data
To complete this task, expand the fintech dataset to view the customers and loans table. Then,
click on each name to select the table and review the Details, Preview, and Schema tabs.

Task 3: Import a CSV file and create a standard table

To complete this task, run the provided code in the query editor to import a CSV file from
Cloud Storage. Then, review the table using the preview tab. A new table called state_region
will be added to the fintech dataset.

2
Task 4: Join data from two tables
To complete this task, run Query B in the Query Editor to join the two tables and review the
results in the Query results panel.

3
Task 5: Create a table based on the results of a query using CTAS
To complete this task, run the provided query in the Query Editor to create a new table named
loan_with_region. The new table appears in the fintech dataset. Then, export the data to
Google Sheets to review the loan_with_region data.

Note: If the export to Google Sheets fails, an error will appear stating that Google Sheets will
not open. Try exporting the data again.

Task 6: Work with nested data

To complete this task, use dot notation to query the purpose column, which is nested inside of
the application record. The results of the query will be a table with two columns: loan_id and
purpose.

4
Task 7 challenge: Deduplicate
To complete this challenge, write a query to create a table named fintech.loan_purposes that
has a single column named purpose. The purpose column should include the unique results
found in the nested purpose column in the loan table of the fintech dataset.

Solution
You can use a Create Table as Select (CTAS) statement to create the table and dot notation
to select the purpose column that is nested in the application record.

Here’s the query:

Unset
CREATE TABLE fintech.loan_purposes AS
SELECT DISTINCT application.purpose
FROM fintech.loan;

1. Copy and paste the above query into the Query Editor.
2. Click Run.

5
As a result of this
query, a new table
named loan_purposes
is added to the fintech
dataset. This table has
one column that
selects the distinct
values from the
purpose column within
the application record
of the loan table.

Task 8 challenge:
Answer business
questions with a report
To complete this challenge, write a query to create a table called loan_count_by_year in the
fintech dataset that counts loans grouped by issue_year.

Solution
You can use a Create Table as Select (CTAS) statement to create the table and COUNT and
GROUP BY to count the loans and group them by issue_year.

Here’s the query:

Unset
CREATE TABLE fintech.loan_count_by_year AS
SELECT issue_year, count(loan_id) AS loan_count
FROM fintech.loan
GROUP BY issue_year;

1. Copy and paste the above query into the Query Editor.
2. Click Run.

6
As a result of this query, a new table named loan_count_by_year is added to the fintech
dataset. This table has two columns: issue_year and loan_count. The loan_count column counts
the number of loans by issue year.

Resources for more information

Use these readings to help support you as you work through the solution:
● SQL query terms reading available in course 1 module 1
● Guide to BigQuery reading available in course 2 module 1

Google Cloud Data Solutions Guide
No ratings yet
Google Cloud Data Solutions Guide
4 pages
Interview QnAs - CloudyML
No ratings yet
Interview QnAs - CloudyML
13 pages
Submission Template Coded Project
No ratings yet
Submission Template Coded Project
12 pages
Submission Template Coded Project
No ratings yet
Submission Template Coded Project
12 pages
From Data To Insights Course Summary
No ratings yet
From Data To Insights Course Summary
67 pages
Google Cloud Data Engineer Exam Results
100% (3)
Google Cloud Data Engineer Exam Results
57 pages
A5 Data Project Brief
No ratings yet
A5 Data Project Brief
5 pages
Wish Fin
No ratings yet
Wish Fin
4 pages
Sprint 2 Unit Testing
No ratings yet
Sprint 2 Unit Testing
18 pages
5 Jan 2025
No ratings yet
5 Jan 2025
7 pages
Question
No ratings yet
Question
24 pages
Engineering Student Internship Report
No ratings yet
Engineering Student Internship Report
25 pages
Barclays Data Engineer Interview Questions
No ratings yet
Barclays Data Engineer Interview Questions
17 pages
Data Analyst Roadmap
No ratings yet
Data Analyst Roadmap
5 pages
Varshas Resume - 1
No ratings yet
Varshas Resume - 1
2 pages
Mastercard Data Engineer Interview Questions
No ratings yet
Mastercard Data Engineer Interview Questions
16 pages
Data Analytics Notes (Autorecovered)
No ratings yet
Data Analytics Notes (Autorecovered)
60 pages
Engineering Students' Capstone Showcase
No ratings yet
Engineering Students' Capstone Showcase
12 pages
5 - Text Processing With Transformers
No ratings yet
5 - Text Processing With Transformers
76 pages
Business Analytics Interview Guide
No ratings yet
Business Analytics Interview Guide
54 pages
Data Analyst & Scientist Profile
No ratings yet
Data Analyst & Scientist Profile
3 pages
Standard Fund Workbook 23B06
No ratings yet
Standard Fund Workbook 23B06
262 pages
21311A05M8
No ratings yet
21311A05M8
10 pages
De Mod 2 Transform Data With Spark
No ratings yet
De Mod 2 Transform Data With Spark
32 pages
Question Bank-BDA (Module 1&2) 2
No ratings yet
Question Bank-BDA (Module 1&2) 2
5 pages
Questions For Preparation
No ratings yet
Questions For Preparation
9 pages
Radhin Krishna - Data Analyst
No ratings yet
Radhin Krishna - Data Analyst
2 pages
Adv Data Analytics Training
No ratings yet
Adv Data Analytics Training
15 pages
Adv Data Analytics Training
No ratings yet
Adv Data Analytics Training
14 pages
Tarek Sayed
No ratings yet
Tarek Sayed
5 pages
PDE Exam Dump 3
No ratings yet
PDE Exam Dump 3
98 pages
HCLT108 1 Jan June2025 FA2 LS V.3 28012025
No ratings yet
HCLT108 1 Jan June2025 FA2 LS V.3 28012025
6 pages
6 Complete SQL Questions Along With Their Solutions
No ratings yet
6 Complete SQL Questions Along With Their Solutions
6 pages
BDA Regular Paper Solution
No ratings yet
BDA Regular Paper Solution
8 pages
s01 PDE Course Workbook
No ratings yet
s01 PDE Course Workbook
80 pages
SQL Interview Question
No ratings yet
SQL Interview Question
18 pages
Mosheer Khan CV
No ratings yet
Mosheer Khan CV
1 page
Exercises - Mastering Postgresql - Mastering SQL Using Postgresql
No ratings yet
Exercises - Mastering Postgresql - Mastering SQL Using Postgresql
25 pages
Mosheer Khan CV
No ratings yet
Mosheer Khan CV
1 page
Datathon at UCI Resource Sheet
No ratings yet
Datathon at UCI Resource Sheet
15 pages
Advanced+Excel+Formulae+ +transcription
No ratings yet
Advanced+Excel+Formulae+ +transcription
28 pages
Ai in Finance PDF Material
No ratings yet
Ai in Finance PDF Material
4 pages
Saran Raj Resume
No ratings yet
Saran Raj Resume
2 pages
Business Analytics Capstone Proposal
100% (1)
Business Analytics Capstone Proposal
4 pages
W1o Two Year Career Plan
No ratings yet
W1o Two Year Career Plan
3 pages
Computer Course Outline
No ratings yet
Computer Course Outline
9 pages
Final Book
No ratings yet
Final Book
45 pages
Capstone Project PPT - Ishwarya-2
No ratings yet
Capstone Project PPT - Ishwarya-2
14 pages
Araj Gupta
No ratings yet
Araj Gupta
21 pages
BDA QUestion Paper
No ratings yet
BDA QUestion Paper
12 pages
Improving Performance of SQLite Data 1703882908
No ratings yet
Improving Performance of SQLite Data 1703882908
8 pages
Data Science in Finance Course
100% (1)
Data Science in Finance Course
81 pages
Data Analytics Using Python
No ratings yet
Data Analytics Using Python
18 pages
Data Analyst Roadmap
No ratings yet
Data Analyst Roadmap
1 page
Big Data Training in Chennai - Big Data Course in Chennai
No ratings yet
Big Data Training in Chennai - Big Data Course in Chennai
1 page
SQL Answers
No ratings yet
SQL Answers
7 pages
Upsc Notes
No ratings yet
Upsc Notes
2 pages
Activity - The Data Journey - Putting It All Together
No ratings yet
Activity - The Data Journey - Putting It All Together
4 pages
Techniques Used To Transform Data, Part 2
No ratings yet
Techniques Used To Transform Data, Part 2
7 pages
Techniques Used To Transform Data, Part 1
No ratings yet
Techniques Used To Transform Data, Part 1
12 pages
Normalized and Denormalized Data
No ratings yet
Normalized and Denormalized Data
2 pages
Shree Nrisingh Madhyamik Vidhyalaya
No ratings yet
Shree Nrisingh Madhyamik Vidhyalaya
22 pages
OpenEdge 10 Development With JSON PDF
No ratings yet
OpenEdge 10 Development With JSON PDF
48 pages
Top 10 GIT Interview Questions For Sdet
No ratings yet
Top 10 GIT Interview Questions For Sdet
12 pages
HOLIDAYS HW IP XI Page 3
No ratings yet
HOLIDAYS HW IP XI Page 3
7 pages
(Feb-2017) New 1Z0-062 Exam Dumps With PDF and VCE Download
No ratings yet
(Feb-2017) New 1Z0-062 Exam Dumps With PDF and VCE Download
7 pages
RR SkillSheet
No ratings yet
RR SkillSheet
172 pages
Unit 1 DM
No ratings yet
Unit 1 DM
62 pages
SQL Mapping for Developers
No ratings yet
SQL Mapping for Developers
69 pages
Resume Bhakti-Joshi
No ratings yet
Resume Bhakti-Joshi
5 pages
@placement Fellas Java
No ratings yet
@placement Fellas Java
6 pages
2.3 Lab - Explore YANG Models Using The Pyang Tool
No ratings yet
2.3 Lab - Explore YANG Models Using The Pyang Tool
2 pages
Technical Assignment 2 - S2021 1
No ratings yet
Technical Assignment 2 - S2021 1
6 pages
MSBTE WBAP PHP 2mark QA Cleaned
No ratings yet
MSBTE WBAP PHP 2mark QA Cleaned
5 pages
CDBM Mod03 Answers
No ratings yet
CDBM Mod03 Answers
20 pages
Company Database ER Diagram & Schema
No ratings yet
Company Database ER Diagram & Schema
74 pages
NetVault Backup APM For SQL Server Users Guide
No ratings yet
NetVault Backup APM For SQL Server Users Guide
98 pages
BAB 4 Perancangan Basis Data
No ratings yet
BAB 4 Perancangan Basis Data
33 pages
Car Selling Price Prediction
No ratings yet
Car Selling Price Prediction
14 pages
MineSight Part1 Understanding Minesight
100% (2)
MineSight Part1 Understanding Minesight
57 pages
Data e Sarana PIBG
No ratings yet
Data e Sarana PIBG
2 pages
PHP Basics for Web Developers
No ratings yet
PHP Basics for Web Developers
51 pages
db2 Migrate
No ratings yet
db2 Migrate
37 pages
Data Scientist Role Play
No ratings yet
Data Scientist Role Play
9 pages
Unit 3 Database Management System
No ratings yet
Unit 3 Database Management System
59 pages
Notes DBMS
No ratings yet
Notes DBMS
8 pages
PPVIS PPGraph Sample
No ratings yet
PPVIS PPGraph Sample
7 pages
SQL Lab Exercises for Students
No ratings yet
SQL Lab Exercises for Students
16 pages
SAP HANA Studio Installation and Update Guide: Linux, Microsoft Windows, Mac OS
No ratings yet
SAP HANA Studio Installation and Update Guide: Linux, Microsoft Windows, Mac OS
56 pages
Mysql Tutorial: What Is Dbms
No ratings yet
Mysql Tutorial: What Is Dbms
11 pages
E Commerce 1
No ratings yet
E Commerce 1
19 pages

Solution Guide - Collect, Process, and Store Data in BigQuery

Uploaded by

Solution Guide - Collect, Process, and Store Data in BigQuery

Uploaded by

Solution Guide: Collect, process, and store

Task 1: Get started with BigQuery

Task 3: Import a CSV file and create a standard table

Task 6: Work with nested data

Here’s the query:

Here’s the query:

Resources for more information

You might also like