8000 GitHub - PacktPublishing/Scalable-Data-Architecture-with-Java: Scalable Data Architecture with Java, published by Packt
[go: up one dir, main page]

Skip to content

PacktPublishing/Scalable-Data-Architecture-with-Java

Repository files navigation

Packt Conference

3 Days, 20+ AI Experts, 25+ Workshops and Power Talks

Code: USD75OFF

Scalable Data Architecture with Java

Scalable Data Architecture with Java

This is the code repository for Scalable Data Architecture with Java, published by Packt.

Build efficient enterprise-grade data architecting solutions using Java

What is this book about?

Java architectural patterns and tools help architects to build reliable, scalable, and secure data engineering solutions that collect, manipulate, and publish data. This book will help you make the most of the architecting data solutions available with clear and actionable advice from an expert.

You’ll start with an overview of data architecture, exploring responsibilities of a Java data architect, and learning about various data formats, data storage, databases, and data application platforms as well as how to choose them. Next, you’ll understand how to architect a batch and real-time data processing pipeline. You’ll also get to grips with the various Java data processing patterns, before progressing to data security and governance. The later chapters will show you how to publish Data as a Service and how you can architect it. Finally, you’ll focus on how to evaluate and recommend an architecture by developing performance benchmarks, estimations, and various decision metrics.

By the end of this book, you’ll be able to successfully orchestrate data architecture solutions using Java and related technologies as well as to evaluate and present the most suitable solution to your clients.

This book covers the following exciting features:

  • Analyze and use the best data architecture patterns for problems
  • Understand when and how to choose Java tools correctly for a data architecture
  • Build batch and real-time data engineering solutions using Java
  • Discover how to apply security and governance to a solution
  • Measure performance, publish benchmarks, and optimize solutions
  • Evaluate, choose, and present the best architectural alternatives
  • Understand how to publish Data as a Service using GraphQL and a REST API

If you feel this book is for you, get your copy today!

https://www.packtpub.com/

Instructions and Navigations

All of the code is organized into folders.

The code will look like the following:

public interface Transformer<K, V, R> {
  void init(ProcessorContext var1);
  R transform(K var1, V var2);
  void close();
}

Following is what you need for this book: Data architects, aspiring data architects, Java developers and anyone who wants to develop or optimize scalable data architecture solutions using Java will find this book useful. A basic understanding of data architecture and Java programming is required to get the best from this book.

With the following software and hardware list you can run all code files present in the book (Chapter 1-12).

Software and Hardware List

Chapter Software required OS required
1-12 Java SDK 8 or 11 Windows, Mac OS, and Linux
1-12 Apache Maven 3.6 or above Windows, Mac OS, and Linux
1-12 IntelliJ IDEA Community Edition Windows, Mac OS, and Linux
1-12 Apache Spark 3.0 or above Windows, Mac OS, and Linux
1-12 AWS S3, Lambda, EMR, ECR, API Gateway AWS CLoud
1-12 Docker Desktop Windows, Mac OS, and Linux
1-12 Minikube v1.23.2 Windows, Mac OS, and Linux
1-12 PostgreSQL 14.0 Windows, Mac OS, and Linux
1-12 MongoDB Atlas AWS CLoud
1-12 Apache Kafka 2.8.2 Windows, Mac OS, and Linux
1-12 Apache NIFI 1.12.0 Windows, Mac OS, and Linux
1-12 DataHub Docker/Kubernetes
1-12 Postman Windows, Mac OS, and Linux
1-12 GraphQL Playground 1.8.10 Windows, Mac OS, and Linux
1-12 JMeter 5.5 Windows, Mac OS, and Linux

We also provide a PDF file that has color images of the screenshots/diagrams used in this book. Click here to download it.

Related products

Get to Know the Author(s)

Sinchan Banerjee is a principal data architect at UST Inc. He works for their client Anthem to architect, build, and deliver scalable, robust data engineering solutions to solve their business problems. Prior to his journey with UST, he worked for various Fortune 500 organizations, such as Amex, Optum, Impetus, and HP, designing, architecting, and building robust data engineering solutions for very high volumes of data. He is the lead author of a patent on storage capacity forecasting and is the co-author of multiple international publications. He is also a certified AWS Professional and a certified Java programmer. He has also been a recipient of multiple awards and accolades for exceptional technical contribution, leadership, and innovation.

Download a free PDF

If you have already purchased a print or Kindle version of this book, you can get a DRM-free PDF version at no cost.
Simply click on the link to claim your free PDF.

https://packt.link/free-ebook/9781801073080

About

Scalable Data Architecture with Java, published by Packt

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  
0