3 Days, 20+ AI Experts, 25+ Workshops and Power Talks
Code: USD75OFF
This is the code repository for Scalable Data Architecture with Java, published by Packt.
Build efficient enterprise-grade data architecting solutions using Java
Java architectural patterns and tools help architects to build reliable, scalable, and secure data engineering solutions that collect, manipulate, and publish data. This book will help you make the most of the architecting data solutions available with clear and actionable advice from an expert.
You’ll start with an overview of data architecture, exploring responsibilities of a Java data architect, and learning about various data formats, data storage, databases, and data application platforms as well as how to choose them. Next, you’ll understand how to architect a batch and real-time data processing pipeline. You’ll also get to grips with the various Java data processing patterns, before progressing to data security and governance. The later chapters will show you how to publish Data as a Service and how you can architect it. Finally, you’ll focus on how to evaluate and recommend an architecture by developing performance benchmarks, estimations, and various decision metrics.
By the end of this book, you’ll be able to successfully orchestrate data architecture solutions using Java and related technologies as well as to evaluate and present the most suitable solution to your clients.
This book covers the following exciting features:
- Analyze and use the best data architecture patterns for problems
- Understand when and how to choose Java tools correctly for a data architecture
- Build batch and real-time data engineering solutions using Java
- Discover how to apply security and governance to a solution
- Measure performance, publish benchmarks, and optimize solutions
- Evaluate, choose, and present the best architectural alternatives
- Understand how to publish Data as a Service using GraphQL and a REST API
If you feel this book is for you, get your copy today!
All of the code is organized into folders.
The code will look like the following:
public interface Transformer<K, V, R> {
void init(ProcessorContext var1);
R transform(K var1, V var2);
void close();
}
Following is what you need for this book: Data architects, aspiring data architects, Java developers and anyone who wants to develop or optimize scalable data architecture solutions using Java will find this book useful. A basic understanding of data architecture and Java programming is required to get the best from this book.
With the following software and hardware list you can run all code files present in the book (Chapter 1-12).
Chapter | Software required | OS required |
---|---|---|
1-12 | Java SDK 8 or 11 | Windows, Mac OS, and Linux |
1-12 | Apache Maven 3.6 or above | Windows, Mac OS, and Linux |
1-12 | IntelliJ IDEA Community Edition | Windows, Mac OS, and Linux |
1-12 | Apache Spark 3.0 or above | Windows, Mac OS, and Linux |
1-12 | AWS S3, Lambda, EMR, ECR, API Gateway | AWS CLoud |
1-12 | Docker Desktop | Windows, Mac OS, and Linux |
1-12 | Minikube v1.23.2 | Windows, Mac OS, and Linux |
1-12 | PostgreSQL 14.0 | Windows, Mac OS, and Linux |
1-12 | MongoDB Atlas | AWS CLoud |
1-12 | Apache Kafka 2.8.2 | Windows, Mac OS, and Linux |
1-12 | Apache NIFI 1.12.0 | Windows, Mac OS, and Linux |
1-12 | DataHub | Docker/Kubernetes |
1-12 | Postman | Windows, Mac OS, and Linux |
1-12 | GraphQL Playground 1.8.10 | Windows, Mac OS, and Linux |
1-12 | JMeter 5.5 | Windows, Mac OS, and Linux |
We also provide a PDF file that has color images of the screenshots/diagrams used in this book. Click here to download it.
Sinchan Banerjee is a principal data architect at UST Inc. He works for their client Anthem to architect, build, and deliver scalable, robust data engineering solutions to solve their business problems. Prior to his journey with UST, he worked for various Fortune 500 organizations, such as Amex, Optum, Impetus, and HP, designing, architecting, and building robust data engineering solutions for very high volumes of data. He is the lead author of a patent on storage capacity forecasting and is the co-author of multiple international publications. He is also a certified AWS Professional and a certified Java programmer. He has also been a recipient of multiple awards and accolades for exceptional technical contribution, leadership, and innovation.
If you have already purchased a print or Kindle version of this book, you can get a DRM-free PDF version at no cost.
Simply click on the link to claim your free PDF.