DataStream Debezium
1 Integration with GCP Services Native 5 Open source, widely used in 3
microservices architectures
2 Data Source Compatibility
(SQL Server, Oracle, MySQL,
PostgreSQL)
3 NOSql and Complex data types?
(Nested objects, Arrays, Maps,
JSON, XML)
4 Change Detection Method
(log-based CDC, trigger-based
CDC, or polling-based CDC)
5 Data Volume and Velocity(the
speed at which data changes
occurs)
6 Latency
7 Real-Time vs. Batch
8 Data Consistency and Integrity
(updates, inserts, deletes, and
conflicts)
9 Fault Tolerance and
Reliability(Network , DB failure,
HA)
10 Security
11 Monitoring and Alerting
12 Data Recovery
13 Scalability
14 Cost
15 Data Validation
16 Disaster Recovery
(ex: restoring data from backups
or failover to a secondary GCP
region)
17 Maintenance and Support
18 Vendor Comparison
19 Schema Evolution / Backward
Compatibility
20 Support for Pubsub Sink
21 Support for GCS Sink
22 Referential Integrity(e.g., foreign
keys)
Data Retention and Cleanup( for
auditing or compliance)
Data Retention
Data Transformation
Authentication of source and
target
● Debezium
○ Architecture
■ Kafka Connect
■ Debezium server.
■ Embedded Engine
○ Connectors
■ Debezium connector for Cassandra
■ Debezium connector for Db2
■ Debezium connector for MongoDB
■ Debezium connector for MySQL
■ Debezium connector for Oracle Database
■ Debezium connector for PostgreSQL
■ Debezium connector for SQL Server
■ Debezium connector for Vitess