Aditya KunarCTAB-GAN: Effective Table Data SynthesizingBy Zilong Zhao, Aditya Kunar, Hiek Van der Scheer, Robert Birke & Lydia Y. Chen5 min read·Jun 7, 2021--1
Aditya KunarDTGAN: Differential Private Training for Tabular GANsBy Aditya Kunar, Robert Birke, Zilong Zhao & Lydia Y. Chen6 min read·Aug 2, 2021--
AmritKmrBig Data ETL with AWS GlueWhen it comes to loading Big Data from an S3 bucket to a DynamoDB Table, conventional lambda and/or python scripts will fail to deliver…10 min read·Aug 30--
Gaurav PatilinGlobantSeamless Data Processing: Spark Structured Streaming for AWS Kinesis Message StreamsHarnessing Spark Structured Streaming for efficient AWS Kinesis message processing and analysis9 min read·Jun 22--
Yusuf GaniyuinPython in Plain EnglishChange Data Capture (CDC) Realtime Streaming with Postgres, Debezium, Kafka, Apache Spark, and…In the dynamic world of data processing and analytics, Change Data Capture (CDC) stands out as a critical technology for real-time data…·8 min read·6 days ago--
Dogukan UluData Engineering End-to-End Project — Part 1 — Spark, Kafka, Elasticsearch, Kibana, MinIO, Docker…Repository8 min read·5 days ago--1
SirajSmall File, Large Impact — Addressing the Small File Issue in SparkIntroduction:4 min read·Jul 14--
Shuvradeep GhoshPySpark for AWS Glue: A Comprehensive Guide to Big Data ProcessingIntroduction10 min read·Aug 6--