Aditya KunarCTAB-GAN: Effective Table Data SynthesizingBy Zilong Zhao, Aditya Kunar, Hiek Van der Scheer, Robert Birke & Lydia Y. Chen5 min read·Jun 7, 2021--1
Aditya KunarDTGAN: Differential Private Training for Tabular GANsBy Aditya Kunar, Robert Birke, Zilong Zhao & Lydia Y. Chen6 min read·Aug 2, 2021--
Nicholas LeongHow I Built a Data Lakehouse With Delta Lake ArchitectureData Engineer Explains the Data Lakehouse Architecture·10 min read·Sep 18--4
Bruno PistoneMastering Big Data Processing: A Comprehensive Guide to use SageMaker Processing with Spark…Authors: Bruno Pistone, Gabrielle Dompreh16 min read·Mar 31--
Roshmita DeyMastering DataFrames in PySparkIn the world of big data processing and analysis, PySpark has emerged as a powerful and flexible framework. At the core of PySpark’s data…5 min read·Sep 3--
Felipe HoffainSnowflakeDiscover the new Snowpark ML Toolkit + dbt Python modelsLet’s do some feature engineering, training, and inference with Snowpark ML and the dbt Python models. First with with 50k rows and then…12 min read·Sep 20--1
TransUnion TechnologyModern Data Streaming ArchitectureIf you had to write software to identify all the red cars parked in a garage, it is a relatively simple (more traditional) software…6 min read·Jun 27--