Writing
Articles about AI, distributed computing, and data engineering.
Career
- Breaking Into Data and AIJan 2026
AI
Drata
Fugue
- Large Scale Image Processing with SparkJan 2023
- Data Quality with whylogs and FugueOct 2022
- Why SQL-Like Interfaces are Sub-optimal for Distributed ComputingAug 2022
- Why Pandas-like Interfaces are Sub-optimal for Distributed ComputingJun 2022
- Introducing FugueFeb 2022
- Scaling PyCaret with Spark (or Dask)Jan 2022
- Delivering Spark Projects Faster and CheaperNov 2021
- Using FugueSQL on Spark DataFramesNov 2021
- Porting Pandas Code to SparkAug 2021
- Data Validation with Pandera and FugueMay 2021
Prefect
- Docker Without the HassleSep 2021
- Interoperable Python and SQLApr 2021