Bauplan blog
Engineering
Jan 13, 2025
Written by Nathan LeClaire
Lessons learned crafting a Serverless Lakehouse from spare parts
Read more
Reference
Dec 20, 2024
Written by Ciro Greco
Full-stack recommender system with Bauplan for data preparation and training, and MongoDB Atlas for real-time inference.
Read more
Research
Dec 2, 2024
Written by Jacopo Tagliabue, Tyler Caraza-Harter and Ciro Greco
Paper presented at WoSC10 2024. In collaboration with The University of Wisconsin.
Read more
Engineering
Nov 22, 2024
Written by Nathan LeClaire and Ciro Greco
Making the experience of running data workflow in the cloud indistinguishable from doing it locally.
Read more
Engineering
Nov 20, 2024
Written by Luca Bigon and Jacopo Tagliabue
DAG planning using an in-memory graph database. In collaboration with Kùzu
Read more
Research
Nov 12, 2024
Written by Jacopo Tagliabue, Ryan Curtin, Ciro Greco
Paper presented at DEMAI@IEEE Big Data 2024.
Read more
Reference
Oct 21, 2024
Written by Jacopo Tagliabue and Chris White (CTO, Prefect)
A reference implementation to implement a Write-Audit-Publish (WAP) pattern with Bauplan and Prefect 3.0.
Read more
Engineering
Sep 18, 2024
Written by Ciro Greco
Find the right balance between cost control and fast startup time for your Spark clusters.
Read more
Open Source
Apr 11, 2024
Written by Ciro Greco
An open source implementation of WAP using Apache Iceberg, Lambdas, and Nessie all running entirely Python.
Read more
Research
Jun 9, 2024
Written by Jacopo Tagliabue and Ciro Greco
Paper presented at SIGMOD/PODS 2024. Awarded best paper DEEM@SIGMOD.
Read more
Engineering
Mar 1, 2024
Written by Ciro Greco
Working on production data is the only way to know whether our applications will work.
Read more
Engineering
Nov 27, 2023
Written by Ciro Greco
Why production cloud environment are too slow and hard to develop in them.
Read more
Engineering
Sep 6, 2023
Written by Ciro Greco and Jacopo Tagliabue
The greatest invention since sliced Virtual Machines.
Read more
Research
Aug 10, 2023
Written by Jacobo Tagliabue, Ciro Greco, and Luca Bigon
Paper presented at VLDB 2023.
Read more
Open Source
Jun 4, 2023
Written by Ciro Greco
An open-source implementation of a Data Lake with DuckDB and AWS Lambdas.
Read more