bauplan
Built with Bauplan
Docs
Blog
Watch a Demo
Try Bauplan
See what you can build when infrastructure becomes Python code.
Analyze PDFs using Bauplan for data preparation and OpenAI’s GPT for text analysis.
PDF
Open AI
Pandas
Patrick Chia
Founding Eng
@atelier-atelico
Orchestrated Write-Audit-Publish pattern for ingesting parquet files to Iceberg tables.
prefect
pandas
iceberg
Chris White
CTO @Prefect
@prefect
Build near real-time analytics pipeline with WAP pattern and visualize metrics with Streamlit.
duckdb
streamlit
Sam Jafari
Dir. Data and AI
@lucid
Convert PDFs into structured, analyzable tables using LLMs.
OpenAI
Streamlit
Jacopo
CTO @bauplan
@bauplan
End-to-end ML pipeline for predicting taxi trip tips with Streamlit data viz.
Scikit-Learn
Notebook
Christine Yu
ML Tech Lead
@sama
Embedding-based recommender system for music playlists.
mongoDB
Vectors
Recs
Ciro
CEO @bauplan
End-to-end entity matching for e-commerce, using OpenAI’s off-the-shelf LLM APIs for accurate matching.
LLM
DuckDB
Nate
End-to-end data engineering repo using Mage & the medallion architecture.
Medallion
Mage
Polars
George Zefkilis
Data Eng
@Novo Nordisk
Build a RAG system with Pinecone and OpenAI over StackOverflow data.
RAG
Pinecone
Serverless data product with built-in quality checks using Lambda and Bauplan.
Dataprod
Lamba
Quality
Andrea Gioia
CTO @Quantyca
Quantyca
A scalable pipeline turning high-volume NetFlow traffic into AI-ready features.
NetFlow
Data Quality
Claude
Marco Graziano
CEO @GrazianoLabs
Graziano Labs
Implement data quality checks using expectations.
PyArrow
Build an interactive dashboard to visualize taxi pickup locations in NYC.