PuffinDB includes a DuckDB Extension implementing the following features:
- Distribute queries across thousands of serverless functions and a Monostore
- Read from and write to hundreds of applications using any Airbyte connector
- Collaborate on the same Iceberg tables with other users
- Write back to an Iceberg table with ACID transactional integrity
- Execute cross-database joins (Cf. Edge-Driven Data Integration)
- Translate between 19 SQL dialects
- Invoke remote query generators
- Invoke curl commands
- Execute incremental and observable data pipelines
- Turn DuckDB into a next-generation vector database
- Support the Lance file format for 100× faster random access
- Accelerate and | or schedule the downloading of large tables to your client
- Cache tables and run computations at the edge (Amazon CloudFront × Lambda@Edge)
- Log queries on your data lake
The PuffinDB extension allows the invocation of curl directly from DuckDB, using the following syntax:
SELECT * FROM curl(url="https://myURL", content-type="text/jsonl");