Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Expose issues in Parquet format via pipeline API #106

Open
Ben-Hodgkiss opened this issue Oct 18, 2024 · 0 comments
Open

Expose issues in Parquet format via pipeline API #106

Ben-Hodgkiss opened this issue Oct 18, 2024 · 0 comments

Comments

@Ben-Hodgkiss
Copy link
Contributor

Ben-Hodgkiss commented Oct 18, 2024

Overview

Expose issues in Parquet format via new pipeline API

Background
Following the design proposal for an internal API, we would like to prove some technology choices which include the use of Fast API with DuckDB accessing Parquet on S3.

This work was identified during the spike on API design.

By exposing the issues in Parquet format via an internal API, we will know whether Parquet will be an appropriate format for consuming via an internal API as well as datasette.

Builds on work to be delivered via https://trello.com/c/NZevJ4wU/3581-spike-prove-fast-api-with-duckdb-and-parquet-on-s3 .

Tech Approach

  • Build on Fast API implementation for Pipeline API by adding unit and integration tests
  • Add GHA CI workflow to publish docker image
  • Provision AWS resources to run pipeline API in dev, staging and prod

Acceptance Criteria/Tests

  • Code merged within pipeline-api repo
  • Pipeline API exposing issues in Parquet format deployed to production environment
  • Integrate with Sentry

Ticket Management - DELETE this section once completed

  • Complete all relevant tags - make sure Infrastructure is tagged so it is picked up by our filters!
  • Complete the time estimate field
  • Make sure you have a PR link in the Overviewabove.
  • If relevant, link to the relevant OKR as an attachment.
  • Link to any tickets in other boards that are dependent on it.
@Ben-Hodgkiss Ben-Hodgkiss converted this from a draft issue Oct 18, 2024
@Ben-Hodgkiss Ben-Hodgkiss moved this from Refine, Prioritise & Plan to Backlog in Infrastructure Oct 22, 2024
@Ben-Hodgkiss Ben-Hodgkiss moved this from Backlog to Refine, Prioritise & Plan in Infrastructure Nov 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Refine, Prioritise & Plan
Development

No branches or pull requests

1 participant