-
Notifications
You must be signed in to change notification settings - Fork 906
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Stress Testing] - Create example projects to assess Kedro performance for complex pipelines #3866
Milestone
Comments
merelcht
changed the title
Assess Kedro performance for complex pipelines
Create example projects to assess Kedro performance for complex pipelines
Jun 17, 2024
7 tasks
noklam
changed the title
Create example projects to assess Kedro performance for complex pipelines
[Stress Testing] - Create example projects to assess Kedro performance for complex pipelines
Aug 29, 2024
Since this is in sprint already, I update the title so this will be the ticket for pipeline stress testing. I will create separate tickets for the component part. |
1 task
7 tasks
7 tasks
This was
linked to
pull requests
Oct 17, 2024
7 tasks
merelcht
modified the milestones:
Improve Developer Experience,
Improve performance of Kedro
Oct 18, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Description
#3957 needs to be done first as pre-work.
There are several features across the Kedro organisation that could benefit from manual testing on large projects to evaluate performance. The proposal is to create several kedro projects of varying size that can be used to test and experiment with.
The example projects don't have to be "legit" as in tackling real data science problems. It can be toy examples. The main point here is creating projects with scale.
This could be particularly useful for testing Viz features, CC @rashidakanchwala, @NeroOkwa
Context
The Kedro-Viz team carried out a performance analysis using an internal QB pipeline, with preliminary results shown here kedro-org/kedro-viz#1064
(from kedro-org/kedro-viz#1064 (comment), summary of internal report).
There is preliminary evidence that the Kedro Framework CLI is a bottleneck for Kedro Viz.
This is on top of the already existing evidence that Kedro takes a lot of time to load even for trivial commands or almost empty projects #1476
We noted that there are several factors that make a pipeline "complex":
In I expanded on @AhdraMeraliQB's original proposal and suggested that we create a family of pipelines, comprising
Comes from #3790
Originally posted by AhdraMeraliQB January 6, 2024
The text was updated successfully, but these errors were encountered: