Refactor benchmarks #12

oruebel · 2024-01-27T08:17:30Z

Refactor benchmarks to separate the definition of test cases from the specific metrics. The idea is to define the test case in a base class and then the actual benchmark class defines the metric to be run for the test cases. The goal is to make it easy to run the same test with different metrics without having to repeat the implementation of the test case.

oruebel · 2024-01-27T08:21:43Z

@CodyCBakerPhD what do you think about a design like this for the tests? The main idea is that I would like to be able to compute multiple metrics for the same test case, but I would like to avoid having to copy-paste the same case multiple times just to change the metric.

src/nwb_benchmarks/benchmarks/streaming_base.py

CodyCBakerPhD · 2024-01-27T17:46:30Z

Overall definitely a good idea; left some suggestions for improving the hierarchy

We could even go down the road of having 'mixin' tests if we want to define common types that ought to apply with mix and matchable setups

Co-authored-by: Cody Baker <[email protected]>

…houtborders/nwb_benchmarks into refactor/benchmarks

for more information, see https://pre-commit.ci

…houtborders/nwb_benchmarks into refactor/benchmarks

for more information, see https://pre-commit.ci

oruebel · 2024-01-27T19:57:59Z

left some suggestions for improving the hierarchy

Thanks @CodyCBakerPhD for the helpful suggestions. I made some more changes to implement the changes you had suggested. Can you please take another look.

for more information, see https://pre-commit.ci

CodyCBakerPhD · 2024-01-29T18:29:55Z

@oruebel Thanks for working with me on this; I've opened #19, #20, and #21 in draft mode as demonstrations of alternative strategies

We should really put some thorough thought into what will make it easiest for us to both (a) write tests quickly (as in, now, during development) without as you say, copying/pasting too much, as well as (b) for us to read (in the far future) to quickly remember and understand what is being setup and tested in each case

The earlier we decide on a refactor is put in place, the less work it will be compared to fixing it down the line

Too much implicit structure and redirection can hamper (b), but we need some amount to speed up and (a) and keep the main benchmarks relatively clean, so a balance must be struck somehow

For example, on this PR, if we wanted to answer the question 'what is FileReadStreaming testing?'

We can see quickly that it runs 3 time tests that redirect to class methods. We follow the parent to the streaming base to discover the method definitions in each case, so this one is not too difficult to go back and read.

However, if we wanted to answer the question 'what is ElectricalSeriesStreamingFsspecBase testing?'

We can see that a single time test is being run, and is running a slice request. But this time we have to follow the parent to find the setup definition to know/check the protocol is indeed fsspec-based, then follow its parent to find the slice request definition (note none of this is actually a mixin strictly speaking, just pure inheritance)

Also note that in both cases, for development, if we wanted to make a slight change to the fsspec read protocol, we'd have to remember to modify it in both the FileReadStreamingBase and the ElectricalSeriesStreamingFsspecBase so ensure a fair comparison and synchrony between the two cases; this could have easily been missed because the read method isn't entirely centralized

As an aside I realized the context method is also going to track the __exit__ conditions and ensuing garbage collection

oruebel · 2024-01-29T19:15:04Z

Thanks for working with me on this; I've opened #19, #20, and #21 in draft mode as demonstrations of alternative strategies

We should really put some thorough thought into what will make it easiest for us to both (a) write tests quickly (as in, now, during development) without as you say, copying/pasting too much, as well as (b) for us to read (in the far future) to quickly remember and understand what is being setup and tested in each case

I totally agree. I appreciate that you are taking such a thoughtful approach to this. I think it would be easiest for us to review the different proposed variants via Zoom so that we can decide on a strategy.

CodyCBakerPhD · 2024-02-14T20:46:28Z

replaced by #21

oruebel added 2 commits January 27, 2024 00:14

Refactor tests to allow reuse of test cases for different metrics

db02cbf

Refactor tests to allow reuse of test cases for different metrics

c48af0a

oruebel requested a review from CodyCBakerPhD January 27, 2024 08:17

CodyCBakerPhD reviewed Jan 27, 2024

View reviewed changes

src/nwb_benchmarks/benchmarks/streaming_base.py Outdated Show resolved Hide resolved

CodyCBakerPhD reviewed Jan 27, 2024

View reviewed changes

src/nwb_benchmarks/benchmarks/streaming_base.py Outdated Show resolved Hide resolved

CodyCBakerPhD reviewed Jan 27, 2024

View reviewed changes

src/nwb_benchmarks/benchmarks/streaming_base.py Outdated Show resolved Hide resolved

CodyCBakerPhD assigned oruebel Jan 27, 2024

oruebel and others added 11 commits January 27, 2024 10:48

Update src/nwb_benchmarks/benchmarks/streaming_base.py

6dfb464

Co-authored-by: Cody Baker <[email protected]>

Make asset specification setable

d59bcde

Merge branch 'refactor/benchmarks' of https://github.com/neurodatawit…

dc15021

…houtborders/nwb_benchmarks into refactor/benchmarks

Merge branch 'main' into refactor/benchmarks

18e7c2f

[pre-commit.ci] auto fixes from pre-commit.com hooks

c6cfdee

for more information, see https://pre-commit.ci

Add asserts in setup to ensure s3_url is set in the tests

997359e

Merge branch 'main' into refactor/benchmarks

c8aaed3

Fix --debug option in command_line_interface

b41a38f

Merge branch 'refactor/benchmarks' of https://github.com/neurodatawit…

2cf6ba2

…houtborders/nwb_benchmarks into refactor/benchmarks

Further refactoring to make the benchmarks even more modular

bb465bb

[pre-commit.ci] auto fixes from pre-commit.com hooks

a95e161

for more information, see https://pre-commit.ci

CodyCBakerPhD and others added 2 commits January 29, 2024 13:10

Merge branch 'main' into refactor/benchmarks

750a326

[pre-commit.ci] auto fixes from pre-commit.com hooks

69ad6f2

for more information, see https://pre-commit.ci

CodyCBakerPhD closed this Feb 14, 2024

CodyCBakerPhD deleted the refactor/benchmarks branch February 21, 2024 20:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor benchmarks #12

Refactor benchmarks #12

oruebel commented Jan 27, 2024

oruebel commented Jan 27, 2024

CodyCBakerPhD commented Jan 27, 2024

oruebel commented Jan 27, 2024

CodyCBakerPhD commented Jan 29, 2024

oruebel commented Jan 29, 2024

CodyCBakerPhD commented Feb 14, 2024

Refactor benchmarks #12

Refactor benchmarks #12

Conversation

oruebel commented Jan 27, 2024

oruebel commented Jan 27, 2024

CodyCBakerPhD commented Jan 27, 2024

oruebel commented Jan 27, 2024

CodyCBakerPhD commented Jan 29, 2024

oruebel commented Jan 29, 2024

CodyCBakerPhD commented Feb 14, 2024