Add Conversion Progress Bars #778

garrettmflynn · 2024-05-13T23:23:47Z

This PR implements the basic skeleton for parallel conversion progress bars, specifically a parallelized global bar for conversions.

Pretty sure this still requires updates to NeuroConv to allow for within-file updates. Specifically, we need to ingest the following conversion options:

for more information, see https://pre-commit.ci

garrettmflynn · 2024-05-13T23:27:58Z

@CodyCBakerPhD Actually I see now in #774 that you don't think we need these options.

I'm getting this error with the above code enabled:

jsonschema.exceptions.ValidationError: Additional properties are not allowed ('iterator_opts' was unexpected)

Failed validating 'additionalProperties' in schema['properties']['Phy Sorting']:
    {'additionalProperties': False,
     'properties': {'stub_test': {'default': False,
                                  'description': 'If True, will truncate '
                                                 'the data to run the '
                                                 'conversion faster and '
                                                 'take up less memory.',
                                  'type': 'boolean'},
                    'units_description': {'default': 'Autogenerated by '
                                                     'neuroconv.',
                                          'type': 'string'},
                    'units_name': {'default': 'units',
                                   'description': 'The name of the units '
                                                  'table. If '
                                                  "write_as=='units', then "
                                                  'units_name must also be '
                                                  "'units'.",
                                   'type': 'string'},
                    'write_as': {'default': 'units',
                                 'description': 'How to save the units '
                                                'table in the nwb file. '
                                                'Options:\n'
                                                "- 'units' will save it to "
                                                'the official '
                                                'NWBFile.Units position; '
                                                'recommended only for the '
                                                'final form of the data.\n'
                                                "- 'processing' will save "
                                                'it to the processing '
                                                'module to serve as a '
                                                'historical provenance for '
                                                'the official table.',
                                 'enum': ['units', 'processing']},
                    'write_ecephys_metadata': {'default': False,
                                               'description': 'Write '
                                                              'electrode '
                                                              'information '
                                                              'contained '
                                                              'in the '
                                                              'metadata.',
                                               'type': 'boolean'}},
     'required': [],
     'type': 'object'}

On instance['Phy Sorting']:
    {'iterator_opts': {'display_progress': True,
                       'progress_bar_class': <class 'tqdm_publisher._subscriber.TQDMProgressSubscriber'>,
                       'progress_bar_options': {'mininterval': 0,
                                                'on_progress_update': <function convert_to_nwb.<locals>.update_conversion_progress at 0x125b33ca0>}},
     'stub_test': True}

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/concurrent/futures/process.py", line 211, in _sendback_result
    result_queue.put(_ResultItem(work_id, result=result,
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/multiprocessing/queues.py", line 371, in put
    obj = _ForkingPickler.dumps(obj)
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/multiprocessing/reduction.py", line 51, in dumps
    cls(buf, protocol).dump(obj)
AttributeError: Can't pickle local object 'convert_to_nwb.<locals>.update_conversion_progress'
"""

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/Users/garrettflynn/Documents/GitHub/nwb-guide/pyflask/apis/neuroconv.py", line 117, in post
    return convert_all_to_nwb(url, **neuroconv_api.payload)
  File "/Users/garrettflynn/Documents/GitHub/nwb-guide/pyflask/manageNeuroconv/manage_neuroconv.py", line 858, in convert_all_to_nwb
    output_filepath = future.result()
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/concurrent/futures/_base.py", line 439, in result
    return self.__get_result()
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/concurrent/futures/_base.py", line 391, in __get_result
    raise self._exception
AttributeError: Can't pickle local object 'convert_to_nwb.<locals>.update_conversion_progress'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/site-packages/flask/app.py", line 1484, in full_dispatch_request
    rv = self.dispatch_request()
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/site-packages/flask/app.py", line 1469, in dispatch_request
    return self.ensure_sync(self.view_functions[rule.endpoint])(**view_args)
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/site-packages/flask_restx/api.py", line 404, in wrapper
    resp = resource(*args, **kwargs)
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/site-packages/flask/views.py", line 109, in view
    return current_app.ensure_sync(self.dispatch_request)(**kwargs)
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/site-packages/flask_restx/resource.py", line 46, in dispatch_request
    resp = meth(*args, **kwargs)
  File "/Users/garrettflynn/Documents/GitHub/nwb-guide/pyflask/apis/neuroconv.py", line 123, in post
    neuroconv_api.abort(500, str(exception))
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/site-packages/flask_restx/namespace.py", line 153, in abort
    abort(*args, **kwargs)
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/site-packages/flask_restx/errors.py", line 28, in abort
    flask.abort(code)
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/site-packages/flask/helpers.py", line 277, in abort
    current_app.aborter(code, *args, **kwargs)
  File "/Users/garrettflynn/miniconda3/envs/nwb-guide/lib/python3.9/site-packages/werkzeug/exceptions.py", line 863, in __call__
    raise self.mapping[code](*args, **kwargs)
werkzeug.exceptions.InternalServerError: 500 Internal Server Error: The server encountered an internal error and was unable to complete your request. Either the server is overloaded or there is an error in the application.

CodyCBakerPhD · 2024-05-15T02:48:14Z

pyflask/manageNeuroconv/manage_neuroconv.py

+                    # "iterator_opts": dict( display_progress=True, progress_bar_class=TQDMProgressSubscriber, progress_bar_options=progress_bar_options )
+                }
                if available_options.get("properties").get(interface).get("properties", {}).get("stub_test")


I would not bother displaying the per file progress when running in stub mode - just the total bar should be fine (since individual files should be instantaneous)

This feature is just for non-stub full conversions with many files

CodyCBakerPhD · 2024-05-15T02:49:34Z

@CodyCBakerPhD Actually I see now in #774 that you don't think we need these options.

I'm getting this error with the above code enabled:

This only applies to recording and imaging interfaces; sorting, such as Phy, do not currently support it at all since they don't buffer at all (source data is small enough to load entirely to RAM)

for more information, see https://pre-commit.ci

…ependencies

…outBorders/nwb-guide into hdmf-progress-bars

for more information, see https://pre-commit.ci

garrettmflynn · 2024-05-15T18:53:36Z

Using all the open PRs together, this is working!

For some reason, there's a small delay between the completion of each file and the registration of their success on the global bar.

…outBorders/nwb-guide into hdmf-progress-bars

garrettmflynn · 2024-05-15T19:07:16Z

This behavior may be related to the fact that the raw TQDM updates indicate n=2 with total=1 for the final update, which still comes slightly before registration globally. Not sure why that mismatch between n and total exists

…he NWB GUIDE root folder

for more information, see https://pre-commit.ci

CodyCBakerPhD · 2024-05-17T16:12:43Z

For some reason, there's a small delay between the completion of each file and the registration of their success on the global bar.

That might just come down to an edge case on the size of this (small) data

I can try with some larger files

for more information, see https://pre-commit.ci

CodyCBakerPhD · 2024-05-20T18:27:42Z

@garrettmflynn it might be a while until the entire stack comes into sync with this; can you splinter out the logging improvement to a separate PR?

Also would it be possible to limit the number of error messages displayed at a time so the screen doesn't plan?

for more information, see https://pre-commit.ci

CodyCBakerPhD · 2024-05-31T18:12:20Z

@garrettmflynn Any idea what's up with the ubuntu pipelines here? I reran them 3 times

garrettmflynn · 2024-05-31T19:06:14Z

Looks like we saw this same error in #676 (comment).

Last time, we were able to track this down by looking at the artifacts. I'll do this now.

for more information, see https://pre-commit.ci

garrettmflynn · 2024-05-31T19:58:09Z

At least for the lastest run, it looks like the progression though one of the pipelines (previously an earlier pipeline, this time CellExplorer) freezes and the Puppeteer instance decides it's been too long after 5min.

Looking at the screenshots, everything looks good. Only weird part is that there aren't any after CellExplorer File Metadata since the stall.

CodyCBakerPhD · 2024-05-31T20:01:27Z

And it's happening on all of the CI now?

…outBorders/nwb-guide into hdmf-progress-bars

garrettmflynn · 2024-05-31T21:18:49Z

My bad, looks like I didn't wait for the final screenshot to finish before closing Puppeteer. Had to significantly refactor the iteration through all the pipeline pages so a screenshot could be injected after each—and it looks like this slipped through.

for more information, see https://pre-commit.ci

…outBorders/nwb-guide into hdmf-progress-bars

for more information, see https://pre-commit.ci

garrettmflynn · 2024-05-31T22:08:28Z

So I'm still not sure why this is happening, but I can at least tell you what is happening:

At a seemingly random pipeline, the tests get stuck on the stub conversion despite it finishing 3s after the page transition is triggered:

This screenshot doesn't change for another 2+ minutes until the Ubuntu test fails. This doesn't happen on Windows or Mac tests.

CodyCBakerPhD · 2024-06-01T03:34:20Z

Just tried this out locally on Windows using the multi-session tutorial dataset that was modified (by changing line https://github.com/NeurodataWithoutBorders/nwb-guide/blob/main/src/pyflask/manageNeuroconv/manage_neuroconv.py#L1347 from 3.0 to 100.0)

Since the number of jobs is not yet exposed to the frontend (so no parallel writing of multiple files) I would have expected only one extra bar below the main one (counting how many files remain), with the second bar tracking progress on writing the 'long' recording for each file

However, I did not see any extra bar displayed

garrettmflynn · 2024-06-01T15:57:37Z

Was this a full or stub conversion? Just tested on my M2 and I'm able to observe sub-bars for each file—though this is disabled for stubs.

garrettmflynn · 2024-06-01T16:00:18Z

While the option isn't exposed on the UI, we are telling the backend to write use 2 jobs at the moment—so you should see four sub-bars populate in groups of two.

Though to clarify, you're expecting to see one sub-bar if the n_jobs is 1?

CodyCBakerPhD · 2024-06-02T20:53:53Z

It was a full conversion; I don't expect sub bars for stub mode (though I would expect parallelization)

Odd, I just tried with my M1 mac and they show up fine; I'll try again on Windows later and raise an issue

Though to clarify, you're expecting to see one sub-bar if the n_jobs is 1?

Yeah, I'd always expect sub-bars per file to show how long each individual file is taking to write

garrettmflynn added 2 commits May 13, 2024 14:38

Attempt parallel conversion

d128c6a

Add global conversion progress bar

4c452f9

garrettmflynn requested a review from CodyCBakerPhD May 13, 2024 23:23

garrettmflynn self-assigned this May 13, 2024

[pre-commit.ci] auto fixes from pre-commit.com hooks

0d3935d

for more information, see https://pre-commit.ci

garrettmflynn mentioned this pull request May 13, 2024

[Feature]: File conversion progress bars #774

Closed

Base automatically changed from inspect-progress-bars to main May 14, 2024 18:15

CodyCBakerPhD reviewed May 15, 2024

View reviewed changes

CodyCBakerPhD and others added 3 commits May 14, 2024 22:49

Merge branch 'main' into hdmf-progress-bars

6af1327

Update manage_neuroconv.py

835126b

[pre-commit.ci] auto fixes from pre-commit.com hooks

511ca60

for more information, see https://pre-commit.ci

garrettmflynn mentioned this pull request May 15, 2024

Pass progress_bar_class to HDMF catalystneuro/neuroconv#861

Merged

garrettmflynn and others added 3 commits May 15, 2024 11:52

Working conversion updates from HDMF using new PRs across different d…

fd741e9

…ependencies

Merge branch 'hdmf-progress-bars' of https://github.com/NeurodataWith…

341d09a

…outBorders/nwb-guide into hdmf-progress-bars

[pre-commit.ci] auto fixes from pre-commit.com hooks

448f47d

for more information, see https://pre-commit.ci

garrettmflynn mentioned this pull request May 15, 2024

Expose progress bar class control hdmf-dev/hdmf#1110

Merged

4 tasks

garrettmflynn added 2 commits May 15, 2024 11:58

Show incorrect progress but do not render

46b1626

Merge branch 'hdmf-progress-bars' of https://github.com/NeurodataWith…

aa4dd0c

…outBorders/nwb-guide into hdmf-progress-bars

garrettmflynn and others added 4 commits May 15, 2024 16:30

Pass errors from individual processes to a run-specific log file in t…

499c2c0

…he NWB GUIDE root folder

Merge branch 'main' into hdmf-progress-bars

128e053

[pre-commit.ci] auto fixes from pre-commit.com hooks

c4a48a2

for more information, see https://pre-commit.ci

Merge branch 'main' into hdmf-progress-bars

e8f11ff

garrettmflynn and others added 2 commits May 20, 2024 07:44

Merge branch 'main' into hdmf-progress-bars

1aff868

[pre-commit.ci] auto fixes from pre-commit.com hooks

885e1e0

for more information, see https://pre-commit.ci

garrettmflynn and others added 2 commits May 31, 2024 07:44

Update GuidedInspectorPage.js

07c8d00

[pre-commit.ci] auto fixes from pre-commit.com hooks

4e07964

for more information, see https://pre-commit.ci

garrettmflynn and others added 3 commits May 31, 2024 12:09

Update pipelines.test.ts

837b71d

Update pipelines.test.ts

3234e76

[pre-commit.ci] auto fixes from pre-commit.com hooks

7576a77

for more information, see https://pre-commit.ci

garrettmflynn added 2 commits May 31, 2024 14:17

Update synced data check and awaiting screenshot

8febb0e

Merge branch 'hdmf-progress-bars' of https://github.com/NeurodataWith…

559a633

…outBorders/nwb-guide into hdmf-progress-bars

pre-commit-ci bot and others added 4 commits May 31, 2024 21:19

[pre-commit.ci] auto fixes from pre-commit.com hooks

54ea4c9

for more information, see https://pre-commit.ci

Take screenshots during long-running events

cbf6343

Merge branch 'hdmf-progress-bars' of https://github.com/NeurodataWith…

8f4cba3

…outBorders/nwb-guide into hdmf-progress-bars

[pre-commit.ci] auto fixes from pre-commit.com hooks

1753aac

for more information, see https://pre-commit.ci

Merge branch 'main' into hdmf-progress-bars

a5a7a88

Merge branch 'main' into hdmf-progress-bars

16c5307

CodyCBakerPhD marked this pull request as ready for review June 2, 2024 20:51

CodyCBakerPhD approved these changes Jun 2, 2024

View reviewed changes

CodyCBakerPhD enabled auto-merge June 2, 2024 20:54

CodyCBakerPhD merged commit 224675f into main Jun 2, 2024
22 checks passed

CodyCBakerPhD deleted the hdmf-progress-bars branch June 2, 2024 20:58

This was referenced Jun 3, 2024

Basic Backend Configuration #732

Merged

Uniform Conversion with Backend Configuration #821

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Conversion Progress Bars #778

Add Conversion Progress Bars #778

garrettmflynn commented May 13, 2024

garrettmflynn commented May 13, 2024

CodyCBakerPhD May 15, 2024

CodyCBakerPhD commented May 15, 2024

garrettmflynn commented May 15, 2024

garrettmflynn commented May 15, 2024

CodyCBakerPhD commented May 17, 2024

CodyCBakerPhD commented May 20, 2024

CodyCBakerPhD commented May 31, 2024

garrettmflynn commented May 31, 2024

garrettmflynn commented May 31, 2024

CodyCBakerPhD commented May 31, 2024

garrettmflynn commented May 31, 2024

garrettmflynn commented May 31, 2024

CodyCBakerPhD commented Jun 1, 2024

garrettmflynn commented Jun 1, 2024

garrettmflynn commented Jun 1, 2024

CodyCBakerPhD commented Jun 2, 2024

Add Conversion Progress Bars #778

Add Conversion Progress Bars #778

Conversation

garrettmflynn commented May 13, 2024

garrettmflynn commented May 13, 2024

CodyCBakerPhD May 15, 2024

Choose a reason for hiding this comment

CodyCBakerPhD commented May 15, 2024

garrettmflynn commented May 15, 2024

garrettmflynn commented May 15, 2024

CodyCBakerPhD commented May 17, 2024

CodyCBakerPhD commented May 20, 2024

CodyCBakerPhD commented May 31, 2024

garrettmflynn commented May 31, 2024

garrettmflynn commented May 31, 2024

CodyCBakerPhD commented May 31, 2024

garrettmflynn commented May 31, 2024

garrettmflynn commented May 31, 2024

CodyCBakerPhD commented Jun 1, 2024

garrettmflynn commented Jun 1, 2024

garrettmflynn commented Jun 1, 2024

CodyCBakerPhD commented Jun 2, 2024