Fix parallel buffered slice writer bug #922

markbader · 2023-07-11T08:12:35Z

Description:

Hot fix implementation of bbox extension to avoid parallel access to properties JSON during Dataset from_images

Issues:

fixes webknossos new CLI JSON error with multiprocessing futures #919

Todos:

Make sure to delete unnecessary points or to check all before merging:

Updated Changelog

…write of view.

markbader · 2023-07-17T11:54:29Z

I added a boolean argument update_bbox to the named partial copy_to_view and propagated it to the actual write of the view or the mag view. This makes sure that the bounding box is not updated by multiple processes. Therefore, the corruption of the properties JSON file should be avoided.

fm3

Cool stuff! I added a few comments, mostly about naming things and defaults. Could you have a look?

fm3 · 2023-07-17T12:24:34Z

webknossos/webknossos/dataset/_utils/buffered_slice_writer.py

@@ -34,6 +34,7 @@ def __init__(
        self,
        view: "View",
        offset: Optional[Vec3IntLike] = None,
+        update_bbox: bool = False,


I’d say the default should be True, and there could be a comment explaining what it does and to encourage setting it to false IF parallel access is intended.

fm3 · 2023-07-17T12:27:41Z

webknossos/webknossos/dataset/view.py

@@ -192,6 +192,7 @@ def write(
        self,
        data: np.ndarray,
        offset: Optional[Vec3IntLike] = None,  # deprecated, relative, in current mag
+        update_bbox: bool = True,


maybe in this context it is rather something like allow_write_outside_view_bbox: bool = False? (this method does not update the bbox, right?)

Also, did you test what happens if there is a write outside of the existing box? does it just work?

I agree, that the name allow_write_outside_view_bbox fits better for View.write() but it seems a bit odd to have different signatures for View and MagView.

Yes, it just worked for the files that I used to test it.

fm3 · 2023-07-17T13:25:35Z

@philippotto if you have time, could you also have a look at these changes? I’m not super sure about the semantics with view vs magview and I think another opinion would make sense

philippotto

As a bandaid hotfix this seems okay to me (however, shouldn't the default for update_bbox be false everywhere to avoid the runtime errors?).

Is there a plan for a proper fix? The PR would cause bounding boxes to be not set correctly potentially, if I understand it correctly?

philippotto · 2023-07-18T08:25:50Z

webknossos/webknossos/dataset/_utils/buffered_slice_writer.py

+            if isinstance(self.view, MagView):
+                self.view.write(
+                    data,
+                    offset=buffer_start.add_or_none(self.offset),
+                    relative_offset=buffer_start_mag1.add_or_none(self.relative_offset),
+                    absolute_offset=buffer_start_mag1.add_or_none(self.absolute_offset),
+                    update_bbox=self.update_bbox,
+                )
+            else:
+                self.view.write(
+                    data,
+                    offset=buffer_start.add_or_none(self.offset),
+                    relative_offset=buffer_start_mag1.add_or_none(self.relative_offset),
+                    absolute_offset=buffer_start_mag1.add_or_none(self.absolute_offset),
+                    allow_write_outside_bbox=not self.update_bbox,
+                )


This can be DRYed with something like this, I think:

kwargs = None if isinstance(self.view, MagView): kwargs = {"update_bbox": self.update_bbox} else: kwargs = {"allow_write_outside_bbox": not self.update_bbox} self.view.write( data, offset=buffer_start.add_or_none(self.offset), relative_offset=buffer_start_mag1.add_or_none(self.relative_offset), absolute_offset=buffer_start_mag1.add_or_none(self.absolute_offset), **kwargs )

philippotto · 2023-07-18T08:26:27Z

webknossos/webknossos/dataset/_utils/buffered_slice_writer.py

+                    offset=buffer_start.add_or_none(self.offset),
+                    relative_offset=buffer_start_mag1.add_or_none(self.relative_offset),
+                    absolute_offset=buffer_start_mag1.add_or_none(self.absolute_offset),
+                    allow_write_outside_bbox=not self.update_bbox,


I don't understand this logic. If the bbox must not be updated, writing outside of the bbox is allowed? Shouldn't this be inverted?

fm3 · 2023-07-18T08:39:46Z

Is there a plan for a proper fix? The PR would cause bounding boxes to be not set correctly potentially, if I understand it correctly?

I’m unsure how to go about this. The idea here was to move the responsibility of setting the full bbox from the subjobs (which do the write call) to the main process (afterwards). The benefit would be that the subjobs don’t run into this parallel editing of the json problem, and yet, the final bbox would be correct (as the caller, in this case add_layer_from_images, can then set it).

The idea was to have the defaults be True, which is the intuitive behavior (everything updates), and only set them to False explicitly in the cases where concurrent usages are intended (creating a need to set the bbox in the caller code afterwards).

A problem that arose from this is that the write calls must now be able to write outside of the (previous, smaller) bbox, because it will be set only at the end. That is why there is the allow_write_outside_bbox param in this case.

I agree that this is all a bit hard to comprehend from the code alone. I am not super happy with this solution, but I also don’t have an idea for a “proper fix” – if you have ideas, please share them!

Another idea was to set the bbox to a huge one (millions of voxel per dimensions) before calling the subjobs. In that case, the subjobs would not touch the json as nothing is bigger than expected. That would alleviate the need for passing these booleans. However, itwould introduce magic numbers for this huge bbox, and also not feel like a proper fix.

We could probably also implement a locking mechanism for the json, but I’m not sure how to do that (in a way that does not impede performance)

philippotto · 2023-07-18T09:26:03Z

I see two possible solutions for this problem (I'm preferring (2)):

Defer the mutation of the JSON. This is similar to what the PR is currently doing, but I'd like to see an automatic fix-up of the JSON, so that the users of this library don't need to take care of this. Also, they shouldn't need to pass special parameters if they use multiprocessing as this is complicated. The automated fix-up could be done by remembering all the written bboxes and passing them to the parent job in the end.
Use a locking mechanism as you also suggested. We have something similar in voxelytics and it looks relatively simple (however, I did not author this, maybe @normanrz wants to chime in?). It's file system based which should be okay since the JSON itself is FS-based, too. The lock file could be named datasource-properties.json.lock. Since the update of the JSON should take very little time (in comparison to writing image data), I don't think that performance will be a problem.
We should take extra care to reduce the chance that the lock file is not cleaned up. That is, the locking phase should be as short as possible (i.e., only when updating the JSON and only when the JSON really changes). If the JSON update raises an exception for some reason, the lock file should also be cleaned up again. Then, the lock file might only be a problem when the python process is terminated during the update of the JSON. In that case, a following run would probably dead-lock. A warning could be emitted if the lock is awaited for more than X seconds with the hint how it can be deleted manually.

I think, I'm in strong favor of (2) since it seems way simpler to me.

fm3 · 2023-07-18T10:34:35Z

I would have assumed that file locks are not guaranteed to be multiprocessing-safe, but I may be wrong about that. Do we use it in voxelytics in the same way without issue? Then I’m ok with that approach. Maybe you could give @markbader an introduction on how it is done in voxelytics so that he can adapt it into here?

philippotto · 2023-07-18T12:11:10Z

I would have assumed that file locks are not guaranteed to be multiprocessing-safe, but I may be wrong about that.

I think it should be safe. The package describes the locking mechanism as a "a simple way of inter-process communication".

Do we use it in voxelytics in the same way without issue? Then I’m ok with that approach.

In vx we use the locking to avoid that the same workflow task is run multiple times (e.g., due to user error). So, I think, that this should work for the libs use case, too.

Maybe you could give @markbader an introduction on how it is done in voxelytics so that he can adapt it into here?

@markbader See this code. I think, it's as simple as importing the module and adding the with SoftFileLock(lock_path, timeout=3): around the update of the JSON file. Let me know if you have questions :)

…ation.

markbader · 2023-07-24T13:45:05Z

@philippotto Thanks for your feedback and implementation ideas. I tried to implement the SoftFileLock approach but ran into some issues:

The filelock python library does not support UPaths right now, and therefore is not usable for remote datasets.
I adapted the filelock library to support UPaths and replaced the os.open calls with pathlib.Path(..).open('x') as it is the python representation for atomic write a file and throw an Exception if it already exists. Sadly, the s3fs library does not support the mode "x" for open. I think it is out of scope to change the s3fs library as well to implement a file lock.

I just discussed with @daniel-wer that under the given circumstances, the implementation of your first proposed solution would be better. @philippotto do you have another idea for an implementation of a locking mechanism, or would you agree that I should implement solution 1?

…ck for s3 buckets.

philippotto · 2023-07-24T14:02:01Z

I just discussed with @daniel-wer that under the given circumstances, the implementation of your first proposed solution would be better. @philippotto do you have another idea for an implementation of a locking mechanism, or would you agree that I should implement solution 1?

I would hope that a filelock-implementation would be possible with S3, but I also don't see an easy way to achieve it without changing the entire approach that is used by the file-lock library. Alternatively, one could use a file lock somewhere else (e.g. in the current working directory) that is shared between all workers. For multiprocessing this would be easy (the only downside is that this wouldn't catch independent actors writing to the same DS). However, I don't see how this would be generalized to slurm etc.

Therefore, I'm fine with doing (1) :)

…entation ideas.

philippotto

Great, looks good to me :) Two things:

The documentation should state somewhere explicitly which restrictions exist for the parallel access of wk datasets. Namely:
- when writing shards in parallel, json_update_allowed should be set to False to disable the automatic update of the bounding box meta data. otherwise, race conditions may happen. the user is responsible for updating the bounding box manually.
- when writing to chunks in shards, one chunk may only be written to by one actor at any time
- when writing to compressed shards, one shard may only be written to by one actor at any time
- reading in parallel should be fine.
see my two review comments

@normanrz could you double check my restrictions from above?

webknossos/webknossos/dataset/dataset.py

webknossos/webknossos/dataset/_utils/pims_images.py

normanrz · 2023-08-11T12:51:05Z

when writing to chunks in shards, one chunk may only be written to by one actor at any time

when writing to compressed shards, one shard may only be written to by one actor at any time

Ok for now. With Zarr, parallel writes to a shard will no longer be allowed at all. Maybe we already put such wording in.

reading in parallel should be fine.

reading in parallel without concurrent writes is ok.

markbader · 2023-08-14T11:46:44Z

@philippotto Thanks for your review! I added a paragraph about parallel access of Datasets in the Dataset Usage part of the docs.

philippotto

Excellent, looks good to me :)

fm3 · 2023-08-14T11:52:36Z

With Zarr, parallel writes to a shard will no longer be allowed at all.

That sounds like trouble for some of our applications that do slice-wise operations. What would be the right way to deal with that? Shards with very small z size so that we can do the parallelization? Or ditch the fine-grained parallelization to write to the large chunk files sequentially each? Or re-chunking in a second pass? All seem to have drawbacks compared to now.

philippotto · 2023-08-14T11:59:29Z

With Zarr, parallel writes to a shard will no longer be allowed at all.

That sounds like trouble for some of our applications that do slice-wise operations. What would be the right way to deal with that? Shards with very small z size so that we can do the parallelization? Or ditch the fine-grained parallelization to write to the large chunk files sequentially each? Or re-chunking in a second pass? All seem to have drawbacks compared to now.

For the vx align case, picking a small shard-z dimension should be a good option imo. Especially, because zarr supports non-cubic shards (so, x and y can still have larger sizes). After materialization, a recubing would be necessary (not strictly, but good for WK) which is a drawback, but on the other hand a low shard-z-size would allow higher parallelization.

markbader added 5 commits June 27, 2023 16:19

Extend traceback to get exception of future.

9204b09

Implement pad flag for webknossos cli convert subcommand.

2bb3883

Implement hotfix solution for fitting bbox without pad flag.

54d3477

Merge branch 'master' into Fix_parallel_BufferedSliceWriter_bug

3fe785e

Update changelog.

7c6f46d

markbader self-assigned this Jul 11, 2023

markbader added 3 commits July 13, 2023 13:57

Merge branch 'master' into Fix_parallel_BufferedSliceWriter_bug

44dd88c

Add update_bbox argument and propagate it from from_images to actual …

52cc6b6

…write of view.

Merge branch 'master' into Fix_parallel_BufferedSliceWriter_bug

5fe1387

markbader requested a review from fm3 July 17, 2023 12:09

fm3 reviewed Jul 17, 2023

View reviewed changes

Implement requested changes.

1863194

markbader added 2 commits July 17, 2023 15:37

Minor changes to default values.

3bbcd06

Run formatter.

a6366c1

philippotto reviewed Jul 18, 2023

View reviewed changes

markbader added 3 commits July 20, 2023 11:59

Merge branch 'master' into Fix_parallel_BufferedSliceWriter_bug

f230aed

Add filelock dependency and start to change to SoftFileLock implement…

e3336f7

…ation.

Implement filelock (upaths are not supported yet).

1f4f773

Adapted implementation of SoftFileLock. Still does not support filelo…

9122256

…ck for s3 buckets.

Reverted changes with filelock and make some notes for further implem…

cf643d1

…entation ideas.

markbader marked this pull request as draft July 26, 2023 13:54

Add json_update_allowed bool.

b21b997

Merge branch 'master' into Fix_parallel_BufferedSliceWriter_bug

54acf99

markbader marked this pull request as ready for review August 2, 2023 08:41

philippotto reviewed Aug 11, 2023

View reviewed changes

webknossos/webknossos/dataset/dataset.py Outdated Show resolved Hide resolved

webknossos/webknossos/dataset/_utils/pims_images.py Show resolved Hide resolved

Add paragraph in docs and implement requested changes.

ea7dc34

philippotto approved these changes Aug 14, 2023

View reviewed changes

Merge branch 'master' into Fix_parallel_BufferedSliceWriter_bug

3256aa0

markbader merged commit 1426ec1 into master Aug 14, 2023
18 checks passed

markbader deleted the Fix_parallel_BufferedSliceWriter_bug branch August 14, 2023 15:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix parallel buffered slice writer bug #922

Fix parallel buffered slice writer bug #922

markbader commented Jul 11, 2023 •

edited

Loading

markbader commented Jul 17, 2023

fm3 left a comment

fm3 Jul 17, 2023

fm3 Jul 17, 2023

fm3 Jul 17, 2023

markbader Jul 17, 2023

fm3 commented Jul 17, 2023

philippotto left a comment

philippotto Jul 18, 2023

philippotto Jul 18, 2023

fm3 commented Jul 18, 2023

philippotto commented Jul 18, 2023

fm3 commented Jul 18, 2023

philippotto commented Jul 18, 2023

markbader commented Jul 24, 2023

philippotto commented Jul 24, 2023

philippotto left a comment •

edited

Loading

normanrz commented Aug 11, 2023

markbader commented Aug 14, 2023

philippotto left a comment

fm3 commented Aug 14, 2023

philippotto commented Aug 14, 2023

Fix parallel buffered slice writer bug #922

Fix parallel buffered slice writer bug #922

Conversation

markbader commented Jul 11, 2023 • edited Loading

Description:

Issues:

Todos:

markbader commented Jul 17, 2023

fm3 left a comment

Choose a reason for hiding this comment

fm3 Jul 17, 2023

Choose a reason for hiding this comment

fm3 Jul 17, 2023

Choose a reason for hiding this comment

fm3 Jul 17, 2023

Choose a reason for hiding this comment

markbader Jul 17, 2023

Choose a reason for hiding this comment

fm3 commented Jul 17, 2023

philippotto left a comment

Choose a reason for hiding this comment

philippotto Jul 18, 2023

Choose a reason for hiding this comment

philippotto Jul 18, 2023

Choose a reason for hiding this comment

fm3 commented Jul 18, 2023

philippotto commented Jul 18, 2023

fm3 commented Jul 18, 2023

philippotto commented Jul 18, 2023

markbader commented Jul 24, 2023

philippotto commented Jul 24, 2023

philippotto left a comment • edited Loading

Choose a reason for hiding this comment

normanrz commented Aug 11, 2023

markbader commented Aug 14, 2023

philippotto left a comment

Choose a reason for hiding this comment

fm3 commented Aug 14, 2023

philippotto commented Aug 14, 2023

markbader commented Jul 11, 2023 •

edited

Loading

philippotto left a comment •

edited

Loading