Parallelize search for solutions #3

jmitchell · 2017-01-14T23:44:35Z

Backtracking problems are embarrassingly parallel, and Backtrex should exploit that internally.

After the core API is stabilized, including implementing #2, split the search space into multiple sub-problems and delegate them to a worker pool. As solutions are discovered, have workers report them back so they can be concatenated to the solution stream.

The only clear drawback is the order in which solutions are discovered would no longer be consistent, but providing an API for sequential search (like the existing one) fixes that.

The benefits of supporting parallelism include scaling to the limits of a single BEAM node, and potentially beyond to more nodes until communication latency becomes the bottleneck. If sub-problem delegation and work stealing are implemented in a way that optimizes communication between coordinating processes to other coordinating processes or workers, the potential horizontal scalability could go far.

More detailed walkthrough

Rough sketch of the concept with the Sudoku solver in mind:

Spawn a pool of worker processes.
Ask the Sudoku.Solver for the next unknown cell and its possible values.
While there are worker processes ready for more work, assign one the original problem, except with the current unknown cell assigned to one of the possible values that hasn't been explored.
If all possible values for the current cell have been assigned and more workers are unoccupied, allow each of them to ask for an unexplored sub-problem from one of the occupied workers.
Whenever a worker process finds a solution, it sends it back to the behaviour.
Whenever a worker finishes exploring the entire solution space of its sub-problem, re-enqueue it to the worker pool (may need to explicitly trigger work stealing described in step 4).

Existing tools?

OTP behaviours and newer Elixir behaviours, like GenStage, may be well suited for implementing this concept. They may even offer better strategies. Research what's out there, and consider options while avoiding assumptions about the topology of the search space (number of unknowns, number of values for each unknown, and time require to compute them), the number workers, or the delegation strategy.

Desired invariants

The union of solution spaces of all delegated sub-problems must be the solution space of the original problem (nothing is missed).
The intersection of every explored sub-problem with every other explored sub-problem is the empty set (no work is repeated).
After the requested number of solutions have been found, searching should suspend at a resumable checkpoint. Lazy Streams should help here. (Consider adding a function to suspend work before all the requested solutions have been found.)

Invariants 1 and 2 may seem obvious, but it can be challenging when processes crash. It may even be impossible when certain coordinating processes crash. Invariant 1 is a strong requirement, whereas bounded compromise on 2 (rework) may be acceptable.

The text was updated successfully, but these errors were encountered:

jmitchell · 2017-01-16T17:36:34Z

Existing backtracking research may clarify details of the design. I've started a wiki page to collect resources and thoughts on how they do or don't apply to this problem.

jmitchell · 2017-01-17T00:37:01Z

Study this optimized, parallel N-queens solver in Rust.

jmitchell · 2017-01-17T18:21:51Z

Finish #12 (Generate profiling reports) first.

cognivore · 2023-01-09T17:38:27Z

👀

jmitchell added the enhancement label Jan 14, 2017

This was referenced Jan 15, 2017

Add more examples #5

Open

Optimize the sequential solver #6

Open

Optimization tips guide for Backtrex users #7

Open

jmitchell mentioned this issue Jan 16, 2017

Concurrent competing optimization strategies #11

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelize search for solutions #3

Parallelize search for solutions #3

jmitchell commented Jan 14, 2017 •

edited

Loading

jmitchell commented Jan 16, 2017

jmitchell commented Jan 17, 2017

jmitchell commented Jan 17, 2017

cognivore commented Jan 9, 2023

Parallelize search for solutions #3

Parallelize search for solutions #3

Comments

jmitchell commented Jan 14, 2017 • edited Loading

More detailed walkthrough

Existing tools?

Desired invariants

jmitchell commented Jan 16, 2017

jmitchell commented Jan 17, 2017

jmitchell commented Jan 17, 2017

cognivore commented Jan 9, 2023

jmitchell commented Jan 14, 2017 •

edited

Loading