Local models and datasets #788

khurram-ghani · 2023-10-10T19:07:39Z

Related issue(s)/PRs: #782

Summary

This PR adds support for local models and datasets. The following scenarios are supported:

global model and global dataset -- as previously
local models and local datasets -- one-to-one mapping
global model and local datasets -- many-to-one mapping

The initial dataset can be a global dataset sampled from the whole search space. This data will be replicated to each of the regions on the first iteration and subsequently each region will have an associated local dataset. For batch-TR algorithm, the dataset for each region are filtered after each iteration to only contain the points in the region (but TREGO doesn't do this).

Note: this replication of initial data can potentially cause an issue when a global model is being used, as the points may be repeated. This will only be an issue if regions overlap and both contain that initial data-point (as filtering would otherwise remove duplicates). The main way to avoid this issue is to provide local initial datasets, instead of a global initial dataset.

The trust_region notebook contains a new temporary TEST section, just to show how local models can be used in the notebook. It is worth noting in the gif that the query-points are filtered to be only inside the boxes for each iteration. This section is only for demonstration and will be removed before merging this PR. A follow-on PR will update the TURBO section to use local models and demonstrate this functionality.

Fully backwards compatible: no

The BatchTrustRegion rule acquisition returns rank 3 points, instead of rank 2 as for other rules (and previous trust-region rules). This means the users should use the new batched observer with this rule. That is already taken care of in BayesianOptimizer. However, with AskTellOptimizer the users should use the batched observer as follows:

from trieste.objectives.utils import mk_batch_observer

observer = ...
batch_observer = mk_batch_observer(observer)

new_points = ask_tell.ask()
new_data = batch_observer(new_points)
ask_tell.tell(new_data)

PR checklist

The quality checks are all passing
The bug case / new feature is covered by tests
Any new features are well-documented (in docstrings or notebooks)

trieste/acquisition/rule.py

tests/unit/test_ask_tell_optimization.py

trieste/acquisition/rule.py

uri-granta

Some initial comments on the first couple of files. Will hopefully have more time to continue looking at this.

trieste/objectives/utils.py

trieste/utils/misc.py

uri-granta

Still need to look at the tests but looking good so far! Various small comments, suggestions and questions.

trieste/utils/misc.py

trieste/acquisition/utils.py

trieste/bayesian_optimizer.py

trieste/acquisition/rule.py

uri-granta

(a few more comments)

trieste/ask_tell_optimization.py

tests/unit/test_ask_tell_optimization.py

tests/unit/objectives/test_utils.py

tests/integration/test_ask_tell_optimization.py

docs/notebooks/trust_region.pct.py

hstojic

I haven't gone through it in details, there is a lot going on here...

my biggest comment is on adding additional complexity to ask-tell and bayesian_optimizer - do we really need to take care of the local models there?
very few acq rules will make use of local models, why not taking care of datasets and model training in the acquisition rule itself, in those few that make use of it?

trieste/ask_tell_optimization.py

trieste/objectives/utils.py

trieste/acquisition/rule.py

hstojic

ok after a discussion

khurram-ghani added 15 commits September 21, 2023 18:22

Add support for local models and datasets (WIP)

c89583e

Add unit test for local models (WIP)

c8aebec

Merge remote-tracking branch 'origin/develop' into khurram/batch_turbo

b361e6d

Update multi model/dataset test (WIP)

4b3c2de

Add unit test for keep datasets in regions

28803aa

Add more tests and move to local tags class

e99df96

Always include global dataset in mapping

2de6271

Add filter_mask method to trust region

6236c95

Add more testing

e96ebe8

Fix mypy model type issues

9a7dd23

Add ask_tell testing

4622a98

Fix summary init when only global dataset

af44e48

Remove walrus operator

ef9e455

Update test, ask_tell data not changed in-place

59e0cab

Add some test comments

e4a5342

vpicheny reviewed Oct 11, 2023

View reviewed changes

trieste/acquisition/rule.py Outdated Show resolved Hide resolved

khurram-ghani added 4 commits October 11, 2023 15:21

Add some rule comments

69a8590

Allow input-multi-observers for batch observer

40df585

Allow multiple models/datasets for base rule

ee1ee56

Support multiple models/datasets in region selects

7d03a04

khurram-ghani commented Oct 12, 2023

View reviewed changes

tests/unit/test_ask_tell_optimization.py Outdated Show resolved Hide resolved

khurram-ghani requested review from hstojic, henrymoss and uri-granta October 12, 2023 13:48

khurram-ghani added 2 commits October 12, 2023 17:23

Fix TR plotting history colors

8551d72

Add notebook init points explanation

b571733

vpicheny reviewed Oct 13, 2023

View reviewed changes

trieste/acquisition/rule.py Outdated Show resolved Hide resolved

vpicheny reviewed Oct 13, 2023

View reviewed changes

trieste/acquisition/rule.py Show resolved Hide resolved

vpicheny reviewed Oct 13, 2023

View reviewed changes

trieste/acquisition/rule.py Outdated Show resolved Hide resolved

Rename region index and add init param

2a934d1

khurram-ghani and others added 4 commits October 16, 2023 17:05

Merge branch 'develop' into khurram/local_models

bea54ea

Remove old comment

52f7975

Tidy-up redundant expression

b2eb662

Keep full datasets along with filtered ones

523aaab

uri-granta reviewed Oct 27, 2023

View reviewed changes

uri-granta self-requested a review October 31, 2023 10:40

Make changes from PR feedback

5b3ec0f

uri-granta reviewed Nov 22, 2023

View reviewed changes

khurram-ghani added 9 commits November 22, 2023 17:57

Merge remote-tracking branch 'origin/develop' into khurram/local_models

2c21f85

Address some of the recent feedback

e5cccc8

Fix dataset mypy error

25da01b

Copy dataset in optimizers to avoid changing it

292faaa

Share DatasetChecker and tidy-up exp values in tests

9170c64

Address more feedback

efd2fc0

Merge remote-tracking branch 'origin/develop' into khurram/local_models

ff84690

Avoid default num_models in integ tests

505631a

Fix old python typing issue

b90d3a1

hstojic reviewed Dec 13, 2023

View reviewed changes

trieste/ask_tell_optimization.py Show resolved Hide resolved

trieste/objectives/utils.py Outdated Show resolved Hide resolved

trieste/acquisition/rule.py Outdated Show resolved Hide resolved

trieste/acquisition/rule.py Show resolved Hide resolved

hstojic approved these changes Dec 13, 2023

View reviewed changes

khurram-ghani added 2 commits December 13, 2023 18:24

Merge remote-tracking branch 'origin/develop' into khurram/local_models

1de7e20

Use flatten_... func and add comment

a75e618

khurram-ghani merged commit f766953 into develop Dec 14, 2023
12 checks passed

khurram-ghani deleted the khurram/local_models branch December 14, 2023 12:34

khurram-ghani mentioned this pull request Feb 19, 2024

Batch trust regions should use latest query point #782

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local models and datasets #788

Local models and datasets #788

khurram-ghani commented Oct 10, 2023 •

edited

Loading

uri-granta left a comment •

edited

Loading

uri-granta left a comment

uri-granta left a comment

hstojic left a comment

hstojic left a comment

Local models and datasets #788

Local models and datasets #788

Conversation

khurram-ghani commented Oct 10, 2023 • edited Loading

Summary

PR checklist

uri-granta left a comment • edited Loading

Choose a reason for hiding this comment

uri-granta left a comment

Choose a reason for hiding this comment

uri-granta left a comment

Choose a reason for hiding this comment

hstojic left a comment

Choose a reason for hiding this comment

hstojic left a comment

Choose a reason for hiding this comment

khurram-ghani commented Oct 10, 2023 •

edited

Loading

uri-granta left a comment •

edited

Loading