Categorical trust regions #865

uri-granta · 2024-08-07T09:36:02Z

Related issue(s)/PRs:

Summary

Follows on from #864.

Fully backwards compatible: yes / no

PR checklist

The quality checks are all passing
The bug case / new feature is covered by tests
Any new features are well-documented (in docstrings or notebooks)

…_encoded_models

…h_encoded_models

…l_trust_regions

…ust_regions

khurram-ghani · 2024-08-21T14:48:51Z

trieste/acquisition/rule.py

@@ -1624,7 +1626,13 @@ def __init__(
        self._y_min = tf.constant(np.inf, dtype=self.location.dtype)

    def _init_eps(self) -> None:
-        self.eps = self._zeta * (self.global_search_space.upper - self.global_search_space.lower)
+        if not isinstance(self.global_search_space, HasOneHotEncoder):


Using an isinstance check in this file to detect categorical spaces makes me a bit uncomfortable, as it forces developers to have to inherit from HasOneHotEncoder, and it feels too much of a special case. I think our original design decision was to use a property like is_categorical to do this. I don't have a strong objection, but just wanted to highlight that. I think we already discussed this before, but can't remeber the conclusion.

Remind me: are there any categorical spaces (where we would wish to use Hamming distances) that are also numerically bounded? Because looking at it now, it seems much more natural to write

Suggested change

if not isinstance(self.global_search_space, HasOneHotEncoder):

if self.global_search_space.has_bounds:

Certainly we shouldn't calculate eps for unbounded spaces.

I don't think so. We won't have categorical spaces that have bounds. However, we could potentially have the reverse, i.e. spaces that are not bounded and are also not categorical.

Well we can't use eps as written for unbounded spaces, even if they're not categorical, as it uses the bounds. Is there any reason not to default to Hamming distance for those cases, at least for now?

That seems fine to me. @vpicheny what do you think?

khurram-ghani · 2024-08-21T15:03:22Z

trieste/acquisition/rule.py

+            # use Hamming distance for categorical spaces
+            return tf.math.reduce_sum(
+                tf.where(tf.expand_dims(points, -2) == tf.expand_dims(points, -3), 0, 1),
+                axis=-1,
+                keepdims=True,  # (keep last dim for distance calculation below)
+            )  # [num_points, num_points, 1]
+        else:


I think maybe we should add more of an explanation here, as the categorical and numerical cases are slightly inconsistent.

The size of the last dimension for the numerical case is D, i.e. the distance in each dimension is calculated separately. Each dimesion is then separately tested against distance in _get_points_within_distance and it selects neighbors if all dimensions are within distance (i.e. reduce_all below).

For the categorical case the last dimension is 1 as we do a reduce_sum. I can see why that is, as we want to effectively do a reduce_any in _get_points_within_distance, i.e. the neighbors are selected if they are within distance in any dimension.

So we can add an explanation, or alternatively do the selection below and explicitly add a reduce_any.

tests/unit/acquisition/test_rule.py

tests/integration/test_mixed_space_bayesian_optimization.py

Uri Granta added 30 commits July 26, 2024 11:09

CategoricalSearchSpace

c82a549

Fix AutoGraph error

260fff7

EncoderFunction

976953f

Move one hot encoder to space.py

2a863b5

DiscreteSearchSpaceABC

5e3cae0

Support one-hot encoding mixed search spaces

33aa239

Not yet using latest gpflow

5e135eb

mypy

8a541a1

Refactor to allow categorical TR spaces

6bc5d4d

Add more tests

74122e5

More tests

d0f9376

Test to_tags

6d2a4e9

has_bounds property

c0cfd42

encode_query_points decorator

9c588a9

Encode some more query points

e6a0692

Categorical Trust Regions

003d8a7

Tweaks

25d20ca

Categorical search spaces

69e7c0b

Remove superfluous encodings

c071112

Migrate to encode method

563e765

Experiment with encoded model approaches

6acd6b7

Merge branch 'uri/categorical_search_spaces' into uri/experiment_with…

d864468

…_encoded_models

Fix typing

d8be1b7

Better name

cd9b89e

Make encoded methods final

7a0dde0

Docstrings

5867830

EncodedFastUpdateModel

110fc28

Missed finals

b05d413

inherit_check_shapes

485e284

Review comments

1209ff4

Uri Granta added 18 commits August 8, 2024 16:08

Add a few unit tests

78ca4ed

mypy

d3254f4

Check we can use Embedding layer as an encoder

355d9e1

Start writing integration test (and fix one_hot_encoder dtype issue)

ab37c52

Encode initial model data too

8a8dcd8

Custom gpr kernel

1ea01df

Merge remote-tracking branch 'origin/develop' into uri/experiment_wit…

073c95e

…h_encoded_models

Consistent dtype in encoder unit test

09c727e

Merge branch 'uri/experiment_with_encoded_models' into uri/categorica…

1546cce

…l_trust_regions

Eps

cc018c8

one_hot_encoded_space

d39679b

Couple of unit tests

b682395

Adress review comments

d4949ec

Merge branch 'uri/experiment_with_encoded_models' into uri/categorica…

4833999

…l_trust_regions

Unit test

bd6ed69

Fix typo and hidden optimizer issue

bba0f1b

Merge branch 'uri/experiment_with_encoded_models' into uri/categorica…

4b77fd8

…l_trust_regions

Integration test and fix thompson sampling

d1cb88f

Base automatically changed from uri/experiment_with_encoded_models to develop August 21, 2024 07:30

Merge remote-tracking branch 'origin/develop' into uri/categorical_tr…

e91ce75

…ust_regions

uri-granta marked this pull request as ready for review August 21, 2024 07:35

See whether increasing steps fixes test_old

c48a40a

uri-granta requested a review from khurram-ghani August 21, 2024 10:02

khurram-ghani reviewed Aug 21, 2024

View reviewed changes

Uri Granta added 3 commits August 22, 2024 12:54

Review comments

31fb555

Switch num_steps

e2e1a06

Revert to 8

2b64c8f

khurram-ghani approved these changes Aug 27, 2024

View reviewed changes

uri-granta merged commit 2c725f9 into develop Aug 27, 2024
12 checks passed

uri-granta deleted the uri/categorical_trust_regions branch August 27, 2024 10:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Categorical trust regions #865

Categorical trust regions #865

uri-granta commented Aug 7, 2024 •

edited

Loading

khurram-ghani Aug 21, 2024

uri-granta Aug 22, 2024 •

edited

Loading

khurram-ghani Aug 22, 2024

uri-granta Aug 22, 2024

khurram-ghani Aug 22, 2024

khurram-ghani Aug 21, 2024

	if not isinstance(self.global_search_space, HasOneHotEncoder):
	if self.global_search_space.has_bounds:

Categorical trust regions #865

Categorical trust regions #865

Conversation

uri-granta commented Aug 7, 2024 • edited Loading

Summary

PR checklist

khurram-ghani Aug 21, 2024

Choose a reason for hiding this comment

uri-granta Aug 22, 2024 • edited Loading

Choose a reason for hiding this comment

khurram-ghani Aug 22, 2024

Choose a reason for hiding this comment

uri-granta Aug 22, 2024

Choose a reason for hiding this comment

khurram-ghani Aug 22, 2024

Choose a reason for hiding this comment

khurram-ghani Aug 21, 2024

Choose a reason for hiding this comment

uri-granta commented Aug 7, 2024 •

edited

Loading

uri-granta Aug 22, 2024 •

edited

Loading