[tune] remove dependency on gpy #59152

matthewdeng · 2025-12-03T21:58:46Z

Description

Replace GPy with scikit-learn's GaussianProcessRegressor in the PB2 (Population Based Bandits) scheduler.

Additional information

Rewrote TV_SquaredExp kernel to implement sklearn's Kernel interface instead of GPy's Kern
Replaced GPy.models.GPRegression with sklearn.gaussian_process.GaussianProcessRegressor
Removed gpy from tune-test-requirements.txt
Updated documentation to only require scikit-learn (which is already a Ray dependency)

Testing

Re-ran existing PB2 tests in est_trial_scheduler_pbt.py to validate change.

Signed-off-by: Matthew Deng <[email protected]>

python/ray/tune/schedulers/pb2_utils.py

Signed-off-by: Matthew Deng <[email protected]>

python/ray/tune/schedulers/pb2.py

Signed-off-by: Matthew Deng <[email protected]>

TimothySeah

I'm not super familiar with gaussian processes or their implementations in scikitlearn/GPy. Also the scikitlearn -> GPy migration does not appear to be a 1 for 1 replacement. It might be worth getting a review from someone more familiar and/or having a good testing plan to ensure no regressions.

TimothySeah · 2025-12-09T20:23:28Z

python/ray/tune/schedulers/pb2.py

-
-    try:
-        m.optimize()
+        m = GaussianProcessRegressor(


Is m = GPy.models.GPRegression(X, y, kernel) equivalent to

m = GaussianProcessRegressor( kernel=kernel, optimizer="fmin_l_bfgs_b", alpha=1e-10 ) m.fit(X, y)

? Might be worth doublechecking if this matters.

Yeah, these are the default values of GaussianProcessRegressor

TimothySeah · 2025-12-09T20:30:47Z

python/ray/tune/schedulers/pb2_utils.py


    def __init__(
-        self, input_dim, variance=1.0, lengthscale=1.0, epsilon=0.0, active_dims=None
+        self,


Do input_dim and active_dims no longer matter?

Yeah sklearn Kernel doesn't need to pass it in during construction, it calculates at runtime.

TimothySeah · 2025-12-09T20:34:22Z

python/ray/tune/schedulers/pb2_utils.py

+        if Y is None:
+            Y = X
+
+        epsilon = np.clip(self.epsilon, 1e-5, 0.5)


Where did the 1e-5 lower bound come from?

It's consistent with epsilon_bounds, I think it should already be handled by sklearn but kept here for consistency with prev logic.

TimothySeah · 2025-12-09T20:36:44Z

python/ray/tune/schedulers/pb2_utils.py

+    def __call__(self, X, Y=None, eval_gradient=False):
+        X = np.atleast_2d(X)
+        if Y is None:
+            Y = X


Do we need to do Y = np.copy(X) since it was X2 = np.copy(X) before?

No because we replaced in-place modification of

X = X[:, 1:] X2 = X2[:, 1:]

with

X_spatial = X[:, 1:] Y_spatial = Y[:, 1:]

TimothySeah · 2025-12-09T20:47:10Z

python/ray/tune/schedulers/pb2_utils.py

-        return self.variance * np.ones(X.shape[0])
+        if eval_gradient:
+            K_gradient_variance = K
+            dist2 = np.square(euclidean_distances(X_spatial, Y_spatial))


Not sure why this is no longer divided by lengthscale - see the previous code: dist2 = np.square(euclidean_distances(X, X2)) / self.lengthscale

Iiuc call in the scikitlearn implementation is similar to K + update_gradients_full in the GPy implementation, but the update_gradients_full portion seems different.

Yeah this is because sklearn uses logspace

TimothySeah · 2025-12-09T20:56:01Z

python/ray/tune/schedulers/pb2_utils.py

+            m.fit(X, y)

-            scores.append(m.log_likelihood())
+            scores.append(m.log_marginal_likelihood_value_)


Log likelihood isn't quite the same as log marginal likelihood but maybe that doesn't affect this algorithm in a meaningful way?

gpy's implementation returns _log_marginal_likelihood, so it's the same thing.

TimothySeah

LGTM but might be worth adding testing notes to the PR description.

Signed-off-by: Matthew Deng <[email protected]>

matthewdeng added 4 commits December 3, 2025 13:48

[tune] remove dependency on gpy

b40d46a

Signed-off-by: Matthew Deng <[email protected]>

fix

4227404

Signed-off-by: Matthew Deng <[email protected]>

fix

946c167

Signed-off-by: Matthew Deng <[email protected]>

seed

cddfa6e

Signed-off-by: Matthew Deng <[email protected]>

matthewdeng marked this pull request as ready for review December 9, 2025 19:49

matthewdeng requested review from a team as code owners December 9, 2025 19:49

matthewdeng added the go add ONLY when ready to merge, run all tests label Dec 9, 2025

cursor bot reviewed Dec 9, 2025

View reviewed changes

python/ray/tune/schedulers/pb2_utils.py Outdated Show resolved Hide resolved

fix

b88ff38

Signed-off-by: Matthew Deng <[email protected]>

matthewdeng removed the go add ONLY when ready to merge, run all tests label Dec 9, 2025

cursor bot reviewed Dec 9, 2025

View reviewed changes

python/ray/tune/schedulers/pb2.py Outdated Show resolved Hide resolved

elliot-barn mentioned this pull request Dec 9, 2025

[train] removing gpy #59289

Closed

fix

dcb7f85

Signed-off-by: Matthew Deng <[email protected]>

TimothySeah reviewed Dec 9, 2025

View reviewed changes

ray-gardener bot added tune Tune-related issues docs An issue or change related to documentation labels Dec 10, 2025

matthewdeng added the go add ONLY when ready to merge, run all tests label Dec 10, 2025

TimothySeah approved these changes Dec 10, 2025

View reviewed changes

matthewdeng and others added 2 commits December 10, 2025 15:51

Merge branch 'master' into remove-gpy

eaa9115

deps

11387cc

Signed-off-by: Matthew Deng <[email protected]>

matthewdeng merged commit 9d75c1e into ray-project:master Dec 11, 2025
6 checks passed

[tune] remove dependency on gpy #59152

[tune] remove dependency on gpy #59152

Uh oh!

Conversation

matthewdeng commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Additional information

Testing

Uh oh!

Uh oh!

Uh oh!

TimothySeah left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TimothySeah Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TimothySeah left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

matthewdeng commented Dec 3, 2025 •

edited

Loading

TimothySeah Dec 9, 2025 •

edited

Loading