feat: Graph-Aware Bayesian Optimization #179

vladislavalerievich · 2025-01-24T13:52:51Z

This PR introduces a new feature to extend BoTorch's functionality for handling graph-aware Bayesian optimization. The key additions are custom implementation of the Weisfeiler-Lehman (WL) kernel represented by BoTorchWLKernel and TorchWLKernel classes, which allow measuring similarity between graphs based on their structural properties. The PR also includes a comprehensive test suite to validate the correctness of the new functionality and related utilities.

Usage example:

from __future__ import annotations

import time
from itertools import product
from typing import TYPE_CHECKING

import networkx as nx
import torch
from botorch import fit_gpytorch_mll, settings
from botorch.acquisition import LinearMCObjective, qLogNoisyExpectedImprovement
from botorch.models import SingleTaskGP
from botorch.models.gp_regression_mixed import CategoricalKernel, ScaleKernel
from gpytorch import ExactMarginalLogLikelihood
from gpytorch.kernels import AdditiveKernel, MaternKernel

from neps.optimizers.models.graphs.context_managers import set_graph_lookup
from neps.optimizers.models.graphs.kernels import BoTorchWLKernel, compute_kernel
from neps.optimizers.models.graphs.optimization import optimize_acqf_graph
from neps.optimizers.models.graphs.utils import min_max_scale, seed_all

if TYPE_CHECKING:
    from gpytorch.distributions.multivariate_normal import MultivariateNormal

start_time = time.time()
settings.debug._set_state(True)
seed_all()

TRAIN_CONFIGS = 50
TEST_CONFIGS = 10
TOTAL_CONFIGS = TRAIN_CONFIGS + TEST_CONFIGS

N_NUMERICAL = 2
N_CATEGORICAL = 1
N_CATEGORICAL_VALUES_PER_CATEGORY = 2
N_GRAPH = 1

assert N_GRAPH == 1, "This example only supports a single graph feature"

# Generate random data
X = torch.cat([
    torch.rand((TOTAL_CONFIGS, N_NUMERICAL), dtype=torch.float64),
    torch.randint(0, N_CATEGORICAL_VALUES_PER_CATEGORY, (TOTAL_CONFIGS, N_CATEGORICAL),
                  dtype=torch.float64),
    torch.arange(TOTAL_CONFIGS, dtype=torch.float64).unsqueeze(1)
], dim=1)

# Generate random graphs
graphs = [nx.erdos_renyi_graph(50, 0.5) for _ in range(TOTAL_CONFIGS)]

# Generate random target values
y = torch.rand(TOTAL_CONFIGS, dtype=torch.float64) + 0.5

# Split into train and test sets
train_x, test_x = X[:TRAIN_CONFIGS], X[TRAIN_CONFIGS:]
train_graphs, test_graphs = graphs[:TRAIN_CONFIGS], graphs[TRAIN_CONFIGS:]
train_y, test_y = y[:TRAIN_CONFIGS].unsqueeze(-1), y[TRAIN_CONFIGS:].unsqueeze(-1)

train_x, test_x = min_max_scale(train_x), min_max_scale(test_x)

kernels = [
    ScaleKernel(
        MaternKernel(nu=2.5, ard_num_dims=N_NUMERICAL, active_dims=range(N_NUMERICAL))),
    ScaleKernel(CategoricalKernel(
        ard_num_dims=N_CATEGORICAL,
        active_dims=range(N_NUMERICAL, N_NUMERICAL + N_CATEGORICAL))),
    ScaleKernel(BoTorchWLKernel(
        graph_lookup=train_graphs, n_iter=5, normalize=True,
        active_dims=(X.shape[1] - 1,)))
]

# Create the Gaussian Process model
gp = SingleTaskGP(train_X=train_x, train_Y=train_y, covar_module=AdditiveKernel(*kernels))

# Compute the posterior distribution
multivariate_normal: MultivariateNormal = gp.forward(train_x)

# Making predictions on test data
with torch.no_grad(), set_graph_lookup(gp, train_graphs + test_graphs, append=False):
    posterior = gp.forward(test_x)
    predictions = posterior.mean
    uncertainties = posterior.variance.sqrt()
    covar = posterior.covariance_matrix

# Fit the GP model
mll = ExactMarginalLogLikelihood(gp.likelihood, gp)
fit_gpytorch_mll(mll)

# Define the acquisition function
acq_function = qLogNoisyExpectedImprovement(
    model=gp,
    X_baseline=train_x,
    objective=LinearMCObjective(weights=torch.tensor([-1.0])),
    prune_baseline=True,
)

# Define the bounds for optimization
bounds = torch.tensor([
    [0.0] * N_NUMERICAL + [0.0] * N_CATEGORICAL + [-1.0] * N_GRAPH,
    [1.0] * N_NUMERICAL + [
        float(N_CATEGORICAL_VALUES_PER_CATEGORY - 1)] * N_CATEGORICAL + [
        len(X) - 1] * N_GRAPH,
])

# Define fixed categorical features
cats_per_column = {i: list(range(N_CATEGORICAL_VALUES_PER_CATEGORY)) for i in
                   range(N_NUMERICAL, N_NUMERICAL + N_CATEGORICAL)}
fixed_cats = [dict(zip(cats_per_column.keys(), combo, strict=False)) for combo in
              product(*cats_per_column.values())]

# Optimize the acquisition function with graph sampling
best_candidate, best_graph, best_score = optimize_acqf_graph(
    acq_function=acq_function,
    bounds=bounds,
    fixed_features_list=fixed_cats,
    train_graphs=train_graphs,
    num_graph_samples=16,
    num_restarts=2,
    raw_samples=32,
    q=1,
)

# Print the results
print(f"Best candidate: {best_candidate}")
print(f"Best graph: {best_graph}")
print(f"Best score: {best_score}")
print(f"Execution time: {time.time() - start_time:.2f} seconds")

vladislavalerievich added 2 commits January 24, 2025 14:11

Add BO over graphs

6794a05

Return the best graph from optimize_acqf_graph function

e94bbe8

vladislavalerievich self-assigned this Jan 24, 2025

vladislavalerievich requested a review from eddiebergman January 24, 2025 13:53

vladislavalerievich added the enhancement New feature or request label Jan 24, 2025

vladislavalerievich changed the title ~~Feature - Graph-Aware Bayesian Optimization~~ feat - Graph-Aware Bayesian Optimization Jan 24, 2025

vladislavalerievich changed the title ~~feat - Graph-Aware Bayesian Optimization~~ feat: Graph-Aware Bayesian Optimization Jan 24, 2025

vladislavalerievich added 2 commits January 24, 2025 18:45

Refactor method that uses lru_cache into a standalone function

1587778

Remove examples

10221a9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Graph-Aware Bayesian Optimization #179

feat: Graph-Aware Bayesian Optimization #179

vladislavalerievich commented Jan 24, 2025 •

edited

Loading

feat: Graph-Aware Bayesian Optimization #179

Are you sure you want to change the base?

feat: Graph-Aware Bayesian Optimization #179

Conversation

vladislavalerievich commented Jan 24, 2025 • edited Loading

vladislavalerievich commented Jan 24, 2025 •

edited

Loading