Synthetic Clusters

Prerequisites

Add the Congrads package to the current Colab notebook environment and install it.

Show code cell content

Hide code cell content

!pip install "congrads[examples]==0.3.0"

Requirement already satisfied: congrads==0.3.0 in /usr/local/lib/python3.12/dist-packages (from congrads[examples]==0.3.0) (0.3.0)
Requirement already satisfied: numpy>=1.24.0 in /usr/local/lib/python3.12/dist-packages (from congrads==0.3.0->congrads[examples]==0.3.0) (2.0.2)
Requirement already satisfied: pandas>=1.5.0 in /usr/local/lib/python3.12/dist-packages (from congrads==0.3.0->congrads[examples]==0.3.0) (2.2.2)
Requirement already satisfied: torch>=2.0.0 in /usr/local/lib/python3.12/dist-packages (from congrads==0.3.0->congrads[examples]==0.3.0) (2.9.0+cpu)
Requirement already satisfied: torchvision>=0.15.1 in /usr/local/lib/python3.12/dist-packages (from congrads==0.3.0->congrads[examples]==0.3.0) (0.24.0+cpu)
Requirement already satisfied: tqdm>=4.65.0 in /usr/local/lib/python3.12/dist-packages (from congrads==0.3.0->congrads[examples]==0.3.0) (4.67.1)
Requirement already satisfied: matplotlib>=3.7.0 in /usr/local/lib/python3.12/dist-packages (from congrads[examples]==0.3.0) (3.10.0)
Requirement already satisfied: tensorboard>=2.18.0 in /usr/local/lib/python3.12/dist-packages (from congrads[examples]==0.3.0) (2.19.0)
Requirement already satisfied: contourpy>=1.0.1 in /usr/local/lib/python3.12/dist-packages (from matplotlib>=3.7.0->congrads[examples]==0.3.0) (1.3.3)
Requirement already satisfied: cycler>=0.10 in /usr/local/lib/python3.12/dist-packages (from matplotlib>=3.7.0->congrads[examples]==0.3.0) (0.12.1)
Requirement already satisfied: fonttools>=4.22.0 in /usr/local/lib/python3.12/dist-packages (from matplotlib>=3.7.0->congrads[examples]==0.3.0) (4.61.1)
Requirement already satisfied: kiwisolver>=1.3.1 in /usr/local/lib/python3.12/dist-packages (from matplotlib>=3.7.0->congrads[examples]==0.3.0) (1.4.9)
Requirement already satisfied: packaging>=20.0 in /usr/local/lib/python3.12/dist-packages (from matplotlib>=3.7.0->congrads[examples]==0.3.0) (25.0)
Requirement already satisfied: pillow>=8 in /usr/local/lib/python3.12/dist-packages (from matplotlib>=3.7.0->congrads[examples]==0.3.0) (11.3.0)
Requirement already satisfied: pyparsing>=2.3.1 in /usr/local/lib/python3.12/dist-packages (from matplotlib>=3.7.0->congrads[examples]==0.3.0) (3.3.1)
Requirement already satisfied: python-dateutil>=2.7 in /usr/local/lib/python3.12/dist-packages (from matplotlib>=3.7.0->congrads[examples]==0.3.0) (2.9.0.post0)
Requirement already satisfied: pytz>=2020.1 in /usr/local/lib/python3.12/dist-packages (from pandas>=1.5.0->congrads==0.3.0->congrads[examples]==0.3.0) (2025.2)
Requirement already satisfied: tzdata>=2022.7 in /usr/local/lib/python3.12/dist-packages (from pandas>=1.5.0->congrads==0.3.0->congrads[examples]==0.3.0) (2025.3)
Requirement already satisfied: absl-py>=0.4 in /usr/local/lib/python3.12/dist-packages (from tensorboard>=2.18.0->congrads[examples]==0.3.0) (1.4.0)
Requirement already satisfied: grpcio>=1.48.2 in /usr/local/lib/python3.12/dist-packages (from tensorboard>=2.18.0->congrads[examples]==0.3.0) (1.76.0)
Requirement already satisfied: markdown>=2.6.8 in /usr/local/lib/python3.12/dist-packages (from tensorboard>=2.18.0->congrads[examples]==0.3.0) (3.10)
Requirement already satisfied: protobuf!=4.24.0,>=3.19.6 in /usr/local/lib/python3.12/dist-packages (from tensorboard>=2.18.0->congrads[examples]==0.3.0) (5.29.5)
Requirement already satisfied: setuptools>=41.0.0 in /usr/local/lib/python3.12/dist-packages (from tensorboard>=2.18.0->congrads[examples]==0.3.0) (75.2.0)
Requirement already satisfied: six>1.9 in /usr/local/lib/python3.12/dist-packages (from tensorboard>=2.18.0->congrads[examples]==0.3.0) (1.17.0)
Requirement already satisfied: tensorboard-data-server<0.8.0,>=0.7.0 in /usr/local/lib/python3.12/dist-packages (from tensorboard>=2.18.0->congrads[examples]==0.3.0) (0.7.2)
Requirement already satisfied: werkzeug>=1.0.1 in /usr/local/lib/python3.12/dist-packages (from tensorboard>=2.18.0->congrads[examples]==0.3.0) (3.1.5)
Requirement already satisfied: filelock in /usr/local/lib/python3.12/dist-packages (from torch>=2.0.0->congrads==0.3.0->congrads[examples]==0.3.0) (3.20.3)
Requirement already satisfied: typing-extensions>=4.10.0 in /usr/local/lib/python3.12/dist-packages (from torch>=2.0.0->congrads==0.3.0->congrads[examples]==0.3.0) (4.15.0)
Requirement already satisfied: sympy>=1.13.3 in /usr/local/lib/python3.12/dist-packages (from torch>=2.0.0->congrads==0.3.0->congrads[examples]==0.3.0) (1.14.0)
Requirement already satisfied: networkx>=2.5.1 in /usr/local/lib/python3.12/dist-packages (from torch>=2.0.0->congrads==0.3.0->congrads[examples]==0.3.0) (3.6.1)
Requirement already satisfied: jinja2 in /usr/local/lib/python3.12/dist-packages (from torch>=2.0.0->congrads==0.3.0->congrads[examples]==0.3.0) (3.1.6)
Requirement already satisfied: fsspec>=0.8.5 in /usr/local/lib/python3.12/dist-packages (from torch>=2.0.0->congrads==0.3.0->congrads[examples]==0.3.0) (2025.3.0)
Requirement already satisfied: mpmath<1.4,>=1.1.0 in /usr/local/lib/python3.12/dist-packages (from sympy>=1.13.3->torch>=2.0.0->congrads==0.3.0->congrads[examples]==0.3.0) (1.3.0)
Requirement already satisfied: markupsafe>=2.1.1 in /usr/local/lib/python3.12/dist-packages (from werkzeug>=1.0.1->tensorboard>=2.18.0->congrads[examples]==0.3.0) (3.0.3)

Import the necesary functions and classes.

Define utility functions for plotting and other.

Show code cell content

Hide code cell content

def plot_decision_boundary(network: Module, dataset: Dataset):
    fig, ax = plt.subplots(figsize=(6, 4))

    # Create a meshgrid over the feature space
    x_min, x_max = 0, 1
    y_min, y_max = 0, 1
    xx, yy = np.meshgrid(np.linspace(x_min, x_max, 300), np.linspace(y_min, y_max, 300))
    grid = np.c_[xx.ravel(), yy.ravel()]
    grid_tensor = torch.tensor(grid, dtype=torch.float32).to(next(network.parameters()).device)

    # Model prediction
    network.eval()
    with torch.no_grad():
        probs = network({"input": grid_tensor})["output"]
        Z = probs[:, 1]  # probability of class 1 (= 1 - probability of class 0)
        Z = Z.cpu().numpy().reshape(xx.shape)

    # Plot filled contour
    contour = ax.contourf(xx, yy, Z, levels=50, cmap="coolwarm", alpha=0.6, vmin=0, vmax=1)
    ax.contour(xx, yy, Z, levels=[0.5], colors="k", linewidths=2)  # decision boundary
    ax.set_title("Decision Boundary")

    # Plot original data points for each class separately so they can have legend entries
    class0 = dataset.labels == 0
    class1 = dataset.labels == 1
    ax.scatter(
        dataset.data[class0, 0],
        dataset.data[class0, 1],
        c="blue",
        edgecolors="k",
        alpha=0.8,
        label="Class A",
    )
    ax.scatter(
        dataset.data[class1, 0],
        dataset.data[class1, 1],
        c="red",
        edgecolors="k",
        alpha=0.8,
        label="Class B",
    )

    ax.set_xlabel("x")
    ax.set_ylabel("y")

    # Add hatched region
    hatched_area = Rectangle(
        (0, 0),  # bottom-left corner
        0.25,  # width
        1,  # height
        facecolor="none",
        edgecolor="green",
        hatch="//",
        linewidth=1,
        alpha=0.7,
        label="Constrained region",
    )
    ax.add_patch(hatched_area)

    # Colorbar
    sm = ScalarMappable(cmap="coolwarm", norm=Normalize(vmin=0, vmax=1))
    sm.set_array([])
    cbar = fig.colorbar(sm, ax=ax)
    cbar.set_label("Probability of class B")
    cbar.set_ticks([0, 0.25, 0.5, 0.75, 1])

    # Add legend
    ax.legend(loc="upper right", frameon=True)

    # Set limits
    ax.set_xlim(0, 1)
    ax.set_ylim(0, 1)

    return ax

Define custom classes.

Before starting with the general training procedure, we fix the randomizer seeds and get the device on which we are training our model:

We have a built-in Seeder class that will pseudo-randomly fix the seeds of random number generators, Numpy and PyTorch.
If there is a GPU available, use it. Otherwise fall back to CPU.

Problem description

In this second example, the goal is to make a binary classification on some noisy training examples.

We aim to train a classifier that respects constraints put on the network. More specifically, for a certain part in the domain we want to enforce a high probability for class A.

Mathematically: \(x \le 0.25\), then \(P(\text{Blue}) \ge 0.7\)

Dataset

For this example we will use the built-in SyntheticClusters dataset and we will split the dataset into training, validation and test sets using another built-in utility function.

# Load and preprocess data
dataset = SyntheticClusters(
    cluster_centers=[
        (0.3, 0.70),
        (0.25, 0.25),
        (0.13, 0.45),
        (0.35, 0.5),
        (0.67, 0.6),
        (0.80, 0.55),
        (0.75, 0.35),
        (0.55, 0.15),
        (0.6, 0.85),
    ],
    cluster_sizes=[100, 50, 20, 50, 50, 15, 100, 50, 50],
    cluster_std=[0.06, 0.07, 0.04, 0.06, 0.07, 0.04, 0.07, 0.06, 0.06],
    cluster_labels=[0, 0, 1, 0, 1, 0, 1, 0, 0],
)
loaders = split_data_loaders(
    dataset,
    loader_args={"batch_size": 100, "shuffle": True},
    valid_loader_args={"shuffle": False},
    test_loader_args={"shuffle": False},
)

Network

For this example we will use a slightly modified fully connected MLPNetwork with a softmax layer on the output. For this use the prepared MLPNetworkWithSoftmax network.

# Instantiate an MLPNetworkWithSoftmax, configure the parameters
network = MLPNetworkWithSoftmax(n_inputs=2, n_outputs=2, n_hidden_layers=3, hidden_dim=50)

# Push the network to the current device
network = network.to(device)

Descriptor

Now that we have the dataset and the network defined, we can set up an important feature in the Congrads toolbox, called the Descriptor.

Please assign tags to all inputs and all outputs. Flag the input tags as constant.

Example:

descriptor = Descriptor()
descriptor.add("input", "t", 0, constant=True)   # Assigns tag 't' to input data tensor column 0

Refer to the descriptor documentation for more information.

# Instantiate descriptor
descriptor = Descriptor()

# Add constant tags for the inputs
descriptor.add("input", "x", 0, constant=True)
descriptor.add("input", "y", 1, constant=True)

# Add variable tags for the outputs
descriptor.add("output", "ProbA", 0)
descriptor.add("output", "ProbB", 1)

Constraints

With the help of the descriptor, we can easily reference certain parts of the neural network, and so we can now define our constraints.

We have numerous pre-defined constraints available that allow a variety of options. Some examples:

ScalarConstraint allows enforcing that data referenced by a tag should be above or below a certain scalar value
ImplicationConstraint allows conditionally enforcing constraints. If constraint X satisfies, then enforce constraint Y.

In this example, we want to train a classifier that respects constraints put on the network. More specifically, for a certain part in the domain we want to enforce a high probability for class A.

The objective: \(x \le 0.25\), then \(P(\text{Blue}) \ge 0.7\)

# Assign descriptor to constraint base
Constraint.descriptor = descriptor

# Assign device to constraint base
Constraint.device = device

# Define constraints
constraints = [
    ImplicationConstraint(
        head=ScalarConstraint("x", "<=", 0.25),
        body=ScalarConstraint("ProbA", ">=", 0.7),
    ),
]

Loss and optimizer

For this example, we will use a modified loss function that will first convert the predicted probabilities back into logits, and then apply an NLLLoss function to it. Use the prepared NNLLossFromProb for this.

We will stick to the Adam optimizer for this example.

# Instantiate loss criterion
criterion = NNLLossFromProb()

# Instantiate optimizer
optimizer = Adam(network.parameters(), lr=0.001)

Metric manager

To allow keeping track of constraint satisfaction rates for each individual constraints, as well as the losses and possibly other metrics, we instantiate a metric manager.

# Initialize metric manager
metric_manager = MetricManager()

Core

The CongradsCore is the brain of the toolbox. It orchestrates the functionality of all previously created objects, integrating descriptors, constraints, and optimization strategies to perform constraint-guided gradient descent. Essentially, it manages the full training or evaluation pipeline: preparing input and output tensors, applying constraints, computing gradients, updating model parameters, and generating predictions in a coordinated manner.

First, we define a callback that handles plotting per epoch.

Refer to the Congrads documentation for more info.

callback_manager = CallbackManager()

class PlottingCallback(Callback):
    def on_epoch_end(self, data, ctx):
        clear_output(wait=True)
        plot_decision_boundary(network, dataset)
        plt.show()
        plt.close()

callback_manager.add(PlottingCallback())

<CallbackManager callbacks=['PlottingCallback'] ctx_keys=[]>

# Instantiate core
core = CongradsCore(
    descriptor=descriptor,
    constraints=constraints,
    dataloader_train=loaders[0],
    dataloader_valid=loaders[1],
    dataloader_test=loaders[2],
    network=network,
    callback_manager=callback_manager,
    criterion=criterion,
    optimizer=optimizer,
    metric_manager=metric_manager,
    device=device,
    enforce_all=True,
)

Finally, we can start training by running the core.fit(...) function. This function allows setting the maximum epochs and callback functions and will start the training process.

# Start training
core.fit(max_epochs=350)

../_images/f9868c9c728332abcaa00797e7bf793a290f9a1b7bf78e48a8a26ea84a122707.png

Epoch: 100%|██████████| 350/350 [03:12<00:00,  1.82it/s]