Skip to main content

Comparing Experiments

The ClearML Web UI provides features for comparing experiments, allowing to locate, visualize, and analyze the differences in experiments’ results and their causes. You can view the differences in:

  • Details - Compare experiment source code, package versions, models, configuration objects, and other details.
  • Hyperparameters
  • Scalars - Compare experiment metrics:
    • Values - Compare minimal, maximal or last reported values in a concise comparison table
    • Graphs - Overlay compared experiments in a single graph per metric
  • Plots - Compare experiment plots
  • Debug samples - Compare debug samples by iteration

With these comparisons, you can investigate the impact of different setups on your experiment results, and gain insight for crafting future experiments.

Selecting Experiments to Compare

To select experiments to compare:

  1. Go to an experiments table that includes the experiments to be compared.
  2. Select the experiments to compare. Once multiple experiments are selected, the batch action bar appears.
  3. In the batch action bar, click COMPARE.

The comparison page opens in the DETAILS tab with the experiments compared side by side.

Modifying Experiment Selection

Click the EXPERIMENTS button to view your currently compared experiments. Click X on a listed experiment to remove it from the comparison.

Experiments list

You can add/remove experiments to your comparison:

  1. Click the + button in any of the comparison tabs. This opens up a window with an experiment table with the currently compared experiments at the top. Adding experiments
  2. Find the experiments to add by sorting and filtering the experiments with the appropriate column header controls. Alternatively, use the search bar to find experiments by name.
  3. Select experiments to include in the comparison (and/or clear the selection of any experiments you wish to remove).
  4. Click APPLY.

Sharing Comparison Page

To share a comparison page, copy the full URL from the address bar and send it to a teammate to collaborate. They will get the exact same page (including selected tabs etc.).

Embedding Comparison Visualization

To embed plots and debug samples from the comparison pages in your Reports, hover over the resource and click Embed code, which will copy to clipboard the embed code to put in your Reports. These visualizations are updated live as the experiments update. The Enterprise Plan and Hosted Service support embedding resources in external tools (e.g. Notion).

Comparison Modes

The comparison pages provide the following views:

Side-by-side Textual Comparison

In the Details and Hyperparameters (Values view) tabs, you can view differences in the experiments' parameters' nominal values. The Details tab displays the experiments' execution details (source code, uncommitted changes, python packages), models, artifacts, configuration objects, and additional general information. Hyperparameters (Values view) displays the experiments' hyperparameter and their values.

The experiments are laid out in vertical cards, so each field is lined up side-by-side. The experiment on the left is used as the base experiment, to which the other experiments are compared. You can set a new base experiment in one of the following ways:

  • Click on Switch base experiment on the top right of the experiment that will be the new base.
  • Click on Pan the new base experiment and drag it all the way to the left

The differences between the experiments are highlighted. Easily locate value differences by clicking click Up arrow (previous diff) or Down arrow (next diff) in the tab header. Obscure identical fields by switching on the Hide Identical Fields toggle.

Use the search bar to find any field names or values. Lines that match the search query are highlighted, and you can navigate between search results.

Side-by-side textual comparison

Tabular Scalar Comparison

The Scalars tab Values view lays out the experiments’ metric values in a table: a row per metric/variant and a column for each experiment. Select from the dropdown menu which metric values to display:

  • Last Values: The last reported values for each experiment
  • Min Values: The minimal value reported throughout the experiment execution
  • Max Values: The maximal value reported throughout the experiment execution

You can download the scalar comparison table as a CSV file by clicking Download.

Switch on the Show row extremes toggle to highlight each variant's maximum and minimum values.

side-by-side scalar comparison

Parallel Coordinates Mode

The Hyperparameters tab's Parallel Coordinates comparison shows experiments’ hyperparameter impact on a specific metric.

To compare by metric:

  1. Under Performance Metric, select a metric to compare for
  2. Select the metric values to use in the plot:
    • LAST - The final value, or the most recent value, for currently running experiments
    • MIN - Minimal value
    • MAX - Maximal value
  3. In Parameters, select the hyperparameters to compare.

For example, plot the metric/variant accuracy/total against the hyperparameters base_lr, dropout, and number_of_epochs.

Parallel coordinates

To focus on a specific experiment, hover over its name in the graph legend.

To hide an experiment, click its name in the graph legend (click again to bring back).

Plot Comparison

The Scalars (Graph view) and Plots tabs compare experiments’ plots.

The Scalars tab displays scalar values as time series line charts. The Plots tab compares the last reported iteration sample of each metric/variant combination per compared experiment.

Line, scatter, and bar graphs are compared by overlaying each metric/variant from all compared experiments' into a single comparative plot.

Use Group by to select how to group plots:

  • Metric - All variants for a metric appear on the same plot.
  • Metric+Variant (default) - Every variant appears on its own plot.

Scalar comparison

Other plot types are displayed separately for each experiment.

non-merged comparison

All single value scalars are plotted into a single clustered bar chart under the "Summary" title, where each cluster represents a reported metric, and each bar in the cluster represents an experiment.

Single scalar comparison

Hover over plots to access plot controls (see Scalar Plot Tools).

Side-by-side Debug Sample Comparison

Compare debug samples at different iterations to examine how your experiments perform throughout their execution.

You can view debug samples by metric in the reported iterations. Filter the samples by metric by selecting a metric from the dropdown menu above the samples. The most recent iteration appears first. To navigate between iterations, click Left arrow (older images),

Right arrow (newer images), or right arrow, newest image (newest images).

Click Sync selection in order to synchronize iteration and metric selection across experiments. For example, if you select a metric for one experiment's debug samples, the same metric will be automatically selected for the rest of the experiments in the comparison.

Debug sample comparison

Open a debug sample (image, audio, or video) in the viewer or player, by clicking the thumbnail.

Debug Sample viewer

To move to the same sample in another iteration, click Left arrow (previous), Right arrow (next), or move the slider.