Trackio documentation

Track

Hugging Face's logo
Join the Hugging Face community

and get access to the augmented documentation experience

to get started

Track

Introduction

Trackio helps you organize your experiments within a project. A project is a collection of runs, where each run represents a single execution of your code with a specific set of parameters and results.

Initialization

To start tracking an experiment with Trackio, you first need to initialize a project with the init() function:

import trackio

trackio.init(project_name="my_project")
  • If the project already exists, it will be loaded.
  • If not, Trackio will create a new one.

In both cases, a new run is started automatically, ready for you to log data.

Naming your run

It’s a good idea to give each run a meaningful name for easier organization and later reference. You can set a name using the run_name parameter:

trackio.init(project_name="my_project", run_name="my_first_run")

If no name is provided, Trackio generates a default one.

Logging Data

Once your run is initialized, you can start logging data using the log() function:

trackio.log({"loss": 0.05})

Each call to log() automatically increments the step counter. If you want to log multiple metrics at once, pass them together:

trackio.log({
    "loss": 0.05,
    "accuracy": 0.95,
})

Logging tables

You can log tabular data using the Table class. This is useful for tracking results like predictions, or any structured data.

import pandas as pd

df = pd.DataFrame(
    {
        "prompt": ["Trackio", "Logging is"],
        "completion": ["is great!", "easy and fun!"],
        "reward": [0.123, 0.456],
    }
)
trackio.log(
    {
        ...
        "texts": trackio.Table(dataframe=df),
    }
)

Logging images

You can log images using the Image class.

trackio.log({"image": trackio.Image(value="path/to/image.png", caption="Image caption")})

Images can be logged from a path, a numpy array, or a PIL Image.

Logging videos

You can log videos using the Video class.

import trackio
import numpy as np

# Create a simple video from numpy array
frames = np.random.randint(0, 255, (10, 3, 64, 64), dtype=np.uint8)
video = trackio.Video(frames, caption="Random video", fps=30)
trackio.log({"my_video": video})

# Create a batch of videos
batch_frames = np.random.randint(0, 255, (3, 10, 3, 64, 64), dtype=np.uint8)
batch_video = trackio.Video(batch_frames, caption="Batch of videos", fps=15)
trackio.log({"batch_videos": batch_video})

# Create video from file path
video = trackio.Video("path/to/video.mp4", caption="Video from file")
trackio.log({"file_video": video})

Videos can be logged from a file path or a numpy array.

Numpy array requirements:

  • Must be of type np.uint8 with RGB values in the range [0, 255]
  • Shape should be either:
    • (frames, channels, height, width) for a single video
    • (batch, frames, channels, height, width) for multiple videos (will be tiled into a grid)

Finishing a Run

When your run is complete, finalize it with finish(). This marks the run as completed and saves all logged data:

trackio.finish()

Resuming a Run

If you need to continue a run (for example, after an interruption), you can resume it by calling init() again with the same project and run name, and setting resume="must":

trackio.init(project_name="my_project", run_name="my_first_run", resume="must")

This will load the existing run so you can keep logging data.

For more flexibility, use resume="allow". This will resume the run if it exists, or create a new one otherwise.

Tracking Configuration

You can also track configuration parameters for your runs. This is useful for keeping track of hyperparameters or other settings used in your experiments. You can log configuration data using the config parameter in the init() function:

trackio.init(
    project_name="my_project",
    run_name="my_first_run",
    config={
        "learning_rate": 0.001,
        "batch_size": 32,
    }
)
< > Update on GitHub