Tutorial on Configuration Files

This tutorial introduces how to configure and organize the modular components of AIGVE from configuration files. AIGVE uses MMEngine's Python-style configuration system. It follows a modular and inheritance-based design, which is convenient to conduct various evaluation experiments. It allows you to define all parameters in one centralized location, enabling easy corresponding data access just like getting values from Python dict. AIGVE's configuration system also supports inheritance, enabling better organization and management of complex configurations.

Config file content

AIGVE uses a modular design, all modules with different functions can be configured through the config. Taking gstqva as an example, we will introduce each field in the config according to different function modules.

Default Base configuration

These configurations define the default setup of the MMEngine runner system. This includes specifying the evaluation loop class, the model wrapper, logging levels, and hook behavior. Since these configuration are not specific to a particular evaluation metric, they are defined in the base configuration for better modularity and reusability.

from mmengine.config import read_base
with read_base():
    from ._base_.default import *

where the default config is defined as:

from core import AIGVELoop, AIGVEModel

default_scope = None

log_level = 'INFO'

model = dict(type=AIGVEModel)

default_hooks = None # Execute default hook actions as https://github.com/open-mmlab/mmengine/blob/85c83ba61689907fb1775713622b1b146d82277b/mmengine/runner/runner.py#L1896

val_cfg = dict(type=AIGVELoop)

default_scope and log_level control default scoping and logging behaviors.
model is a required component of MMEngine's runner pipeline. Since AIGVE is an evaluation-only framework, it is by default set to AIGVEModel, allowing loaded data batch to flow directly into the evaluator without model-specific logic.
default_hooks is by default set to None to disable default behaviors.
val_cfg defines the whole evaluation pipeline behaviors. It is set to AIGVELoop, the core loop class in AIGVE for running evaluations from datasets loading to metric evaluation. For more details about AIGVELoop, please refer to Tutorial on the AIGVE Loop.

Dataset & Loader configuration

Following AIGVELoop, val_dataloader is required for batch data loading. Different AIGVE metrics may require distinct data sources, data loading manners, data sampling strategies, or data preprocessing pipelines. As such, they may use different dataloader classes or set different parameters to build it. All AIGVE customizable dataLoaders are inherited from PyTorch's Dataset class. In gstvqa, its val_dataloader is configured as:

from mmengine.dataset import DefaultSampler
from datasets import GSTVQADataset

val_dataloader = dict(
    batch_size=1,
    num_workers=4,
    persistent_workers=True,
    drop_last=False,
    sampler=dict(type=DefaultSampler, shuffle=False),
    dataset=dict(
        type=GSTVQADataset,
        # video_dir='AIGVE_Tool/aigve/data/toy/evaluate/', # it has 16 frames for each video, each frame is [512, 512, 3]
        # prompt_dir='AIGVE_Tool/aigve/data/toy/annotations/evaluate.json',
        video_dir='AIGVE_Tool/aigve/data/AIGVE_Bench/videos_3frame/', # it has 81 frames for each video, each frame is [768, 1360, 3]
        prompt_dir='AIGVE_Tool/aigve/data/AIGVE_Bench/annotations/test.json',
        model_name='vgg16',  # User can choose 'vgg16' or 'resnet18'
        max_len=3,
    )
)

batch_size, num_workers, persistent_workers, drop_last, sampler and dataset are parameters for torch.utils.data.DataLoader. Specifically for dataset parameter, the gstvqa is configured to use GSTVQADataset.
video_dir, prompt_dir, model_name and max_len are four parameters defined in the GSTVQADataset.
For more details about customizing such dataloader, please refer to Tutorial on Customizable Dataloaders. Our AIGVE support loading any dataset formatted using MMFormat JSON annotation file. For more details about preparing dataset into this format, please refer to Tutorial on Dataset Preparation.

Evaluator configuration

Following AIGVELoop, val_evaluator is required for batch data processing, evaluation, and final metric score computing. Different AIGVE metrics generally will define their own evaluation class, to perform their unique evaluation-related operations, such as model loading, dynamic feature extraction, per-sample evaluation, score aggregation, or flexible computational resource management. All our modular evaluator metrics are inherited from MMEngine's BaseMetric class. In gstvqa, its val_evaluator is configured as:

from metrics.video_quality_assessment.nn_based.gstvqa.gstvqa_metric import GstVqa

val_evaluator = dict(
    type=GstVqa,
    model_path="metrics/video_quality_assessment/nn_based/gstvqa/GSTVQA/TCSVT_Release/GVQA_Release/GVQA_Cross/models/training-all-data-GSTVQA-konvid-EXP0-best",
)

the gstvqa is configured to use GstVqa.
Some metrics require downloading pretrained models manually. Make sure they are downloaded correctly and placed in the correct paths as specified in the configuration files. For example here, please make sure your model_path contains the path of pretrained model you downloaded.
For more details about customizing such metric evaluator, please refer to Tutorial on Modular Metrics.

Tips for Configuration Files

Place shared/default settings in _base_ folders and reuse them across multiple configs to avoid duplication and keep configs clean.
Ensure paths configurations such as video_dir, prompt_dir, and model_path are correct.
Keep each config modular. Separate different functional blocks into logical sections (e.g., dataloader, evaluator, etc.)
Use toy-version dataset to test new configs and logics before scaling up.
Some metrics require downloading pretrained models manually. Make sure they are downloaded correctly and placed in the correct paths as specified in the configuration files.

What's Next?

After configuring the evaluation pipeline, you can proceed to: