Task-Guided Multi-Annotation Triplet Learning (TG-MATL)

Task-Guided Multi-Annotation Triplet Learning for Remote Sensing Representations

This repository implements a task-guided approach to multi-annotation triplet learning (TG-MATL) for learning robust embeddings from remote sensing imagery. The framework compares different loss functions and task-head architectures across multiple embedding types (CLIP, DINOv2, MAE).

Managed by Meilun Zhou through GatorSense.

Quick Start

Prerequisites

Python 3.8+
8-16 GB RAM (for training)
GPU recommended (NVIDIA with CUDA support)

Installation

Option 1: Conda

conda env create -f environment.yml
conda activate matl

Option 2: pip

pip install -r requirements.txt

Data Requirements

Before running experiments, prepare the following:

1. Input Data (`data.npz`)

Create a data.npz file in the root directory containing:

import numpy as np

data = {
    'rgb': rgb_images,              # Shape: (N, 300, 300, 3) - RGB images
    'thermal': thermal_images,      # Shape: (N, 300, 300, 1) - Thermal images
    'class_label': class_labels,    # Shape: (N,) - Class labels (0-2)
    'box_label': bounding_boxes,    # Shape: (N, 4) as [xmin, xmax, ymin, ymax]
}
np.savez('data.npz', **data)

2. Embedding Files

Create ./embeddings/ directory with pre-computed embeddings:

embeddings/
├── awir_clip_emb.npy         # CLIP embeddings (N, 512)
├── awir_dinov2_emb.npy       # DINOv2 embeddings (N, 768)
└── awir_mae_emb.npy          # MAE embeddings (N, 768)

3. External Dependencies

The following modules are imported but not included. They should be provided separately:

triplet_loss — Standard triplet loss implementations
dual_triplet_loss_clf_aspect — Multi-annotation triplet loss
continuous_triplet_loss — Continuous feature triplet loss
TripletNetwork_Online — Neural network architectures (projection/task heads)

Place these in the root directory alongside the scripts.

Core Scripts

Experiment 1: Projection Head Training (`experiments/igarss_exp1.py`)

Trains projection heads using four loss functions on three embedding types with 8-fold stratified cross-validation:

Loss Functions:

DTL (Deep Triplet Loss): Standard triplet loss on class labels
DTL-Hard: Hard negative mining variant
MATL (Multi-Annotation Triplet Loss): Combines class + box triplet losses
TG-MATL (Task-Guided MATL): MI-weighted sample selection

Embeddings: CLIP, DINOv2, MAE (1024-dim pre-computed)

cd experiments
python igarss_exp1.py --margin 0.1 --batch_size 32 --test_size 0.7
cd ..

Outputs:

Models: results/trained_models/exp1_emb_proj/{embedding}/{method}_best_fold{i}.h5
Test embeddings: results/exp1_test_projections/{embedding}/{method}_proj_val0.7_fold{i}.npy
Timings: results/exp1_timings/timings_test0.7.csv

Experiment 1 Parameter Sweep (`experiments/igarss_exp1_sweep.py`)

Grid search over TG-MATL hyperparameters:

top_percent: [40%, 50%, 60%, 70%] — high mutual information samples
random_percent: [5%, 10%, 15%, 20%, 25%, 30%] — random samples

cd experiments
python igarss_exp1_sweep.py --margin 0.1 --batch_size 32 --test_size 0.7
cd ..

Experiment 2: Task Head Training (`experiments/igarss_exp2.py`)

Trains task-specific heads on learned embeddings:

Classification head: 3-way object classifier
Regression head (box features): Normalized scale/aspect ratio
Regression head (box location): Normalized center coordinates

Evaluates on:

Base embeddings (no projection)
DTL, DTL-Hard, MATL, TG-MATL projections

cd experiments
python igarss_exp2.py --margin 0.1 --batch_size 32 --test_size 0.7
cd ..

Outputs:

Task heads: results/trained_models/exp2_task_heads/{embedding}/{method}/*_head_run*.h5
Metrics: results/exp2_results/{embedding}/{method}_fold{i}_taskhead_metrics.npy

Experiment 2 Parameter Sweep (`experiments/igarss_exp2_sweep.py`)

Task heads using TG-MATL models from igarss_exp1_sweep.py:

cd experiments
python igarss_exp2_sweep.py --margin 0.1 --batch_size 32 --test_size 0.7
cd ..

Experimental Design

Cross-Validation

8-fold Stratified K-Fold on all samples
Train/Val split: 30/70 ratio within training folds
Test split: Fixed 30% held out from fold construction
Reproducibility: Random seeds set for deterministic splits

Evaluation Metrics

Classification: Accuracy, Precision, Recall, F1-score
Regression: MSE, R² score for box features and location
Timing: Training duration per fold/method

Output Structure

results/
└── exp1_timings/
    └── timings_test0.7.csv  # Per-fold training times
└── trained_models/
    └── exp1_emb_proj/
        ├── clip/{method}_best_fold{i}.h5
        ├── dinov2/{method}_best_fold{i}.h5
        └── mae/{method}_best_fold{i}.h5
    └── exp2_task_heads/
        └── {embedding}/{method}/
            ├── class_head_run{i}.h5
            ├── reg_boxfeat_head_run{i}.h5
            └── reg_boxloc_head_run{i}.h5
└── exp1_test_projections/
    └── {embedding}/{method}_proj_val0.7_fold{i}.npy
└── exp2_results/
    └── {embedding}/{method}_fold{i}_taskhead_metrics.npy

Reproducibility

This repository is designed for maximum reproducibility:

Relative Paths — All paths use ./data/, ./embeddings/, ./results/
Fixed Random Seeds — Set before data splits and model initialization
Deterministic CV — StratifiedKFold with fixed random_state
Minimal Dependencies — requirements.txt pins all package versions
Documented Imports — All dependencies clearly specified

To reproduce results:

Install dependencies: pip install -r requirements.txt
Place data.npz and embeddings in appropriate directories
Ensure external modules (triplet_loss, TripletNetwork_Online) are available in root directory

Run experiments:

cd experiments
python igarss_exp1.py --margin 0.1 --batch_size 32 --test_size 0.7
python igarss_exp2.py --margin 0.1 --batch_size 32 --test_size 0.7
cd ..

Results saved to ./results/ with proper structure

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contact

For questions or issues, please contact: zhou.m@ufl.edu

Last Updated: March 2026

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
experiments		experiments
matl		matl
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements.txt		requirements.txt
sample_data.npz		sample_data.npz
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Task-Guided Multi-Annotation Triplet Learning (TG-MATL)

Quick Start

Prerequisites

Installation

Data Requirements

1. Input Data (`data.npz`)

2. Embedding Files

3. External Dependencies

Core Scripts

Experiment 1: Projection Head Training (`experiments/igarss_exp1.py`)

Experiment 1 Parameter Sweep (`experiments/igarss_exp1_sweep.py`)

Experiment 2: Task Head Training (`experiments/igarss_exp2.py`)

Experiment 2 Parameter Sweep (`experiments/igarss_exp2_sweep.py`)

Experimental Design

Cross-Validation

Evaluation Metrics

Output Structure

Reproducibility

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Task-Guided Multi-Annotation Triplet Learning (TG-MATL)

Quick Start

Prerequisites

Installation

Data Requirements

1. Input Data (data.npz)

2. Embedding Files

3. External Dependencies

Core Scripts

Experiment 1: Projection Head Training (experiments/igarss_exp1.py)

Experiment 1 Parameter Sweep (experiments/igarss_exp1_sweep.py)

Experiment 2: Task Head Training (experiments/igarss_exp2.py)

Experiment 2 Parameter Sweep (experiments/igarss_exp2_sweep.py)

Experimental Design

Cross-Validation

Evaluation Metrics

Output Structure

Reproducibility

License

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Input Data (`data.npz`)

Experiment 1: Projection Head Training (`experiments/igarss_exp1.py`)

Experiment 1 Parameter Sweep (`experiments/igarss_exp1_sweep.py`)

Experiment 2: Task Head Training (`experiments/igarss_exp2.py`)

Experiment 2 Parameter Sweep (`experiments/igarss_exp2_sweep.py`)

Packages