Visual-Physiological Pain Assessment

Representation Learning and Dual Attention Fusion

⚠️Disclaimer: This repo is a work in progress and may not run flawlessly out-of-the-box. We're sharing the code as-is for reference and reproducibility.

📚 Overview

This repository focuses on multimodal pain assessment using both facial video and bio-physiological signals. The architecture consists of two main stages:

Representation Learning
- Video Branch: Masked Autoencoder for facial expression modeling, based on MARLIN.
- Signal Branch: Masked Autoencoder for multivariate time-series signals, based on MVTS Transformer.
Classifier Training
- Attention-based fusion of video and signal features for pain-level classification.

🛠️ Installation

conda env create -f environment.yml

⚙️ The model was trained and evaluated on NVIDIA RTX 8000 GPUs. Video encoder fits within a 16 GB GPU VRAM budget during training with small batch sizes.

📁 Dataset

Publicly available third-party datasets:

BioVid Heat Pain Database: Available upon request from the official website [https://www.nit.ovgu.de/BioVid.html](https://www.nit.ovgu.de/BioVid.html, as described in 10.1109/CYBConf.2013.6617456

AI4Pain dataset: Available upon request from the official website [AI4Pain Challenge](https://sites.google.com/view/ai4pain/challenge-details, as described in 10.1109/ACIIW63320.2024.00012

To adapt your own dataset as well as for shape/format changes:

Modify model/classifier.py for architecture.
Modify dataloaders in src/dataset/celebv_hq.py.
Following Pytorch Lightning 1.7.7 modules.

🧠 Model Components

Component	Description	Path
Video MAE	Visual encoder	`src/marlin_pytorch/model`
Signal MAE	Time-series encoder	`mvts_transformer/src/models`
Cross Attention	Fusion between modalities	`model/crossatten.py`
Classifier	Final classification head	`model/classifier.py`

✅ 🧪 Pre-training

🎥 Video Branch

1. Face Preprocessing

python preprocess/celebvhq_preprocess.py --data_dir /path/to/videos

2. Generate masks:

python preprocess/ytf_preprocess.py --data_dir

train_set.csv example for pre-training (from YoutubeFaces)

path,len
AJ_Cook/0,79
AJ_Cook/2,194
Aaron_Sorkin/0,70
Aaron_Sorkin/3,174
Aaron_Tippin/0,119
Aaron_Tippin/1,83
Abdel_Aziz_Al-Hakim/0,103
Abdel_Aziz_Al-Hakim/1,285
Abdel_Aziz_Al-Hakim/4,141
Abdul_Majeed_Shobokshi/1,624
Abdulaziz_Kamilov/4,195

3. Video pre-training:

Directory for .csv

├── Train
│   ├── cropped
│   │   ├── id
│   │   ├── id
│   │   ├── ...
│   ├── face_parsing_images_DB
│   ├── train.txt
│   ├── val.txt
│   ├── ...

python train.py \
    --config config/pretrain/marlin_vit_base.yaml \
    --data_dir /path/to/youtube_faces \
    --n_gpus 4 \
    --num_workers 8 \
    --batch_size 16 \
    --epochs 2000 \
    --official_pretrained /path/to/checkpoint.pth

🧬 Signal Branch

Directory for .csv

├── Data
│   ├── id1
│   │   |   ├── video1
│   │   |   |   ├── frame1.jpg
│   │   |   |   ├── ...
│   │   |   ├── ...
│   │   ├── ...
│   ├── id2
│   ├── ...

Signal pre-training

cd mvts_transformer
python src/main.py --output_dir experiments --comment "pretraining through imputation" --name $1_pretrained --records_file Imputation_records.xls --data_dir /path/to/$1/ --data_class pain --pattern TRAIN --val_ratio 0.2 --epochs 700 --lr 0.001 --optimizer RAdam --batch_size 32 --pos_encoding learnable --d_model 128

🧪 Probing / Classifier Training

Prepare your pre-trained model from previous pre-training, and modify model/classifier.py to load them.

Directory Layout:

├── Train
│   ├── video
│   │   ├── id
│   │   |   ├── video1
│   │   |   |   ├── frame1.jpg
│   │   |   |   ├── ...
│   │   |   ├── ...
│   │   ├── ...
│   ├── biosignals_filtered
│   │   ├── id
            ├── 1.csv
            ├── ...
│   │   ├── ...
│   ├── celebvhq_info.json
│   ├── train.txt
│   ├── val.txt
│   ├── ...

Run Evaluation / Classification

python3 evaluate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Visual-Physiological Pain Assessment

📚 Overview

🛠️ Installation

📁 Dataset

🧠 Model Components

✅ 🧪 Pre-training

🎥 Video Branch

1. Face Preprocessing

2. Generate masks:

3. Video pre-training:

🧪 Probing / Classifier Training

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
config		config
dataset		dataset
model		model
mvts_transformer		mvts_transformer
preprocess		preprocess
src/marlin_pytorch		src/marlin_pytorch
util		util
cam.py		cam.py
environment.yml		environment.yml
evaluate.py		evaluate.py
init.py		init.py
readme.md		readme.md
setup.py		setup.py
test.py		test.py
train.py		train.py

Folders and files

Latest commit

History

Repository files navigation

Visual-Physiological Pain Assessment

📚 Overview

🛠️ Installation

📁 Dataset

🧠 Model Components

✅ 🧪 Pre-training

🎥 Video Branch

1. Face Preprocessing

2. Generate masks:

3. Video pre-training:

🧪 Probing / Classifier Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages