SERN-MIL

Official repository for the paper: SERN-MIL: Selective Embedding Retrieval and Nuclei-Feature Aggregation Based MIL for Prostate Gleason Grading.

This repository contains the codebase for preprocessing, feature extraction, training, and inference with a CLAM + CellViT dependent pipeline aligned with the paper methodology.

Method Overview

WSI preprocessing at 0.50 um/px
Tissue masking (Otsu + morphology)
Tiling (4096x4096) and patching (256x256)
CLAM-compatible tissue embedding extraction
CellViT-based nuclei feature extraction
Feature fusion with coordinate alignment
SER module for selective embedding retrieval
Region-aware MIL for GP classification and ISUP grading
Optional survival/BCR head

Repository Layout

configs/              # Pipeline, model, dataset, and experiment configs
docker/               # Dockerfile and compose
docs/                 # Architecture and pipeline docs
scripts/              # User-facing entry scripts
src/sern_mil/         # Core package
tests/                # Unit tests

Installation

Local

pip install -e .

Docker

docker build -f docker/Dockerfile -t sern-mil:latest .
docker run --rm -it -v $PWD:/workspace sern-mil:latest bash

Data Preparation

Place your slides in:

data/wsi/

Optionally keep labels/splits in:

data/labels/
data/splits/

Run Full Preprocessing + Feature Pipeline

Make targets

make preprocess
make features
make fuse

Equivalent explicit commands

python -m sern_mil.cli.main preprocess --config configs/default.yaml
python -m sern_mil.cli.main extract-clam --config configs/default.yaml
python -m sern_mil.cli.main extract-cellvit --config configs/default.yaml
python -m sern_mil.cli.main fuse --config configs/default.yaml

Generated manifests:

data_index/preprocess_manifest.jsonl
data_index/clam_features_manifest.jsonl
data_index/cellvit_features_manifest.jsonl
data_index/fused_features_manifest.jsonl

Training

Training pipeline is separated from inference.

Quick run

make train

Explicit command

python scripts/train_pipeline.py --config configs/training_pipeline.yaml

CLI alternative

python -m sern_mil.cli.main train-pipeline --config configs/training_pipeline.yaml

Training summary output:

artifacts/training_run_summary.json

Checkpoints directory:

checkpoints/

Inference

Single WSI

python scripts/infer_pipeline.py --config configs/inference_pipeline.yaml --slide data/wsi/sample.svs

Batch folder

python scripts/infer_pipeline.py --config configs/inference_pipeline.yaml --wsis-dir data/wsi

Make target

make infer

Inference output:

artifacts/inference_predictions.json

Main Config Files

configs/default.yaml: preprocessing + feature pipeline settings
configs/training_pipeline.yaml: training-specific settings
configs/inference_pipeline.yaml: inference-specific settings
configs/models/sern_mil.yaml: model-level defaults
configs/experiments/*.yaml: ablation toggles

Tests

PYTEST_DISABLE_PLUGIN_AUTOLOAD=1 python -m pytest -q

Notes

Pipeline code is structured for paper-faithful components (SER, NFI, MIL), with clean separation between:
- preprocessing/feature generation
- training
- inference
For full experimental reproduction, ensure your dataset splits and checkpoints match your study protocol.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
configs		configs
docker		docker
docs		docs
scripts		scripts
src/sern_mil		src/sern_mil
tests		tests
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SERN-MIL

Method Overview

Repository Layout

Installation

Local

Docker

Data Preparation

Run Full Preprocessing + Feature Pipeline

Make targets

Equivalent explicit commands

Training

Quick run

Explicit command

CLI alternative

Inference

Single WSI

Batch folder

Make target

Main Config Files

Tests

Notes

About

Uh oh!

Releases

Packages

Languages

Quantimb-Lab/SERN-MIL

Folders and files

Latest commit

History

Repository files navigation

SERN-MIL

Method Overview

Repository Layout

Installation

Local

Docker

Data Preparation

Run Full Preprocessing + Feature Pipeline

Make targets

Equivalent explicit commands

Training

Quick run

Explicit command

CLI alternative

Inference

Single WSI

Batch folder

Make target

Main Config Files

Tests

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages