uniform_batch_correction

Standalone Nextflow pipeline for UniFORM-style batch normalization of cellmeasurement GeoJSON outputs.

Input

Provide a CSV/YAML samplesheet with:

sample: sample ID
geojson: path to input cellmeasurement GeoJSON (for feature-level mode)
ome_tiff: path to input OME-TIFF/TIFF image (for pixel-level mode)
adata: path to input AnnData .h5ad file (for feature-level matrix mode)

For adata mode, normalization is performed per group inside each .h5ad using an obs column (default: image). Each unique group value is treated as one sample for cohort alignment.

Example CSV is provided in assets/samplesheet.csv.

Usage

nextflow run . \
  -profile conda,large \
  --input assets/samplesheet.csv \
  --outdir results

The large profile targets SLURM large-memory nodes (default queue regular) and increases process resources for large images.

Key parameters

--run_uniform (default: true)
--uniform_apply_to (default: geojson; options: geojson, ome_tiff, adata)
--uniform_num_bins (default: 1024)
--uniform_min_value (default: 1.0)
--uniform_exclude_pattern (default: ^(kronos_|emb_))
--uniform_output_suffix (default: _uniform)
--uniform_pixel_output_suffix (default: _unifrom)
--uniform_pixel_sample_size (default: 200000)
--uniform_pixel_group_by (default: image; options: image, batch)
--uniform_pixel_batch_map (default: empty; CSV/TSV with sample->batch mapping)
--uniform_pixel_batch_sample_column (default: sample)
--uniform_pixel_batch_column (default: batch)
--uniform_adata_group_by (default: image)
--uniform_adata_sample_size (default: 200000)
--uniform_adata_target (default: all; use cell_mean to target only *_Cell_Mean features)
--uniform_adata_filter_column (default: empty; e.g. statistic)
--uniform_adata_filter_regex (default: empty; only matching features are normalized)
--uniform_generate_plots (default: true)
--uniform_qc_top_n_keys (default: 12)
--uniform_qc_max_heatmap_keys (default: 40)

Normalized GeoJSON files are published to results/uniformnormalize/. Normalized AnnData files are published to results/uniformnormalize/. QC files are published to results/uniformnormalize/qc/.

Pixel normalization by batch

To normalize OME-TIFF/TIFF images by batch instead of per image, provide a sample-to-batch table and set uniform_pixel_group_by=batch.

Example mapping file (CSV/TSV):

sample,batch
SOL2_0003_A12,Pilot
SOL2_0004_1N,Pilot
SOL2_0007_14MP,3
SOL2_0008,1

Run:

nextflow run . \
  -profile conda,large \
  --input assets/samplesheet.csv \
  --outdir results \
  --uniform_apply_to ome_tiff \
  --uniform_pixel_group_by batch \
  --uniform_pixel_batch_map /path/to/patient_batch_map.csv \
  --uniform_pixel_batch_sample_column sample \
  --uniform_pixel_batch_column batch

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
assets		assets
bin		bin
conf		conf
docs		docs
modules/local/uniformnormalize		modules/local/uniformnormalize
tests		tests
workflows		workflows
.gitignore		.gitignore
README.md		README.md
main.nf		main.nf
nextflow.config		nextflow.config
nextflow_schema.json		nextflow_schema.json
nf-test.config		nf-test.config
samplesheet.csv		samplesheet.csv
samplesheet_comet_images.csv		samplesheet_comet_images.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

uniform_batch_correction

Input

Usage

Key parameters

Pixel normalization by batch

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

uniform_batch_correction

Input

Usage

Key parameters

Pixel normalization by batch

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages