ROBIN Quickstart¶
How to run ROBIN from this repository after installation: live BAM analysis with robin workflow, optional NiceGUI monitoring, and target panels.
For disclaimer, GUI password, and startup order, see What happens at startup. For a step-by-step guide to the web interface, see Using ROBIN.
For deeper detail (CLI flags, MinKNOW, troubleshooting), use the README and the command-line reference.
What ROBIN expects¶
ROBIN consumes aligned BAMs from Oxford Nanopore sequencing (typically written in real time by MinKNOW):
| Expectation | Notes |
|---|---|
| Basecalling | HAC is sufficient; SUP not required. |
| Methylation | 5hmC / 5mC calling enabled in MinKNOW if your analyses need methylation. |
| Alignment | Done in MinKNOW — ROBIN does not realign reads. |
| BAM rollover | Read-count–based chunks. Each BAM must be ≤ 50,000 reads; we recommend ~50,000 reads per file. Do not rely on time-only (e.g. hourly) rollover — see README — BAM read limit. |
| POD5 / FASTQ | Not required; you can turn them off if you only need BAM. |
Memory (≤ 64 GB RAM)¶
On smaller machines, restart between long runs or after switching flow-cell positions. Dorado can hold GPU/host memory; restarting Dorado or the instrument after a run reduces out-of-memory risk.
Reference and panel BEDs¶
Use robin utils sequencing-files to gather the processed panel BED (and optional source BED when packaged) plus a GRCh38 reference FASTA in one place for MinKNOW (adaptive sampling, alignment) and for robin workflow --reference. Run this after installation when you need a consistent reference path.
| Option | Purpose |
|---|---|
-p / --panel | Required. Same names as --target-panel (built-in panels such as rCNS2, AML, PanCan; run robin utils sequencing-files --help for the list on your install). |
-r / --reference | Reference FASTA: either an HTTPS URL to download, or a local path to .fa / .fa.gz. If omitted, ROBIN uses the default NCBI GRCh38 no-alt analysis set (UCSC-style contig names) — a large download; use -r to point at an existing file if you already have GRCh38. |
-o / --output-dir | Output folder (default: ./reference_files in the current directory). |
-y / --yes | Skip the confirmation prompt (for scripts). |
The command copies the panel BED(s) from ROBIN resources and materializes the reference at the chosen location. Use the same reference file for MinKNOW alignment and for robin workflow --reference.
Other robin utils commands¶
| Command | Purpose |
|---|---|
robin utils update-models | Download or refresh classification / model assets (see Installation). |
robin utils update-clinvar | Download or refresh ClinVar under ROBIN resources for annotation paths. |
robin utils mgmt | Summarise MGMT CpG methylation from existing mgmt_sorted.bam outputs (post-run inspection). |
Run robin utils --help for the full list.
Run a workflow¶
Typical command:
robin workflow <data_folder> --work-dir <output_folder> \
-w target,cnv,fusion,mgmt,sturgeon,nanodx,pannanodx,random_forest \
--reference ~/references/hg38_simple.fa \
--center <center_id>
| Argument | Meaning |
|---|---|
<data_folder> | Directory watched for incoming BAMs |
--work-dir | Root directory for outputs |
-w | Comma-separated analysis types |
--reference | Reference FASTA (needed for most steps) |
--center | Site label (e.g. Sherwood, Auckland) |
--target-panel | Panel name, e.g. rCNS2, PanCan |
Examples¶
# Broad workflow with a fixed panel
robin workflow ~/data/bam_files \
--work-dir ~/results \
-w target,cnv,fusion,mgmt,sturgeon,nanodx,pannanodx,random_forest \
--reference ~/references/hg38_simple.fa \
--center Sherwood \
--target-panel rCNS2
# Smaller selection
robin workflow ~/data/bam_files \
--work-dir ~/results \
-w mgmt,sturgeon \
--reference ~/references/hg38_simple.fa \
--center Auckland \
--target-panel PanCan
# More logging
robin workflow ~/data/bam_files \
--work-dir ~/results \
-w mgmt,cnv,sturgeon \
--reference ~/references/hg38_simple.fa \
--center New_York \
--target-panel rCNS2 \
--verbose \
--log-level INFO
Point --reference at the same GRCh38 FASTA you use for MinKNOW — for example the file produced under your sequencing-files output directory — not a different assembly or naming scheme.
Commands¶
list-job-types¶
Examples: preprocessing, bed_conversion, mgmt, cnv, target, fusion, sturgeon, nanodx, pannanodx, random_forest (see live output for your version).
workflow¶
Required:
-w/--workflow— e.g.mgmt,sturgeon, or queue-qualified plans (robin workflow --help).--center— site ID.
Useful options (full list: robin workflow --help):
--work-dir— output directory--reference— reference FASTA--verbose,--log-level,--job-log-level--no-process-existing— only new files after start--deduplicate-jobs— dedupe selected job types by sample--use-ray/--no-use-ray— Ray execution--with-gui/--no-gui— NiceGUI workflow monitor
Panel management¶
Built-in panels include rCNS2, AML, PanCan. Custom panels are registered from BED (at least four columns: chr, start, end, gene name(s)).
robin list-panels
robin add-panel /path/to/panel.bed MyCustomPanel
robin add-panel /path/to/panel.bed MyCustomPanel --validate-only
robin remove-panel MyCustomPanel
robin remove-panel MyCustomPanel --force
You cannot reuse reserved names: rCNS2, AML, PanCan.
Behaviour and limitations¶
- CNV — Heuristic calls; review visually before any clinical interpretation.
- Stop with Ctrl+C — Graceful shutdown is attempted but not guaranteed.
- Bugs / questions — Open an issue.
Performance¶
- Batched processing on heavy paths
- Optional
LJ_BAM_THREADSfor BAM threading where supported - Non-blocking GUI updates when the NiceGUI monitor is enabled
License and credits¶
Research use only; see LICENSE in the repository. ROBIN integrates tools such as Sturgeon, Rapid-CNS2, Readfish, cnv_from_bam, methylartist — see the repo and papers for full attribution.