Neutron Reflectometry Analysis Tools

A toolbox of small, well-named CLI tools that an LLM agent (or a human) can chain together to analyze neutron reflectometry data end-to-end: partial data quality checks → model generation → refl1d fit → report. Built around refl1d/bumps for the math, with optional AuRE for LLM-driven model creation and fit evaluation.

Quick Start

Install with pip install -e ".[dev]" (see Installation).

(Optional) Set up a one-time user-global LLM config:

mkdir -p ~/.config/analyzer
cat > ~/.config/analyzer/.env <<'EOF'
LLM_PROVIDER=openai
LLM_MODEL=gpt-4o
LLM_API_KEY=sk-...
EOF
check-llm

cd into a sample folder containing reduced data and run the pipeline:

analyze-sample sample.yaml

The YAML uses the same shape as create-model --config (a describe: + states: list). A minimal example:

describe: 50 nm Cu / 3 nm Ti on Si in D2O
model_name: cu_d2o_218281
states:
  - name: state_218281
    data:
      - rawdata/REFL_218281_1_218281_partial.txt
      - rawdata/REFL_218281_2_218282_partial.txt
      - rawdata/REFL_218281_3_218283_partial.txt

The pipeline runs partial-data checks, halts on bad reduction, then calls create-model → run-fit → assess-result, writing a Markdown report under reports/.

What you get

analyze-sample — One-shot pipeline for a single sample, with a reduction-issue gate that emits a reduction_batch.yaml manifest you review and dispatch yourself (reduction is never run automatically).
create-model — Generate a refl1d-ready Python script from a sample description (LLM/AuRE) or convert an AuRE problem JSON. Multi-state co-refinement is supported.
run-fit — Run a bumps DREAM fit on a refl1d script and produce parameter tables, plots, and a Markdown report.
assess-result — Re-render the report from a fit-output directory: reflectivity overlay (multi-experiment), per-state SLD profiles with 90% CL bands. Optionally appends an aure evaluate LLM verdict.
assess-partial — Overlap-χ² check on partial reflectivity files.
theta-offset — Compute or batch-compute incident-angle offsets for a Liquids Reflectometer run.
eis-intervals / eis-reduce-events — Time-resolved reduction helpers (Mantid-based; Docker recommended).
iceberg-packager — Package time-resolved data into Parquet/Iceberg.
analyzer-batch — Dispatch multiple analyzer-tool jobs from a YAML manifest.
check-llm — Verify that AuRE and the configured LLM endpoint are reachable.

Run analyzer-tools --list-tools for the full registry, or analyzer-tools --help-tool <name> for any single tool. Per-workflow documentation lives under skills/.

Installation

git clone <repository-url>
cd analyzer
python3 -m venv venv
source venv/bin/activate    # Windows: venv\Scripts\activate
pip install -e ".[dev]"

This gives you all analysis, fitting, and pipeline tools. The Mantid-based reduction commands (simple-reduction, eis-reduce-events) require Mantid and are skipped gracefully when it isn't installed; use Docker for the full stack — see docs/docker.md.

LLM features (create-model Mode B, aure evaluate augmentation) require AuRE installed in the same environment and a configured LLM endpoint. They degrade gracefully when unavailable.

Configuration

The analyzer needs a project root and five role-based directories (combined data, partial data, models, results, reports). The simplest setup is to cd into a sample folder — everything resolves under $PWD with lowercase defaults (rawdata/, models/, results/, reports/). A repo-level .env above the sample folders can rename those sub-folders without becoming the project root itself.

See docs/configuration.md for the full .env-cascade rules and variable reference.

Batch processing

analyzer-batch runs many analyzer-tool invocations from a single YAML manifest. The manifest is pure orchestration — each job dispatches to one of the CLI tools (create-model, run-fit, assess-result, theta-offset, …) using the same flags you'd type by hand.

Manifest shape

# Optional top-level keys
data_location: ./rawdata     # prepended to bare data filenames in args
output_dir:    ./results     # injected as --output-dir on every job
theta_offset:  -0.005        # injected as --theta-offset (when not already set)

defaults:
  output_root: ./output      # each job's outputs written under <output_root>/<name>

jobs:
  - name: <unique label>     # used for logs and --jobs filter
    tool: <tool name>        # see analyzer-tools --list-tools
    args: [<argv …>]         # exactly as on the command line

Run it:

analyzer-batch manifest.yaml                 # run everything
analyzer-batch manifest.yaml --dry-run       # print commands only
analyzer-batch manifest.yaml --jobs cu_d2o   # run a subset by name

A complete reference manifest covering theta-offset, partial checks, fit

assess, and for_each expansion lives in manifest.example.yaml.

Example: batch many samples through `analyze-sample`

The recommended way to process many samples is to write one YAML per sample and dispatch them with analyzer-batch. Each per-sample YAML uses the same shape as create-model --config (the describe: + states: form — see skills/create-model/SKILL.md), and each analyze-sample job runs the full pipeline (partial-data check → reduction gate → create-model → run-fit → assess-result → optional AuRE evaluation) for that sample.

# samples/cu_thf_218281.yaml
describe: |
  2 nm CuOx / 50 nm Cu / 3 nm Ti on Si in 100 mM LiTFSI/THF.
  Neutrons enter from the silicon side.
model_name: cu_thf_218281
out: models/cu_thf_218281.py
states:
  - name: state_218281
    data:
      - REFL_218281_1_218281_partial.txt
      - REFL_218281_2_218282_partial.txt
      - REFL_218281_3_218283_partial.txt
    theta_offset:      {init: 0.0, min: -0.02, max: 0.02}
    sample_broadening: true
shared_parameters:
  - Cu.thickness
  - Cu.material.rho
  - Ti.thickness
  - Ti.material.rho

# manifest.yaml
data_location: ./rawdata        # bare REFL_*.txt names resolve here

jobs:
  - name: pipeline_218281
    tool: analyze-sample
    args: [samples/cu_thf_218281.yaml]

  - name: pipeline_218386
    tool: analyze-sample
    args: [samples/cu_thf_218386.yaml]

  - name: pipeline_218430
    tool: analyze-sample
    args: [samples/cu_thf_218430.yaml, --skip-aure-eval]

analyzer-batch manifest.yaml --dry-run                 # verify commands first
analyzer-batch manifest.yaml                           # run all samples
analyzer-batch manifest.yaml --jobs pipeline_218281    # run a single one

Each job writes its own reports/sample_<tag>/ folder. Failures in one job don't stop the others, and the run summary at the end reports pass/fail counts. If a sample trips the reduction-issue gate, that single job halts and emits a reduction_batch.yaml for review.

Going lower-level

analyze-sample is one job per sample. When you need finer control — e.g. regenerating a model without rerunning the fit, or fitting an existing script multiple times with different settings — call the underlying tools directly from the manifest:

jobs:
  - name: build_cu_thf
    tool: create-model
    args: [--config, samples/cu_thf_218281.yaml]

  - name: fit_cu_thf
    tool: run-fit
    args: [models/cu_thf_218281.py, --name, cu_thf_218281]

  - name: assess_cu_thf
    tool: assess-result
    args: [results/cu_thf_218281]

Note that run-fit takes a refl1d-ready Python script as its single positional argument (typically the file create-model produced) and that assess-result takes the fit-output directory.

Documentation

Topic	Where
End-to-end pipeline (`analyze-sample`)	skills/pipeline/SKILL.md
`create-model` reference (Mode A & B)	skills/create-model/SKILL.md
Fitting workflow (`create-model` → `run-fit` → `assess-result`)	skills/fitting/SKILL.md
Partial-data overlap checks	skills/partial-assessment/SKILL.md
Theta-offset calculation	skills/theta-offset/SKILL.md
Time-resolved reduction	skills/time-resolved/SKILL.md, docs/time-resolved-eis.md
Data layout & file formats	skills/data-organization/SKILL.md
Reflectometry primer	skills/reflectometry-basics/SKILL.md
Configuration / `.env` cascade	docs/configuration.md
Batch manifests (`analyzer-batch`)	Batch processing, manifest.example.yaml
Docker (full stack with Mantid)	docs/docker.md
Single-file skill summary (for downstream repos)	skills/distributable/SKILL.md

Citation

If this project helps your work, please cite via the Zenodo DOI (badge above) or the metadata in CITATION.cff.

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
.github		.github
analyzer_tools		analyzer_tools
docs		docs
scripts		scripts
skills		skills
tests		tests
.coveragerc		.coveragerc
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CITATION.cff		CITATION.cff
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
manifest.example.yaml		manifest.example.yaml
pixi.toml		pixi.toml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neutron Reflectometry Analysis Tools

Quick Start

What you get

Installation

Configuration

Batch processing

Manifest shape

Example: batch many samples through `analyze-sample`

Going lower-level

Documentation

Citation

About

Uh oh!

Releases 5

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Neutron Reflectometry Analysis Tools

Quick Start

What you get

Installation

Configuration

Batch processing

Manifest shape

Example: batch many samples through analyze-sample

Going lower-level

Documentation

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Example: batch many samples through `analyze-sample`

Packages