documentlanguagemodel Public

Watch 0 Fork 0 Star 0

markdown · 18061 bytes Raw Blame History

CLI reference

Generated from the running dlm --help output. Auto-regeneration via typer-cli is planned for a follow-up sprint; until then this file is hand-maintained and gated by the test suite.

Global options

Applied to every subcommand:

Option	Env var	Default	Description
`--home PATH`	`DLM_HOME`	`~/.dlm`	Override the store root.
`-v, --verbose`	—	off	Emit plan / resolver diagnostics on stderr.
`-q, --quiet`	—	off	Suppress informational output.
`--version`	—	—	Print version and exit.
`--install-completion`	—	—	Install shell completion.
`--show-completion`	—	—	Print shell completion script.
`-h, --help`	—	—	Show command help.

Commands

`dlm init`

Bootstrap a new .dlm file with a fresh ULID, create the per-store directory, and persist the license-acceptance record (audit-05 B2).

dlm init <path> [--base <key>] [--template <name>] [--multimodal]
                [--i-accept-license] [--force]

Option	Default	Notes
`--base <key>`	`qwen2.5-1.5b`	Registry key or `hf:org/name`. Ignored when `--template` is used (the template's `recommended_base` wins). With `--multimodal`, defaults to `paligemma-3b-mix-224`.
`--template <name>`	None	Bootstrap from a named gallery template. See `dlm templates list`. Mutually exclusive with `--multimodal`.
`--multimodal`	false	Scaffold a vision-language `.dlm` with an `::image::` section (schema v10). Flips `--base` to `paligemma-3b-mix-224` unless explicitly overridden; a non-VL `--base` is refused. See multimodal-training cookbook.
`--i-accept-license`	false	Required for gated bases (Llama-3.2, PaliGemma).
`--force`	false	Overwrite an existing `.dlm` at path.

Writes <path> with minimum frontmatter, provisions ~/.dlm/store/<dlm_id>/ with an initial manifest.json, and (for gated bases) stores the LicenseAcceptance record so dlm train / dlm export don't re-prompt. Refuses if the .dlm file already exists and --force wasn't passed.

`dlm train`

Train / retrain the adapter.

dlm train <path> [--resume|--fresh] [--seed N] [--max-steps N]
                 [--i-accept-license]
                 [--strict-lock|--update-lock|--ignore-lock]

Option	Default	Notes
`--resume`	false	Continue from `training_state.pt`. Mutex with `--fresh`.
`--fresh`	false	Discard prior optimizer state; train from scratch. Mutex with `--resume`. Default when neither flag is set.
`--seed N`	frontmatter.training.seed	Override training seed.
`--max-steps N`	unlimited	Cap step count.
`--i-accept-license`	false	Required for gated bases (usually captured once at `dlm init` and persisted).
`--strict-lock`	false	Fail on any `dlm.lock` drift (even WARN).
`--update-lock`	false	Bypass validation; always write a fresh `dlm.lock`.
`--ignore-lock`	false	Bypass validation; don't write `dlm.lock`.
`--gpus SPEC`	single-process	Multi-GPU training via Accelerate. `all` uses every visible CUDA device; `N` uses the first N; `0,1` selects exact device ids. Dispatches to `accelerate launch` when >1 device is selected. Refused on MPS/CPU/ROCm; heterogeneous CUDA SMs refused.
`--watch`	false	Save-to-train mode (Sprint 25). After the initial train, block on filesystem events and re-run bounded-step retrains on each settled save.
`--watch-max-steps N`	100	Per-cycle step cap for `--watch`. Keeps cycles responsive.
`--watch-debounce-ms N`	400	Quiet interval before a burst of saves triggers a retrain.
`--repl`	false	With `--watch`: also run `dlm repl` in the same process. Scaffolded only — threading integration is a followup; today the flag refuses with exit 2.
`--no-cache`	false	Bypass the tokenized-section cache for this run. Default is cache-on when the `.dlm` declares `training.sources`. Use when debugging tokenization or cross-checking cached-vs-uncached determinism. Entries from prior runs stay on disk; the next run without the flag picks them back up. See directive-cache.

The three lock flags are mutually exclusive. See Determinism for the mismatch severity table.

--gpus multiplies the effective batch size by world_size; the resulting lock records world_size and warns on drift between runs. Multi-GPU + QLoRA on CUDA is permitted (bitsandbytes supports DDP); multi-GPU + ROCm is out of scope for Sprint 23.

`dlm prompt`

Run inference against the current adapter.

dlm prompt <path> [query] [--max-tokens N] [--temp F] [--top-p F]
                  [--adapter NAME] [--gate {auto,off}] [--image PATH]...
                  [--verbose]

Option	Default	Notes
`--max-tokens N`	256	Max new tokens to generate.
`--temp F`	0.7	Temperature. `0.0` = greedy decoding (deterministic).
`--top-p F`	None	Top-p sampling.
`--adapter NAME`	None	Select a named adapter from `training.adapters`. Required on multi-adapter documents; rejected on single-adapter ones.
`--gate {auto,off}`	`auto`	Learned adapter gate (Sprint 34). `auto` uses the trained gate when one exists in the store; `off` forces uniform weights across declared adapters. Silently ignored when `--adapter` pins a single adapter. See `docs/cookbook/learned-adapter-gate.md`.
`--image PATH`	none	Attach an image to the prompt. Repeat for multiple images; each expands to one `<image>` placeholder the processor slots pixels into. Required on vision-language bases; rejected on text bases. See multimodal-training cookbook.
`--backend {auto,pytorch,mlx}`	`auto`	Inference backend. `auto` picks MLX on Apple Silicon (when `uv sync --extra mlx` is installed), else PyTorch. Ignored on VL bases (the VL path always uses PyTorch + AutoModelForImageTextToText).
`--verbose`	false	Print resolved `InferencePlan` on stderr.

Query is the CLI positional argument. Omit to read from stdin.

`dlm repl`

Interactive prompt-and-respond REPL against the trained adapter (Sprint 24).

dlm repl <path> [--adapter NAME] [--backend {auto,pytorch,mlx}]

Option	Default	Notes
`--adapter NAME`	None	Named adapter; required on multi-adapter docs.
`--backend {auto,pytorch,mlx}`	`auto`	Same contract as `dlm prompt --backend`.

Slash commands inside the REPL: /help, /exit, /clear, /save, /adapter, /params, /model, /history. Ctrl-D exits; Ctrl-C cancels generation or input. Session history persists at ~/.dlm/history. See the interactive-session cookbook.

`dlm metrics`

Query the per-store SQLite metrics DB (Sprint 26).

dlm metrics <path> [--json|--csv] [--run-id N] [--phase PHASE] [--since WINDOW] [--limit N]
dlm metrics watch <path> [--poll-seconds N]

Option	Default	Notes
`--json`	false	Emit JSON object (`{runs: [...], steps: [...], evals: [...]}` when combined with `--run-id`).
`--csv`	false	Emit CSV of runs or (with `--run-id`) steps + evals.
`--run-id N`	None	Drill into one run; prints its step/eval counts.
`--phase`	None	Filter runs by phase (`sft`/`dpo`/`orpo`/`cpt`).
`--since`	None	Time window (`24h`, `7d`, `30m`, `10s`).
`--limit N`	20	Cap the number of runs returned.

dlm metrics watch polls the DB and tails new step/eval rows as they arrive. See the metrics cookbook for the full flow + optional TensorBoard / W&B sinks (uv sync --extra observability).

`dlm templates`

Browse the starter template gallery (Sprint 27).

dlm templates list [--json] [--refresh] [--accept-unsigned]

Option	Default	Notes
`--json`	false	Emit the full `TemplateMeta` for each entry as JSON.
`--refresh`	false	Refresh from the upstream gallery. Currently a no-op — upstream repo and signing key are pending (Sprint 27 deferred polish); the command warns and falls back to the bundled gallery.
`--accept-unsigned`	false	Reserved. Will bypass signed-tag verification once the live fetcher is wired.

Pair with dlm init --template <name> to create a new .dlm:

dlm init mydoc.dlm --template coding-tutor

See the template-gallery cookbook for the full walkthrough and the TemplateMeta schema.

`dlm export`

Produce GGUF files + Modelfile + register with Ollama.

dlm export <path> [--quant Q] [--merged [--dequantize]]
                  [--name N] [--no-template] [--skip-ollama]
                  [--no-smoke] [--no-imatrix] [--verbose]
                  [--draft TAG | --no-draft]
                  [--adapter NAME | --adapter-mix SPEC]

Option	Default	Notes
`--quant Q`	frontmatter.export.default_quant	`Q4_K_M` / `Q5_K_M` / `Q6_K` / `Q8_0` / `F16`.
`--merged`	false	Merge LoRA into base before quantizing.
`--dequantize`	false	Required with `--merged` on a QLoRA adapter (pitfall #3).
`--name N`	derived	Ollama model name.
`--no-template`	false	Skip writing `TEMPLATE` into the Modelfile (power users only — Ollama will fuzzy-match, which Sprint 12 deliberately works around).
`--skip-ollama`	false	Emit GGUFs but don't register.
`--no-smoke`	false	Register but skip the smoke prompt.
`--no-imatrix`	false	Opt out of imatrix-calibrated quantization.
`--verbose`	false	Surface preflight + conversion diagnostics.
`--draft TAG`	auto	Override the speculative-decoding draft model.
`--no-draft`	false	Disable speculative decoding. Mutex with `--draft`.
`--adapter NAME`	None	Export a single named adapter from `training.adapters`. Rejected on single-adapter documents. Mutex with `--adapter-mix`.
`--adapter-mix SPEC`	None	Weighted composition like `knowledge:1.0,tone:0.5`. Produces one Ollama model by merging the named adapters at export time. LoRA-only; QLoRA sources require `--dequantize --merged`. Mutex with `--adapter`.
`--adapter-mix-method`	`linear`	PEFT merge strategy: `linear` (default; fast weighted sum) or `svd` (higher fidelity, heavier compute). Only meaningful with `--adapter-mix`.

`dlm pack`

Produce a portable .dlm.pack bundle.

dlm pack <path> [--out PATH] [--include-exports] [--include-base]
                [--include-logs] [--i-am-the-licensee URL]

Option	Default	Notes
`--out PATH`	`<name>.dlm.pack`	Pack output.
`--include-exports`	false	Bundle all GGUF exports.
`--include-base`	false	Bundle the base model weights. Requires license acknowledgement for gated bases.
`--include-logs`	false	Bundle per-run JSONL logs.
`--i-am-the-licensee URL`	none	URL acknowledging separate base-license acceptance.

`dlm unpack`

Install a .dlm.pack into the local store.

dlm unpack <pack> [--force] [--out DIR]

Option	Default	Notes
`--force`	false	Overwrite an existing store with the same `dlm_id`.
`--out DIR`	pack parent	Where to place the restored `.dlm`.

`dlm push`

Upload a .dlm (auto-packs) or .dlm.pack to a sharing destination (Sprint 28).

dlm push <path> --to <destination> [--sign] [pack flags]

Option	Default	Notes
`--to <destination>`	required	`hf:<org>/<repo>`, `https://...` URL endpoint, or a local path.
`--sign`	false	Sign the pack with `minisign` before upload (requires `minisign` on PATH + key at `~/.dlm/minisign.key`).
`--include-exports`	false	Forwarded to `dlm pack` when auto-packing a `.dlm`.
`--include-base`	false	Same.
`--include-logs`	false	Same.
`--i-am-the-licensee URL`	none	Required with `--include-base` on a non-redistributable base.

Destinations:

hf:<org>/<repo> — HuggingFace Hub. Uses $HF_TOKEN if set. Autogenerates a README.md with library_name: dlm tag. Creates the repo if missing (your personal namespace needs no approval).
https://… — any HTTPS endpoint that accepts a POST with an application/octet-stream body. Sets Authorization: from $DLM_SHARE_AUTH when present (e.g. Bearer <token>).
<local/path> — copy the pack to a filesystem path.

`dlm pull`

Download + verify + unpack a .dlm.pack from a remote source.

dlm pull <source> [--out DIR] [--force]

Option	Default	Notes
`<source>`	required	`hf:<org>/<repo>`, `https://…`, `peer://host:port/<id>?token=…`, or a local path.
`--out DIR`	CWD	Directory for the restored `.dlm`.
`--force`	false	Overwrite an existing store with the same `dlm_id`.

Pulls always verify sha256 checksums during unpack. If a .minisig sidecar is served alongside the pack, dlm pull tries every key in ~/.dlm/trusted-keys/*.pub — match → verified, no match → unverified warning (still installs, checksums are fine). No sidecar → unsigned (still installs).

`dlm serve`

Serve a .dlm's pack over LAN for peers to pull.

dlm serve <path> [--port N] [--public --i-know-this-is-public]
                 [--max-concurrency N] [--rate-limit N]
                 [--token-ttl-minutes N]

Option	Default	Notes
`--port N`	7337	Bind port.
`--public`	false	Bind `0.0.0.0` only when paired with `--i-know-this-is-public`. Without the confirmation flag, `--public` logs a refusal and binds `127.0.0.1`.
`--i-know-this-is-public`	false	Acknowledges the public bind. Meaningless without `--public`.
`--max-concurrency N`	4	Max concurrent connections per token. Excess returns HTTP 429.
`--rate-limit N`	30	Max requests per minute per token.
`--token-ttl-minutes N`	15	Issued token lifetime. Ctrl-C invalidates every outstanding token instantly — the session secret lives only in the serving process.

On start, prints the peer:// URL (with embedded token) that the other side pastes into dlm pull. Ctrl-C cleanly stops the server and deletes the temp pack.

`dlm doctor`

Inspect hardware + print the resolved training plan.

dlm doctor [--json]

No-network. Probes torch + psutil only; refuses to go online.

`dlm show`

Show training history, exports, and adapter state for a document.

dlm show <path> [--json]

Pretty-prints manifest + lock state. --json emits machine-readable output.

`dlm migrate`

Migrate a .dlm frontmatter to the current schema version.

dlm migrate <path> [--dry-run] [--no-backup]

Option	Default	Notes
`--dry-run`	false	Print the migrated frontmatter without writing.
`--no-backup`	false	Skip the `.dlm.bak` backup.

`dlm cache`

Inspect and manage the per-store tokenized-section cache (Sprint 31). The cache speeds up re-training on directive-sourced codebases by keying tokenized output on (section_id, tokenizer_sha, sequence_len).

dlm cache show <path> [--json]
dlm cache prune <path> [--older-than DURATION]
dlm cache clear <path> [--force]

Subcommand	Notes
`show`	Print entry count, size on disk, last-run hit rate. `--json` for machine-readable output.
`prune`	Delete entries not accessed within `--older-than` (e.g. `30d`, `12h`, `45m`). Default `90d`.
`clear`	Wipe the entire cache. Prompts for confirmation unless `--force` is passed.

See docs/cookbook/directive-cache.md for tuning, invalidation triggers, and maintenance patterns.

`dlm harvest`

Pull failing-probe results from a sway-style eval report back into the document as !probe-tagged ::instruction:: sections for the next retrain. See docs/cookbook/probe-driven-training.md.

dlm harvest <path> --sway-json <report> [--apply] [--dry-run]
                   [--tag NAME] [--min-confidence F]
                   [--strict | --lax]
dlm harvest <path> --revert

Option	Default	Notes
`--sway-json PATH`	required	Path to the sway probe report JSON.
`--apply`	false	Write changes to disk. Without it, dry-run.
`--dry-run`	true	Print the diff; no writes.
`--revert`	—	Strip all `auto_harvest=True` sections; mutually exclusive with `--sway-json`.
`--tag NAME`	`auto-harvest`	Provenance tag written into `harvest_source`.
`--min-confidence F`	`0.0`	Skip candidates below this confidence.
`--strict` / `--lax`	lax	Strict: fail if any failing probe lacks a reference. Lax: skip + log.

Exit codes: 0 success, 1 validation error (malformed JSON, strict miss, mutual-exclusion violation), 2 no candidates to harvest.

`dlm train --listen-rpc`

During --watch, open a JSON-RPC endpoint that accepts inject_probe pushes from external eval harnesses. Requires DLM_PROBE_TOKEN in the environment. See docs/cookbook/probe-driven-training.md for the wire protocol and security notes.

Option	Default	Notes
`--listen-rpc HOST:PORT`	off	Bind the probe-RPC endpoint. Requires `--watch` or `--max-cycles`.
`--max-cycles N`	`0`	Bounded-loop alternative to `--watch` for convergence runs (scaffolded — currently refuses execution without `--watch`).

Exit codes

Code	Meaning
0	Success.
1	Runtime failure (license refused, disk full, OOM, template drift, lock validation).
2	CLI misuse (mutex violation, missing argument).

Domain errors are formatted consistently via bare console.print calls in each subcommand (prefix convention: <subject>: <message>, e.g. lock: base_model_revision changed). Uncaught exceptions escape into dlm.cli.reporter which picks a matching prefix from the module the exception came from and renders a tier-3 generic message.

View source

  
        1
        # CLI reference
      
        2
        
        3
        Generated from the running `dlm --help` output. Auto-regeneration via
      
        4
        `typer-cli` is planned for a follow-up sprint; until then this file is
      
        5
        hand-maintained and gated by the test suite.
      
        6
        
        7
        ## Global options
      
        8
        
        9
        Applied to every subcommand:
      
        10
        
        11
        | Option | Env var | Default | Description |
      
        12
        |---|---|---|---|
      
        13
        | `--home PATH` | `DLM_HOME` | `~/.dlm` | Override the store root. |
      
        14
        | `-v, --verbose` | — | off | Emit plan / resolver diagnostics on stderr. |
      
        15
        | `-q, --quiet` | — | off | Suppress informational output. |
      
        16
        | `--version` | — | — | Print version and exit. |
      
        17
        | `--install-completion` | — | — | Install shell completion. |
      
        18
        | `--show-completion` | — | — | Print shell completion script. |
      
        19
        | `-h, --help` | — | — | Show command help. |
      
        20
        
        21
        ## Commands
      
        22
        
        23
        ### `dlm init`
      
        24
        
        25
        Bootstrap a new `.dlm` file with a fresh ULID, create the per-store
      
        26
        directory, and persist the license-acceptance record (audit-05 B2).
      
        27
        
        28
        ```
      
        29
        dlm init <path> [--base <key>] [--template <name>] [--multimodal]
      
        30
                        [--i-accept-license] [--force]
      
        31
        ```
      
        32
        
        33
        | Option | Default | Notes |
      
        34
        |---|---|---|
      
        35
        | `--base <key>` | `qwen2.5-1.5b` | Registry key or `hf:org/name`. Ignored when `--template` is used (the template's `recommended_base` wins). With `--multimodal`, defaults to `paligemma-3b-mix-224`. |
      
        36
        | `--template <name>` | None | Bootstrap from a named gallery template. See `dlm templates list`. Mutually exclusive with `--multimodal`. |
      
        37
        | `--multimodal` | false | Scaffold a vision-language `.dlm` with an `::image::` section (schema v10). Flips `--base` to `paligemma-3b-mix-224` unless explicitly overridden; a non-VL `--base` is refused. See [multimodal-training cookbook](../cookbook/multimodal-training.md). |
      
        38
        | `--i-accept-license` | false | Required for gated bases (Llama-3.2, PaliGemma). |
      
        39
        | `--force` | false | Overwrite an existing `.dlm` at path. |
      
        40
        
        41
        Writes `<path>` with minimum frontmatter, provisions
      
        42
        `~/.dlm/store/<dlm_id>/` with an initial `manifest.json`, and (for
      
        43
        gated bases) stores the `LicenseAcceptance` record so `dlm train` /
      
        44
        `dlm export` don't re-prompt. Refuses if the `.dlm` file already
      
        45
        exists and `--force` wasn't passed.
      
        46
        
        47
        ### `dlm train`
      
        48
        
        49
        Train / retrain the adapter.
      
        50
        
        51
        ```
      
        52
        dlm train <path> [--resume|--fresh] [--seed N] [--max-steps N]
      
        53
                         [--i-accept-license]
      
        54
                         [--strict-lock|--update-lock|--ignore-lock]
      
        55
        ```
      
        56
        
        57
        | Option | Default | Notes |
      
        58
        |---|---|---|
      
        59
        | `--resume` | false | Continue from `training_state.pt`. Mutex with `--fresh`. |
      
        60
        | `--fresh` | false | Discard prior optimizer state; train from scratch. Mutex with `--resume`. Default when neither flag is set. |
      
        61
        | `--seed N` | frontmatter.training.seed | Override training seed. |
      
        62
        | `--max-steps N` | unlimited | Cap step count. |
      
        63
        | `--i-accept-license` | false | Required for gated bases (usually captured once at `dlm init` and persisted). |
      
        64
        | `--strict-lock` | false | Fail on any `dlm.lock` drift (even WARN). |
      
        65
        | `--update-lock` | false | Bypass validation; always write a fresh `dlm.lock`. |
      
        66
        | `--ignore-lock` | false | Bypass validation; don't write `dlm.lock`. |
      
        67
        | `--gpus SPEC` | single-process | Multi-GPU training via Accelerate. `all` uses every visible CUDA device; `N` uses the first N; `0,1` selects exact device ids. Dispatches to `accelerate launch` when >1 device is selected. Refused on MPS/CPU/ROCm; heterogeneous CUDA SMs refused. |
      
        68
        | `--watch` | false | Save-to-train mode (Sprint 25). After the initial train, block on filesystem events and re-run bounded-step retrains on each settled save. |
      
        69
        | `--watch-max-steps N` | 100 | Per-cycle step cap for `--watch`. Keeps cycles responsive. |
      
        70
        | `--watch-debounce-ms N` | 400 | Quiet interval before a burst of saves triggers a retrain. |
      
        71
        | `--repl` | false | With `--watch`: also run `dlm repl` in the same process. **Scaffolded only** — threading integration is a followup; today the flag refuses with exit 2. |
      
        72
        | `--no-cache` | false | Bypass the tokenized-section cache for this run. Default is cache-on when the `.dlm` declares `training.sources`. Use when debugging tokenization or cross-checking cached-vs-uncached determinism. Entries from prior runs stay on disk; the next run without the flag picks them back up. See [directive-cache](../cookbook/directive-cache.md). |
      
        73
        
        74
        The three lock flags are mutually exclusive. See [Determinism](../determinism.md)
      
        75
        for the mismatch severity table.
      
        76
        
        77
        `--gpus` multiplies the effective batch size by `world_size`; the
      
        78
        resulting lock records `world_size` and warns on drift between runs.
      
        79
        Multi-GPU + QLoRA on CUDA is permitted (bitsandbytes supports DDP);
      
        80
        multi-GPU + ROCm is out of scope for Sprint 23.
      
        81
        
        82
        ### `dlm prompt`
      
        83
        
        84
        Run inference against the current adapter.
      
        85
        
        86
        ```
      
        87
        dlm prompt <path> [query] [--max-tokens N] [--temp F] [--top-p F]
      
        88
                          [--adapter NAME] [--gate {auto,off}] [--image PATH]...
      
        89
                          [--verbose]
      
        90
        ```
      
        91
        
        92
        | Option | Default | Notes |
      
        93
        |---|---|---|
      
        94
        | `--max-tokens N` | 256 | Max new tokens to generate. |
      
        95
        | `--temp F` | 0.7 | Temperature. `0.0` = greedy decoding (deterministic). |
      
        96
        | `--top-p F` | None | Top-p sampling. |
      
        97
        | `--adapter NAME` | None | Select a named adapter from `training.adapters`. Required on multi-adapter documents; rejected on single-adapter ones. |
      
        98
        | `--gate {auto,off}` | `auto` | Learned adapter gate (Sprint 34). `auto` uses the trained gate when one exists in the store; `off` forces uniform weights across declared adapters. Silently ignored when `--adapter` pins a single adapter. See `docs/cookbook/learned-adapter-gate.md`. |
      
        99
        | `--image PATH` | none | Attach an image to the prompt. Repeat for multiple images; each expands to one `<image>` placeholder the processor slots pixels into. Required on vision-language bases; rejected on text bases. See [multimodal-training cookbook](../cookbook/multimodal-training.md). |
      
        100
        | `--backend {auto,pytorch,mlx}` | `auto` | Inference backend. `auto` picks MLX on Apple Silicon (when `uv sync --extra mlx` is installed), else PyTorch. Ignored on VL bases (the VL path always uses PyTorch + AutoModelForImageTextToText). |
      
        101
        | `--verbose` | false | Print resolved `InferencePlan` on stderr. |
      
        102
        
        103
        Query is the CLI positional argument. Omit to read from stdin.
      
        104
        
        105
        ### `dlm repl`
      
        106
        
        107
        Interactive prompt-and-respond REPL against the trained adapter
      
        108
        (Sprint 24).
      
        109
        
        110
        ```
      
        111
        dlm repl <path> [--adapter NAME] [--backend {auto,pytorch,mlx}]
      
        112
        ```
      
        113
        
        114
        | Option | Default | Notes |
      
        115
        |---|---|---|
      
        116
        | `--adapter NAME` | None | Named adapter; required on multi-adapter docs. |
      
        117
        | `--backend {auto,pytorch,mlx}` | `auto` | Same contract as `dlm prompt --backend`. |
      
        118
        
        119
        Slash commands inside the REPL: `/help`, `/exit`, `/clear`, `/save`,
      
        120
        `/adapter`, `/params`, `/model`, `/history`. Ctrl-D exits; Ctrl-C
      
        121
        cancels generation or input. Session history persists at
      
        122
        `~/.dlm/history`. See the [interactive-session cookbook](../cookbook/interactive-session.md).
      
        123
        
        124
        ### `dlm metrics`
      
        125
        
        126
        Query the per-store SQLite metrics DB (Sprint 26).
      
        127
        
        128
        ```
      
        129
        dlm metrics <path> [--json|--csv] [--run-id N] [--phase PHASE] [--since WINDOW] [--limit N]
      
        130
        dlm metrics watch <path> [--poll-seconds N]
      
        131
        ```
      
        132
        
        133
        | Option | Default | Notes |
      
        134
        |---|---|---|
      
        135
        | `--json` | false | Emit JSON object (`{runs: [...], steps: [...], evals: [...]}` when combined with `--run-id`). |
      
        136
        | `--csv` | false | Emit CSV of runs or (with `--run-id`) steps + evals. |
      
        137
        | `--run-id N` | None | Drill into one run; prints its step/eval counts. |
      
        138
        | `--phase` | None | Filter runs by phase (`sft`/`dpo`/`orpo`/`cpt`). |
      
        139
        | `--since` | None | Time window (`24h`, `7d`, `30m`, `10s`). |
      
        140
        | `--limit N` | 20 | Cap the number of runs returned. |
      
        141
        
        142
        `dlm metrics watch` polls the DB and tails new step/eval rows as
      
        143
        they arrive. See the [metrics cookbook](../cookbook/metrics.md) for
      
        144
        the full flow + optional TensorBoard / W&B sinks (`uv sync --extra
      
        145
        observability`).
      
        146
        
        147
        ### `dlm templates`
      
        148
        
        149
        Browse the starter template gallery (Sprint 27).
      
        150
        
        151
        ```
      
        152
        dlm templates list [--json] [--refresh] [--accept-unsigned]
      
        153
        ```
      
        154
        
        155
        | Option | Default | Notes |
      
        156
        |---|---|---|
      
        157
        | `--json` | false | Emit the full `TemplateMeta` for each entry as JSON. |
      
        158
        | `--refresh` | false | Refresh from the upstream gallery. **Currently a no-op** — upstream repo and signing key are pending (Sprint 27 deferred polish); the command warns and falls back to the bundled gallery. |
      
        159
        | `--accept-unsigned` | false | Reserved. Will bypass signed-tag verification once the live fetcher is wired. |
      
        160
        
        161
        Pair with `dlm init --template <name>` to create a new `.dlm`:
      
        162
        
        163
        ```bash
      
        164
        dlm init mydoc.dlm --template coding-tutor
      
        165
        ```
      
        166
        
        167
        See the [template-gallery cookbook](../cookbook/template-gallery.md)
      
        168
        for the full walkthrough and the `TemplateMeta` schema.
      
        169
        
        170
        ### `dlm export`
      
        171
        
        172
        Produce GGUF files + Modelfile + register with Ollama.
      
        173
        
        174
        ```
      
        175
        dlm export <path> [--quant Q] [--merged [--dequantize]]
      
        176
                          [--name N] [--no-template] [--skip-ollama]
      
        177
                          [--no-smoke] [--no-imatrix] [--verbose]
      
        178
                          [--draft TAG | --no-draft]
      
        179
                          [--adapter NAME | --adapter-mix SPEC]
      
        180
        ```
      
        181
        
        182
        | Option | Default | Notes |
      
        183
        |---|---|---|
      
        184
        | `--quant Q` | frontmatter.export.default_quant | `Q4_K_M` / `Q5_K_M` / `Q6_K` / `Q8_0` / `F16`. |
      
        185
        | `--merged` | false | Merge LoRA into base before quantizing. |
      
        186
        | `--dequantize` | false | Required with `--merged` on a QLoRA adapter (pitfall #3). |
      
        187
        | `--name N` | derived | Ollama model name. |
      
        188
        | `--no-template` | false | Skip writing `TEMPLATE` into the Modelfile (power users only — Ollama will fuzzy-match, which Sprint 12 deliberately works around). |
      
        189
        | `--skip-ollama` | false | Emit GGUFs but don't register. |
      
        190
        | `--no-smoke` | false | Register but skip the smoke prompt. |
      
        191
        | `--no-imatrix` | false | Opt out of imatrix-calibrated quantization. |
      
        192
        | `--verbose` | false | Surface preflight + conversion diagnostics. |
      
        193
        | `--draft TAG` | auto | Override the speculative-decoding draft model. |
      
        194
        | `--no-draft` | false | Disable speculative decoding. Mutex with `--draft`. |
      
        195
        | `--adapter NAME` | None | Export a single named adapter from `training.adapters`. Rejected on single-adapter documents. Mutex with `--adapter-mix`. |
      
        196
        | `--adapter-mix SPEC` | None | Weighted composition like `knowledge:1.0,tone:0.5`. Produces one Ollama model by merging the named adapters at export time. LoRA-only; QLoRA sources require `--dequantize --merged`. Mutex with `--adapter`. |
      
        197
        | `--adapter-mix-method` | `linear` | PEFT merge strategy: `linear` (default; fast weighted sum) or `svd` (higher fidelity, heavier compute). Only meaningful with `--adapter-mix`. |
      
        198
        
        199
        ### `dlm pack`
      
        200
        
        201
        Produce a portable `.dlm.pack` bundle.
      
        202
        
        203
        ```
      
        204
        dlm pack <path> [--out PATH] [--include-exports] [--include-base]
      
        205
                        [--include-logs] [--i-am-the-licensee URL]
      
        206
        ```
      
        207
        
        208
        | Option | Default | Notes |
      
        209
        |---|---|---|
      
        210
        | `--out PATH` | `<name>.dlm.pack` | Pack output. |
      
        211
        | `--include-exports` | false | Bundle all GGUF exports. |
      
        212
        | `--include-base` | false | Bundle the base model weights. Requires license acknowledgement for gated bases. |
      
        213
        | `--include-logs` | false | Bundle per-run JSONL logs. |
      
        214
        | `--i-am-the-licensee URL` | none | URL acknowledging separate base-license acceptance. |
      
        215
        
        216
        ### `dlm unpack`
      
        217
        
        218
        Install a `.dlm.pack` into the local store.
      
        219
        
        220
        ```
      
        221
        dlm unpack <pack> [--force] [--out DIR]
      
        222
        ```
      
        223
        
        224
        | Option | Default | Notes |
      
        225
        |---|---|---|
      
        226
        | `--force` | false | Overwrite an existing store with the same `dlm_id`. |
      
        227
        | `--out DIR` | pack parent | Where to place the restored `.dlm`. |
      
        228
        
        229
        ### `dlm push`
      
        230
        
        231
        Upload a `.dlm` (auto-packs) or `.dlm.pack` to a sharing destination
      
        232
        (Sprint 28).
      
        233
        
        234
        ```
      
        235
        dlm push <path> --to <destination> [--sign] [pack flags]
      
        236
        ```
      
        237
        
        238
        | Option | Default | Notes |
      
        239
        |---|---|---|
      
        240
        | `--to <destination>` | required | `hf:<org>/<repo>`, `https://...` URL endpoint, or a local path. |
      
        241
        | `--sign` | false | Sign the pack with `minisign` before upload (requires `minisign` on PATH + key at `~/.dlm/minisign.key`). |
      
        242
        | `--include-exports` | false | Forwarded to `dlm pack` when auto-packing a `.dlm`. |
      
        243
        | `--include-base` | false | Same. |
      
        244
        | `--include-logs` | false | Same. |
      
        245
        | `--i-am-the-licensee URL` | none | Required with `--include-base` on a non-redistributable base. |
      
        246
        
        247
        **Destinations:**
      
        248
        - `hf:<org>/<repo>` — HuggingFace Hub. Uses `$HF_TOKEN` if set. Autogenerates a `README.md` with `library_name: dlm` tag. Creates the repo if missing (your personal namespace needs no approval).
      
        249
        - `https://…` — any HTTPS endpoint that accepts a POST with an `application/octet-stream` body. Sets `Authorization:` from `$DLM_SHARE_AUTH` when present (e.g. `Bearer <token>`).
      
        250
        - `<local/path>` — copy the pack to a filesystem path.
      
        251
        
        252
        ### `dlm pull`
      
        253
        
        254
        Download + verify + unpack a `.dlm.pack` from a remote source.
      
        255
        
        256
        ```
      
        257
        dlm pull <source> [--out DIR] [--force]
      
        258
        ```
      
        259
        
        260
        | Option | Default | Notes |
      
        261
        |---|---|---|
      
        262
        | `<source>` | required | `hf:<org>/<repo>`, `https://…`, `peer://host:port/<id>?token=…`, or a local path. |
      
        263
        | `--out DIR` | CWD | Directory for the restored `.dlm`. |
      
        264
        | `--force` | false | Overwrite an existing store with the same `dlm_id`. |
      
        265
        
        266
        Pulls always verify sha256 checksums during unpack. If a `.minisig`
      
        267
        sidecar is served alongside the pack, `dlm pull` tries every key in
      
        268
        `~/.dlm/trusted-keys/*.pub` — match → `verified`, no match →
      
        269
        `unverified` warning (still installs, checksums are fine). No sidecar
      
        270
        → `unsigned` (still installs).
      
        271
        
        272
        ### `dlm serve`
      
        273
        
        274
        Serve a `.dlm`'s pack over LAN for peers to pull.
      
        275
        
        276
        ```
      
        277
        dlm serve <path> [--port N] [--public --i-know-this-is-public]
      
        278
                         [--max-concurrency N] [--rate-limit N]
      
        279
                         [--token-ttl-minutes N]
      
        280
        ```
      
        281
        
        282
        | Option | Default | Notes |
      
        283
        |---|---|---|
      
        284
        | `--port N` | 7337 | Bind port. |
      
        285
        | `--public` | false | Bind `0.0.0.0` **only when paired with** `--i-know-this-is-public`. Without the confirmation flag, `--public` logs a refusal and binds `127.0.0.1`. |
      
        286
        | `--i-know-this-is-public` | false | Acknowledges the public bind. Meaningless without `--public`. |
      
        287
        | `--max-concurrency N` | 4 | Max concurrent connections per token. Excess returns HTTP 429. |
      
        288
        | `--rate-limit N` | 30 | Max requests per minute per token. |
      
        289
        | `--token-ttl-minutes N` | 15 | Issued token lifetime. Ctrl-C invalidates every outstanding token instantly — the session secret lives only in the serving process. |
      
        290
        
        291
        On start, prints the `peer://` URL (with embedded token) that the
      
        292
        other side pastes into `dlm pull`. Ctrl-C cleanly stops the server
      
        293
        and deletes the temp pack.
      
        294
        
        295
        ### `dlm doctor`
      
        296
        
        297
        Inspect hardware + print the resolved training plan.
      
        298
        
        299
        ```
      
        300
        dlm doctor [--json]
      
        301
        ```
      
        302
        
        303
        No-network. Probes torch + psutil only; refuses to go online.
      
        304
        
        305
        ### `dlm show`
      
        306
        
        307
        Show training history, exports, and adapter state for a document.
      
        308
        
        309
        ```
      
        310
        dlm show <path> [--json]
      
        311
        ```
      
        312
        
        313
        Pretty-prints manifest + lock state. `--json` emits machine-readable
      
        314
        output.
      
        315
        
        316
        ### `dlm migrate`
      
        317
        
        318
        Migrate a `.dlm` frontmatter to the current schema version.
      
        319
        
        320
        ```
      
        321
        dlm migrate <path> [--dry-run] [--no-backup]
      
        322
        ```
      
        323
        
        324
        | Option | Default | Notes |
      
        325
        |---|---|---|
      
        326
        | `--dry-run` | false | Print the migrated frontmatter without writing. |
      
        327
        | `--no-backup` | false | Skip the `.dlm.bak` backup. |
      
        328
        
        329
        ### `dlm cache`
      
        330
        
        331
        Inspect and manage the per-store tokenized-section cache (Sprint 31).
      
        332
        The cache speeds up re-training on directive-sourced codebases by
      
        333
        keying tokenized output on `(section_id, tokenizer_sha, sequence_len)`.
      
        334
        
        335
        ```
      
        336
        dlm cache show <path> [--json]
      
        337
        dlm cache prune <path> [--older-than DURATION]
      
        338
        dlm cache clear <path> [--force]
      
        339
        ```
      
        340
        
        341
        | Subcommand | Notes |
      
        342
        |---|---|
      
        343
        | `show` | Print entry count, size on disk, last-run hit rate. `--json` for machine-readable output. |
      
        344
        | `prune` | Delete entries not accessed within `--older-than` (e.g. `30d`, `12h`, `45m`). Default `90d`. |
      
        345
        | `clear` | Wipe the entire cache. Prompts for confirmation unless `--force` is passed. |
      
        346
        
        347
        See `docs/cookbook/directive-cache.md` for tuning, invalidation
      
        348
        triggers, and maintenance patterns.
      
        349
        
        350
        ### `dlm harvest`
      
        351
        
        352
        Pull failing-probe results from a sway-style eval report back into the
      
        353
        document as `!probe`-tagged `::instruction::` sections for the next
      
        354
        retrain. See `docs/cookbook/probe-driven-training.md`.
      
        355
        
        356
        ```
      
        357
        dlm harvest <path> --sway-json <report> [--apply] [--dry-run]
      
        358
                           [--tag NAME] [--min-confidence F]
      
        359
                           [--strict | --lax]
      
        360
        dlm harvest <path> --revert
      
        361
        ```
      
        362
        
        363
        | Option | Default | Notes |
      
        364
        |---|---|---|
      
        365
        | `--sway-json PATH` | required | Path to the sway probe report JSON. |
      
        366
        | `--apply` | false | Write changes to disk. Without it, dry-run. |
      
        367
        | `--dry-run` | true | Print the diff; no writes. |
      
        368
        | `--revert` | — | Strip all `auto_harvest=True` sections; mutually exclusive with `--sway-json`. |
      
        369
        | `--tag NAME` | `auto-harvest` | Provenance tag written into `harvest_source`. |
      
        370
        | `--min-confidence F` | `0.0` | Skip candidates below this confidence. |
      
        371
        | `--strict` / `--lax` | lax | Strict: fail if any failing probe lacks a reference. Lax: skip + log. |
      
        372
        
        373
        Exit codes: `0` success, `1` validation error (malformed JSON, strict
      
        374
        miss, mutual-exclusion violation), `2` no candidates to harvest.
      
        375
        
        376
        ### `dlm train --listen-rpc`
      
        377
        
        378
        During `--watch`, open a JSON-RPC endpoint that accepts `inject_probe`
      
        379
        pushes from external eval harnesses. Requires `DLM_PROBE_TOKEN` in the
      
        380
        environment. See `docs/cookbook/probe-driven-training.md` for the wire
      
        381
        protocol and security notes.
      
        382
        
        383
        | Option | Default | Notes |
      
        384
        |---|---|---|
      
        385
        | `--listen-rpc HOST:PORT` | off | Bind the probe-RPC endpoint. Requires `--watch` or `--max-cycles`. |
      
        386
        | `--max-cycles N` | `0` | Bounded-loop alternative to `--watch` for convergence runs (scaffolded — currently refuses execution without `--watch`). |
      
        387
        
        388
        ## Exit codes
      
        389
        
        390
        | Code | Meaning |
      
        391
        |---|---|
      
        392
        | 0 | Success. |
      
        393
        | 1 | Runtime failure (license refused, disk full, OOM, template drift, lock validation). |
      
        394
        | 2 | CLI misuse (mutex violation, missing argument). |
      
        395
        
        396
        Domain errors are formatted consistently via bare `console.print`
      
        397
        calls in each subcommand (prefix convention: `<subject>: <message>`,
      
        398
        e.g. `lock: base_model_revision changed`). Uncaught exceptions escape
      
        399
        into `dlm.cli.reporter` which picks a matching prefix from the
      
        400
        module the exception came from and renders a tier-3 generic message.

1	# CLI reference
2
3	Generated from the running `dlm --help` output. Auto-regeneration via
4	`typer-cli` is planned for a follow-up sprint; until then this file is
5	hand-maintained and gated by the test suite.
6
7	## Global options
8
9	Applied to every subcommand:
10
11	\| Option \| Env var \| Default \| Description \|
12	\|---\|---\|---\|---\|
13	\| `--home PATH` \| `DLM_HOME` \| `~/.dlm` \| Override the store root. \|
14	\| `-v, --verbose` \| — \| off \| Emit plan / resolver diagnostics on stderr. \|
15	\| `-q, --quiet` \| — \| off \| Suppress informational output. \|
16	\| `--version` \| — \| — \| Print version and exit. \|
17	\| `--install-completion` \| — \| — \| Install shell completion. \|
18	\| `--show-completion` \| — \| — \| Print shell completion script. \|
19	\| `-h, --help` \| — \| — \| Show command help. \|
20
21	## Commands
22
23	### `dlm init`
24
25	Bootstrap a new `.dlm` file with a fresh ULID, create the per-store
26	directory, and persist the license-acceptance record (audit-05 B2).
27
28	```
29	dlm init <path> [--base <key>] [--template <name>] [--multimodal]
30	[--i-accept-license] [--force]
31	```
32
33	\| Option \| Default \| Notes \|
34	\|---\|---\|---\|
35	\| `--base <key>` \| `qwen2.5-1.5b` \| Registry key or `hf:org/name`. Ignored when `--template` is used (the template's `recommended_base` wins). With `--multimodal`, defaults to `paligemma-3b-mix-224`. \|
36	\| `--template <name>` \| None \| Bootstrap from a named gallery template. See `dlm templates list`. Mutually exclusive with `--multimodal`. \|
37	\| `--multimodal` \| false \| Scaffold a vision-language `.dlm` with an `::image::` section (schema v10). Flips `--base` to `paligemma-3b-mix-224` unless explicitly overridden; a non-VL `--base` is refused. See [multimodal-training cookbook](../cookbook/multimodal-training.md). \|
38	\| `--i-accept-license` \| false \| Required for gated bases (Llama-3.2, PaliGemma). \|
39	\| `--force` \| false \| Overwrite an existing `.dlm` at path. \|
40
41	Writes `<path>` with minimum frontmatter, provisions
42	`~/.dlm/store/<dlm_id>/` with an initial `manifest.json`, and (for
43	gated bases) stores the `LicenseAcceptance` record so `dlm train` /
44	`dlm export` don't re-prompt. Refuses if the `.dlm` file already
45	exists and `--force` wasn't passed.
46
47	### `dlm train`
48
49	Train / retrain the adapter.
50
51	```
52	dlm train <path> [--resume\|--fresh] [--seed N] [--max-steps N]
53	[--i-accept-license]
54	[--strict-lock\|--update-lock\|--ignore-lock]
55	```
56
57	\| Option \| Default \| Notes \|
58	\|---\|---\|---\|
59	\| `--resume` \| false \| Continue from `training_state.pt`. Mutex with `--fresh`. \|
60	\| `--fresh` \| false \| Discard prior optimizer state; train from scratch. Mutex with `--resume`. Default when neither flag is set. \|
61	\| `--seed N` \| frontmatter.training.seed \| Override training seed. \|
62	\| `--max-steps N` \| unlimited \| Cap step count. \|
63	\| `--i-accept-license` \| false \| Required for gated bases (usually captured once at `dlm init` and persisted). \|
64	\| `--strict-lock` \| false \| Fail on any `dlm.lock` drift (even WARN). \|
65	\| `--update-lock` \| false \| Bypass validation; always write a fresh `dlm.lock`. \|
66	\| `--ignore-lock` \| false \| Bypass validation; don't write `dlm.lock`. \|
67	\| `--gpus SPEC` \| single-process \| Multi-GPU training via Accelerate. `all` uses every visible CUDA device; `N` uses the first N; `0,1` selects exact device ids. Dispatches to `accelerate launch` when >1 device is selected. Refused on MPS/CPU/ROCm; heterogeneous CUDA SMs refused. \|
68	\| `--watch` \| false \| Save-to-train mode (Sprint 25). After the initial train, block on filesystem events and re-run bounded-step retrains on each settled save. \|
69	\| `--watch-max-steps N` \| 100 \| Per-cycle step cap for `--watch`. Keeps cycles responsive. \|
70	\| `--watch-debounce-ms N` \| 400 \| Quiet interval before a burst of saves triggers a retrain. \|
71	\| `--repl` \| false \| With `--watch`: also run `dlm repl` in the same process. Scaffolded only — threading integration is a followup; today the flag refuses with exit 2. \|
72	\| `--no-cache` \| false \| Bypass the tokenized-section cache for this run. Default is cache-on when the `.dlm` declares `training.sources`. Use when debugging tokenization or cross-checking cached-vs-uncached determinism. Entries from prior runs stay on disk; the next run without the flag picks them back up. See [directive-cache](../cookbook/directive-cache.md). \|
73
74	The three lock flags are mutually exclusive. See [Determinism](../determinism.md)
75	for the mismatch severity table.
76
77	`--gpus` multiplies the effective batch size by `world_size`; the
78	resulting lock records `world_size` and warns on drift between runs.
79	Multi-GPU + QLoRA on CUDA is permitted (bitsandbytes supports DDP);
80	multi-GPU + ROCm is out of scope for Sprint 23.
81
82	### `dlm prompt`
83
84	Run inference against the current adapter.
85
86	```
87	dlm prompt <path> [query] [--max-tokens N] [--temp F] [--top-p F]
88	[--adapter NAME] [--gate {auto,off}] [--image PATH]...
89	[--verbose]
90	```
91
92	\| Option \| Default \| Notes \|
93	\|---\|---\|---\|
94	\| `--max-tokens N` \| 256 \| Max new tokens to generate. \|
95	\| `--temp F` \| 0.7 \| Temperature. `0.0` = greedy decoding (deterministic). \|
96	\| `--top-p F` \| None \| Top-p sampling. \|
97	\| `--adapter NAME` \| None \| Select a named adapter from `training.adapters`. Required on multi-adapter documents; rejected on single-adapter ones. \|
98	\| `--gate {auto,off}` \| `auto` \| Learned adapter gate (Sprint 34). `auto` uses the trained gate when one exists in the store; `off` forces uniform weights across declared adapters. Silently ignored when `--adapter` pins a single adapter. See `docs/cookbook/learned-adapter-gate.md`. \|
99	\| `--image PATH` \| none \| Attach an image to the prompt. Repeat for multiple images; each expands to one `<image>` placeholder the processor slots pixels into. Required on vision-language bases; rejected on text bases. See [multimodal-training cookbook](../cookbook/multimodal-training.md). \|
100	\| `--backend {auto,pytorch,mlx}` \| `auto` \| Inference backend. `auto` picks MLX on Apple Silicon (when `uv sync --extra mlx` is installed), else PyTorch. Ignored on VL bases (the VL path always uses PyTorch + AutoModelForImageTextToText). \|
101	\| `--verbose` \| false \| Print resolved `InferencePlan` on stderr. \|
102
103	Query is the CLI positional argument. Omit to read from stdin.
104
105	### `dlm repl`
106
107	Interactive prompt-and-respond REPL against the trained adapter
108	(Sprint 24).
109
110	```
111	dlm repl <path> [--adapter NAME] [--backend {auto,pytorch,mlx}]
112	```
113
114	\| Option \| Default \| Notes \|
115	\|---\|---\|---\|
116	\| `--adapter NAME` \| None \| Named adapter; required on multi-adapter docs. \|
117	\| `--backend {auto,pytorch,mlx}` \| `auto` \| Same contract as `dlm prompt --backend`. \|
118
119	Slash commands inside the REPL: `/help`, `/exit`, `/clear`, `/save`,
120	`/adapter`, `/params`, `/model`, `/history`. Ctrl-D exits; Ctrl-C
121	cancels generation or input. Session history persists at
122	`~/.dlm/history`. See the [interactive-session cookbook](../cookbook/interactive-session.md).
123
124	### `dlm metrics`
125
126	Query the per-store SQLite metrics DB (Sprint 26).
127
128	```
129	dlm metrics <path> [--json\|--csv] [--run-id N] [--phase PHASE] [--since WINDOW] [--limit N]
130	dlm metrics watch <path> [--poll-seconds N]
131	```
132
133	\| Option \| Default \| Notes \|
134	\|---\|---\|---\|
135	\| `--json` \| false \| Emit JSON object (`{runs: [...], steps: [...], evals: [...]}` when combined with `--run-id`). \|
136	\| `--csv` \| false \| Emit CSV of runs or (with `--run-id`) steps + evals. \|
137	\| `--run-id N` \| None \| Drill into one run; prints its step/eval counts. \|
138	\| `--phase` \| None \| Filter runs by phase (`sft`/`dpo`/`orpo`/`cpt`). \|
139	\| `--since` \| None \| Time window (`24h`, `7d`, `30m`, `10s`). \|
140	\| `--limit N` \| 20 \| Cap the number of runs returned. \|
141
142	`dlm metrics watch` polls the DB and tails new step/eval rows as
143	they arrive. See the [metrics cookbook](../cookbook/metrics.md) for
144	the full flow + optional TensorBoard / W&B sinks (`uv sync --extra
145	observability`).
146
147	### `dlm templates`
148
149	Browse the starter template gallery (Sprint 27).
150
151	```
152	dlm templates list [--json] [--refresh] [--accept-unsigned]
153	```
154
155	\| Option \| Default \| Notes \|
156	\|---\|---\|---\|
157	\| `--json` \| false \| Emit the full `TemplateMeta` for each entry as JSON. \|
158	\| `--refresh` \| false \| Refresh from the upstream gallery. Currently a no-op — upstream repo and signing key are pending (Sprint 27 deferred polish); the command warns and falls back to the bundled gallery. \|
159	\| `--accept-unsigned` \| false \| Reserved. Will bypass signed-tag verification once the live fetcher is wired. \|
160
161	Pair with `dlm init --template <name>` to create a new `.dlm`:
162
163	```bash
164	dlm init mydoc.dlm --template coding-tutor
165	```
166
167	See the [template-gallery cookbook](../cookbook/template-gallery.md)
168	for the full walkthrough and the `TemplateMeta` schema.
169
170	### `dlm export`
171
172	Produce GGUF files + Modelfile + register with Ollama.
173
174	```
175	dlm export <path> [--quant Q] [--merged [--dequantize]]
176	[--name N] [--no-template] [--skip-ollama]
177	[--no-smoke] [--no-imatrix] [--verbose]
178	[--draft TAG \| --no-draft]
179	[--adapter NAME \| --adapter-mix SPEC]
180	```
181
182	\| Option \| Default \| Notes \|
183	\|---\|---\|---\|
184	\| `--quant Q` \| frontmatter.export.default_quant \| `Q4_K_M` / `Q5_K_M` / `Q6_K` / `Q8_0` / `F16`. \|
185	\| `--merged` \| false \| Merge LoRA into base before quantizing. \|
186	\| `--dequantize` \| false \| Required with `--merged` on a QLoRA adapter (pitfall #3). \|
187	\| `--name N` \| derived \| Ollama model name. \|
188	\| `--no-template` \| false \| Skip writing `TEMPLATE` into the Modelfile (power users only — Ollama will fuzzy-match, which Sprint 12 deliberately works around). \|
189	\| `--skip-ollama` \| false \| Emit GGUFs but don't register. \|
190	\| `--no-smoke` \| false \| Register but skip the smoke prompt. \|
191	\| `--no-imatrix` \| false \| Opt out of imatrix-calibrated quantization. \|
192	\| `--verbose` \| false \| Surface preflight + conversion diagnostics. \|
193	\| `--draft TAG` \| auto \| Override the speculative-decoding draft model. \|
194	\| `--no-draft` \| false \| Disable speculative decoding. Mutex with `--draft`. \|
195	\| `--adapter NAME` \| None \| Export a single named adapter from `training.adapters`. Rejected on single-adapter documents. Mutex with `--adapter-mix`. \|
196	\| `--adapter-mix SPEC` \| None \| Weighted composition like `knowledge:1.0,tone:0.5`. Produces one Ollama model by merging the named adapters at export time. LoRA-only; QLoRA sources require `--dequantize --merged`. Mutex with `--adapter`. \|
197	\| `--adapter-mix-method` \| `linear` \| PEFT merge strategy: `linear` (default; fast weighted sum) or `svd` (higher fidelity, heavier compute). Only meaningful with `--adapter-mix`. \|
198
199	### `dlm pack`
200
201	Produce a portable `.dlm.pack` bundle.
202
203	```
204	dlm pack <path> [--out PATH] [--include-exports] [--include-base]
205	[--include-logs] [--i-am-the-licensee URL]
206	```
207
208	\| Option \| Default \| Notes \|
209	\|---\|---\|---\|
210	\| `--out PATH` \| `<name>.dlm.pack` \| Pack output. \|
211	\| `--include-exports` \| false \| Bundle all GGUF exports. \|
212	\| `--include-base` \| false \| Bundle the base model weights. Requires license acknowledgement for gated bases. \|
213	\| `--include-logs` \| false \| Bundle per-run JSONL logs. \|
214	\| `--i-am-the-licensee URL` \| none \| URL acknowledging separate base-license acceptance. \|
215
216	### `dlm unpack`
217
218	Install a `.dlm.pack` into the local store.
219
220	```
221	dlm unpack <pack> [--force] [--out DIR]
222	```
223
224	\| Option \| Default \| Notes \|
225	\|---\|---\|---\|
226	\| `--force` \| false \| Overwrite an existing store with the same `dlm_id`. \|
227	\| `--out DIR` \| pack parent \| Where to place the restored `.dlm`. \|
228
229	### `dlm push`
230
231	Upload a `.dlm` (auto-packs) or `.dlm.pack` to a sharing destination
232	(Sprint 28).
233
234	```
235	dlm push <path> --to <destination> [--sign] [pack flags]
236	```
237
238	\| Option \| Default \| Notes \|
239	\|---\|---\|---\|
240	\| `--to <destination>` \| required \| `hf:<org>/<repo>`, `https://...` URL endpoint, or a local path. \|
241	\| `--sign` \| false \| Sign the pack with `minisign` before upload (requires `minisign` on PATH + key at `~/.dlm/minisign.key`). \|
242	\| `--include-exports` \| false \| Forwarded to `dlm pack` when auto-packing a `.dlm`. \|
243	\| `--include-base` \| false \| Same. \|
244	\| `--include-logs` \| false \| Same. \|
245	\| `--i-am-the-licensee URL` \| none \| Required with `--include-base` on a non-redistributable base. \|
246
247	Destinations:
248	- `hf:<org>/<repo>` — HuggingFace Hub. Uses `$HF_TOKEN` if set. Autogenerates a `README.md` with `library_name: dlm` tag. Creates the repo if missing (your personal namespace needs no approval).
249	- `https://…` — any HTTPS endpoint that accepts a POST with an `application/octet-stream` body. Sets `Authorization:` from `$DLM_SHARE_AUTH` when present (e.g. `Bearer <token>`).
250	- `<local/path>` — copy the pack to a filesystem path.
251
252	### `dlm pull`
253
254	Download + verify + unpack a `.dlm.pack` from a remote source.
255
256	```
257	dlm pull <source> [--out DIR] [--force]
258	```
259
260	\| Option \| Default \| Notes \|
261	\|---\|---\|---\|
262	\| `<source>` \| required \| `hf:<org>/<repo>`, `https://…`, `peer://host:port/<id>?token=…`, or a local path. \|
263	\| `--out DIR` \| CWD \| Directory for the restored `.dlm`. \|
264	\| `--force` \| false \| Overwrite an existing store with the same `dlm_id`. \|
265
266	Pulls always verify sha256 checksums during unpack. If a `.minisig`
267	sidecar is served alongside the pack, `dlm pull` tries every key in
268	`~/.dlm/trusted-keys/*.pub` — match → `verified`, no match →
269	`unverified` warning (still installs, checksums are fine). No sidecar
270	→ `unsigned` (still installs).
271
272	### `dlm serve`
273
274	Serve a `.dlm`'s pack over LAN for peers to pull.
275
276	```
277	dlm serve <path> [--port N] [--public --i-know-this-is-public]
278	[--max-concurrency N] [--rate-limit N]
279	[--token-ttl-minutes N]
280	```
281
282	\| Option \| Default \| Notes \|
283	\|---\|---\|---\|
284	\| `--port N` \| 7337 \| Bind port. \|
285	\| `--public` \| false \| Bind `0.0.0.0` only when paired with `--i-know-this-is-public`. Without the confirmation flag, `--public` logs a refusal and binds `127.0.0.1`. \|
286	\| `--i-know-this-is-public` \| false \| Acknowledges the public bind. Meaningless without `--public`. \|
287	\| `--max-concurrency N` \| 4 \| Max concurrent connections per token. Excess returns HTTP 429. \|
288	\| `--rate-limit N` \| 30 \| Max requests per minute per token. \|
289	\| `--token-ttl-minutes N` \| 15 \| Issued token lifetime. Ctrl-C invalidates every outstanding token instantly — the session secret lives only in the serving process. \|
290
291	On start, prints the `peer://` URL (with embedded token) that the
292	other side pastes into `dlm pull`. Ctrl-C cleanly stops the server
293	and deletes the temp pack.
294
295	### `dlm doctor`
296
297	Inspect hardware + print the resolved training plan.
298
299	```
300	dlm doctor [--json]
301	```
302
303	No-network. Probes torch + psutil only; refuses to go online.
304
305	### `dlm show`
306
307	Show training history, exports, and adapter state for a document.
308
309	```
310	dlm show <path> [--json]
311	```
312
313	Pretty-prints manifest + lock state. `--json` emits machine-readable
314	output.
315
316	### `dlm migrate`
317
318	Migrate a `.dlm` frontmatter to the current schema version.
319
320	```
321	dlm migrate <path> [--dry-run] [--no-backup]
322	```
323
324	\| Option \| Default \| Notes \|
325	\|---\|---\|---\|
326	\| `--dry-run` \| false \| Print the migrated frontmatter without writing. \|
327	\| `--no-backup` \| false \| Skip the `.dlm.bak` backup. \|
328
329	### `dlm cache`
330
331	Inspect and manage the per-store tokenized-section cache (Sprint 31).
332	The cache speeds up re-training on directive-sourced codebases by
333	keying tokenized output on `(section_id, tokenizer_sha, sequence_len)`.
334
335	```
336	dlm cache show <path> [--json]
337	dlm cache prune <path> [--older-than DURATION]
338	dlm cache clear <path> [--force]
339	```
340
341	\| Subcommand \| Notes \|
342	\|---\|---\|
343	\| `show` \| Print entry count, size on disk, last-run hit rate. `--json` for machine-readable output. \|
344	\| `prune` \| Delete entries not accessed within `--older-than` (e.g. `30d`, `12h`, `45m`). Default `90d`. \|
345	\| `clear` \| Wipe the entire cache. Prompts for confirmation unless `--force` is passed. \|
346
347	See `docs/cookbook/directive-cache.md` for tuning, invalidation
348	triggers, and maintenance patterns.
349
350	### `dlm harvest`
351
352	Pull failing-probe results from a sway-style eval report back into the
353	document as `!probe`-tagged `::instruction::` sections for the next
354	retrain. See `docs/cookbook/probe-driven-training.md`.
355
356	```
357	dlm harvest <path> --sway-json <report> [--apply] [--dry-run]
358	[--tag NAME] [--min-confidence F]
359	[--strict \| --lax]
360	dlm harvest <path> --revert
361	```
362
363	\| Option \| Default \| Notes \|
364	\|---\|---\|---\|
365	\| `--sway-json PATH` \| required \| Path to the sway probe report JSON. \|
366	\| `--apply` \| false \| Write changes to disk. Without it, dry-run. \|
367	\| `--dry-run` \| true \| Print the diff; no writes. \|
368	\| `--revert` \| — \| Strip all `auto_harvest=True` sections; mutually exclusive with `--sway-json`. \|
369	\| `--tag NAME` \| `auto-harvest` \| Provenance tag written into `harvest_source`. \|
370	\| `--min-confidence F` \| `0.0` \| Skip candidates below this confidence. \|
371	\| `--strict` / `--lax` \| lax \| Strict: fail if any failing probe lacks a reference. Lax: skip + log. \|
372
373	Exit codes: `0` success, `1` validation error (malformed JSON, strict
374	miss, mutual-exclusion violation), `2` no candidates to harvest.
375
376	### `dlm train --listen-rpc`
377
378	During `--watch`, open a JSON-RPC endpoint that accepts `inject_probe`
379	pushes from external eval harnesses. Requires `DLM_PROBE_TOKEN` in the
380	environment. See `docs/cookbook/probe-driven-training.md` for the wire
381	protocol and security notes.
382
383	\| Option \| Default \| Notes \|
384	\|---\|---\|---\|
385	\| `--listen-rpc HOST:PORT` \| off \| Bind the probe-RPC endpoint. Requires `--watch` or `--max-cycles`. \|
386	\| `--max-cycles N` \| `0` \| Bounded-loop alternative to `--watch` for convergence runs (scaffolded — currently refuses execution without `--watch`). \|
387
388	## Exit codes
389
390	\| Code \| Meaning \|
391	\|---\|---\|
392	\| 0 \| Success. \|
393	\| 1 \| Runtime failure (license refused, disk full, OOM, template drift, lock validation). \|
394	\| 2 \| CLI misuse (mutex violation, missing argument). \|
395
396	Domain errors are formatted consistently via bare `console.print`
397	calls in each subcommand (prefix convention: `<subject>: <message>`,
398	e.g. `lock: base_model_revision changed`). Uncaught exceptions escape
399	into `dlm.cli.reporter` which picks a matching prefix from the
400	module the exception came from and renders a tier-3 generic message.