ui.server — Training Endpoints¶

REST endpoints for triggering model training, feature building, and model management from the web UI.

Endpoints¶

`POST /api/train/start`¶

Start a LightGBM training job in the background. Only one job runs at a time; returns 409 if a job is already running.

Request body:

Field	Type	Default	Description
`date_from`	`str`	(required)	Start date (YYYY-MM-DD)
`date_to`	`str`	(required)	End date (YYYY-MM-DD, inclusive)
`num_boost_round`	`int`	`100`	Number of boosting rounds
`class_weight`	`str`	`"balanced"`	`"balanced"` or `"none"`
`synthetic`	`bool`	`false`	Use dummy features and labels

Response (202):

{
  "job_id": "a1b2c3d4e5f6",
  "status": "running",
  "step": "initializing",
  "progress_pct": 0,
  "message": "Starting…",
  "error": null,
  "metrics": null,
  "model_dir": null,
  "started_at": "2026-03-12T10:00:00+00:00",
  "finished_at": null
}

`POST /api/train/build-features`¶

Build features for a date range in the background. Runs build_features_for_date() for each date.

Request body:

Field	Type	Description
`date_from`	`str`	Start date (YYYY-MM-DD)
`date_to`	`str`	End date (YYYY-MM-DD, inclusive)

Response (202): Same schema as /api/train/start.

`GET /api/train/status`¶

Poll the current training job state. Returns the last job's status even after completion.

Response:

{
  "job_id": "a1b2c3d4e5f6",
  "status": "complete",
  "step": "done",
  "progress_pct": 100,
  "message": "Model saved to 2026-03-12_100000_run-0042",
  "error": null,
  "metrics": {"macro_f1": 0.85, "weighted_f1": 0.87},
  "model_dir": "/path/to/models/2026-03-12_100000_run-0042",
  "started_at": "2026-03-12T10:00:00+00:00",
  "finished_at": "2026-03-12T10:01:30+00:00"
}

Status values: "idle", "running", "complete", "failed".

`POST /api/train/cancel`¶

Best-effort cancel of the running training job. Returns 409 if no job is running.

`GET /api/train/models`¶

List available model bundles from the models directory.

Response:

[
  {
    "model_id": "2026-03-12_100000_run-0042",
    "path": "/path/to/models/2026-03-12_100000_run-0042",
    "valid": true,
    "invalid_reason": null,
    "macro_f1": 0.85,
    "weighted_f1": 0.87,
    "created_at": "2026-03-12T10:01:30"
  }
]

`GET /api/train/models/current/inspect`¶

Read-only bundle-saved validation metrics for the model currently loaded in the tray (same data as taskclf model inspect without replay). Requires get_tray_state from the tray process; otherwise returns {"loaded": false, "reason": "tray_state_unavailable"}.

When a model path is available but nothing is loaded, returns {"loaded": false, "reason": "no_model_loaded"}.

When loaded, response shape:

{
  "loaded": true,
  "bundle_path": "/path/to/models/run-id",
  "metadata": { "...": "..." },
  "bundle_saved_validation": {
    "macro_f1": 0.85,
    "weighted_f1": 0.87,
    "label_names": ["Build", "..."],
    "confusion_matrix": [[...]],
    "per_class_derived": { "...": "..." },
    "top_confusion_pairs": [
      {"true_label": "Write", "pred_label": "Build", "count": 12}
    ]
  }
}

Replay / held-out test metrics are not included; use the CLI taskclf model inspect with a date range for that.

`GET /api/train/models/{model_id}/inspect`¶

Bundle-saved validation metrics for a known bundle id under the server’s models directory (same bundle_path / metadata / bundle_saved_validation shape as above, without a loaded field). Returns 404 if the id is unknown, 422 if the bundle is invalid.

`GET /api/train/data-check`¶

Prepare data for a date range. For any date without an existing features.parquet, the endpoint automatically fetches events from the running ActivityWatch server and builds feature rows before reporting totals.

Query parameters:

Param	Type	Description
`date_from`	`str`	Start date (YYYY-MM-DD)
`date_to`	`str`	End date (YYYY-MM-DD, inclusive)

Response:

{
  "date_from": "2026-02-01",
  "date_to": "2026-03-01",
  "dates_with_features": ["2026-02-01", "2026-02-02", "2026-02-03"],
  "dates_missing_features": [],
  "total_feature_rows": 4320,
  "label_span_count": 25,
  "trainable_rows": 1800,
  "trainable_labels": ["coding", "meeting", "writing"],
  "dates_built": ["2026-02-03"],
  "build_errors": []
}

trainable_rows — feature rows that survive label projection (strict containment). If 0, training will fail. The UI disables the Train button when this is 0.
trainable_labels — distinct label classes present in the projected rows.
dates_built — dates where features were freshly built from AW during this call.
build_errors — per-date error messages if AW was unreachable or returned no data.

WebSocket Events¶

Training progress is streamed to connected WebSocket clients (/ws/predictions) via the shared EventBus.

`train_progress`¶

Emitted at each pipeline step during training.

{
  "type": "train_progress",
  "job_id": "a1b2c3d4e5f6",
  "step": "training",
  "progress_pct": 50,
  "message": "Training LightGBM (100 rounds, 500 train / 100 val)…"
}

Steps: initializing, loading_features, projecting_labels, splitting, training, saving, done, building_features.

`train_complete`¶

Emitted when training finishes successfully.

{
  "type": "train_complete",
  "job_id": "a1b2c3d4e5f6",
  "metrics": {"macro_f1": 0.85, "weighted_f1": 0.87},
  "model_dir": "/path/to/models/2026-03-12_100000_run-0042"
}

`train_failed`¶

Emitted when training fails or is cancelled.

{
  "type": "train_failed",
  "job_id": "a1b2c3d4e5f6",
  "error": "No feature data found for the given date range"
}

Troubleshooting¶

Error	Cause	Fix
Feature files exist but contain 0 rows	ActivityWatch is not running or has no data for the date range	Start AW, verify it's collecting events, then re-run "Prepare Data"
No label spans overlap the selected date range	Labels were created for dates outside the training range	Create labels that cover the same dates as your feature data
No labeled rows after projection	Labels exist and features exist, but no label span fully contains a feature window	Ensure label spans are at least 60 s long and cover times when AW recorded activity
No feature data found for the given date range	No `.parquet` files exist at all for the selected dates	Click "Prepare Data" first to build features from AW

Auto-Reload¶

When training completes via the web UI while a tray instance is running, the newly trained model is automatically loaded as the active suggester. No manual model-switch is required.

Frontend¶

The training UI is accessible via the "training" tab in the StatePanel. It provides:

Data Readiness — date range picker and "Prepare Data" button that auto-builds missing features from ActivityWatch before showing feature coverage and label span counts.
Train Form — configurable boost rounds, class weight, synthetic data toggle, and a "Train Model" button.
Progress Indicator — real-time step, message, and progress bar fed by train_progress WebSocket events.
Result Display — macro/weighted F1 on completion with the model directory path.
Model List — all valid bundles sorted by creation date with metrics.

ui.server — Training Endpoints¶

Endpoints¶

POST /api/train/start¶

POST /api/train/build-features¶

GET /api/train/status¶

POST /api/train/cancel¶

GET /api/train/models¶

GET /api/train/models/current/inspect¶

GET /api/train/models/{model_id}/inspect¶

GET /api/train/data-check¶

WebSocket Events¶

train_progress¶

train_complete¶

train_failed¶

Troubleshooting¶

Auto-Reload¶

Frontend¶

`POST /api/train/start`¶

`POST /api/train/build-features`¶

`GET /api/train/status`¶

`POST /api/train/cancel`¶

`GET /api/train/models`¶

`GET /api/train/models/current/inspect`¶

`GET /api/train/models/{model_id}/inspect`¶

`GET /api/train/data-check`¶

`train_progress`¶

`train_complete`¶

`train_failed`¶