PWM-Pilot-Audio
The first pilot (audio-first)
PWM-Pilot-Audio is the first audio-only instantiation of PWM-Bench. It does not test the full multimodal thesis. It tests whether longitudinal audio-derived evidence improves prospective, calibrated, person-specific forecasting over the population prior (R1) and the personal-routine baseline (R2).
Protocol documents
Audio-first protocol bundle (v0.1, pre-registration draft — planned study, no results).
Design
Systems under test
| System | Description | Evidence | Notes |
|---|---|---|---|
| A | Population prior | L0 | Reference baseline (R1). |
| B | Digital exhaust | L0 | Calendar + communications metadata. |
| C | Digital exhaust + chat history | L0–L1 | Adds text evidence. |
| D | Digital exhaust + chat history + passive observational stream | L0–L3 | Adds the multimodal passive stream. |
Each system is additionally scored against the personal routine baseline (R2) and under identity permutation.
Hypotheses
H1 alternative
Evidence-based systems beat the population prior and the routine baseline.
H2 alternative
Skill is non-decreasing with evidence richness.
H3 alternative
Skill collapses under identity permutation.
H0 null
No system beats the personal routine baseline.
PWM-Pilot is designed so that H0 is genuinely falsifiable: the routine baseline (R2) is deliberately strong, and a system earns a claim of person-specific skill only by beating it under sealed, permutation-gated conditions.