PWM-Pilot-Audio

The first pilot (audio-first)

PWM-Pilot-Audio is the first audio-only instantiation of PWM-Bench. It does not test the full multimodal thesis. It tests whether longitudinal audio-derived evidence improves prospective, calibrated, person-specific forecasting over the population prior (R1) and the personal-routine baseline (R2).

Protocol documents

Audio-first protocol bundle (v0.1, pre-registration draft — planned study, no results).

Full Protocol (PDF)Participant Summary (PDF)Advisor Summary (PDF)

Design

Participants: 5 consenting participants
Duration: 30 days
Observation: Up to 12 hours/day of possible passive observation stream
Forecasts: Prospective, sealed; several hundred to low-thousands of forecast instances
Comparison: Evidence-tier comparison across four systems

Systems under test

System	Description	Evidence	Notes
A	Population prior	L0	Reference baseline (R1).
B	Digital exhaust	L0	Calendar + communications metadata.
C	Digital exhaust + chat history	L0–L1	Adds text evidence.
D	Digital exhaust + chat history + passive observational stream	L0–L3	Adds the multimodal passive stream.

Each system is additionally scored against the personal routine baseline (R2) and under identity permutation.

Hypotheses

H1 alternative

Evidence-based systems beat the population prior and the routine baseline.

H2 alternative

Skill is non-decreasing with evidence richness.

H3 alternative

Skill collapses under identity permutation.

H0 null

No system beats the personal routine baseline.

PWM-Pilot is designed so that H0 is genuinely falsifiable: the routine baseline (R2) is deliberately strong, and a system earns a claim of person-specific skill only by beating it under sealed, permutation-gated conditions.