Sonnet subagent walked all 25 outcome classes end-to-end through the live canvas at /network?active-view=anything-engine. No code modified. 46 screenshots captured. The 60s dispatch watchdog I shipped earlier today is NOT firing in practice. find_investors (flagship) returns 0 cards. 4 of 8 lightweight classes misroute to find_investors. Below is the truth.
PASS: find_warm_intros (4 real cards w/ connection paths), find_co_investors (6+ real investor cards), META (24 workflow tile picker). FAIL: find_investors, find_cofounder, find_collaborators, plan_outcome, research_company, travel, summarize_meeting, purchase_real_estate, find_partners.
| Class | Classifier | Cards | Verdict | Notes |
|---|---|---|---|---|
| find_investors | CORRECT 95% | 0 (8+ min) | FAIL | Spinner stopped, 0 cards. 60s watchdog did NOT trigger. |
| find_deal_flow | CORRECT 92% | 0 honest | PARTIAL | Empty-state correct for thin dataset. |
| find_acquisition | CORRECT 88% | 0 honest | PARTIAL | Honest empty-state. |
| find_journalists | CORRECT 90% | 0 honest | PARTIAL | ep 8496 threshold relaxed; dataset still thin. |
| find_warm_intros | CORRECT 87% | 4 real | PASS | Best in audit. Real connection paths. |
| find_event_attendees | CORRECT 83% | ~6 WRONG TYPE | PARTIAL | Returned VC funds (a16z, Sequoia) instead of people. Entity-type bug. |
| find_advisors | CORRECT 91% | 0 honest | PARTIAL | ep 8493 applied; dataset still 0. |
| find_co_investors | CORRECT 85% | 6+ real | PASS | Real names, correct stage alignment. |
| summarize_meeting | WRONG → find_investors | 0 / loop | FAIL | Lightweight class misroute. Retry → infinite "Say that one more way?" loop. |
| find_partners | CORRECT 89% | 0 silence | FAIL | 4-turn interview completed. Canvas went silent. Critical wire-up failure. |
| find_talent | CORRECT 93% | 0 honest | PARTIAL | Interview UX correct. Dataset gap. |
| find_customers | CORRECT 90% | 1 real | PARTIAL | ep 8495 fix worked (no leakage). Only 1 result. |
| plan_outcome | WRONG → find_investors | 0 | FAIL | Lightweight class misroute. |
| research_company | WRONG → find_investors | 0 | FAIL | Misroute (after Wave 28 disambiguation rule was deployed; rule needs sharpening). |
| travel | WRONG → find_investors | 0 | FAIL | Brief travel·78% flash then settled to find_investors. Also corrupted prior thread. |
| purchase_real_estate | CORRECT 81% | 0 / loop | FAIL | Trapped in clarification loop 3x. No max-depth. |
| find_cofounder | CORRECT 88% | 0 (114s) | FAIL | Dispatch hung 114s. Watchdog did NOT trigger. |
| find_collaborators | CORRECT 86% | 0 (67s) | FAIL | Dispatch freeze. Watchdog absent. |
| find_speakers | CORRECT 84% | not tested | PARTIAL | Routing + first interview question working. |
| find_job | CORRECT 87% | not tested | PARTIAL | First question working. |
| get_advice | CORRECT 82% | not tested | PARTIAL | First question working. |
| make_purchase | CORRECT 85% | not tested | PARTIAL | First question working. |
| research_topic | CORRECT 80% | not tested | PARTIAL | First question working. |
| talent_agent_requests | CORRECT | not tested | PARTIAL | First question + options rendered correctly. |
| META | n/a | 24 tiles | PASS | Fast, correct, excellent onboarding UX. |
No code modified during the audit. 46 screenshots captured at /tmp/dogfood-100/. Subagent walked the canvas via the live agent-browser daemon (already authed). Total run time: ~43 minutes across 25 classes. 5 of the 14 PARTIAL verdicts are "classifier + first interview question working but full dispatch not tested" — the subagent capped to avoid hitting the dispatch-freeze pattern repeatedly. Re-test those 5 after P0-1 (watchdog) is fixed.