| Paper | Finding | Key Number |
|---|---|---|
| #08 | Larger models LESS faithful | 7/8 tasks |
| #10 | Claude 3.7 Sonnet 25% faithful, DeepSeek R1 39% | 25-39% |
| #43 | 40-60% unfaithfulness rate; OOD → 74% unfaithful | 74% OOD |
| #62 | Faithfulness-accuracy tradeoff exists | GPT-4: lowest faith |
| #247 | "Reasoning Horizon" at 70-85% of chain | <20% causal |
| #257 | Post-hoc reasoning = forward reasoning quality | 66.1% > 64.6% |
Key Insight: Paper #257 provides the smoking gun — if post-hoc
reasoning is AS GOOD as forward reasoning, the "reasoning" is narrative construction,
not computation.