- Tested parallel 1-round, sequential 1-round, debate/parallel 3-round - 3 rounds is sweet spot: positions converge, meaningful evolution - Sequential most token-efficient; parallel 3-round best depth-to-cost - Debate and parallel 3-round mechanically identical (prompt tone differs) - Added cost profiles, recommended defaults by use case - Updated TODOs: unify flows, test 2-round, test mixed model tiers