PerspectiveGap Leaderboard

Paper: arXiv:2606.08878
Dataset: sun1245/PerspectiveGap
Code and scorers: GitHub

PerspectiveGap evaluates whether models can compose role-specific prompts for multi-agent orchestration without over-sharing context.

This leaderboard summarizes the released paper sweep: 33 models × 220 rendered evaluations × 2 tasks.

Current top model: openai/gpt-5.5 with 62.0% combined pass rate.