
Claude Jupiter v1-p vs GPT-5.5 Benchmark: Real API Test on Reasoning and Coding
We tested claude-jupiter-v1-p and gpt-5.5 through https://cn.crazyrouter.com/v1 across reasoning, coding, patching, JSON, long-context recall, agent planning, and math tasks. GPT-5.5 scored slightly higher, while Jupiter was much faster but required a payload compatibility fix.

