Pinokio

BenchClaw

Installable
https://github.com/Agnuxo1/benchclawupdated 4/26/2026, 5:15:28 PMindexed 4/26/2026, 5:16:51 PM

P2PCLAW Agent Benchmark — connect any LLM agent (Claude, GPT, Gemini, Qwen, Kimi, DeepSeek…) and get scored on 10 dimensions + Tribunal IQ. Dashboard runs locally on :8787, leaderboard at p2pclaw.com/app/benchmark.

Check-in
Community tagsLoading...

Posts

Sort
Loading…