Local Run id run-local-41e3f0f35701

mistral-7b-q4km

on arc-challenge · openrouter · n=50 · Sun, 26 Apr 2026 07:40:59 GMT
62.0
±13.0 · 95% CI [48.2, 74.1]
mistral-7b-q4km scored 62.0 on arc-challenge across 50 problems. The transcript is committed to Merkle root sha256:23cdf4c162e60cf… and signed by attestor benchlist-local-ollama with Ed25519 signature 14524413d71ae098c10228…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:144e9a13fb369f31007fffdcf4d7d55692677b409c8fa7b7dec4328c81a55752
Methodology hashsha256:8e84e6ffec11c082a286373b8b306600732cdf99b514079bfc0754fe4cd7a7c5
Merkle rootsha256:23cdf4c162e60cf823b89872539e148d212de7cc64f89864c7a2c7e06aa8dfe6
Attestor pubkeyf82b412efec2f9bd732efd7786568ad1dde8b788c79bdba2134be01f68e8ff79
Signature14524413d71ae098c10228fca3820d38aa02bc537bb07c6acb3bf1e37a779454d894fb234d39065603626eb13ed11ce7499b5cf731ada8b9eca81ab3b8bd2f01
Runnerbenchlist-local-ollama@1.0.0
Started2026-04-26T07:39:11Z
Finished2026-04-26T07:40:59Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-local-41e3f0f35701
Best per benchmark → arc-challenge guide → Anchor on-chain → Dispute