DeFiPunk'd

Snapshot of 4,102 live protocols · 254 reviewed · 806 model submissions.

Growth over time

361 commits on main

EVIDENCE Per-protocol factual evidence crawled from public sources — audits, source code, adapter, control structure. The universe of protocols we have evidence for.
4,951 protocols tracked
20,175 .json files
REPORTS Raw individual LLM runs from DEFI@home contributors, one file per (protocol, slice, run). Multiple runs accumulate per slice until ≥3 agree.
255 protocols tracked
804 .json files
VERDICTS Quorum-merged verdicts produced once ≥3 independent runs agree on grade and overlapping evidence — one file per (protocol, slice).
34 protocols tracked
127 .json files

Grade distribution per slice

Across all live protocols. Verifiability and Autonomy are rule-based so they cover ~every protocol; Control / Ability to exit / Open Access only get a color once an AI consensus lands, so most are still unknown. When unknown dwarfs the colored portion, the unknown segment is capped with a ⫽ axis break.

  • Control 3 green: 3 of 4,102 Contracts can't be changed, or any change goes through a long delay (≥7 days) plus credible governance. 16 orange: 16 of 4,102 Upgrades go through a short delay or a small group with weak governance. 80 red: 80 of 4,102 A single key holder or small multisig can change the contracts immediately, with no delay. 4,003 unknown: 4,003 of 4,102 Couldn't tell who controls upgrades.
  • Ability to exit 7 green: 7 of 4,102 Anyone can withdraw at any time; pauses are limited and can't trap funds for long. 9 orange: 9 of 4,102 Withdrawals can be paused broadly or delayed beyond 7 days under certain governance actions. 84 red: 84 of 4,102 An admin can block withdrawals indefinitely, or there's no on-chain way to exit at all. 4,002 unknown: 4,002 of 4,102 Couldn't tell whether users can always exit.
  • Autonomy 4 green: 4 of 4,102 Works on its own — even if outside services (oracles, bridges, keepers) fail, user principal stays safe. 15 orange: 15 of 4,102 An outside dependency could pause withdrawals or hurt yields, but can't steal user principal. 532 red: 532 of 4,102 Failure of a single oracle, bridge, or operator could let someone take user funds. 3,551 unknown: 3,551 of 4,102 Couldn't audit the external dependencies.
  • Open Access 11 green: 11 of 4,102 No KYC, no allowlist, and reachable through more than just the official UI (SDKs, third-party apps, aggregators). 3 orange: 3 of 4,102 Permissionless on-chain, but in practice only the official UI can talk to it. 82 red: 82 of 4,102 KYC, allowlist, blocklist, or admin approval is required to use the protocol. 4,006 unknown: 4,006 of 4,102 Couldn't tell what restrictions apply.
  • Verifiability 1,373 green: 1,373 of 4,102 Source code is public, matches what's deployed, and was recently audited by a recognized firm. 936 orange: 936 of 4,102 Source or audit exists but is partly stale, partial, or only from minor firms. 1,793 red: 1,793 of 4,102 No source code, no audit, or the deployed code isn't verifiable on the explorer.

Most-reviewed protocols

Top 10 by total model submissions across all five slices.

  1. Uniswap V4 26
  2. Aave 23
  3. EigenCloud 21
  4. Lido 21
  5. Pendle 21
  6. Railgun 21
  7. Rocket Pool 21
  8. Base Bridge 19
  9. WBTC 19
  10. Aave V3 18

Model breakdown

Submissions per model.

  1. claude-haiku-4-5 (autorun) Low — hallucination-prone, quorum weight ×0.05 (20× penalty) 1 of 3 205
  2. claude-sonnet-4-6 (autorun) Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 113
  3. claude-opus-4-7 High — thinking model, full quorum weight 3 of 3 78
  4. gpt-5.5-thinking High — thinking model, full quorum weight 3 of 3 63
  5. gemini-3-flash-preview Low — hallucination-prone, quorum weight ×0.05 (20× penalty) 1 of 3 50
  6. gpt-5.5 Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 50
  7. claude-sonnet-4-6 Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 42
  8. grok-4 Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 29
  9. claude-sonnet-4-5 (autorun) Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 23
  10. GPT-5.5 Thinking High — thinking model, full quorum weight 3 of 3 21
  11. grok-3 Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 16
  12. chatgpt-5 Low — hallucination-prone, quorum weight ×0.05 (20× penalty) 1 of 3 15
  13. grok-xai Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 15
  14. claude-opus-4-6 (autorun) High — thinking model, full quorum weight 3 of 3 13
  15. grok-built-by-xai Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 12
  16. claude-opus-4-7 (autorun) High — thinking model, full quorum weight 3 of 3 9
  17. gemini-3.1-pro High — thinking model, full quorum weight 3 of 3 9
  18. GPT-5.5 Pro Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 7
  19. chatgpt-thinking-xhigh-5-5 High — thinking model, full quorum weight 3 of 3 5
  20. gemini-3-flash Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 5
  21. unknown (autorun) Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 5
  22. grok-2 Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 3
  23. gemini-3-pro High — thinking model, full quorum weight 3 of 3 2
  24. gpt-5.5-pro Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 2
  25. claude-haiku-4-5 Low — hallucination-prone, quorum weight ×0.05 (20× penalty) 1 of 3 1
  26. claude-opus-4-20250514 High — thinking model, full quorum weight 3 of 3 1
  27. claude-opus-4-5 (autorun) High — thinking model, full quorum weight 3 of 3 1
  28. claude-opus-4-6 High — thinking model, full quorum weight 3 of 3 1
  29. claude-opus-4-8 (autorun) High — thinking model, full quorum weight 3 of 3 1
  30. deepseek-reasoner High — thinking model, full quorum weight 3 of 3 1
  31. gpt-5-codex Low — hallucination-prone, quorum weight ×0.05 (20× penalty) 1 of 3 1
  32. gpt-5.2-pro Low — hallucination-prone, quorum weight ×0.05 (20× penalty) 1 of 3 1
  33. GPT-5.4 Pro Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 1
  34. gpt-5.4-thinking High — thinking model, full quorum weight 3 of 3 1
  35. grok-3-pro Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 1
  36. grok-4-preview Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 1
  37. grok-beta Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 1
  38. grok-xai-4 Medium — non-thinking, quorum weight ×0.2 (5× penalty) 2 of 3 1

Tier distribution

Protocols by medal tier across the registry. The Ungraded segment is capped — the 4,042 ungraded protocols would otherwise hide the graded tiers.

  • Gold 1
  • Silver 13
  • Bronze 11
  • Wood 35
  • Ungraded 4,042

TVL by tier

Total TVL across live protocols: $517.3B. Segment widths are proportional to dollars; the Ungraded segment is capped to keep the graded tiers visible.

  • Gold $4.3B
  • Silver $62.4B
  • Bronze $39.0B
  • Wood $303.2B
  • Ungraded $108.4B

Slice coverage matrix

For each protocol with at least one submission: which of the five slices have reached AI consensus. Click a cell to jump to that slice on the protocol's risk-analysis page.

  • Strong consensus
  • Weak consensus
  • Insufficient submissions
  • Models disagree
  • · No submissions
Protocol Tier ControlAbility to exitAutonomyOpen AccessVerifiability