[new-plugin] clawvard-agent-eval#88
Conversation
Skill-only plugin that routes agents through Clawvard's entrance exam across 8 capability dimensions (Understanding, Execution, Retrieval, Reasoning, Reflection, Tooling, EQ, Memory). Supports authenticated retakes via persisted token with explicit user confirmation gate. Source: https://github.com/THEZIONLABS/clawvard-agent-eval
✅ Phase 1: Structure Validation — PASSED→ Proceeding to Phase 2: Build Verification |
📋 Phase 3: AI Code Review Report — Score: 86/100
1. Plugin Overview
Summary: This plugin guides an AI agent through the Clawvard entrance exam — a capability benchmark across 8 dimensions (Understanding, Execution, Retrieval, Reasoning, Reflection, Tooling, EQ, Memory). It walks through 16 questions in 8 batches via Clawvard's REST API, reports the result, and optionally saves an identity token after explicit user confirmation. Target Users: Agent developers and users who want to benchmark/grade an AI agent's capabilities and obtain a Clawvard report or identity token for authenticated retakes. 2. Architecture AnalysisComponents: Skill Structure: Data Flow:
Dependencies: 3. Auto-Detected Permissionsonchainos Commands Used
Wallet Operations
External APIs / URLs
Chains Operated OnNone — plugin does not interact with any blockchain. Overall Permission SummaryThe plugin sends agent identification, exam answers, and optional reasoning traces to a single external domain ( 4. onchainos API ComplianceDoes this plugin use onchainos CLI for all on-chain write operations?N/A — this plugin performs no on-chain operations. On-Chain Write Operations (MUST use onchainos)
Data Queries (allowed to use external sources)
External APIs / Libraries DetectedOnly Verdict: ✅ Fully CompliantPlugin has no on-chain functionality, so onchainos compliance does not apply. The Plugin Store guidelines explicitly state non-onchainos usage is acceptable. 5. Security AssessmentStatic Rule Scan (C01-C09, H01-H09, M01-M08, L01-L02)
No other static rules match. C01-C09 (command injection, prompt injection, obfuscation, credential exfiltration, etc.): not detected. H01-H09: no hardcoded secrets, no credential output, no persistence, no sensitive-path access, no destructive operations, no plaintext .env writes, no credential solicitation, no signed-tx CLI parameter. M01/M02/M04-M06/M08: no install commands, no resource exhaustion, no dynamic execution, no skill chaining. L01/L02: no tool enumeration or undeclared network endpoints. LLM Judge Analysis (L-PINJ, L-MALI, L-MEMA, L-IINJ, L-AEXE, L-FINA, L-FISO)
Toxic Flow Detection (TF001-TF006)No toxic flows detected. The plugin does not combine sensitive-path access + network exfiltration (TF001), prompt-injection + persistence (TF002), unverified-deps + malicious intent (TF004), curl|sh + financial API (TF005), or external data + financial operation (TF006). Prompt Injection ScanNo instruction-override patterns, no DAN/jailbreak language, no pseudo-system tags, no HTML comments with hidden instructions, no base64 payloads, no unicode/hex obfuscation, no backtick command substitution. Result: ✅ Clean Dangerous Operations CheckThe plugin does not transfer funds, sign blockchain transactions, broadcast transactions, or call smart contracts. The only state-changing operation is persisting a Clawvard token to local storage, which the SKILL.md explicitly gates behind user confirmation with disclosure. Result: ✅ Safe Data Exfiltration RiskThe plugin sends data only to one declared domain ( Result: ✅ No Risk (with the guidance provided) Overall Security Rating: 🟢 Low Risk6. Source Code Security (if source code is included)Skipped — plugin has no source code or build section. 7. Code ReviewQuality Score: 86/100
Strengths
Issues Found
8. Language Check
9. SUMMARY.md Review
11. Recommendations
12. Reviewer SummaryOne-line verdict: A clean, low-risk utility skill that benchmarks an agent against the Clawvard exam; well-scoped, with strong consent gating around token persistence, and only minor documentation gaps. Merge recommendation: Blockers (if any — list every issue that MUST be fixed before merge, each prefixed with ❌): No blockers found. Non-blocking improvements to address:
Generated by Claude AI via Anthropic API — review the full report before approving. |
✅ Phase 4: Publish CompletePlugins:
Published by Plugin Store CI |
Add the Clawvard agent entrance-exam evaluator plugin (skill-only) to the production okx/plugin-store. Originally landed in the staging mirror mig-pre#88; this PR ports it over with all internal references retargeted to okx/plugin-store. Changes: - skills/clawvard-agent-eval/ — full plugin tree (5 files) - registry.json — +1 entry, alphabetically inserted before compound-v3-plugin (count 36 → 37) - .claude-plugin/marketplace.json — +1 entry, same alphabetical slot (count 36 → 37) mig-pre → okx replacements applied in SKILL.md (2 occurrences): - Pre-flight version-check curl URL: raw.githubusercontent.com/mig-pre → raw.githubusercontent.com/okx - Auto-update install command: npx skills add mig-pre/plugin-store → npx skills add okx/plugin-store The other 4 plugin files (LICENSE, SUMMARY.md, plugin.yaml, plugin.json) are byte-for-byte identical to mig-pre — verified via SHA-256.
Plugin Submission
Plugin name: clawvard-agent-eval
Version: 0.1.0
Type: new-plugin
Checklist
plugin-store lintpasses locally with no errorsokx-,official-,plugin-store-)nameanddescriptionWhat does this plugin do?
Clawvard Agent Evaluation is a skill-only plugin that routes AI agents through Clawvard's entrance exam across 8 capability dimensions: Understanding, Execution, Retrieval, Reasoning, Reflection, Tooling, EQ, and Memory.
The exam has 16 questions in 8 batches. After completion, the agent can persist its Clawvard identity token for authenticated retakes — but only after explicit user confirmation.
Which onchainos commands does it use?
None — this is a pure skill plugin (no binary, no on-chain transactions).
Security Considerations
clawvard.school.Source: https://github.com/THEZIONLABS/clawvard-agent-eval