Founder, WAITDEAD · vertical intelligence studio · Mendoza, Argentina
Shipping doesn't sleep. Decisions, not dashboards.
- WAITDEAD — vertical intelligence studio. Eight catalogs, one engine. PriceIntel live for retailers and distributors; LeadRadar, CompetiSignal, RateScope, EstateLens, HireSignal, RegWatch, SupplyPulse on demand (7–14 day build). Free 48-hour sample brief, USD 600–1,500/mo per product, cancel any time.
- REVCLI — governed AI workflow platform with paying customers in production. Human approval gates, audit trails, policy engine.
no-vibes empirical baseline against the MAST (Multi-Agent System failure Taxonomy, Cemri et al., NeurIPS 2025 Datasets & Benchmarks spotlight) MAD dataset:
- F1 0.815 (95% CI [0.615, 0.941]) on the n=19 human-labelled subset against MAST mode 3.3 ("No or Incorrect Verification")
- Fleiss κ 1.000 inter-annotator agreement on mode 3.3 specifically (paper's overall κ=0.88 is on a different 150-trace taxonomy-development set)
- Per-MAS breakdown: result is not driven by a single framework
- Productized as
crewai-no-vibes—pip install crewai-no-vibes, pure-Python CrewAI Task guardrail, zero runtime deps
Full empirical write-up with bootstrap methodology, per-MAS breakdown, and per-mode Fleiss kappa: evaluation/MAST-RESULTS.md.
The discipline that makes this credible: of 13 hooks conceptually mapped to MAST modes, only no-vibes produces measurable F1 > 0 at the trace-level baseline. The other 12 are documented honestly as "conceptually mapped, no measured signal yet" rather than overclaimed. Honest demotion compounds.
| Repo | What it does |
|---|---|
llm-dark-patterns |
Apache-2.0 28-hook suite for runtime enforcement of LLM dark patterns at the Claude Code Stop event. Out-of-band bash + jq judge, deterministic; the model that produced the closeout cannot rewrite the verdict from inside its own output. |
agent-closeout-bench |
Rust YAML rule-pack engine + 175 fixtures + 73 tests. Powers the MAST-EVAL + DarkBench v1.5-rust head-to-head. |
crewai-no-vibes |
Pure-Python port of the evidence_claims rule pack as a CrewAI Task.guardrails function. Apache-2.0, zero dependencies, on PyPI. |
minmaxing |
Production Claude Code harness — /opusworkflow, governed model profiles, 5-tier memory, parallel orchestration. |
claude-plugins |
Self-hosted Claude Code plugin marketplace. Bypasses the stalled Anthropic community pipeline. |
no-vibes, no-sycophancy, no-cliffhanger |
Standalone single-file Stop hooks for users who want one hook without the suite. |
- MAST — Cemri et al., NeurIPS 2025 Datasets & Benchmarks spotlight
- DarkBench — Kran et al., ICLR 2025
- DarkPatterns-LLM
- ELEPHANT sycophancy taxonomy — Cheng et al.
- ACM IUI 2025 false-memory recall
- restlessmachine.com — personal site
- waitdead.com — WAITDEAD brand
- @flazzar1n on X
fernando@waitdead.com
8+ years RevOps · 50+ audits · 13 industries · 9 CRM platforms. Operated from Mendoza, Argentina under SmoothRevenue LLC (Wyoming).


