Fernando Lazzarin waitdeadai

Fernando Lazzarin

Founder, WAITDEAD · vertical intelligence studio · Mendoza, Argentina

Shipping doesn't sleep. Decisions, not dashboards.

What I'm building

WAITDEAD — vertical intelligence studio. Eight catalogs, one engine. PriceIntel live for retailers and distributors; LeadRadar, CompetiSignal, RateScope, EstateLens, HireSignal, RegWatch, SupplyPulse on demand (7–14 day build). Free 48-hour sample brief, USD 600–1,500/mo per product, cancel any time.
REVCLI — governed AI workflow platform with paying customers in production. Human approval gates, audit trails, policy engine.

Just shipped

no-vibes empirical baseline against the MAST (Multi-Agent System failure Taxonomy, Cemri et al., NeurIPS 2025 Datasets & Benchmarks spotlight) MAD dataset:

F1 0.815 (95% CI [0.615, 0.941]) on the n=19 human-labelled subset against MAST mode 3.3 ("No or Incorrect Verification")
Fleiss κ 1.000 inter-annotator agreement on mode 3.3 specifically (paper's overall κ=0.88 is on a different 150-trace taxonomy-development set)
Per-MAS breakdown: result is not driven by a single framework
Productized as crewai-no-vibes — pip install crewai-no-vibes, pure-Python CrewAI Task guardrail, zero runtime deps

Full empirical write-up with bootstrap methodology, per-MAS breakdown, and per-mode Fleiss kappa: evaluation/MAST-RESULTS.md.

The discipline that makes this credible: of 13 hooks conceptually mapped to MAST modes, only no-vibes produces measurable F1 > 0 at the trace-level baseline. The other 12 are documented honestly as "conceptually mapped, no measured signal yet" rather than overclaimed. Honest demotion compounds.

Open-source highlights

Repo	What it does
`llm-dark-patterns`	Apache-2.0 28-hook suite for runtime enforcement of LLM dark patterns at the Claude Code Stop event. Out-of-band bash + jq judge, deterministic; the model that produced the closeout cannot rewrite the verdict from inside its own output.
`agent-closeout-bench`	Rust YAML rule-pack engine + 175 fixtures + 73 tests. Powers the MAST-EVAL + DarkBench v1.5-rust head-to-head.
`crewai-no-vibes`	Pure-Python port of the `evidence_claims` rule pack as a CrewAI Task.guardrails function. Apache-2.0, zero dependencies, on PyPI.
`minmaxing`	Production Claude Code harness — `/opusworkflow`, governed model profiles, 5-tier memory, parallel orchestration.
`claude-plugins`	Self-hosted Claude Code plugin marketplace. Bypasses the stalled Anthropic community pipeline.
`no-vibes`, `no-sycophancy`, `no-cliffhanger`	Standalone single-file Stop hooks for users who want one hook without the suite.

Research anchors

MAST — Cemri et al., NeurIPS 2025 Datasets & Benchmarks spotlight
DarkBench — Kran et al., ICLR 2025
DarkPatterns-LLM
ELEPHANT sycophancy taxonomy — Cheng et al.
ACM IUI 2025 false-memory recall

Where to find me

restlessmachine.com — personal site
waitdead.com — WAITDEAD brand
@flazzar1n on X
LinkedIn
fernando@waitdead.com

8+ years RevOps · 50+ audits · 13 industries · 9 CRM platforms. Operated from Mendoza, Argentina under SmoothRevenue LLC (Wyoming).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fernando Lazzarin waitdeadai

Achievements

Achievements

Block or report waitdeadai