claude-skill

Validador de progresión de stage para Salesforce

Dificultad

intermedio

Tiempo de setup

60min

Para

revops

RevOps

Stack

Un Claude Skill que audita qué oportunidades de Salesforce cumplen genuinamente los criterios de salida del stage al que acaban de moverse. Para cada opp que progresó la semana anterior, el Skill verifica las reglas determinísticas (campos requeridos, actividades registradas, roles de stakeholders), y luego cruza las afirmaciones cualitativas del rep contra las transcripciones de calls de Gong. El output es una cola de coaching para la revisión semanal de RevOps, no un mecanismo de enforcement que tire los deals atrás automáticamente.

El bundle del artifact se entrega en apps/web/public/artifacts/stage-progression-validator-skill/ y contiene SKILL.md más tres templates de referencia: references/1-stage-criteria-template.md (la rúbrica de stages de tu equipo), references/2-methodology-mapping-template.md (cómo MEDDPICC, MEDDIC, SPICED, BANT, o un framework custom se mapea a tus campos de Salesforce y a patrones de frases en Gong), y references/3-sample-output-format.md (el Markdown exacto que el Skill emite).

Cuándo usarlo

Córrelo con la cadencia de tu reunión de forecast. El patrón canónico es un batch del domingo en la noche con clave week_ending, dejando el reporte en un canal de Slack antes del huddle de managers del lunes en la mañana. El modo single-opp también es válido — un revisor del deal desk puede correr el Skill contra un solo Opportunity.Id antes de una reunión de aprobación de pricing, o un manager puede correrlo contra un deal antes de un 1:1 para aterrizar la conversación en los gaps específicos en lugar de en una sensación vaga de “esto se siente atascado”.

El check de la afirmación cualitativa es la parte que paga sola. Salesforce ya hace cumplir las validation rules de campos requeridos; lo que no puede hacer es notar que el rep declaró “el buyer aceptó los criterios de éxito” y que ninguna call de Gong de los últimos 45 días capturó realmente esa conversación. El Skill es methodology-aware al buscar — para el economic buyer de MEDDPICC, busca el nombre del buyer dentro de doce tokens de lenguaje de decisión (“aprobar”, “firmar”, “dueño del presupuesto”) en lugar de cualquier mención del nombre. Esa distinción es lo que separa un flag útil de un falso positivo que los reps aprenden a ignorar.

Cuándo NO usarlo

Auto-rollback. No conectes el output del Skill a un update de Salesforce que degrade deals cuando el veredicto sea fail. El veredicto es un input entre varios; el manager es dueño de la decisión de degradar con el contexto completo que el Skill no puede ver (reuniones fuera de Gong, compromisos por side channel, particularidades del procurement del lado del cliente).
Performance management. Un solo fail en un solo deal es ruido. La señal son los patrones a lo largo de semanas — el rep cuya tasa de fail sube de 5% a 30% en un trimestre mientras sus pares se mantienen estables. Usar un veredicto puntual en un PIP rompe la confianza del rep y el Skill deja de funcionar.
Inputs de comp. El stage maneja el forecast, a veces maneja accelerators. Si el output del validador entra en cálculos de comp, creaste un incentivo directo para que los reps manipulen los inputs — rechazar la grabación de Gong, omitir notas, guardar datos en spreadsheets paralelos. Mantén el output del validador en el canal de coaching y fuera del pipeline de comp.
Stages sin rúbrica documentada. Si references/1-stage-criteria-template.md no tiene entrada para el stage validado, el Skill emite needs_methodology en lugar de adivinar. No “tunees” el Skill para calificar esos stages con un default — arregla la rúbrica.
Equipos que no guardan nada estructurado. Un equipo corriendo MEDDPICC en slides y no en Salesforce va a fallar cada check cualitativo. Corre el Skill en modo dry-run durante dos semanas; si más del 40% de las opps cae en needs_methodology o saca menos de 0.2 en checks cualitativos transversalmente, el doc de methodology mapping es ficción. Arregla el doc o instrumenta los campos faltantes antes de salir live.

Setup

Documenta los stages. Abre references/1-stage-criteria-template.md y reemplaza el contenido del template con la rúbrica real de tu equipo, stage por stage. Cada stage tiene tres buckets de reglas: field_rules (un campo de Salesforce debe tener un valor distinto al default), activity_rules (una actividad registrada de un tipo específico debe existir dentro de una ventana de recencia), y stakeholder_rules (OpportunityContactRole debe incluir un contacto con un rol que matchee una regex). Marca los campos como evidence_required: gong cuando quieras un cruce contra la transcripción de Gong sobre la afirmación cualitativa.
Mapea la metodología. Edita references/2-methodology-mapping-template.md para que coincida con el framework de tu equipo. El archivo viene con ejemplos trabajados para MEDDPICC, MEDDIC y SPICED — copia el que aplique y ajusta los nombres de campos de Salesforce a los API names reales de tu org. La columna de patrones de frases es la que le dice al Skill qué cuenta como evidencia de Gong; no la dejes con el default del template a menos que tus campos realmente coincidan con los mappings de ejemplo.
Instala el Skill. Pon el bundle en ~/.claude/skills/stage-progression-validator/. Configura SFDC_TOKEN (solo lectura sobre Opportunity, OpportunityFieldHistory, Task, Event, OpportunityContactRole) y GONG_API_KEY (con scopes calls/extensive y deals). Solo lectura es el scope correcto; el Skill no debe escribir de vuelta a Salesforce.
Agenda el run semanal. Un cron simple alcanza — claude run stage-progression-validator week_ending=$(date -d 'sunday' +%F) los domingos a las 22:00. Pipea el output a tu canal de Slack o a un email de digest semanal.
Acompáñalo con un ritual de coaching. La cola de veredictos es inútil si nadie la abre. Slot fijo de 30 minutos el lunes, el manager recorre las filas fail y needs_manager_review con cada rep. Después de ocho semanas, el volumen en esos buckets debería bajar — esa es la métrica de éxito.

Qué hace el skill realmente

Para cada progresión en la ventana, el Skill calcula dos scores. El score determinístico es la fracción de reglas de metodología satisfechas — cinco reglas, tres pasan, el score es 0.6. Esto es rúbrica estructurada por diseño en lugar de lenguaje natural libre: los criterios libres fuerzan al modelo a interpretar casos límite de manera inconsistente entre runs y los reps no pueden predecir qué va a disparar un fail, lo cual mata la confianza de la que depende la herramienta.

El score cualitativo es la fracción de afirmaciones evidence_required: gong que encuentran evidencia de soporte en la transcripción dentro de la ventana relevante. El matching de frases es methodology-aware. Para el economic buyer de MEDDPICC, el Skill busca el nombre del buyer dentro de doce tokens de lenguaje de decisión. Para el critical event de SPICED, busca lenguaje de urgencia acotado por fecha con verbos de consecuencia (“perder”, “deslizar”, “arriesgar”) cerca. Un check ingenuo de “cualquier mención del nombre del buyer cuenta” produce demasiados falsos pases — el rep mencionando al buyer de pasada en una call con otro stakeholder no es evidencia del compromiso del buyer.

Los dos scores se combinan en uno de cinco veredictos: pass (ambos en 1.0), flag (un bucket fuerte, el otro débil), fail (ambos por debajo del umbral límite, default 0.6), needs_manager_review (la banda fronteriza entre flag y fail — ningún score claramente malo ni claramente bueno), o needs_methodology (la rúbrica no tiene entrada para este stage). El bucket needs_manager_review existe porque forzar cada deal fronterizo a un binario flag versus fail produce ruido que los reps aprenden a descartar; las filas fronterizas van a una cola separada que el manager resuelve a mano, lo que preserva la señal en los otros buckets.

Realidad de costos

Claude Sonnet 4 al pricing actual sale aproximadamente entre 15-25 centavos por oportunidad validada, dominado por la lectura de transcripciones de Gong (una ventana típica de 30 días cubre 4-8 calls por deal activo a 5-15K tokens cada una, más unos cientos de tokens de rúbrica de metodología cargados desde references). Un batch semanal de 50 deals cuesta alrededor de 7-12 USD en API spend.

El tiempo ahorrado es el caso a favor del Skill. Un lead de RevOps haciendo esta auditoría a mano gasta 20-30 minutos por deal — sacando el historial de stage, abriendo cada call de Gong, escaneando por el nombre del buyer y la conversación de criterios de éxito. A 50 deals son dos días completos de auditoría manual por semana, razón por la cual casi ningún equipo realmente lo hace. El Skill colapsa eso a un pase de revisión del reporte de 4-6 minutos sobre el digest, con inspección más profunda solo en las filas de los buckets fail y needs_manager_review — típicamente 5-10 deals de 50, así que 30-60 minutos de revisión enfocada. Neto: 12-15 horas de RevOps por semana de vuelta, por menos de 15 USD en costo de API.

Métrica de éxito

Trackea dos métricas a lo largo de una rampa de ocho semanas. Primero, la tasa de fail — la proporción de progresiones semanales que aterrizan en fail. Una rampa sana muestra una caída desde un baseline (frecuentemente 25-40% en el primer run) a menos de 10% a medida que los reps internalizan lo que la rúbrica exige antes de avanzar un deal. Si no baja, o la rúbrica es demasiado estricta (los reps físicamente no pueden satisfacerla sin conversaciones con el buyer para las que el deal aún no está listo) o el loop de coaching no está pasando. Segundo, la edad mediana en el stage inmediatamente anterior al gate más estricto. Si esa edad se infla — o sea, los reps están estacionando deals un stage por debajo de su realidad para esquivar el gate — la rúbrica está equivocada, no los reps. Afloja la rúbrica antes de seguir corriendo el Skill.

vs alternativas

Validation rules de Salesforce — hacen cumplir la presencia de campos a nivel de registro (no puedes guardar una opp en Stage 4 sin Economic_Buyer__c poblado). No pueden hacer el check cualitativo: un rep puede escribir cualquier nombre en el campo, las validation rules pasan, el Skill detecta que ninguna call de Gong soporta la afirmación. Las validation rules también son una herramienta tosca porque rechazan el save de cuajo; el Skill produce un veredicto graduado con el que el manager trabaja.
Clari, Gong Forecast, y herramientas similares de AI-forecasting — hacen validación de stage como parte de una superficie de producto mucho más grande (forecast, deal review, conversation analytics, coaching). Espera entre 50-150 USD por rep por mes versus el costo de API aproximado de 10-15 USD por semana de este Skill. Elige la plataforma si también necesitas su capa de forecasting y conversation analytics; elige este Skill si tu gap es específicamente la auditoría de progresión de stage y ya tienes Salesforce y Gong.
Revisiones manuales de deal desk — un lead de RevOps humano leyendo cada progresión. La herramienta correcta para equipos enterprise de high-ACV donde los deals son pocos y trascendentes. Herramienta equivocada para SMB o midmarket de volumen donde el costo de la auditoría (12-15 horas por semana) significa que no pasa para nada y las malas progresiones se cuelan al forecast.
No hacer nada — el baseline real en la mayoría de los equipos. La precisión del forecast en la mayoría de las orgs B2B SaaS está entre mediocre y vergonzosa precisamente porque los stages sobre los que se construye el forecast no están validados. El costo de no hacer nada aparece en la reacción del CFO frente a un cierre de trimestre malo, que es un peor momento para descubrir que los datos de input no eran confiables.

Cosas para cuidar

Una validación excesivamente estricta empuja a los reps a manipular los stages. Guardrail: instrumenta la edad mediana del stage inmediatamente anterior al gate más estricto. Si se dispara después de que el Skill sale live, la rúbrica está mal; afloja antes de continuar.
Mismatch de metodología entre slides y Salesforce. Guardrail: corre en dry-run dos semanas. Si needs_methodology más scores cualitativos bajos cubre más del 40% de las opps, arregla el methodology mapping o la instrumentación de campos subyacente antes de tratar cualquier veredicto como accionable.
Drift del validador respecto a los criterios reales de salida. Los líderes de ventas redefinen silenciosamente el significado de los stages en QBRs; el archivo de rúbrica no se actualiza. Guardrail: la rúbrica lleva un campo last_reviewed; el Skill antepone un warning a cada reporte cuando la fecha es mayor a 90 días.
Gaps en la cobertura de grabación de Gong parecen deshonestidad del rep. Guardrail: el archivo de methodology mapping declara un recording_coverage_floor por stage. Los deals por debajo del piso aterrizan en needs_manager_review con el gap de cobertura surfaceado explícitamente, no en fail.
Pushback del rep ante un veredicto de fail. Guardrail: el reporte incluye textualmente los misses de reglas determinísticas y los patrones de frases no matcheados. La conversación se aterriza en el gap específico, que el rep puede arreglar actualizando el campo y volviendo a correr, o rebatir con evidencia fuera de Gong que el manager acepte.

Stack

Salesforce — historial de stage, campos del deal, contact roles, actividades registradas
Gong — transcripciones de conversaciones grabadas, listas de calls a nivel deal
Claude (Sonnet 4) — matching de frases methodology-aware contra transcripciones, síntesis de veredictos
Cron / scheduler de tu elección — el trigger semanal
Slack o email — el canal de digest donde aterriza el reporte antes del huddle del manager

Editar esta página en GitHub

Archivos de este artefacto

Descargar todo (.zip)

---
name: stage-progression-validator
description: Validate that a Salesforce opportunity genuinely meets its claimed stage's exit criteria. For each opp that progressed in a window, the skill checks deterministic field rules, cross-references rep-claimed milestones against Gong call evidence, and emits a pass/flag/fail verdict with the specific gap. Designed as a coaching trigger for RevOps weekly reviews, not as an enforcement gate.
---

# Stage progression validator

## When to invoke

Whenever you need to audit deals that progressed between Salesforce stages and want to know which progressions were buyer-driven versus rep-optimistic. Typical cadence: a weekly batch keyed to the forecast meeting (run Sunday night, review Monday morning). Also valid: a one-shot run on a single opportunity ID before a deal-desk review or before a manager 1:1.

Take an `Opportunity.Id` (single mode) or a window expressed as `week_ending=YYYY-MM-DD` (batch mode), plus a path to the methodology rubric. Produce a structured Markdown report with a row per progression and a verdict per row.

Do NOT invoke this skill for:

- **Auto-stage rollback.** The skill emits verdicts; it must not write back to Salesforce. A "fail" verdict is a coaching input, not an instruction to demote the deal — that decision is the manager's, with rep context the skill cannot see.
- **Performance management of reps.** Verdicts are noisy at the per-deal level and only meaningful as patterns over weeks. Using a single "fail" in a PIP is misuse and will collapse rep trust in the tool.
- **Comp implications.** Stage assignments drive forecast, sometimes accelerators. Routing this skill's output into comp calculations creates a direct incentive for reps to game the validator (refusing Gong recording, omitting rep notes, etc.). Keep this output separate from comp data flows.
- **Deals in stages without documented exit criteria.** Garbage in, garbage out. If the methodology doc has no rubric for the stage being validated, return `needs_methodology` rather than guessing a verdict.

## Inputs

- Required: `opp_id` OR `week_ending` — single opportunity or a Sunday-anchored ISO date for the batch window
- Required: `methodology_path` — path to the team's stage exit-criteria rubric (see `references/stage-criteria-template.md`)
- Required: `sfdc_token` — Salesforce session token with read on `Opportunity`, `OpportunityFieldHistory`, `Task`, `Event`, `OpportunityContactRole`
- Required: `gong_api_key` — Gong key with `calls/extensive` and `deals` scopes
- Optional: `methodology_mapping` — path to a methodology-mapping doc if the team uses MEDDPICC, MEDDIC, SPICED, or a custom framework (see `references/methodology-mapping-template.md`)
- Optional: `borderline_threshold` — float in `[0, 1]`, default `0.6`. Verdicts where the deterministic-criteria score falls between the threshold and `1.0 - threshold` are emitted as `needs_manager_review` rather than `flag`/`fail`.

## Reference files

Always read these from `references/` before scoring. Without them, the verdicts collapse to checking Salesforce required-field logic, which Salesforce itself already enforces.

- `references/stage-criteria-template.md` — the team's stage-by-stage exit criteria. Replace the template contents with the team's real rubric.
- `references/methodology-mapping-template.md` — maps the team's chosen sales methodology (MEDDPICC, MEDDIC, SPICED, BANT, custom) onto fields in Salesforce. The skill uses this to know which field holds the economic-buyer name, which holds the metric, etc.
- `references/sample-output-format.md` — the exact Markdown format for the report. The renderer downstream (Slack digest, email) parses this format.

## Method

Run the steps in order. Steps 3 and 4 are where the engineering choices matter; do not skip them.

### 1. Pull the candidate set

For batch mode, query `OpportunityFieldHistory` where `Field = 'StageName'` and `CreatedDate` falls inside the window. For single mode, query the same table filtered to the supplied `opp_id` and take the most recent `StageName` change. Skip progressions where the new stage has no entry in the methodology rubric — emit those as `needs_methodology`, not as `fail`.

### 2. Score deterministic criteria

For each candidate, compute a deterministic score in `[0, 1]` from the methodology rubric. Each rule in the rubric is one of three types:

- **Field rule** — a Salesforce field must hold a non-default value (e.g. `Economic_Buyer__c IS NOT NULL`).
- **Activity rule** — a logged activity of a specified type must exist in the prior 30 days (e.g. `Task.Type = 'Demo'`).
- **Stakeholder rule** — `OpportunityContactRole` must contain a contact with a role matching a regex (e.g. `Role MATCHES /^(VP|Director|C.+O)/`).

The score is the fraction of rules satisfied. This is structured-rubric, not free-form, by design: free-form natural-language criteria force the skill to interpret edge cases inconsistently across runs and produce verdicts that reps cannot predict or trust.

### 3. Cross-reference qualitative claims with Gong

The methodology mapping flags certain fields as `evidence_required: gong`. For each such field that holds a non-default value, the skill must find a Gong call within 30 days where the relevant phrase appears in the transcript.

Phrase matching is methodology-aware, not methodology-agnostic. For MEDDPICC's `Economic Buyer`, the skill searches transcripts for the buyer's name within 12 tokens of decision-language ("approve", "sign off", "budget owner", "final say"). For SPICED's `Critical Event`, it searches for date-bounded urgency language. The mapping doc names the phrase patterns per field — if the mapping says `evidence_required: gong` but provides no patterns, the skill emits `needs_methodology` rather than guessing what counts as evidence.

Why methodology-aware: a generic "look for any mention of the buyer name" check produces too many false passes (the rep mentioning the buyer in a call to a different stakeholder is not evidence of buyer commitment).

### 4. Combine scores into a verdict

Let `D` be the deterministic score from step 2 and `Q` be the fraction of qualitative claims with Gong evidence from step 3. Combine:

- `pass` — `D == 1.0` and `Q == 1.0`
- `flag` — `D >= 0.8` or `Q >= 0.8`, but not both at `1.0`
- `fail` — `D < borderline_threshold` and `Q < borderline_threshold`
- `needs_manager_review` — neither `pass`, `flag`, nor `fail`. The deal sits in the borderline band where false positives and false negatives both have non-trivial cost.

The `needs_manager_review` band exists because the alternative — forcing a binary `flag` versus `fail` on every borderline deal — produces noise that reps learn to dismiss. The borderline bucket goes to a separate queue that the manager hand-resolves, which preserves the signal in the `flag` and `fail` queues.

### 5. Emit the report

Write the report to stdout in the exact format from `references/sample-output-format.md`. Include the deterministic-rule misses verbatim (which rule failed) and the qualitative-claim misses with the field name and the phrase pattern that did not match. Do not paraphrase Salesforce field names or rep notes — the manager will compare the report against the Salesforce UI.

## Output format

```markdown
# Stage progression validation — week ending 2026-05-02

Window: 2026-04-26 → 2026-05-02
Opportunities scored: 18
- pass: 9
- flag: 4
- fail: 3
- needs_manager_review: 2
- needs_methodology: 0

## fail (3)

### Acme Corp — Stage 4 Negotiation
- Owner: jane.doe@example.com
- Progressed: 2026-04-29
- Deterministic score: 0.40 (2 of 5 rules satisfied)
- Qualitative score: 0.00 (0 of 2 claims supported)

Deterministic misses:
- `Economic_Buyer__c` is NULL
- `Decision_Criteria__c` is NULL
- `OpportunityContactRole` has no role matching `/^(VP|Director|C.+O)/`

Qualitative misses:
- `Economic_Buyer__c` claim: no Gong call in last 30 days references claimed buyer "Pat Ellis" within 12 tokens of decision-language pattern
- `Success_Criteria__c` claim: no Gong call in last 30 days contains success-criteria pattern

### {next fail row}
...

## flag (4)
...

## needs_manager_review (2)
...

## pass (9)
| Opp | Owner | New stage | Deterministic | Qualitative |
|---|---|---|---|---|
| ... | ... | ... | 1.00 | 1.00 |
```

## Watch-outs

- **Over-strict validation pushes reps to game stages.** If the rubric demands more than reps can plausibly satisfy without a buyer conversation that isn't yet warranted, reps will park deals one stage below their reality. Guard: instrument a "stage age" metric; if median stage age in the stage just before the strict gate balloons after the skill ships, the rubric is wrong, not the reps. Tune the rubric down before keeping the skill running.
- **Methodology mismatch.** A team that runs MEDDPICC in slides but stores nothing structured in Salesforce will fail every qualitative check. Guard: run the skill in `dry_run` mode for two weeks first; if more than 40% of opps emit `needs_methodology` or score `Q < 0.2` across the board, the methodology mapping doc is fictional — fix the doc or instrument the missing fields before going live.
- **Validator drift from real exit criteria.** Sales leaders quietly change what "Stage 3" means in QBRs; the rubric file does not get updated. Guard: append a `last_reviewed` field at the top of `references/stage-criteria-template.md` and have the skill emit a warning at the top of every report if `last_reviewed` is more than 90 days old. Stale rubrics produce confidently wrong verdicts, which is worse than no verdicts.
- **Gong recording-coverage gaps look like rep dishonesty.** Some calls genuinely happen off-Gong (in-person meetings, customer-side dial-in policies). Guard: the methodology mapping must include a `recording_coverage_floor` per stage; if a deal's recorded-call count is below the floor, emit `needs_manager_review` and surface the coverage gap explicitly rather than emitting `fail`.
- **Single-deal rage at a `fail` verdict.** A "fail" on a deal a rep is confident in will trigger pushback. Guard: the report must include the deterministic-rule misses and the unmatched phrase patterns verbatim. The rep can then either (a) update the field/log the activity and re-run, or (b) point to off-Gong evidence the manager accepts. Either way, the conversation is grounded in the specific gap, not in the verdict label.

# Stage exit-criteria rubric — TEMPLATE

> Replace this template's contents with the team's real stage-by-stage rubric.
> The stage-progression-validator skill reads this file on every run.
> Without your real rules, the verdicts are meaningless.

## Last reviewed

YYYY-MM-DD — bump this date every time the rubric is materially changed. The skill warns at the top of the report if this date is more than 90 days old.

## Methodology in use

One of: `MEDDPICC`, `MEDDIC`, `SPICED`, `BANT`, `custom`. Keep this string in sync with `methodology-mapping-template.md` so the skill loads the right phrase patterns.

## Stages

For each stage that the skill should validate, list rules under three buckets: `field_rules`, `activity_rules`, `stakeholder_rules`. Stages omitted from this file are emitted as `needs_methodology` rather than scored.

### Stage 2 — Discovery confirmed

field_rules:
- `Pain_Point__c IS NOT NULL`
- `Decision_Timeline__c IS NOT NULL`
- `Budget_Range__c IS NOT NULL`

activity_rules:
- `Task.Type = 'Discovery Call'` in last 30 days

stakeholder_rules:
- `OpportunityContactRole` includes a contact with role matching `/^(Manager|Director|VP)/`

evidence_required (qualitative — checked against Gong):
- `Pain_Point__c`
- `Decision_Timeline__c`

### Stage 3 — Solution validated

field_rules:
- `Success_Criteria__c IS NOT NULL`
- `Technical_Validation_Complete__c = true`
- `Decision_Criteria__c IS NOT NULL`

activity_rules:
- `Task.Type = 'Demo'` in last 45 days
- `Task.Type = 'Technical Deep Dive'` in last 30 days

stakeholder_rules:
- `OpportunityContactRole` includes a contact with role matching `/^(VP|Director)/`
- At least one contact with `Is_Technical_Buyer__c = true`

evidence_required (qualitative):
- `Success_Criteria__c`

### Stage 4 — Negotiation

field_rules:
- `Economic_Buyer__c IS NOT NULL`
- `Decision_Criteria__c IS NOT NULL`
- `Paper_Process__c IS NOT NULL`
- `Close_Plan__c IS NOT NULL`
- `Competitive_Landscape__c IS NOT NULL`

activity_rules:
- `Task.Type = 'Pricing Discussion'` in last 30 days

stakeholder_rules:
- `OpportunityContactRole` includes a contact with role matching `/^(VP|Director|C.+O)/`

evidence_required (qualitative):
- `Economic_Buyer__c`
- `Close_Plan__c`

### Stage 5 — Verbal commit

field_rules:
- `Verbal_Commit_Date__c IS NOT NULL`
- `Procurement_Engaged__c = true`
- `MSA_Status__c IN ('In review', 'Approved')`

activity_rules:
- `Task.Type = 'Procurement Call'` in last 21 days

stakeholder_rules:
- `OpportunityContactRole` includes one contact with role `Procurement`
- `OpportunityContactRole` includes one contact with role `Legal` if `MSA_Status__c = 'In review'`

evidence_required (qualitative):
- `Verbal_Commit_Date__c`

## Recording-coverage floor (per stage)

Minimum recorded calls in the prior 30 days for the deal. If the deal is below the floor, the skill emits `needs_manager_review` and surfaces the coverage gap rather than scoring qualitative checks.

| Stage | Min recorded calls in last 30 days |
|---|---|
| Stage 2 | 1 |
| Stage 3 | 2 |
| Stage 4 | 2 |
| Stage 5 | 1 |

# Methodology mapping — TEMPLATE

> Replace this template's contents with the team's real mapping. The skill uses
> this to translate methodology concepts (e.g. MEDDPICC's "Economic Buyer")
> into the Salesforce field that holds the answer and into Gong phrase patterns
> that count as supporting evidence.

## Methodology in use

`MEDDPICC` (replace if your team uses a different framework — see worked examples for MEDDIC, SPICED, and a custom framework below).

## MEDDPICC mapping (replace contents with team's real fields)

| MEDDPICC concept | Salesforce field | Evidence required | Phrase patterns |
|---|---|---|---|
| Metric | `Success_Metric__c` | gong | quantitative-language pattern (numbers, units, deltas) within 20 tokens of the field value |
| Economic Buyer | `Economic_Buyer__c` | gong | the buyer's name within 12 tokens of decision-language: `approve`, `sign off`, `budget owner`, `final say`, `the call is mine` |
| Decision Criteria | `Decision_Criteria__c` | none | n/a |
| Decision Process | `Decision_Process__c` | gong | step-language pattern: ordinal markers (`first`, `then`, `after that`) with named owners |
| Paper Process | `Paper_Process__c` | gong | procurement or legal entity name within 30 tokens of `MSA`, `redline`, `security review`, `vendor onboarding` |
| Identify Pain | `Pain_Point__c` | gong | the rep-claimed pain phrase or a synonym in customer's own voice (not the rep's) |
| Champion | `Champion__c` | gong | the named contact speaking on the customer's behalf in at least one call where the rep is mostly listening |
| Competition | `Competitive_Landscape__c` | none | n/a |

## MEDDIC mapping (worked example for teams on MEDDIC, not MEDDPICC)

Replace your `methodology in use` above with `MEDDIC` and use this table instead:

| MEDDIC concept | Salesforce field | Evidence required | Phrase patterns |
|---|---|---|---|
| Metrics | `Success_Metric__c` | gong | quantitative-language pattern |
| Economic Buyer | `Economic_Buyer__c` | gong | name within 12 tokens of decision-language |
| Decision Criteria | `Decision_Criteria__c` | none | n/a |
| Decision Process | `Decision_Process__c` | gong | step-language pattern |
| Identify Pain | `Pain_Point__c` | gong | pain phrase in customer's voice |
| Champion | `Champion__c` | gong | customer-led call segment |

## SPICED mapping (worked example)

| SPICED concept | Salesforce field | Evidence required | Phrase patterns |
|---|---|---|---|
| Situation | `Current_State__c` | none | n/a |
| Pain | `Pain_Point__c` | gong | pain phrase in customer's voice |
| Impact | `Quantified_Impact__c` | gong | quantified-cost language: currency or time units within 20 tokens of pain |
| Critical Event | `Critical_Event__c` | gong | date-bounded urgency: a specific date or quarter within 15 tokens of consequence-language (`miss`, `slip`, `risk`) |
| Decision | `Decision_Process__c` | gong | named decision steps with owners |

## Custom framework template

If the team uses a homegrown rubric, list each concept on its own row with the same four columns. The skill treats `Salesforce field` as the ground truth for "what was claimed" and the `Phrase patterns` as the ground truth for "what counts as supporting evidence in Gong."

| Custom concept | Salesforce field | Evidence required | Phrase patterns |
|---|---|---|---|
| {concept} | {field} | `gong` or `none` | {regex or natural-language phrase rule} |

## Last reviewed

YYYY-MM-DD

# Sample output format — REFERENCE

> The stage-progression-validator skill must emit the report in this exact
> format. Downstream renderers (Slack digest job, weekly email) parse this
> Markdown — keep section headings and ordering stable.

## Report header

```markdown
# Stage progression validation — week ending YYYY-MM-DD

Window: YYYY-MM-DD → YYYY-MM-DD
Methodology: MEDDPICC (rubric last reviewed YYYY-MM-DD)
Opportunities scored: N
- pass: N
- flag: N
- fail: N
- needs_manager_review: N
- needs_methodology: N
```

If the rubric `last_reviewed` is more than 90 days old, prepend a single line: `> WARNING: stage-criteria rubric last reviewed YYYY-MM-DD (over 90 days).`

## fail section

One block per failed deal. Order by deterministic-score ascending (worst first), tie-break by qualitative-score ascending.

```markdown
## fail (N)

### {Account name} — {New stage label}
- Opp ID: 006xxxxxxxxxxxxxxx
- Owner: owner@example.com
- Progressed: YYYY-MM-DD
- Deterministic score: D.DD (X of Y rules satisfied)
- Qualitative score: D.DD (X of Y claims supported)

Deterministic misses:
- `{field}` is NULL
- `OpportunityContactRole` has no role matching `/{regex}/`
- `Task.Type = '{type}'` not found in last {N} days

Qualitative misses:
- `{field}` claim: no Gong call in last 30 days matches pattern `{pattern_name}`
- `{field}` claim: no Gong call in last 30 days contains `{pattern}` near claimed value

Recording coverage: {N} recorded calls in last 30 days (floor: {M}).
```

## flag section

Same block format as `fail`. Order by combined score ascending.

## needs_manager_review section

Same block format. Add a one-line `Reason:` field naming why the deal landed in the borderline band — `low recording coverage`, `one rule short`, `mixed signal across deterministic and qualitative`, etc.

## needs_methodology section

```markdown
## needs_methodology (N)

| Opp | Owner | New stage | Reason |
|---|---|---|---|
| {Opp ID} | {owner} | {stage label} | no rubric entry for stage |
```

## pass section

Tabular, no per-deal block — passes are not interesting in the digest.

```markdown
## pass (N)

| Opp | Owner | New stage | Deterministic | Qualitative |
|---|---|---|---|---|
| {Opp ID} | {owner} | {stage label} | 1.00 | 1.00 |
```

## Footer

```markdown
---
Generated by stage-progression-validator skill at YYYY-MM-DDTHH:MM:SSZ
Inputs: methodology_path={path}, borderline_threshold={float}
```