Un Claude Skill que ingiere las últimas diez llamadas de Gong de un AE y entrega una nota de coaching con una forma fija de tres-cosas-funcionando / dos-cosas-para-apretar / un-ejercicio-específico. Construido para managers que quieren coaching semanal consistente pero no tienen diez horas a la semana para volver a escuchar llamadas. El skill escribe el draft; el manager edita, lo contextualiza con señales fuera de las llamadas y lo entrega en el siguiente uno-a-uno. Auto-envío está explícitamente fuera de scope.
Cuándo usarlo
Eres un sales manager con 4-10 AE reportando directo a ti. Quieres que cada rep reciba la misma profundidad de coaching cada semana, no solo los ruidosos con deals enredados. Tienes Gong, tu equipo tiene una rúbrica de llamadas y puedes apartar treinta minutos por semana por rep para revisar y editar el draft. El skill colapsa el loop de “escuchar diez llamadas más tomar notas más estructurar feedback” de ~3 horas por rep a ~30 minutos.
Úsalo semanalmente. Diario es demasiado (ningún rep cambia comportamiento en 24 horas). Mensual es demasiado raro (las llamadas que se citan están viejas y los patrones son más difíciles de recordar). La entrega del viernes en la tarde es el punto dulce — el AE tiene el fin de semana para absorberlo sin que se le coma su día de venta.
Cuándo NO usarlo
Performance Improvement Plans, procesos formales de HR, decisiones de comp o papeleo de terminación. Esto es una nota de coaching, no un registro de personnel. Los PIPs requieren involucramiento de HR, acuse firmado y un nivel de defensibilidad que este Skill no está diseñado para cumplir. El archivo references/03-escalation-criteria.md del bundle define las señales binarias que sacan a un rep del path de coaching por completo.
Un rep que no manejas directamente. El Skill verifica el manager-of-record contra Gong y se rehúsa si no hay match. La salida con el manager equivocado expondría contenido de llamadas que el invocador no debería ver — el modo de falla de mayor impacto que este Skill podría habilitar si no estuviera blindado.
Reps recién contratados en sus primeros 30 días de ramp. El coaching de ramp es su propia conversación: shadowing, role-play, scoring de certificación. Una nota semanal scoreada por rúbrica sobre diez llamadas delgadas produce ruido. Espera hasta que el rep tenga tres semanas de volumen real de llamadas con clientes.
Una semana con menos de tres llamadas usables. El Skill devuelve Insufficient call data en vez de rellenar. Dos llamadas no establecen un patrón; pretender que sí es sesgo retrospectivo lavado a través de markdown.
Setup
Pega el bundle. El Skill completo está en apps/web/public/artifacts/ae-rep-coaching-skill/SKILL.md junto con tres templates de referencia. Copia el directorio a ~/.claude/skills/ae-rep-coaching/ (o al .claude/skills/ a nivel de proyecto de tu equipo) para que Claude Code lo recoja.
Autentica Gong. API key con acceso de lectura a las llamadas y transcripts del AE. El Skill respeta el modelo de permisos de Gong; si las llamadas del AE están restringidas, el Skill no las puede ver, lo cual es el comportamiento correcto.
Reemplaza los templates con tus artefactos reales. El bundle viene con tres archivos de referencia placeholder. Cada uno es genérico hasta que lo llenes con el contenido real de tu equipo:
references/01-coaching-rubric-template.md — tu rúbrica por tipo de llamada (discovery, demo, negociación, cierre). El piloto viene con cinco criterios por tipo como forma de arranque.
references/02-coaching-note-format.md — la forma exacta en Markdown que usa cada nota semanal. El formato fijo es deliberado para que el AE pueda escanear qué cambió semana-a-semana.
references/03-escalation-criteria.md — las señales binarias que hacen que el Skill se rehúse a escribir una nota de coaching y en su lugar levante un flag de conversación de desempeño.
Configura el rep ID y el rango de fechas. El default son las últimas diez llamadas o los últimos catorce días, lo que sea más corto. Tope la ventana en treinta días; las llamadas más viejas reflejan una versión distinta del rep.
Decide la cadencia y el canal de entrega. Semanal, viernes en la tarde, en el siguiente 1:1 o como un DM que el rep pueda leer antes del 1:1. Consigue el buy-in del AE sobre el canal antes de que aterrice la primera nota en frío.
Lo que el skill hace en realidad
Seis pasos, en orden, sin paralelización:
Verifica el manager-of-record. Rechazo duro si el usuario que invoca no es el manager directo del rep en Gong.
Jala llamadas recientes filtradas por duración (≥5 min), número de asistentes (≤5 externos) y calidad de audio. Detente con menos de tres llamadas usables.
Clasifica cada llamada como discovery / demo / negociación / cierre. La clasificación por llamada (no una asunción general) es la decisión de ingeniería que previene el modo de falla más ruidoso del arranque: aplicar una rúbrica de discovery a una llamada de negociación.
Scorea contra la sección de rúbrica que coincida con una cita del transcript por cada score. Sin score sin cita — esta protección descarta feedback de instinto disfrazado de análisis.
Corre el check de escalación contra 03-escalation-criteria.md. Si cualquier criterio se activa (tergiversación de pricing, tono hostil, promesas de side-deal, etc.) el Skill se detiene y devuelve el bloque de escalación en vez de la nota de coaching.
Agrega en tres / dos / uno con un render de formato fijo. Los patrones requieren ≥2 llamadas que los respalden cada uno; el Skill devuelve un solo “apretar” en lugar de dos cuando solo uno está soportado. Nunca rellena.
El ejercicio al final es el output más importante. “Mejora el discovery” es inútil. “En tus próximas tres llamadas de discovery, haz la pregunta de presupuesto antes del minuto veinte” es accionable, observable en Gong la siguiente semana y auditable. El Skill está prompteado con dureza para hacer el ejercicio concreto y medible.
Realidad de costos
Por nota de coaching (un rep, diez llamadas, ~10k tokens de transcript por llamada después de filtrar, Claude Sonnet 4.5):
~120k tokens de input (transcripts + rúbrica + formato + referencia de escalación) → ~$0.36 al pricing actual de Sonnet.
~2k tokens de output (la nota misma) → ~$0.03.
~$0.40 por rep por semana, $1.60 por rep por mes.
Para un manager con ocho reps, son ~$13/mes en costo de tokens.
Tiempo ahorrado por manager por semana: alrededor de 2.5 horas por rep colapsa a 30 minutos (revisión + edición). Para ocho reps, son ~16 horas de regreso por semana. El piso realista está más cerca de ~10 horas de regreso una vez que cuentas el tiempo adicional de 1:1 hablando la nota con cada rep — que es justamente el punto.
El costo de Gong no es incremental; si ya tienes Gong (la dependencia dura del Skill) el acceso a la API viene incluido.
Métrica de éxito
Mira un solo número durante un trimestre: el porcentaje de 1:1s semanales en los que el AE menciona sin que se le pregunte el ejercicio de la semana anterior. Si ese número no está por encima de 50% para la semana 6, los ejercicios no son lo suficientemente específicos — regresa y aprieta el wording en 02-coaching-note-format.md para que “ejercicio” no pueda colapsar a “consejo general”.
Señales secundarias (más lentas, más ruidosas): movimiento del close-rate en el tipo de llamada específico en el que se está coacheando al rep, tiempo de ramp para reps nuevos una vez que el loop está establecido, calidad del 1:1 calificada por el manager.
vs alternativas
Notas escritas por el manager desde cero. Mejor fidelidad si el manager realmente lo hace. La trampa es la consistencia: bajo carga, las notas escritas por el manager o se saltan una semana, se reducen a “buen trabajo, sigue así”, o se escriben para el rep con el deal en problemas mientras los performers estables se quedan sin coaching. El Skill no produce mejores notas que un gran manager con tiempo infinito; produce mejores notas que el mismo manager bajo carga realista.
Los scorecards y features de coaching nativos de Gong. Gong scorea llamadas individuales contra una rúbrica y hace visibles tendencias agregadas. Útil, complementario y más barato que este skill si solo necesitas scoring. Lo que no hace bien es sintetizar entre llamadas en la forma tres / dos / uno con un ejercicio específico. Puedes apilar: usa los scorecards de Gong para calificar llamada por llamada, usa este Skill para la síntesis semanal.
Membrain, Spekit u otras plataformas dedicadas de coaching de ventas. Setup más pesado, licencia adicional, capacidad más amplia (librerías de skills, learning paths). La respuesta correcta para orgs de ventas enterprise grandes con headcount dedicado de enablement. La respuesta equivocada para un solo sales manager que solo quiere notas semanales que no se coman su viernes.
Status quo. “Voy a llegar al coaching después del pipeline review.” El pipeline review nunca se termina.
Cuídate de
Amplificación de sesgo. Scorear con rúbrica contra transcripts puede codificar sesgo del revisor — los reps verbosos scorean más alto en “rapport”, los hablantes no nativos de inglés scorean más bajo en “flujo de discovery”, los reps con clientes más ruidosos scorean más alto en “pide reacción” incluso cuando el rep no hizo nada. Protección: cada score requiere una cita del transcript, no un vibe; la rúbrica se revisa trimestralmente con el equipo completo en lugar de mantenerla un solo manager en aislamiento; el Skill avisa cuando la rúbrica tiene más de 90 días. Ver apps/web/public/artifacts/ae-rep-coaching-skill/references/01-coaching-rubric-template.md.
Fuga de datos por manager equivocado. Un manager jala una nota de coaching de un rep que no maneja y termina leyendo contenido de llamadas que no debería ver. Protección: el paso 1 del Skill verifica el manager-of-record contra el modelo de permisos de Gong antes de cargar cualquier transcript. Rechazo duro en mismatch — sin output parcial, sin sugerencia de workaround.
Rúbrica vieja. Una rúbrica escrita hace 18 meses para discovery outbound en frío aplicada a demos inbound product-led de hoy produce feedback irrelevante que el AE silenciosamente deja de leer. Protección: cada archivo de rúbrica carga una fecha last_edited; el Skill antepone un warning a la nota de coaching cuando la sección de rúbrica que coincide tiene más de 90 días, y una revisión trimestral de la rúbrica es parte del plan de rollout, no opcional.
Calibración de tono. Las notas de coaching que se leen como una advertencia escrita se ignoran o escalan la relación. Protección: el Skill enforza la voz de “peer de confianza” en 02-coaching-note-format.md, el check de escalación en el paso 5 saca señales de conversación de desempeño del path de coaching por completo, y el plan de rollout pide probar el tono con uno de tus AEs más fuertes antes de mandar uno en frío.
Dependencia del manager-of-record. Esta es una herramienta para el manager, no un reemplazo. El output es un draft. El manager edita, contextualiza con observaciones fuera de la llamada (historia del 1:1, etapa de ramp, contexto del deal que la llamada no mostró) y entrega en persona. El auto-envío está intencionalmente fuera del bundle.
Stack
Gong — fuente de llamadas, capa de transcript, verificación de manager-of-record, scorecards opcionales por llamada como capa complementaria
Claude (Sonnet 4.5 o superior) — scoring de rúbrica, check de escalación, síntesis en la forma tres / dos / uno
Rúbrica de coaching, formato de nota, criterios de escalación — los tres archivos de referencia en apps/web/public/artifacts/ae-rep-coaching-skill/references/ que convierten un genérico “scorea este transcript” en un loop de coaching que tu equipo realmente posee
---
name: ae-rep-coaching
description: Generate a weekly coaching note for a single AE from their last ten Gong calls. Output is a three-things-working / two-things-to-tighten / one-specific-exercise note that the manager edits and delivers — never auto-sent. Use weekly per direct report.
---
# AE rep coaching
## When to invoke
Invoke when a sales manager wants a structured coaching draft for one direct report based on recent call activity. Take a Gong rep ID and a date window as input and produce a Markdown coaching note grounded in specific call moments.
Do NOT invoke for:
- Performance Improvement Plans (PIPs), formal HR processes, or any document that becomes part of an employee's official record. This skill writes a coaching note, not a personnel file. PIPs require HR involvement, signed acknowledgment, and a defensibility bar this skill is not designed to meet.
- Compensation, promotion, or termination decisions. The skill scores call behaviors against a rubric. It does not assess overall contribution, ramp trajectory, or pipeline coverage.
- Peer-to-peer feedback (no manager-of-record context, no authorization to read calls).
- A rep the invoking user does not directly manage. The Skill checks the requesting user's manager-of-record status against the rep ID and refuses if there is no match. Wrong-manager data leakage is the highest-impact failure mode this skill could enable.
- Calls that are not internal sales activity (customer-success QBRs, partner calls, internal syncs miscategorized in Gong).
## Inputs
- Required: `rep_id` — the Gong user ID for the AE being coached.
- Required: `manager_id` — the Gong user ID of the invoking manager. Used to verify the manager-of-record relationship before any transcript is read.
- Optional: `window_days` — how far back to pull. Default 14. Cap at 30; older calls reflect a different version of the rep.
- Optional: `max_calls` — cap on number of calls analyzed. Default 10. Set lower (5-7) for high-volume SDRs to keep token cost bounded.
- Optional: `call_types` — restrict to one or more of `discovery|demo|negotiation|closing`. Default: all four.
## Reference files
Read all of the following from `references/` before generating the note. These are the user's own coaching artifacts. Without them, the output is generic feedback that any sales-coaching blog could produce.
- `references/01-coaching-rubric-template.md` — the per-call-type rubric the Skill scores against. Replace the template with your team's actual rubric.
- `references/02-coaching-note-format.md` — the literal Markdown format and tone the manager wants. The Skill matches this style rather than inventing one per run.
- `references/03-escalation-criteria.md` — the signals that mean "stop, this is not a coaching moment, this is a performance conversation." When any criterion fires, the Skill refuses to produce the coaching note and surfaces the criteria instead.
## Method
Run these steps in order. Do not parallelize — each step depends on data from the previous one, and the escalation check must run before any analysis is committed to the output.
### 1. Verify manager-of-record
Query Gong for the rep's manager-of-record. If it does not match `manager_id`, refuse the request and return:
```
Refused: <manager_id> is not the manager of record for <rep_id>. Coaching notes are written by the direct manager only.
```
This is a hard refusal. Do not produce a partial note, do not suggest workarounds. Wrong-manager output is the most damaging failure mode this Skill could enable.
### 2. Pull recent calls
Use `pull_recent_calls(rep_id, window_days, max_calls, call_types)`. Filter out:
- Calls under five minutes (no signal, mostly logistics).
- Calls with more than five external attendees (group dynamics drown out the rep's behavior).
- Calls flagged in Gong as `bad_audio` or `transcription_failed`.
If fewer than three usable calls remain after filtering, stop and return: `Insufficient call data: <N> usable calls in window. Extend window_days or wait for more activity.` Producing a coaching note from one or two calls is hindsight bias amplification.
### 3. Classify call type
For each remaining call, take the Gong stage tag if present. If absent, classify the call into `discovery|demo|negotiation|closing` based on transcript content. Engineering choice: classification is explicit and per-call rather than blanket because applying a discovery rubric to a negotiation call produces irrelevant scoring (the noisiest failure mode in early rollouts).
### 4. Score against rubric
For each call, load the matching rubric section from `01-coaching-rubric-template.md`. Score each criterion 1-5 with a specific transcript citation (timestamp + 1-2 sentence quote). No score without a citation — this guard rules out gut-feel feedback dressed up as analysis.
### 5. Run escalation check
Before aggregating, evaluate every criterion in `03-escalation-criteria.md` against the scored calls. If any criterion fires (e.g. repeated misrepresentation of pricing, deal terms invented on the fly, hostile tone toward customer), stop and return the escalation block instead of the coaching note. Coaching is the wrong intervention for these signals; a performance conversation with HR involvement is.
### 6. Aggregate into three / two / one
Across the scored calls, identify:
- **Three patterns working.** Behaviors that scored 4-5 on multiple calls. Cite at least two calls per pattern.
- **Two patterns to tighten.** Behaviors that scored 1-2 on multiple calls. Cite at least two calls per pattern. If only one weak pattern is supported by ≥2 calls, return one — never pad to two.
- **One specific exercise.** A concrete, measurable behavior change for the upcoming week, tied to the strongest "tighten" pattern.
### 7. Render the note
Use the format in `02-coaching-note-format.md` exactly. Engineering choice: the format is fixed (not regenerated per run) so the AE sees the same shape every week and can scan for what changed.
## Output format
```markdown
# Coaching note — {Rep name}, week of {YYYY-MM-DD}
Calls analyzed: {N} ({list of call types and dates})
Window: last {window_days} days
## Three things working
1. **{Pattern}.** Cited in: {Call A — timestamp}, {Call B — timestamp}.
"{short quote}"
2. **{Pattern}.** Cited in: {Call C — timestamp}, {Call D — timestamp}.
"{short quote}"
3. **{Pattern}.** Cited in: {Call E — timestamp}, {Call F — timestamp}.
"{short quote}"
## Two things to tighten
1. **{Pattern}.** Cited in: {Call G — timestamp}, {Call H — timestamp}.
"{short quote}". Why it matters: {one sentence linked to deal outcomes}.
2. **{Pattern}.** Cited in: {Call I — timestamp}, {Call J — timestamp}.
"{short quote}". Why it matters: {one sentence linked to deal outcomes}.
## One exercise for next week
On your next {N} {call type} calls, {specific measurable behavior}.
Success looks like: {observable outcome the manager can verify in Gong}.
---
Draft by ae-rep-coaching skill. Manager edits and delivers; this note
is not auto-sent.
```
## Watch-outs
- **Coaching is not a performance review.** A coaching note that reads like a written warning gets ignored or escalates the relationship. Guard: the prompt forces "trusted peer" voice and the escalation check in step 5 routes performance issues out of the coaching path entirely.
- **Wrong-manager data leakage.** If a manager pulls a coaching note on a rep they do not manage, the Skill exposes call content the invoker should not see. Guard: step 1 verifies manager-of-record against Gong's permission model before any transcript is loaded; hard refusal on mismatch.
- **Bias amplification.** Rubric scoring against transcripts can encode reviewer bias (verbose reps score higher; non-native English speakers score lower on "discovery rapport"). Guard: every score requires a transcript citation, not a vibe; the rubric is reviewed quarterly with the full team, not maintained by one manager in isolation.
- **Stale rubric.** A rubric written for cold outbound applied to warm inbound demos produces irrelevant feedback. Guard: each rubric file carries a `last_edited` date; the Skill prepends a warning to the coaching note if the matching rubric section is older than 90 days.
- **Hindsight bias.** Two calls do not establish a pattern. Guard: step 2 refuses to produce a note with fewer than three usable calls; "patterns" require ≥2 supporting calls each.
- **Manager-of-record dependency.** This is a tool for the manager, not a replacement. The output is a draft. The manager edits, contextualizes with off-call observations (1:1 history, ramp stage, deal context), and delivers in person. Auto-sending is explicitly out of scope.
# Coaching rubric — TEMPLATE
> Replace this template's contents with your team's actual rubric per
> call type. The ae-rep-coaching skill loads the matching section
> based on each call's classification. Without your real rubric, the
> coaching note reflects generic sales-coaching wisdom rather than
> what your team has decided "good" looks like.
## How to use this rubric
Each call type has 4-6 criteria. Each criterion is scored 1-5 against a transcript with a citation (timestamp + 1-2 sentence quote). The Skill aggregates scores across calls; this file defines what is being scored, not how the aggregation works.
Update `last_edited` at the bottom every time you change the rubric. The Skill warns the manager when the rubric is older than 90 days.
## Discovery rubric
| # | Criterion | What 1-2 looks like | What 4-5 looks like |
|---|---|---|---|
| 1 | Opens with explicit agenda | No agenda; jumps to demo or pitch | States agenda, asks for additions, gets explicit buy-in |
| 2 | Uncovers measurable pain | Asks "any pain points?" generically | Pain quantified — money, time, headcount, risk |
| 3 | Maps the buying committee | Talks only to the loudest person | Names roles + economic buyer, asks who else weighs in |
| 4 | Tests budget and timeline early | Avoids both, hopes for the best | Both surfaced before minute 25, no awkwardness |
| 5 | Confirms next step before ending | "We'll be in touch" | Specific next step + date + named owner on each side |
## Demo rubric
| # | Criterion | What 1-2 looks like | What 4-5 looks like |
|---|---|---|---|
| 1 | Demo is anchored to discovery | Generic feature tour | Tied to ≥3 pains surfaced in discovery, named explicitly |
| 2 | Talks less than the customer overall | Monologue, ratio > 70/30 rep | Ratio ~50/50 or customer-heavier |
| 3 | Asks for reaction every 5-7 min | Talks for 20 min uninterrupted | Asks "does this match what you described?" repeatedly |
| 4 | Surfaces objections proactively | Hopes objections do not come up | Names the likely objection before the buyer does |
| 5 | Closes with mutual action plan | "Let me know what you think" | Confirms next step, decision criteria, and decision date |
## Negotiation rubric
| # | Criterion | What 1-2 looks like | What 4-5 looks like |
|---|---|---|---|
| 1 | Anchors on value before price | Leads with discount | Recaps quantified pain before discussing terms |
| 2 | Trades, never gives | Gives discount unilaterally | Every concession paired with an ask (term length, refs, timing) |
| 3 | Confirms the real decision criteria | Assumes price is the issue | Surfaces and confirms the actual blocker |
| 4 | Multi-threads the close | Single contact carrying the deal | Engages economic buyer + at least one other |
| 5 | Documents agreement same-day | Verbal only | Recap email out within 24h with terms + next step |
## Closing rubric
| # | Criterion | What 1-2 looks like | What 4-5 looks like |
|---|---|---|---|
| 1 | Confirms the path-to-signature | Asks "are we good?" | Walks through procurement, legal, security as a sequence |
| 2 | Owns the close date | Lets the buyer set the timeline | Names the date and gets explicit agreement |
| 3 | Pre-empts last-minute asks | Surprised by procurement requests | Has surfaced the redlines and asks before this call |
| 4 | Maintains champion engagement | Champion goes silent in week 2 | Champion is co-steering with the rep, has internal coverage |
## Last edited
{YYYY-MM-DD}
# Coaching note format — TEMPLATE
> The ae-rep-coaching skill renders its weekly note in this exact
> shape. Adjust the wording (tone, signoff, emoji policy) to match
> how your team writes; do not change the section count or order
> without updating the Skill's `Output format` section in lockstep.
## Why a fixed format
The AE reads one of these every week. A fixed shape lets them scan for what changed week over week instead of re-parsing the structure each time. The same reason quarterly business reviews use a fixed deck template — the cognitive load belongs on the content, not the container.
## The literal format
```markdown
# Coaching note — {Rep first name}, week of {YYYY-MM-DD}
Calls analyzed: {N} ({list of call dates and types})
Window: last {window_days} days. Rubric version: {YYYY-MM-DD}.
## Three things working
1. **{Crisp one-line pattern name}.** Cited in: {Call name — HH:MM},
{Call name — HH:MM}.
> "{1-2 sentence transcript quote}"
What you did: {one sentence in trusted-peer voice}.
2. **{Pattern}.** Cited in: {Call — HH:MM}, {Call — HH:MM}.
> "{quote}"
What you did: {one sentence}.
3. **{Pattern}.** Cited in: {Call — HH:MM}, {Call — HH:MM}.
> "{quote}"
What you did: {one sentence}.
## Two things to tighten
1. **{Crisp one-line pattern name}.** Cited in: {Call — HH:MM},
{Call — HH:MM}.
> "{quote}"
Why it matters: {one sentence linked to deal outcome — cycle
length, conversion, expansion potential, churn risk}.
The shift: {one sentence describing the alternative behavior}.
2. **{Pattern}.** Cited in: {Call — HH:MM}, {Call — HH:MM}.
> "{quote}"
Why it matters: {one sentence}.
The shift: {one sentence}.
## One exercise for next week
On your next {N} {call type} calls, {specific measurable behavior —
e.g. "ask the budget question before minute 20" or "surface one
objection proactively before the buyer raises it"}.
Success looks like: {one sentence — observable signal in Gong the
manager will check next week, not "feel more confident"}.
---
Draft generated by the ae-rep-coaching skill from the last
{window_days} days of calls. Your manager edits this before
delivering. If anything in here does not match your read of the
calls, push back — the rubric serves the conversation, not the
other way around.
```
## Voice rules
- Trusted peer, not performance reviewer. Read aloud — if it sounds like a written warning, rewrite.
- Specific over flattering. "You opened with an agenda on three of four calls" beats "great job on agendas."
- No corporate hedging. "You did" / "tighten this" / "try this." No "you might consider perhaps."
- One exercise, not five. Five exercises is zero exercises.
## Last edited
{YYYY-MM-DD}
# Escalation criteria — TEMPLATE
> Replace this template's contents with the criteria your team has
> agreed mark a "stop coaching, start a performance conversation"
> moment. The ae-rep-coaching skill evaluates every criterion before
> producing the weekly note. If any criterion fires, the Skill
> refuses to render the coaching note and returns this list with the
> matching evidence instead.
## Why a hard separation
Coaching notes are formative, low-stakes, weekly. Performance conversations are summative, high-stakes, on-record, and involve HR. Confusing the two damages the rep both ways: small issues become existential, real issues get softened into "patterns to tighten" and go unaddressed. The Skill enforces the separation by refusing to write a coaching note when any of the criteria below fires.
## Criteria
Each criterion is binary (fires or does not). The Skill quotes the transcript evidence when reporting a fire, so the manager can verify before acting.
### 1. Misrepresentation of product capability
The rep claims a capability the product does not have, in a way a reasonable buyer would rely on. Example: "Yes, we support SOC 2 Type 2 out of the box" when the product has Type 1 only.
### 2. Misrepresentation of pricing or contract terms
The rep states pricing, discount authority, term length, or termination terms inconsistent with what is in the actual contract template or pricing book. One-off slips happen; a pattern across multiple calls is escalation.
### 3. Hostile or disrespectful tone toward the customer
Sarcasm, dismissiveness, raised voice, interrupting repeatedly. The Skill cites timestamps and quotes; the manager makes the judgment call on intent, but a pattern triggers the criterion regardless of intent.
### 4. Side deal or off-contract promise
The rep promises something — feature, timeline, credit, side letter — that is not part of the standard contract and was not approved by deal desk or the manager. This is a legal exposure issue, not a coaching issue.
### 5. Discovery of harassment, discrimination, or compliance issue
Anything that surfaces in a call (rep behavior or customer behavior) that triggers HR or legal review. The Skill never tries to write this up itself; it surfaces the timestamp and stops.
### 6. Visible signs of burnout or distress
Repeated mentions of being overwhelmed, audible distress on calls, patterns suggesting a mental-health concern. Coaching is not the right intervention; a 1:1 conversation, EAP referral, or workload review is.
## What the Skill returns when a criterion fires
```markdown
# Escalation — not a coaching moment
The ae-rep-coaching skill stopped before writing a coaching note for
{Rep name} because the following criterion fired:
**{Criterion name}**
Evidence:
- {Call name — HH:MM}: "{quote}"
- {Call name — HH:MM}: "{quote}"
Recommended next step: {appropriate path — HR conversation, deal
desk review, EAP referral, etc.}. Do not paste this output into a
performance document; it is a flag, not a finding. Verify the
evidence yourself before acting.
```
## Last edited
{YYYY-MM-DD}