n8n-flow

Auto-trackea menciones y cambios de competidores con n8n y Claude

Dificultad

intermedio

Tiempo de setup

60min

Para

revops · sales-enablement

RevOps

Stack

La mayor parte de la inteligencia competitiva dentro de equipos de ventas B2B llega de la forma equivocada: un rep pierde un deal, postea en #lost-deals que el prospect mencionó un nuevo tier de pricing del competidor, y el resto del equipo se entera tres semanas después. El costo del descubrimiento tardío se acumula — cada deal que cierra en esa ventana entra a la conversación poco preparado. Este flujo es el arreglo barato y aburrido. Un cron diario crawlea una lista de páginas de competidores que de hecho te importan, normaliza el HTML para descartar ruido de deploy, le pide a Claude que resuma qué cambió materialmente (y que devuelva NO_CHANGE cuando el diff es cosmético), y postea un único digest semanal a Slack para que el canal se mantenga lo suficientemente denso en señal como para que los reps todavía lo abran después de un mes.

El bundle del artefacto en apps/web/public/artifacts/competitive-intel-tracker-n8n/ contiene el workflow de n8n importable (competitive-intel-tracker-n8n.json, 20 nodos a través de tres triggers) y _README.md con setup de credenciales, las dos tablas de Postgres que necesitas crear, y una verificación de primera corrida de seis pasos que ejercita tanto la rama de skip-por-no-materialidad como el slash command on-demand de Slack.

Cuándo usarlo

Tienes entre cinco y quince competidores contra los que te posicionas activamente, puedes nombrar de tres a cinco páginas públicas por competidor que cambian de formas que importan (pricing, posicionamiento de producto, señal de hiring que sugiera estrategia), y tienes al menos un canal de Slack que el equipo de ventas genuinamente abre. Estás dispuesto a mantener una lista de URLs trackeadas a medida que los competidores reestructuran sus sitios. Tienes una base de datos Postgres (u otro almacén al que puedas adaptar las queries) y una instancia de n8n alcanzable desde el internet público si quieres que el slash command on-demand funcione.

Esta también es la forma correcta si previamente intentaste un artilugio RSS de “alerta de Slack en cada post de blog del competidor” y el equipo lo muteó dentro de una semana — el filtro de materialidad y la cadencia semanal aquí son respuestas directas a ese modo de falla.

Cuándo NO usarlo

No montes esto si tu set competitivo está dominado por agregadores de reviews JS-heavy como G2, Capterra o TrustRadius. Su HTML público es una cáscara — el contenido real de las reviews se renderiza client-side o detrás de autenticación, y crawlearlos respetuosamente te devolverá casi nada. Paga por un vendor que los maneje (Crayon, Klue, Kompyte) o sáltate esas fuentes por completo.

No uses esto si tu equipo necesita la intel en tiempo real — por ejemplo, un ciclo de deal que rota dentro de una semana y cuyas calls de discovery dependen del cambio de pricing del competidor de ayer. La cadencia aquí es fetch diario, digest semanal. Si necesitas latencia bajo una hora, estás comprando un producto distinto (alertas de Klue) o construyendo un workflow distinto (webhooks de cambio por página alimentados a DMs de Slack del rep, no un digest).

No uses esto contra superficies privadas del competidor (trials gateados, portales de cliente pagos, cualquier cosa detrás de login). Crawlear esos está en una clase ética y legal distinta a chequear páginas de marketing públicas, y este flujo no es el sustrato correcto para ello.

No uses esto para menos de tres competidores. El costo de setup (veinte a treinta filas de páginas trackeadas, schema, credenciales, tuning de materialidad) no se paga si estás mirando uno o dos — un Google Alert y un recordatorio de calendario es la respuesta correcta a esa escala.

Setup

Lee apps/web/public/artifacts/competitive-intel-tracker-n8n/_README.md de punta a punta antes de importar. La versión corta: importa competitive-intel-tracker-n8n.json vía Import from File de n8n, crea las dos tablas de Postgres (competitor_tracked_pages y competitor_change_log) con el DDL del README, conecta cuatro credenciales (PLACEHOLDER_POSTGRES_CRED_ID, PLACEHOLDER_ANTHROPIC_CRED_ID, PLACEHOLDER_SLACK_CRED_ID, más la URL opcional de webhook del slash command de Slack), define la timezone del workflow explícitamente en Settings, siembra la tabla de tracked-pages con veinte a treinta filas, y recorre la verificación de primera corrida de seis pasos antes de activar. La verificación deliberadamente ejercita la ruta sin snapshot previo, la ruta cheap-no-change, la ruta de diff forzado, la ruta de skip por no-materialidad, la ruta del digest, y el webhook on-demand — seis ramas, seis inputs pequeños.

Qué hace realmente el flujo

El crawler es un loop splitInBatches con batchSize: 1 para que la falla de una sola página no aborte la corrida. Cada iteración duerme cuatro segundos antes del HTTP fetch — eso reparte treinta páginas en dos minutos, lo que te mantiene bien por debajo de cualquier rate limit razonable por host y se lee como un bot educado en los logs del servidor. El nodo httpRequest define neverError: true porque un 403 de defensas anti-bot debería registrarse y saltarse, no crashear el workflow.

La normalización ocurre en un nodo Code que strippea <script>, <style>, <noscript> y comentarios HTML por completo, luego enmascara cuatro clases de contenido volátil: timestamps ISO, fechas en formato US, años de cuatro dígitos, y cualquier string hexa de más de 32 caracteres (build IDs, hashes de assets). Sin este paso, cada deploy de Astro/Next/Hugo que re-renderice un footer ”© 2026” o un og:updated_time actualizado registraría como un cambio, el digest semanal dispararía con veinte entradas sin sentido, y el canal moriría.

El gate de materialidad es un AND de cuatro condiciones: el fetch tuvo éxito, el hash difiere del snapshot previo, existe un snapshot previo, y el delta de longitud excede 0,5%. El término de delta de longitud es el pre-filtro barato que ahorra llamadas a Claude — ediciones de un solo carácter o solo de whitespace nunca llegan al modelo. El término “tenía-snapshot-previo” es lo que hace barata la primerísima corrida: una página trackeada nuevita captura su hash baseline y se salta el diff por completo.

La llamada a Claude envía ambos snapshots truncados a 6000 caracteres cada uno (aproximadamente 1500 tokens cada uno, más system prompt y overhead → alrededor de 3500 tokens de entrada por página material). El system prompt fuerza una elección binaria: devolver NO_CHANGE si el diff es cosmético, solo de navegación, solo de footer, o no identificable, o devolver exactamente dos oraciones — qué cambió y por qué a un vendedor le debería importar. El nodo Parse trata NO_CHANGE como un sentinel y voltea is_material = false para que la fila igual quede logueada para auditoría pero nunca llegue al digest.

El agregador de digest del lunes a las 14:30 corre una sola query SQL que agrupa los cambios materiales de los últimos siete días por competidor, y luego renderiza un mensaje de Slack Block Kit por competidor — no un mega-post. Los reps de ventas mutean digests largos sin cortes; los mensajes por competidor son scaneables y threadeables. Las semanas silenciosas (sin cambios materiales en ningún lado) no postean nada. El webhook on-demand es un tercer trigger, completamente independiente: consume un POST de slash command de Slack, corre una query de match LIKE contra el change log de los últimos 90 días, y responde con hasta diez bloques formateados de forma efímera al usuario que solicitó.

Realidad de costos

Por corrida de crawl, con 30 páginas trackeadas y un típico 3-5 de ellas cambiando materialmente: aproximadamente 11.000 tokens de entrada y 1.000 tokens de salida contra claude-sonnet-4-6, lo que aterriza en cerca de $0,05 por corrida. Diariamente por 30 días: ~$1,50/mes en gasto de Claude. n8n self-hosted: $0 incremental; n8n Cloud Starter: $20/mes standalone o $0 si ya lo corres para otros flujos. Postgres: unos pocos megabytes de almacenamiento si guardas el change log indefinidamente (la columna last_content_text es la pesada — 30 filas × ~50KB ≈ 1,5MB total, creciendo lento).

Wall-clock por corrida: ~2,5 minutos (30 páginas × 4s de throttle + latencia de Claude para las materiales). Digest de Slack: bajo 5 segundos. Webhook on-demand: bajo 2 segundos para la respuesta.

Tiempo de operador: 30-60 minutos una vez por trimestre para refrescar la lista de tracked-pages cuando los competidores reestructuran sus sitios, más ~5 minutos la primera vez que alguien reporte un falso positivo (“el digest dijo que el pricing cambió pero no fue así”) para tunear el umbral de materialidad o agregar un patrón de máscara de ruido.

Cómo se ve el éxito

Métrica concreta a vigilar las primeras ocho semanas: open-rate del digest o equivalente a read-receipt en Slack (puedes proxearlo por conteo de reacciones o sondeando manualmente a los reps). Si menos del 30% del canal lee el digest, la relación señal-a-ruido es muy baja — ajusta el umbral de materialidad (sube el gate de delta de longitud de 0,5% a 1%), tira los page types de menor señal (las páginas de hiring de competidores con una página permanente de open-jobs que rota semanalmente son usualmente ruido), o fusiona competidores de baja frecuencia en una sección de digest “long tail”. Si más del 60% lo lee consistentemente, construiste lo correcto y el siguiente movimiento es agregar una ruta on-demand para el caso de uso de discovery-call (ya cableado — solo publicita el slash command).

Una segunda métrica: número de veces en un trimestre que un rep cita el digest en un thread #won-deals o #lost-deals. Cinco citas por trimestre desde un equipo de 20 reps es una buena señal; cero citas después de dos meses significa que o el digest no se lee o el contenido es no accionable.

Versus las alternativas

Klue o Crayon ($30k-$80k/año por el tier SMB de cualquiera, último chequeo Q1 2026) maneja las fuentes JS-heavy de agregadores de reviews que no puedes crawlear vos mismo, despacha una experiencia de consumidor pulida para el equipo de ventas (battlecards, temas de win/loss, hub de intel), e incluye una capa de curación humana que captura el matiz que Claude se pierde. Si tu intel competitiva es lo bastante central a un ciclo de deal como para que tengas a una persona de inteligencia competitiva full-time, compra Klue o Crayon. Este flujo es la respuesta correcta cuando estás corriendo una org de 20 reps sin un hire dedicado de CI y necesitas dejar de descubrir cambios de pricing del competidor desde tus propios threads de lost-deals — te lleva al 70% del valor al 1% del costo.

Visualping o Distill.io (bajo $10/mes) hacen bien la capa de detección de cambio de página, pero se detienen en “esta página cambió” y vuelcan el diff en tu inbox. El trabajo interesante — convertir un diff en “esto es lo que tu equipo de ventas necesita decir distinto” — es exactamente lo que Claude hace aquí. Podrías pegar Visualping a n8n y bypassear la mitad de crawler/hasher de este flujo si quisieras outsourcear la preocupación de polite-crawler; el filtro de materialidad y la etapa de diff con Claude son las partes que de verdad importan.

Un único feed de Google Alerts es lo que la mayoría de los equipos default y lo que la mayoría de los equipos calladamente dejan de leer después de un mes. Google Alerts dispara con menciones de prensa, no con cambios de página; se pierde por completo las ediciones de página de pricing (la página no obtiene una nueva entrada de índice de noticias); y el volumen está dominado por ruido de press release sindicado. Usa Alerts como complemento de este flujo para señal de prensa, no como reemplazo del sustrato de monitoreo de páginas.

Un crawler bespoke en Python sobre un cron job en tu data warehouse es lo que cada staff engineer quiere construir. Lo van a tener funcionando en un sprint, la capa de diff funcionando en un sprint después, el formato de Slack funcionando en un sprint después, y entonces nadie va a ser dueño de él cuando el ingeniero cambie de equipo. La razón para usar n8n acá es que hace el workflow visible (el grafo es la documentación), editable por un no-ingeniero (la persona de marketing ops puede agregar una página trackeada sin un PR), y lo bastante aburrido como para sobrevivir a la persona que lo construyó.

Watch-outs

Bloqueos anti-bot devuelven 403/503 y tu hash silenciosamente se queda obsoleto. Guard: el nodo Fetch Page HTML define neverError: true y la condición fetch_ok del gate de materialidad (status 200-399 AND body length > 200 bytes) enruta los fetches fallidos a la rama false — quedan logueados pero nunca llegan a Claude ni al digest. Agrega una query semanal contra competitor_change_log para páginas cuyo last_seen_at sea mayor a 7 días y trata eso como el reporte de “tracked pages obsoletas”.
Claude alucina un cambio cuando el diff normalizado está sucio (por ejemplo, un rename de clase CSS tocó cada <div> y el texto strippeado no se recuperó del todo). Guard: la escape hatch del system prompt es el string literal NO_CHANGE, y el parser trata cualquier cosa que matchee ^NO_CHANGE\b (case-insensitive) como no material. Cuando veas una entrada de digest obviamente errónea, el fix es agregar un patrón de máscara de ruido en el nodo Code Normalize + Hash, no bajar la temperatura del modelo.
El canal de Slack se mutea dentro de cuatro semanas de salir vivo si incluso el 20% de las entradas del digest son no materiales. Guard: cadencia semanal en lugar de diaria (el cron de digest bundleado es 30 14 * * 1, lunes 14:30 únicamente), el piso de delta de longitud de materialidad en 0,5%, el sentinel NO_CHANGE de Claude, y el gate IF de semanas-silenciosas-quedan-silenciosas que suprime el digest por completo cuando ningún competidor tiene cambios materiales. Si los reps igual lo mutean, el siguiente dial a girar es tirar los page_type de menor señal de la lista de tracked-pages — usualmente páginas de hiring.
Nombres largos de competidores o grandes volúmenes de cambio sobrepasan el límite de 50 bloques por mensaje de Slack. Guard: un mensaje por competidor (no un mega-post), así el cap es por competidor y no por semana. Si un solo competidor genuinamente tiene más de ~15 cambios materiales en una semana, eso en sí mismo es una señal de que el umbral de materialidad necesita subir para ese competidor específicamente.
El slash command on-demand filtra inteligencia competitiva a cualquiera en el workspace porque los slash commands de Slack no enforzan membresía de canal. Guard: el respondToWebhook devuelve response_type: "ephemeral" para que solo el usuario que solicitó vea el resultado, y la query está acotada al change log (no se devuelve texto crudo de página). Si necesitas control de acceso más estricto, gateá el slash command sobre un user-group ID de Slack en el nodo Code Parse Slash Command antes de correr la query SQL.

Stack

n8n — tres triggers (cron de fetch diario, cron de digest semanal, webhook on-demand), HTTP fetcher, normalizer, gate de materialidad, persistencia
Postgres — competitor_tracked_pages (la lista source-of-truth, 20-30 filas) y competitor_change_log (audit trail de cada cambio detectado, material o no)
Claude Sonnet 4.6 — la etapa de diff-y-resumen, con el sentinel NO_CHANGE como escape hatch
Slack — el canal de distribución del digest y la superficie del slash command on-demand

Editar esta página en GitHub

Archivos de este artefacto

Descargar todo (.zip)

# Competitive intel tracker — n8n bundle

## What this flow does

A daily cron pulls a list of tracked competitor pages from Postgres, fetches each one with a real user-agent and a 4-second throttle, normalizes the HTML by stripping volatile noise (script blocks, build IDs, server-rendered timestamps, current-year strings), hashes the result, and compares it to the previously stored hash. Pages whose hash and length-delta both clear a materiality threshold get diffed by Claude Sonnet against the prior snapshot; the model is instructed to return the literal string `NO_CHANGE` when the diff is cosmetic. Material summaries land in a `competitor_change_log` table. A second cron fires Mondays at 14:30 and aggregates the last seven days of material changes into one Slack Block Kit message per competitor — silent weeks stay silent. A third trigger (a Slack slash command webhook) lets sales reps query the same change log on demand for a single competitor over the last 90 days.

## Import

1. In n8n, open the workflow list and click **Import from File** in the top-right kebab menu.
2. Select `competitive-intel-tracker-n8n.json`.
3. Confirm the workflow opens with 20 nodes across three triggers (the daily crawler, the weekly digest, and the on-demand webhook). The graph should read left-to-right with the digest below the crawler and the webhook below that.
4. Open **Settings** on the workflow and confirm `executionOrder: v1` and a sensible `timezone` (the bundle ships `Europe/London` — change it to your team's working timezone before activating; Cron expressions are interpreted in this zone).
5. Do **not** activate yet. Wire credentials and create the database tables first (next two sections).

## Credentials

The flow references four credential placeholders by name. Each placeholder must be replaced with a real n8n credential of the matching type before the workflow will execute.

### `PLACEHOLDER_POSTGRES_CRED_ID` — Postgres (read/write)

Used by five nodes (`Pull Tracked Pages`, `Persist Change + Update Snapshot`, `Touch Snapshot (No Material Change)`, `Aggregate Last 7 Days Of Material Changes`, `Fetch On-Demand History`). Create an n8n **Postgres** credential pointing at the database that holds your tracked pages and change log. The bundle assumes two tables — create them with:

```sql
CREATE TABLE competitor_tracked_pages (
page_id bigserial PRIMARY KEY,
competitor_name text NOT NULL,
page_type text NOT NULL, -- 'pricing' | 'blog' | 'hiring' | 'reviews' | 'docs'
url text NOT NULL UNIQUE,
active boolean NOT NULL DEFAULT true,
last_content_hash text,
last_content_text text,
last_seen_at timestamptz
);

CREATE TABLE competitor_change_log (
id bigserial PRIMARY KEY,
page_id bigint REFERENCES competitor_tracked_pages(page_id) ON DELETE CASCADE,
competitor_name text NOT NULL,
page_type text NOT NULL,
url text NOT NULL,
content_hash text NOT NULL,
summary text NOT NULL,
is_material boolean NOT NULL,
detected_at timestamptz NOT NULL DEFAULT now()
);

CREATE INDEX ON competitor_change_log (competitor_name, detected_at DESC);
CREATE INDEX ON competitor_change_log (detected_at DESC) WHERE is_material;
```

Seed `competitor_tracked_pages` with twenty to thirty rows before the first run. The recommended starter set per competitor: pricing page, two recent blog posts, careers/jobs index, docs landing page. Skip JS-heavy review sites (G2, Capterra, TrustRadius) unless you have a rendering service — the raw HTML they ship is mostly empty.

### `PLACEHOLDER_ANTHROPIC_CRED_ID` — Anthropic API key

Used by `Claude — Diff + Summarize`. Create an n8n **Header Auth** credential with header name `x-api-key` and value set to your Anthropic API key (find it at console.anthropic.com → API Keys). The flow uses `claude-sonnet-4-6` — change the model in the JSON if your account routes elsewhere. Token budget per run: roughly `(pages × ~3000 input tokens) + (material pages × ~200 output tokens)` — see the cost-reality section in the page body for absolute numbers.

### `PLACEHOLDER_SLACK_CRED_ID` — Slack bot token

Used by `Slack — Post Weekly Digest`. Create a Slack app at api.slack.com/apps, add the bot scopes `chat:write` and `chat:write.public` (the latter so the bot can post to channels it has not been explicitly invited to), install the app, and copy the **Bot User OAuth Token** (starts with `xoxb-`). Create an n8n **Header Auth** credential with header name `Authorization` and value `Bearer xoxb-...`. Update the channel name in the `Slack — Post Weekly Digest` node from `#competitive-intel` to whatever channel your sales team actually reads.

### Slash command (optional, no credential — webhook URL only)

The `On-Demand Webhook` node exposes a path at `/webhook/intel-on-demand`. To wire a Slack slash command to it: in your Slack app config, add a slash command (e.g. `/whatsnew`), set the request URL to your n8n public URL plus that path, and grant the `commands` scope. No n8n credential is needed because Slack POSTs to the webhook directly. If your n8n is not internet-reachable, either expose it via a tunnel or skip this trigger and run the on-demand query manually from the n8n editor.

## First-run verification

Run these in order. Each step proves a different branch of the flow.

1. **Insert one tracked page that you know changes daily** (a competitor's blog index works well). Verify with `SELECT * FROM competitor_tracked_pages;` that the row exists with `last_content_hash IS NULL`.
2. **Manually execute the `Daily Cron — 5am UTC` trigger** from the n8n editor. The first run should: fetch the page, compute a hash, *fail* the `Material Change?` IF (because there is no prior snapshot to compare — the `had-prior-snapshot` condition is false), and route to `Touch Snapshot (No Material Change)` which writes the initial hash. Confirm `competitor_tracked_pages.last_content_hash` is now populated and `competitor_change_log` is still empty.
3. **Manually execute the trigger a second time, immediately.** The hash should match (page didn't change in two minutes), the IF fails, no Claude call. This proves the cheap path.
4. **Edit the row to force a diff.** Run `UPDATE competitor_tracked_pages SET last_content_text = 'lorem ipsum placeholder', last_content_hash = 'force-diff' WHERE page_id = <id>;` and re-execute the trigger. The IF should now pass, Claude should be called, and you should see a row appear in `competitor_change_log`. Open the row and read the summary — it should describe the page in two sentences. If it returned `NO_CHANGE` despite the forced diff, lower the materiality threshold or check the truncation in the prompt.
5. **Test the no-op materiality filter.** Insert a row pointing at a page that has trivial dynamic content (e.g. a homepage with rotating testimonials). After the first snapshot is captured, re-run the cron. The hash will likely differ but the length delta should be small — confirm it routes to the false branch and does not spend a Claude call.
6. **Test the weekly digest.** Manually execute `Weekly Digest Cron — Mon 14:30`. If `competitor_change_log` has at least one `is_material = true` row from the last 7 days, you should see a Slack message land in the configured channel. If the table is empty for the window, no message fires — that is correct behavior, not a bug.
7. **Test the on-demand webhook.** From a terminal, `curl -X POST https://<your-n8n>/webhook/intel-on-demand -d 'text=acme'` (or trigger your wired Slack slash command). Expect a JSON response with up to 10 of the most recent material changes for any competitor whose name contains `acme`. With an empty change log, expect the "No material changes recorded" fallback.
8. **Activate the workflow** only after all six branches above behaved as described.

{
  "name": "Competitive intel tracker",
  "nodes": [
    {
      "parameters": {
        "rule": {
          "interval": [
            {
              "field": "cronExpression",
              "expression": "0 5 * * *"
            }
          ]
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000001",
      "name": "Daily Cron — 5am UTC",
      "type": "n8n-nodes-base.scheduleTrigger",
      "typeVersion": 1,
      "position": [240, 300],
      "notesInFlow": true,
      "notes": "Crawl runs daily at 05:00 in the workflow timezone (set in Settings). Digest fan-out is gated to Mondays only by the Weekly-Digest IF node."
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "SELECT\n  page_id,\n  competitor_name,\n  page_type,\n  url,\n  last_content_hash,\n  last_content_text,\n  last_seen_at\nFROM competitor_tracked_pages\nWHERE active = true\nORDER BY competitor_name, page_type\nLIMIT 200;",
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000002",
      "name": "Pull Tracked Pages",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [460, 300],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_POSTGRES_CRED_ID",
          "name": "Postgres — competitive-intel"
        }
      },
      "notesInFlow": true,
      "notes": "Source-of-truth table for the tracked-pages list. Twenty to thirty rows is typical; cap at 200 to fail closed if the list grows unmanageably."
    },
    {
      "parameters": {
        "batchSize": 1,
        "options": {
          "reset": false
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000003",
      "name": "Iterate One Page At A Time",
      "type": "n8n-nodes-base.splitInBatches",
      "typeVersion": 3,
      "position": [680, 300],
      "notesInFlow": true,
      "notes": "Batch size 1 — each iteration handles one URL so per-page failure does not abort the run. Pair with a Wait node downstream to throttle."
    },
    {
      "parameters": {
        "amount": 4,
        "unit": "seconds"
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000004",
      "name": "Throttle — 4s Between Fetches",
      "type": "n8n-nodes-base.wait",
      "typeVersion": 1.1,
      "position": [900, 300],
      "notesInFlow": true,
      "notes": "Spreads ~30 fetches over ~2 minutes. Combined with one-request-per-page-per-day this keeps us well under any reasonable rate limit."
    },
    {
      "parameters": {
        "method": "GET",
        "url": "={{ $json.url }}",
        "sendHeaders": true,
        "headerParameters": {
          "parameters": [
            { "name": "User-Agent", "value": "ooligo-intel-bot/1.0 (+https://ooligo.com/bots)" },
            { "name": "Accept", "value": "text/html,application/xhtml+xml" }
          ]
        },
        "options": {
          "timeout": 20000,
          "redirect": {
            "redirect": {
              "followRedirects": true,
              "maxRedirects": 3
            }
          },
          "response": {
            "response": {
              "fullResponse": true,
              "neverError": true
            }
          }
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000005",
      "name": "Fetch Page HTML",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [1120, 300],
      "notesInFlow": true,
      "notes": "neverError:true so a 403/503 from anti-bot does not kill the batch — we record it and move on."
    },
    {
      "parameters": {
        "jsCode": "// Strip noise from the HTML, normalize, and hash. The 'noise' is anything that\n// re-renders on every deploy without representing a content change: build IDs,\n// CSRF tokens, current-year strings, server-rendered timestamps, CDN cache\n// busters in asset URLs. Without this filter the digest fires every day with\n// nothing actually changed and the Slack channel gets muted within a week.\n\nconst page = $('Iterate One Page At A Time').item.json;\nconst response = $json;\nconst statusCode = response.statusCode || response.status || 0;\nconst rawBody = typeof response.body === 'string' ? response.body : JSON.stringify(response.body || '');\n\nfunction stripNoise(html) {\n  return html\n    // Remove <script> and <style> blocks entirely\n    .replace(/<script[\\s\\S]*?<\\/script>/gi, '')\n    .replace(/<style[\\s\\S]*?<\\/style>/gi, '')\n    .replace(/<noscript[\\s\\S]*?<\\/noscript>/gi, '')\n    .replace(/<!--[\\s\\S]*?-->/g, '')\n    // Strip all tags to plain text\n    .replace(/<[^>]+>/g, ' ')\n    // Decode common entities\n    .replace(/&nbsp;/g, ' ').replace(/&amp;/g, '&').replace(/&lt;/g, '<').replace(/&gt;/g, '>').replace(/&quot;/g, '\"')\n    // Mask volatile values\n    .replace(/\\b\\d{4}-\\d{2}-\\d{2}T\\d{2}:\\d{2}:\\d{2}(?:\\.\\d+)?Z?\\b/g, '<TS>')\n    .replace(/\\b(?:Jan|Feb|Mar|Apr|May|Jun|Jul|Aug|Sep|Oct|Nov|Dec)[a-z]*\\s+\\d{1,2},?\\s+20\\d{2}\\b/g, '<DATE>')\n    .replace(/\\b20\\d{2}\\b/g, '<YEAR>')\n    .replace(/[a-f0-9]{32,}/gi, '<HASH>')\n    .replace(/\\b[A-Z0-9]{16,}\\b/g, '<TOKEN>')\n    // Collapse whitespace\n    .replace(/\\s+/g, ' ')\n    .trim();\n}\n\nconst normalized = stripNoise(rawBody);\n\nconst crypto = require('crypto');\nconst contentHash = crypto.createHash('sha256').update(normalized).digest('hex');\n\n// Materiality pre-filter: very small diffs are not worth a Claude call.\nconst prevText = page.last_content_text || '';\nconst lengthDelta = Math.abs(normalized.length - prevText.length);\nconst lengthRatio = prevText.length === 0 ? 1 : lengthDelta / prevText.length;\n\nreturn [{\n  json: {\n    page_id: page.page_id,\n    competitor_name: page.competitor_name,\n    page_type: page.page_type,\n    url: page.url,\n    fetch_status: statusCode,\n    fetched_at: new Date().toISOString(),\n    new_hash: contentHash,\n    old_hash: page.last_content_hash || null,\n    new_text: normalized,\n    old_text: prevText,\n    hash_changed: contentHash !== (page.last_content_hash || ''),\n    length_delta_pct: Math.round(lengthRatio * 1000) / 10,\n    fetch_ok: statusCode >= 200 && statusCode < 400 && rawBody.length > 200\n  }\n}];"
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000006",
      "name": "Normalize + Hash",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [1340, 300]
    },
    {
      "parameters": {
        "conditions": {
          "options": {
            "caseSensitive": true,
            "leftValue": "",
            "typeValidation": "strict"
          },
          "conditions": [
            {
              "id": "fetch-ok",
              "leftValue": "={{ $json.fetch_ok }}",
              "rightValue": true,
              "operator": { "type": "boolean", "operation": "equal" }
            },
            {
              "id": "hash-changed",
              "leftValue": "={{ $json.hash_changed }}",
              "rightValue": true,
              "operator": { "type": "boolean", "operation": "equal" }
            },
            {
              "id": "had-prior-snapshot",
              "leftValue": "={{ $json.old_text }}",
              "rightValue": "",
              "operator": { "type": "string", "operation": "notEmpty" }
            },
            {
              "id": "non-trivial-delta",
              "leftValue": "={{ $json.length_delta_pct }}",
              "rightValue": 0.5,
              "operator": { "type": "number", "operation": "gte" }
            }
          ],
          "combinator": "and"
        },
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000007",
      "name": "Material Change?",
      "type": "n8n-nodes-base.if",
      "typeVersion": 2.2,
      "position": [1560, 300],
      "notesInFlow": true,
      "notes": "Four-part gate: fetch succeeded, hash differs, we have a prior snapshot to compare against, and length delta exceeds 0.5% (filters out single-character or whitespace-only edits)."
    },
    {
      "parameters": {
        "method": "POST",
        "url": "https://api.anthropic.com/v1/messages",
        "sendHeaders": true,
        "headerParameters": {
          "parameters": [
            { "name": "anthropic-version", "value": "2023-06-01" },
            { "name": "content-type", "value": "application/json" }
          ]
        },
        "authentication": "predefinedCredentialType",
        "nodeCredentialType": "httpHeaderAuth",
        "sendBody": true,
        "specifyBody": "json",
        "jsonBody": "={\n  \"model\": \"claude-sonnet-4-6\",\n  \"max_tokens\": 400,\n  \"system\": \"You compare two snapshots of a competitor's public web page and report what changed in a way that helps a B2B sales team. Output rules: (1) If the diff is cosmetic, navigation-only, footer-only, or you cannot identify a specific factual delta, return exactly the string NO_CHANGE on a single line. Nothing else. (2) Otherwise return two short sentences. Sentence one: what changed (a price, a feature, a target customer, a hire, a positioning shift). Sentence two: why a salesperson should care (a new objection to pre-empt, a new wedge to use, a new threat to flag). Do not invent details that are not in the diff. Do not speculate about strategy. Do not pad with generic commentary.\",\n  \"messages\": [\n    {\n      \"role\": \"user\",\n      \"content\": \"Competitor: {{ $json.competitor_name }}\\nPage type: {{ $json.page_type }}\\nURL: {{ $json.url }}\\n\\n--- PREVIOUS SNAPSHOT ---\\n{{ $json.old_text.slice(0, 6000) }}\\n\\n--- CURRENT SNAPSHOT ---\\n{{ $json.new_text.slice(0, 6000) }}\"\n    }\n  ]\n}",
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000008",
      "name": "Claude — Diff + Summarize",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [1780, 200],
      "credentials": {
        "httpHeaderAuth": {
          "id": "PLACEHOLDER_ANTHROPIC_CRED_ID",
          "name": "Anthropic — x-api-key"
        }
      },
      "notesInFlow": true,
      "notes": "Snapshots truncated to 6000 chars each — keeps input ≤ ~3k tokens per page. NO_CHANGE sentinel is the model's escape hatch when the diff is noisy."
    },
    {
      "parameters": {
        "jsCode": "// Pull the model's text out of the Anthropic response and decide whether to keep it.\nconst page = $('Material Change?').item.json;\nconst resp = $json;\nconst summary = (resp?.content?.[0]?.text || '').trim();\nconst isNoChange = summary === '' || summary === 'NO_CHANGE' || /^NO_CHANGE\\b/i.test(summary);\n\nreturn [{\n  json: {\n    page_id: page.page_id,\n    competitor_name: page.competitor_name,\n    page_type: page.page_type,\n    url: page.url,\n    new_hash: page.new_hash,\n    new_text: page.new_text,\n    summary,\n    is_material: !isNoChange,\n    summarized_at: new Date().toISOString(),\n    input_tokens: resp?.usage?.input_tokens || null,\n    output_tokens: resp?.usage?.output_tokens || null\n  }\n}];"
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000009",
      "name": "Parse Claude Response",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [2000, 200]
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "INSERT INTO competitor_change_log (\n  page_id, competitor_name, page_type, url,\n  content_hash, summary, is_material, detected_at\n) VALUES ($1, $2, $3, $4, $5, $6, $7, now())\nRETURNING id;\n\nUPDATE competitor_tracked_pages\nSET\n  last_content_hash = $5,\n  last_content_text = $8,\n  last_seen_at = now()\nWHERE page_id = $1;",
        "options": {
          "queryReplacement": "={{ $json.page_id }},{{ $json.competitor_name }},{{ $json.page_type }},{{ $json.url }},{{ $json.new_hash }},{{ JSON.stringify($json.summary) }},{{ $json.is_material }},{{ JSON.stringify($json.new_text) }}"
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-00000000000a",
      "name": "Persist Change + Update Snapshot",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [2220, 200],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_POSTGRES_CRED_ID",
          "name": "Postgres — competitive-intel"
        }
      },
      "notesInFlow": true,
      "notes": "Two statements: append to the change log (audit trail), then advance the snapshot. is_material flag drives the weekly digest filter."
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "UPDATE competitor_tracked_pages\nSET\n  last_content_hash = COALESCE($2, last_content_hash),\n  last_content_text = COALESCE($3, last_content_text),\n  last_seen_at = now()\nWHERE page_id = $1;",
        "options": {
          "queryReplacement": "={{ $json.page_id }},{{ $json.fetch_ok ? $json.new_hash : null }},{{ $json.fetch_ok ? JSON.stringify($json.new_text) : null }}"
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-00000000000b",
      "name": "Touch Snapshot (No Material Change)",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [1780, 400],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_POSTGRES_CRED_ID",
          "name": "Postgres — competitive-intel"
        }
      },
      "notesInFlow": true,
      "notes": "False branch: still advances the stored hash so the next run compares against the latest content, but does NOT spend a Claude call or write to the change log."
    },
    {
      "parameters": {
        "rule": {
          "interval": [
            {
              "field": "cronExpression",
              "expression": "30 14 * * 1"
            }
          ]
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-00000000000c",
      "name": "Weekly Digest Cron — Mon 14:30",
      "type": "n8n-nodes-base.scheduleTrigger",
      "typeVersion": 1,
      "position": [240, 700],
      "notesInFlow": true,
      "notes": "Independent trigger. Mondays at 14:30 in the workflow timezone — Tuesday morning for APAC, mid-morning for EU, breakfast for the US east coast."
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "SELECT\n  competitor_name,\n  json_agg(\n    json_build_object(\n      'page_type', page_type,\n      'url', url,\n      'summary', summary,\n      'detected_at', detected_at\n    ) ORDER BY detected_at DESC\n  ) AS changes\nFROM competitor_change_log\nWHERE is_material = true\n  AND detected_at >= now() - interval '7 days'\nGROUP BY competitor_name\nORDER BY competitor_name;",
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-00000000000d",
      "name": "Aggregate Last 7 Days Of Material Changes",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [460, 700],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_POSTGRES_CRED_ID",
          "name": "Postgres — competitive-intel"
        }
      }
    },
    {
      "parameters": {
        "conditions": {
          "options": {
            "caseSensitive": true,
            "leftValue": "",
            "typeValidation": "strict"
          },
          "conditions": [
            {
              "id": "have-changes",
              "leftValue": "={{ $json.competitor_name }}",
              "rightValue": "",
              "operator": { "type": "string", "operation": "notEmpty" }
            }
          ],
          "combinator": "and"
        },
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-00000000000e",
      "name": "Anything To Report?",
      "type": "n8n-nodes-base.if",
      "typeVersion": 2.2,
      "position": [680, 700],
      "notesInFlow": true,
      "notes": "Silent weeks stay silent — no 'no updates this week' filler messages. The channel never fires unless there is something actually worth reading."
    },
    {
      "parameters": {
        "jsCode": "// Render one Slack Block Kit payload per competitor with material changes this week.\nconst c = $json;\nconst changes = c.changes || [];\nconst blocks = [\n  {\n    type: 'header',\n    text: { type: 'plain_text', text: `Competitor update — ${c.competitor_name}`, emoji: false }\n  },\n  {\n    type: 'context',\n    elements: [\n      { type: 'mrkdwn', text: `${changes.length} material change${changes.length === 1 ? '' : 's'} in the last 7 days` }\n    ]\n  },\n  { type: 'divider' }\n];\nfor (const ch of changes) {\n  blocks.push({\n    type: 'section',\n    text: {\n      type: 'mrkdwn',\n      text: `*${ch.page_type}* — <${ch.url}|view page>\\n${ch.summary}`\n    }\n  });\n}\nreturn [{\n  json: {\n    competitor_name: c.competitor_name,\n    blocks,\n    fallback_text: `Competitor update — ${c.competitor_name} (${changes.length} material change${changes.length === 1 ? '' : 's'} this week)`\n  }\n}];"
      },
      "id": "2d2d2d2d-0002-0000-0000-00000000000f",
      "name": "Compose Slack Blocks",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [900, 700]
    },
    {
      "parameters": {
        "method": "POST",
        "url": "https://slack.com/api/chat.postMessage",
        "authentication": "predefinedCredentialType",
        "nodeCredentialType": "httpHeaderAuth",
        "sendHeaders": true,
        "headerParameters": {
          "parameters": [
            { "name": "content-type", "value": "application/json; charset=utf-8" }
          ]
        },
        "sendBody": true,
        "specifyBody": "json",
        "jsonBody": "={\n  \"channel\": \"#competitive-intel\",\n  \"text\": {{ JSON.stringify($json.fallback_text) }},\n  \"blocks\": {{ JSON.stringify($json.blocks) }},\n  \"unfurl_links\": false,\n  \"unfurl_media\": false\n}",
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000010",
      "name": "Slack — Post Weekly Digest",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [1120, 700],
      "credentials": {
        "httpHeaderAuth": {
          "id": "PLACEHOLDER_SLACK_CRED_ID",
          "name": "Slack — bot token"
        }
      },
      "notesInFlow": true,
      "notes": "One message per competitor, not one mega-post — sales reps mute long unbroken digests. Update channel name to your team's actual channel."
    },
    {
      "parameters": {
        "httpMethod": "POST",
        "path": "intel-on-demand",
        "responseMode": "responseNode",
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000011",
      "name": "On-Demand Webhook (Slack Slash Command)",
      "type": "n8n-nodes-base.webhook",
      "typeVersion": 2,
      "position": [240, 1100],
      "notesInFlow": true,
      "notes": "Wire a Slack slash command (e.g. /whatsnew acme) to this URL. Slack POSTs form-encoded body with text=<competitor query>."
    },
    {
      "parameters": {
        "jsCode": "// Parse Slack slash command payload, normalize the competitor name.\nconst body = $json.body || $json;\nconst raw = (body.text || '').trim();\nif (!raw) {\n  return [{ json: { error: 'Usage: /whatsnew <competitor>', _respond_immediately: true } }];\n}\nreturn [{ json: { query: raw.toLowerCase(), response_url: body.response_url || null } }];"
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000012",
      "name": "Parse Slash Command",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [460, 1100]
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "SELECT\n  competitor_name,\n  page_type,\n  url,\n  summary,\n  detected_at\nFROM competitor_change_log\nWHERE is_material = true\n  AND lower(competitor_name) LIKE '%' || $1 || '%'\n  AND detected_at >= now() - interval '90 days'\nORDER BY detected_at DESC\nLIMIT 10;",
        "options": {
          "queryReplacement": "={{ $json.query }}"
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000013",
      "name": "Fetch On-Demand History",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [680, 1100],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_POSTGRES_CRED_ID",
          "name": "Postgres — competitive-intel"
        }
      }
    },
    {
      "parameters": {
        "respondWith": "json",
        "responseBody": "={\n  \"response_type\": \"ephemeral\",\n  \"text\": {{ JSON.stringify(($input.all().length === 0 ? 'No material changes recorded in the last 90 days.' : 'Last ' + $input.all().length + ' material changes:')) }},\n  \"blocks\": {{ JSON.stringify($input.all().map(i => ({ type: 'section', text: { type: 'mrkdwn', text: '*' + i.json.competitor_name + ' — ' + i.json.page_type + '* (' + new Date(i.json.detected_at).toISOString().slice(0,10) + ')\\n' + i.json.summary + '\\n<' + i.json.url + '|view page>' } }))) }}\n}",
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000014",
      "name": "Respond To Slack",
      "type": "n8n-nodes-base.respondToWebhook",
      "typeVersion": 1.1,
      "position": [900, 1100]
    }
  ],
  "connections": {
    "Daily Cron — 5am UTC": {
      "main": [
        [{ "node": "Pull Tracked Pages", "type": "main", "index": 0 }]
      ]
    },
    "Pull Tracked Pages": {
      "main": [
        [{ "node": "Iterate One Page At A Time", "type": "main", "index": 0 }]
      ]
    },
    "Iterate One Page At A Time": {
      "main": [
        [{ "node": "Throttle — 4s Between Fetches", "type": "main", "index": 0 }]
      ]
    },
    "Throttle — 4s Between Fetches": {
      "main": [
        [{ "node": "Fetch Page HTML", "type": "main", "index": 0 }]
      ]
    },
    "Fetch Page HTML": {
      "main": [
        [{ "node": "Normalize + Hash", "type": "main", "index": 0 }]
      ]
    },
    "Normalize + Hash": {
      "main": [
        [{ "node": "Material Change?", "type": "main", "index": 0 }]
      ]
    },
    "Material Change?": {
      "main": [
        [{ "node": "Claude — Diff + Summarize", "type": "main", "index": 0 }],
        [{ "node": "Touch Snapshot (No Material Change)", "type": "main", "index": 0 }]
      ]
    },
    "Claude — Diff + Summarize": {
      "main": [
        [{ "node": "Parse Claude Response", "type": "main", "index": 0 }]
      ]
    },
    "Parse Claude Response": {
      "main": [
        [{ "node": "Persist Change + Update Snapshot", "type": "main", "index": 0 }]
      ]
    },
    "Persist Change + Update Snapshot": {
      "main": [
        [{ "node": "Iterate One Page At A Time", "type": "main", "index": 0 }]
      ]
    },
    "Touch Snapshot (No Material Change)": {
      "main": [
        [{ "node": "Iterate One Page At A Time", "type": "main", "index": 0 }]
      ]
    },
    "Weekly Digest Cron — Mon 14:30": {
      "main": [
        [{ "node": "Aggregate Last 7 Days Of Material Changes", "type": "main", "index": 0 }]
      ]
    },
    "Aggregate Last 7 Days Of Material Changes": {
      "main": [
        [{ "node": "Anything To Report?", "type": "main", "index": 0 }]
      ]
    },
    "Anything To Report?": {
      "main": [
        [{ "node": "Compose Slack Blocks", "type": "main", "index": 0 }],
        []
      ]
    },
    "Compose Slack Blocks": {
      "main": [
        [{ "node": "Slack — Post Weekly Digest", "type": "main", "index": 0 }]
      ]
    },
    "On-Demand Webhook (Slack Slash Command)": {
      "main": [
        [{ "node": "Parse Slash Command", "type": "main", "index": 0 }]
      ]
    },
    "Parse Slash Command": {
      "main": [
        [{ "node": "Fetch On-Demand History", "type": "main", "index": 0 }]
      ]
    },
    "Fetch On-Demand History": {
      "main": [
        [{ "node": "Respond To Slack", "type": "main", "index": 0 }]
      ]
    }
  },
  "active": false,
  "settings": {
    "executionOrder": "v1",
    "timezone": "Europe/London",
    "saveDataErrorExecution": "all",
    "saveDataSuccessExecution": "all",
    "saveManualExecutions": true
  },
  "versionId": "2d2d2d2d-0002-0000-0000-0000000000ff",
  "meta": {
    "templateCreatedBy": "ooligo",
    "instanceId": "ooligo-pilot"
  },
  "id": "competitive-intel-tracker",
  "tags": [
    { "name": "revops" },
    { "name": "competitive-intel" },
    { "name": "sales-enablement" }
  ]
}