n8n-flow

n8n と Claude を使った競合他社の言及と変更の自動追跡

Difficulty

中級

Setup time

60min

For

revops · sales-enablement

RevOps

Stack

B2B セールスチームにおける競合インテリジェンスは、多くの場合間違った方法で届きます。担当者がディールを失い、プロスペクトが競合他社の新しい料金プランに言及したと #lost-deals に投稿し、チームの残りが 3 週間後に知ります。遅い発見のコストは複合します — その間にクローズするすべてのディールが十分な準備なしに会話に入ります。このフローは安価で地味な修正です。日次 cron が実際に気にする競合他社のページリストをクロールし、デプロイノイズを除去するために HTML を正規化し、何が実質的に変わったかを Claude に要約させ（差分が化粧品の場合は NO_CHANGE を返す）、チャンネルが 1 ヶ月後でも担当者が開き続けるほどシグナルが密な状態を保つために単一の週次ダイジェストを Slack に投稿します。

apps/web/public/artifacts/competitive-intel-tracker-n8n/ のアーティファクトバンドルには、インポート可能な n8n ワークフロー（3 つのトリガーにわたる 20 ノードの competitive-intel-tracker-n8n.json）と、クレデンシャルセットアップ、作成が必要な 2 つの Postgres テーブル、マテリアリティスキップブランチとオンデマンドの Slack スラッシュコマンドの両方を実行する 6 ステップの初回実行検証が記載された _README.md が含まれています。

これを使用するタイミング

積極的にポジショニングする競合他社が 5〜15 社あり、重要な方法で変化する各競合他社の 3〜5 つの公開ページを名指しできる（価格設定、製品ポジショニング、戦略をほのめかす採用シグナル）、セールスチームが実際に開く Slack チャンネルが少なくとも 1 つある場合です。競合他社がサイトを再構築する際に追跡 URL のリストを維持する意志があります。Postgres データベース（または他のクエリを適応できるストア）とオンデマンドスラッシュコマンドを機能させたい場合はパブリックインターネットから到達可能な n8n インスタンスがあります。

以前「毎回の競合他社のブログ投稿に対する Slack アラート」RSS の仕組みを試してチームが 1 週間以内にミュートした場合、ここでのマテリアリティフィルターと週次ケイデンスはその失敗モードへの直接的な対応です。

これを使用しないタイミング

競合セットが G2、Capterra、TrustRadius などの JS 重視のレビュー集約サイトに支配されている場合は立ち上げないでください。それらの公開 HTML はシェルです — 実際のレビューコンテンツはクライアントサイドでレンダリングされるか認証の背後にあり、礼儀正しくクロールしてもほとんど何も返ってきません。それらを扱うベンダー（Crayon、Klue、Kompyte）に費用を払うか、それらのソースを完全にスキップしてください。

インテルをリアルタイムで必要とするチームには使用しないでください — 例えば、1 週間以内に回転するディールサイクルで昨日の競合他社の価格変更が発見コールのかなめとなる場合。ここでのケイデンスは日次フェッチ、週次ダイジェストです。1 時間未満のレイテンシが必要な場合は、別の製品（Klue アラート）を購入するか別のワークフロー（担当者の Slack DM にフィードするページ変更ウェブフック、ダイジェストではなく）を構築しています。

プライベートな競合他社の表面（ゲートされたトライアル、有料顧客ポータル、ログインの背後にあるもの）に対してはこれを使用しないでください。それらをクロールすることは公開マーケティングページのチェックとは倫理的および法的に異なるクラスにあり、このフローはそれに適した基盤ではありません。

競合他社が 3 社未満の場合はこれを使用しないでください。セットアップコスト（20〜30 行の追跡ページ、スキーマ、クレデンシャル、マテリアリティチューニング）は、1〜2 社を監視している場合には見合いません。Google アラートとカレンダーのリマインダーがその規模での正しい答えです。

セットアップ

インポートする前に apps/web/public/artifacts/competitive-intel-tracker-n8n/_README.md を最初から最後まで読んでください。短いバージョン：n8n の Import from File で competitive-intel-tracker-n8n.json をインポートし、README の DDL で 2 つの Postgres テーブル（competitor_tracked_pages と competitor_change_log）を作成し、4 つのクレデンシャル（PLACEHOLDER_POSTGRES_CRED_ID、PLACEHOLDER_ANTHROPIC_CRED_ID、PLACEHOLDER_SLACK_CRED_ID、プラスオプションの Slack スラッシュコマンドウェブフック URL）を設定し、Settings でワークフロータイムゾーンを明示的に設定し、20〜30 行で追跡ページテーブルにシードし、アクティベートする前に 6 ステップの初回実行検証を実行します。検証は意図的に、以前のスナップショットなしのパス、安価な変更なしのパス、強制差分パス、マテリアリティスキップパス、ダイジェストパス、オンデマンドウェブフックの 6 つのブランチと 6 つの小さな入力を実行します。

フローの実際の動作

クローラーは単一のページ障害が実行を中断しないよう batchSize: 1 の splitInBatches ループです。各イテレーションは HTTP フェッチの前に 4 秒スリープします — これにより 30 ページが約 2 分にわたって分散され、どんな合理的なホストごとのレート制限もよく下回り、サーバーログには礼儀正しいボットとして見えます。httpRequest ノードは neverError: true を設定します。アンチボット防御からの 403 は記録してスキップするべきであり、ワークフローをクラッシュさせてはならないからです。

正規化は <script>、<style>、<noscript>、HTML コメントを一括でストリップし、4 つのクラスの揮発性コンテンツをマスクする Code ノードで行われます：ISO タイムスタンプ、US フォーマットの日付、4 桁の年、32 文字を超えるすべての 16 進数文字列（ビルド ID、アセットハッシュ）。このステップがなければ、「© 2026」フッターや更新された og:updated_time を再レンダリングする Astro/Next/Hugo のすべてのデプロイが変更として登録され、週次ダイジェストが 20 件の無意味なエントリで発火し、チャンネルが死にます。

マテリアリティゲートは 4 条件の AND です：フェッチが成功した、ハッシュが以前のスナップショットと異なる、以前のスナップショットが存在する、長さのデルタが 0.5% を超える。長さデルタの項は、Claude コールを節約する安価な事前フィルターです — 単一文字や空白のみの編集はモデルに到達しません。「以前のスナップショットがあった」という項は、初回実行を安価にするものです：新しく追跡されたページはベースラインハッシュをキャプチャして差分を完全にスキップします。

Claude コールは両方のスナップショットを各 6000 文字（各約 1500 トークン、プラスシステムプロンプトとオーバーヘッド、マテリアルなページあたり約 3500 入力トークン）に切り詰めて送信します。システムプロンプトは二値選択を強制します：差分が化粧品のみ、ナビゲーションのみ、フッターのみ、または特定できない場合は NO_CHANGE を返す。または、何が変わったかとセールスパーソンが気にすべき理由を正確に 2 文で返す。Parse ノードは NO_CHANGE をセンチネルとして扱い、is_material = false に切り替えます。行は監査のために依然としてログに記録されますが、ダイジェストには届きません。

月曜日の 14:30 ダイジェスト集計器は、競合他社ごとに過去 7 日間のマテリアルな変更をグループ化する 1 つの SQL クエリを実行し、競合他社ごとに 1 つの Slack Block Kit メッセージをレンダリングします — 1 つのメガ投稿ではなく。セールス担当者は長い中断のないダイジェストをミュートします。競合他社ごとのメッセージはスキャン可能でスレッド化可能です。サイレントな週（どこにもマテリアルな変更がない）は何も投稿しません。オンデマンドウェブフックは第 3 のトリガーで完全に独立しています：Slack スラッシュコマンドの POST を消費し、過去 90 日間の変更ログに対して LIKE 照合クエリを実行し、要求ユーザーに対してエフェメラルに最大 10 件のフォーマットされたブロックで応答します。

コストの実態

30 の追跡ページと典型的な 3〜5 件のマテリアルな変更を持つクロール実行あたり：claude-sonnet-4-6 に対して約 11,000 入力トークンと 1,000 出力トークン、実行あたり約 0.05 USD。30 日間、日次：月約 1.50 USD の Claude 支出。n8n セルフホスト：0 USD 増分；n8n Cloud Starter：他のフローで既に実行している場合は 20 USD/月のスタンドアロンまたは 0 USD。Postgres：変更ログを無期限に保持する場合は数メガバイトのストレージ（last_content_text 列が重いもの — 30 行 × ~50KB ≈ 合計 1.5MB、ゆっくり増加）。

実行あたりのウォールクロック：約 2.5 分（30 ページ × 4 秒スロットル + マテリアルなものへの Claude レイテンシ）。Slack ダイジェスト：5 秒未満。オンデマンドウェブフック：応答まで 2 秒未満。

オペレーターの時間：競合他社がサイトを再構築する際に追跡ページリストを更新するために四半期に一度 30〜60 分、プラス誰かが誤陽性を報告した最初の時（「ダイジェストが価格が変わったと言ったが変わっていなかった」）にマテリアリティ閾値を調整するか noise-mask パターンを追加するために約 5 分。

成功の姿

最初の 8 週間で注目すべき具体的な指標：Slack でのダイジェスト開封率または既読確認の同等物（リアクション数または担当者を手動で確認することで代理できます）。チャンネルの 30% 未満がダイジェストを読む場合、シグナル対ノイズ比が低すぎます — マテリアリティ閾値を引き締め（長さデルタゲートを 0.5% から 1% に引き上げ）、最低シグナルページタイプを削除し（毎週変動する恒久的な採用ページを持つ競合他社の採用ページは通常ノイズです）、または低頻度の競合他社を「ロングテール」ダイジェストセクションにマージします。一貫して 60% 以上が読む場合、正しいものを構築しており、次のステップはディスカバリーコールのユースケースのためのオンデマンドパスを追加することです（すでに接続済み — スラッシュコマンドを公開するだけです）。

2 番目の指標：四半期に 1 件の担当者が #won-deals または #lost-deals スレッドでダイジェストを引用した回数。20 人チームから四半期に 5 件の引用は良いシグナルです。2 ヶ月後に引用がゼロの場合は、ダイジェストが読まれていないか内容が行動不可能なことを意味します。

代替案との比較

Klue または Crayon（SMB ティアで最後に確認した 2026 年 Q1 の年間 3〜8 万 USD）は、自分でクロールできない JS 重視のレビュー集約ソースを扱い、セールスチーム向けの洗練されたコンシューマーエクスペリエンス（バトルカード、ウィン/ロスのテーマ、インテルハブ）を提供し、Claude が見逃すニュアンスをキャッチする人間キュレーションレイヤーを含みます。競合インテリジェンスがディールサイクルの中心で専任の CI 担当者がいる場合は Klue または Crayon を購入してください。このフローは、専任の CI 採用なしに 20 人の担当者を持つ組織を運営し、自社のロストディールスレッドから競合他社の価格変更を発見することを止める必要がある場合の正解です — コストの 1% で価値の 70% を得られます。

Visualping または Distill.io（月 10 USD 未満）はページ変更検出レイヤーをうまく行いますが、「このページが変わった」で止まり、差分をインボックスに送信します。興味深い作業 — 差分を「セールスチームが違うことを言う必要があること」に変換すること — はまさに Claude がここで行うことです。礼儀正しいクローラーの懸念をアウトソースしたい場合は Visualping を n8n に接続してこのフローのクローラー/ハッシャーの半分をバイパスできます。マテリアリティフィルターと Claude 差分ステージが実際に重要な部分です。

単一の Google アラートフィードは、ほとんどのチームがデフォルトとして使用し、ほとんどのチームが静かに 1 ヶ月後に読むのをやめるものです。Google アラートはページ変更ではなくプレスの言及に発火します。価格ページの編集を完全に見逃します（ページは新しいニュースインデックスエントリを取得しません）。ボリュームはシンジケートされたプレスリリースノイズに支配されます。プレスシグナルのためのこのフローの補完として、代替品ではなくアラートを使用してください。

データウェアハウスの cron ジョブ上のカスタム Python クローラーは、すべてのスタッフエンジニアが構築したいものです。1 スプリントでクローラーが動作し、その後のスプリントで差分レイヤーが動作し、その後のスプリントで Slack フォーマットが動作し、その後、エンジニアがチームを変わると誰もそれを所有しなくなります。ここで n8n を使用する理由は、ワークフローを可視化し（グラフがドキュメントです）、エンジニア以外が編集可能にし（マーケティング Ops の人が PR なしで追跡ページを追加できます）、それを構築した人が去っても続くほど退屈にするためです。

注意事項

アンチボットブロックが 403/503 を返し、ハッシュが静かに古くなります。 ガード：Fetch Page HTML ノードは neverError: true を設定し、マテリアリティゲートの fetch_ok 条件（ステータス 200〜399 かつ本文長 > 200 バイト）が失敗したフェッチを false ブランチにルーティングします — それらはログに記録されますが、Claude やダイジェストには到達しません。last_seen_at が 7 日より古いページについて competitor_change_log に対する週次クエリを追加し、それを「古い追跡ページ」レポートとして扱ってください。
正規化された差分がごちゃごちゃしている場合（例：CSS クラスの名前変更がすべての <div> に触れてストリップされたテキストが完全に回復しなかった）、Claude が変更を幻覚します。ガード：システムプロンプトのエスケープハッチはリテラル文字列 NO_CHANGE であり、パーサーは ^NO_CHANGE\b（大文字小文字を区別しない）に一致するものを非マテリアルとして扱います。明らかに間違ったダイジェストエントリが見えたとき、修正は Normalize + Hash Code ノードにノイズマスクパターンを追加することであり、モデルの温度を下げることではありません。
Slack チャンネルが稼働後 4 週間以内にミュートされます。ダイジェストエントリの 20% でも非マテリアルな場合。ガード：日次ではなく週次ケイデンス（バンドルされているダイジェスト cron は月曜日 14:30 の 30 14 * * 1 のみ）、0.5% のマテリアリティ長さデルタフロア、NO_CHANGE Claude センチネル、競合他社にマテリアルな変更がない場合にダイジェストを完全に抑制するサイレント週が黙っているままの IF ゲート。それでもミュートする場合は、次に調整すべきダイヤルは追跡ページリストから最低シグナルの page_type 値を削除することです — 通常は採用ページ。
長い競合他社名や大量の変更が Slack の 50 ブロックのメッセージ制限を超えます。 ガード：競合他社ごとに 1 つのメッセージ（1 つのメガ投稿ではなく）。上限は週ではなく競合他社ごとです。単一の競合他社が 1 週間に実際に 15 件以上のマテリアルな変更を持つ場合、それはその特定の競合他社のマテリアリティ閾値を引き上げる必要があるシグナルです。
オンデマンドスラッシュコマンドが Slack ワークスペース内の誰にでも競合インテリジェンスを漏洩させます。Slack スラッシュコマンドはチャンネルメンバーシップを強制しないためです。ガード：respondToWebhook は response_type: "ephemeral" を返すため、要求ユーザーのみが結果を見ます。クエリは変更ログにスコープされています（生のページテキストは返されません）。より厳格なアクセス制御が必要な場合は、SQL クエリを実行する前に Parse Slash Command Code ノードで Slack ユーザーグループ ID にスラッシュコマンドをゲートします。

スタック

n8n — 3 つのトリガー（日次フェッチ cron、週次ダイジェスト cron、オンデマンドウェブフック）、HTTP フェッチャー、正規化器、マテリアリティゲート、永続化
Postgres — competitor_tracked_pages（信頼できる最新情報リスト、20〜30 行）と competitor_change_log（マテリアルかどうかを問わず検出されたすべての変更の監査証跡）
Claude Sonnet 4.6 — NO_CHANGE センチネルをエスケープハッチとして持つ差分と要約ステージ
Slack — ダイジェスト配信チャンネルとオンデマンドスラッシュコマンドサーフェス

GitHubでこのページを編集

Files in this artifact

Download all (.zip)

# Competitive intel tracker — n8n bundle

## What this flow does

A daily cron pulls a list of tracked competitor pages from Postgres, fetches each one with a real user-agent and a 4-second throttle, normalizes the HTML by stripping volatile noise (script blocks, build IDs, server-rendered timestamps, current-year strings), hashes the result, and compares it to the previously stored hash. Pages whose hash and length-delta both clear a materiality threshold get diffed by Claude Sonnet against the prior snapshot; the model is instructed to return the literal string `NO_CHANGE` when the diff is cosmetic. Material summaries land in a `competitor_change_log` table. A second cron fires Mondays at 14:30 and aggregates the last seven days of material changes into one Slack Block Kit message per competitor — silent weeks stay silent. A third trigger (a Slack slash command webhook) lets sales reps query the same change log on demand for a single competitor over the last 90 days.

## Import

1. In n8n, open the workflow list and click **Import from File** in the top-right kebab menu.
2. Select `competitive-intel-tracker-n8n.json`.
3. Confirm the workflow opens with 20 nodes across three triggers (the daily crawler, the weekly digest, and the on-demand webhook). The graph should read left-to-right with the digest below the crawler and the webhook below that.
4. Open **Settings** on the workflow and confirm `executionOrder: v1` and a sensible `timezone` (the bundle ships `Europe/London` — change it to your team's working timezone before activating; Cron expressions are interpreted in this zone).
5. Do **not** activate yet. Wire credentials and create the database tables first (next two sections).

## Credentials

The flow references four credential placeholders by name. Each placeholder must be replaced with a real n8n credential of the matching type before the workflow will execute.

### `PLACEHOLDER_POSTGRES_CRED_ID` — Postgres (read/write)

Used by five nodes (`Pull Tracked Pages`, `Persist Change + Update Snapshot`, `Touch Snapshot (No Material Change)`, `Aggregate Last 7 Days Of Material Changes`, `Fetch On-Demand History`). Create an n8n **Postgres** credential pointing at the database that holds your tracked pages and change log. The bundle assumes two tables — create them with:

```sql
CREATE TABLE competitor_tracked_pages (
page_id bigserial PRIMARY KEY,
competitor_name text NOT NULL,
page_type text NOT NULL, -- 'pricing' | 'blog' | 'hiring' | 'reviews' | 'docs'
url text NOT NULL UNIQUE,
active boolean NOT NULL DEFAULT true,
last_content_hash text,
last_content_text text,
last_seen_at timestamptz
);

CREATE TABLE competitor_change_log (
id bigserial PRIMARY KEY,
page_id bigint REFERENCES competitor_tracked_pages(page_id) ON DELETE CASCADE,
competitor_name text NOT NULL,
page_type text NOT NULL,
url text NOT NULL,
content_hash text NOT NULL,
summary text NOT NULL,
is_material boolean NOT NULL,
detected_at timestamptz NOT NULL DEFAULT now()
);

CREATE INDEX ON competitor_change_log (competitor_name, detected_at DESC);
CREATE INDEX ON competitor_change_log (detected_at DESC) WHERE is_material;
```

Seed `competitor_tracked_pages` with twenty to thirty rows before the first run. The recommended starter set per competitor: pricing page, two recent blog posts, careers/jobs index, docs landing page. Skip JS-heavy review sites (G2, Capterra, TrustRadius) unless you have a rendering service — the raw HTML they ship is mostly empty.

### `PLACEHOLDER_ANTHROPIC_CRED_ID` — Anthropic API key

Used by `Claude — Diff + Summarize`. Create an n8n **Header Auth** credential with header name `x-api-key` and value set to your Anthropic API key (find it at console.anthropic.com → API Keys). The flow uses `claude-sonnet-4-6` — change the model in the JSON if your account routes elsewhere. Token budget per run: roughly `(pages × ~3000 input tokens) + (material pages × ~200 output tokens)` — see the cost-reality section in the page body for absolute numbers.

### `PLACEHOLDER_SLACK_CRED_ID` — Slack bot token

Used by `Slack — Post Weekly Digest`. Create a Slack app at api.slack.com/apps, add the bot scopes `chat:write` and `chat:write.public` (the latter so the bot can post to channels it has not been explicitly invited to), install the app, and copy the **Bot User OAuth Token** (starts with `xoxb-`). Create an n8n **Header Auth** credential with header name `Authorization` and value `Bearer xoxb-...`. Update the channel name in the `Slack — Post Weekly Digest` node from `#competitive-intel` to whatever channel your sales team actually reads.

### Slash command (optional, no credential — webhook URL only)

The `On-Demand Webhook` node exposes a path at `/webhook/intel-on-demand`. To wire a Slack slash command to it: in your Slack app config, add a slash command (e.g. `/whatsnew`), set the request URL to your n8n public URL plus that path, and grant the `commands` scope. No n8n credential is needed because Slack POSTs to the webhook directly. If your n8n is not internet-reachable, either expose it via a tunnel or skip this trigger and run the on-demand query manually from the n8n editor.

## First-run verification

Run these in order. Each step proves a different branch of the flow.

1. **Insert one tracked page that you know changes daily** (a competitor's blog index works well). Verify with `SELECT * FROM competitor_tracked_pages;` that the row exists with `last_content_hash IS NULL`.
2. **Manually execute the `Daily Cron — 5am UTC` trigger** from the n8n editor. The first run should: fetch the page, compute a hash, *fail* the `Material Change?` IF (because there is no prior snapshot to compare — the `had-prior-snapshot` condition is false), and route to `Touch Snapshot (No Material Change)` which writes the initial hash. Confirm `competitor_tracked_pages.last_content_hash` is now populated and `competitor_change_log` is still empty.
3. **Manually execute the trigger a second time, immediately.** The hash should match (page didn't change in two minutes), the IF fails, no Claude call. This proves the cheap path.
4. **Edit the row to force a diff.** Run `UPDATE competitor_tracked_pages SET last_content_text = 'lorem ipsum placeholder', last_content_hash = 'force-diff' WHERE page_id = <id>;` and re-execute the trigger. The IF should now pass, Claude should be called, and you should see a row appear in `competitor_change_log`. Open the row and read the summary — it should describe the page in two sentences. If it returned `NO_CHANGE` despite the forced diff, lower the materiality threshold or check the truncation in the prompt.
5. **Test the no-op materiality filter.** Insert a row pointing at a page that has trivial dynamic content (e.g. a homepage with rotating testimonials). After the first snapshot is captured, re-run the cron. The hash will likely differ but the length delta should be small — confirm it routes to the false branch and does not spend a Claude call.
6. **Test the weekly digest.** Manually execute `Weekly Digest Cron — Mon 14:30`. If `competitor_change_log` has at least one `is_material = true` row from the last 7 days, you should see a Slack message land in the configured channel. If the table is empty for the window, no message fires — that is correct behavior, not a bug.
7. **Test the on-demand webhook.** From a terminal, `curl -X POST https://<your-n8n>/webhook/intel-on-demand -d 'text=acme'` (or trigger your wired Slack slash command). Expect a JSON response with up to 10 of the most recent material changes for any competitor whose name contains `acme`. With an empty change log, expect the "No material changes recorded" fallback.
8. **Activate the workflow** only after all six branches above behaved as described.

{
  "name": "Competitive intel tracker",
  "nodes": [
    {
      "parameters": {
        "rule": {
          "interval": [
            {
              "field": "cronExpression",
              "expression": "0 5 * * *"
            }
          ]
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000001",
      "name": "Daily Cron — 5am UTC",
      "type": "n8n-nodes-base.scheduleTrigger",
      "typeVersion": 1,
      "position": [240, 300],
      "notesInFlow": true,
      "notes": "Crawl runs daily at 05:00 in the workflow timezone (set in Settings). Digest fan-out is gated to Mondays only by the Weekly-Digest IF node."
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "SELECT\n  page_id,\n  competitor_name,\n  page_type,\n  url,\n  last_content_hash,\n  last_content_text,\n  last_seen_at\nFROM competitor_tracked_pages\nWHERE active = true\nORDER BY competitor_name, page_type\nLIMIT 200;",
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000002",
      "name": "Pull Tracked Pages",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [460, 300],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_POSTGRES_CRED_ID",
          "name": "Postgres — competitive-intel"
        }
      },
      "notesInFlow": true,
      "notes": "Source-of-truth table for the tracked-pages list. Twenty to thirty rows is typical; cap at 200 to fail closed if the list grows unmanageably."
    },
    {
      "parameters": {
        "batchSize": 1,
        "options": {
          "reset": false
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000003",
      "name": "Iterate One Page At A Time",
      "type": "n8n-nodes-base.splitInBatches",
      "typeVersion": 3,
      "position": [680, 300],
      "notesInFlow": true,
      "notes": "Batch size 1 — each iteration handles one URL so per-page failure does not abort the run. Pair with a Wait node downstream to throttle."
    },
    {
      "parameters": {
        "amount": 4,
        "unit": "seconds"
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000004",
      "name": "Throttle — 4s Between Fetches",
      "type": "n8n-nodes-base.wait",
      "typeVersion": 1.1,
      "position": [900, 300],
      "notesInFlow": true,
      "notes": "Spreads ~30 fetches over ~2 minutes. Combined with one-request-per-page-per-day this keeps us well under any reasonable rate limit."
    },
    {
      "parameters": {
        "method": "GET",
        "url": "={{ $json.url }}",
        "sendHeaders": true,
        "headerParameters": {
          "parameters": [
            { "name": "User-Agent", "value": "ooligo-intel-bot/1.0 (+https://ooligo.com/bots)" },
            { "name": "Accept", "value": "text/html,application/xhtml+xml" }
          ]
        },
        "options": {
          "timeout": 20000,
          "redirect": {
            "redirect": {
              "followRedirects": true,
              "maxRedirects": 3
            }
          },
          "response": {
            "response": {
              "fullResponse": true,
              "neverError": true
            }
          }
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000005",
      "name": "Fetch Page HTML",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [1120, 300],
      "notesInFlow": true,
      "notes": "neverError:true so a 403/503 from anti-bot does not kill the batch — we record it and move on."
    },
    {
      "parameters": {
        "jsCode": "// Strip noise from the HTML, normalize, and hash. The 'noise' is anything that\n// re-renders on every deploy without representing a content change: build IDs,\n// CSRF tokens, current-year strings, server-rendered timestamps, CDN cache\n// busters in asset URLs. Without this filter the digest fires every day with\n// nothing actually changed and the Slack channel gets muted within a week.\n\nconst page = $('Iterate One Page At A Time').item.json;\nconst response = $json;\nconst statusCode = response.statusCode || response.status || 0;\nconst rawBody = typeof response.body === 'string' ? response.body : JSON.stringify(response.body || '');\n\nfunction stripNoise(html) {\n  return html\n    // Remove <script> and <style> blocks entirely\n    .replace(/<script[\\s\\S]*?<\\/script>/gi, '')\n    .replace(/<style[\\s\\S]*?<\\/style>/gi, '')\n    .replace(/<noscript[\\s\\S]*?<\\/noscript>/gi, '')\n    .replace(/<!--[\\s\\S]*?-->/g, '')\n    // Strip all tags to plain text\n    .replace(/<[^>]+>/g, ' ')\n    // Decode common entities\n    .replace(/&nbsp;/g, ' ').replace(/&amp;/g, '&').replace(/&lt;/g, '<').replace(/&gt;/g, '>').replace(/&quot;/g, '\"')\n    // Mask volatile values\n    .replace(/\\b\\d{4}-\\d{2}-\\d{2}T\\d{2}:\\d{2}:\\d{2}(?:\\.\\d+)?Z?\\b/g, '<TS>')\n    .replace(/\\b(?:Jan|Feb|Mar|Apr|May|Jun|Jul|Aug|Sep|Oct|Nov|Dec)[a-z]*\\s+\\d{1,2},?\\s+20\\d{2}\\b/g, '<DATE>')\n    .replace(/\\b20\\d{2}\\b/g, '<YEAR>')\n    .replace(/[a-f0-9]{32,}/gi, '<HASH>')\n    .replace(/\\b[A-Z0-9]{16,}\\b/g, '<TOKEN>')\n    // Collapse whitespace\n    .replace(/\\s+/g, ' ')\n    .trim();\n}\n\nconst normalized = stripNoise(rawBody);\n\nconst crypto = require('crypto');\nconst contentHash = crypto.createHash('sha256').update(normalized).digest('hex');\n\n// Materiality pre-filter: very small diffs are not worth a Claude call.\nconst prevText = page.last_content_text || '';\nconst lengthDelta = Math.abs(normalized.length - prevText.length);\nconst lengthRatio = prevText.length === 0 ? 1 : lengthDelta / prevText.length;\n\nreturn [{\n  json: {\n    page_id: page.page_id,\n    competitor_name: page.competitor_name,\n    page_type: page.page_type,\n    url: page.url,\n    fetch_status: statusCode,\n    fetched_at: new Date().toISOString(),\n    new_hash: contentHash,\n    old_hash: page.last_content_hash || null,\n    new_text: normalized,\n    old_text: prevText,\n    hash_changed: contentHash !== (page.last_content_hash || ''),\n    length_delta_pct: Math.round(lengthRatio * 1000) / 10,\n    fetch_ok: statusCode >= 200 && statusCode < 400 && rawBody.length > 200\n  }\n}];"
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000006",
      "name": "Normalize + Hash",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [1340, 300]
    },
    {
      "parameters": {
        "conditions": {
          "options": {
            "caseSensitive": true,
            "leftValue": "",
            "typeValidation": "strict"
          },
          "conditions": [
            {
              "id": "fetch-ok",
              "leftValue": "={{ $json.fetch_ok }}",
              "rightValue": true,
              "operator": { "type": "boolean", "operation": "equal" }
            },
            {
              "id": "hash-changed",
              "leftValue": "={{ $json.hash_changed }}",
              "rightValue": true,
              "operator": { "type": "boolean", "operation": "equal" }
            },
            {
              "id": "had-prior-snapshot",
              "leftValue": "={{ $json.old_text }}",
              "rightValue": "",
              "operator": { "type": "string", "operation": "notEmpty" }
            },
            {
              "id": "non-trivial-delta",
              "leftValue": "={{ $json.length_delta_pct }}",
              "rightValue": 0.5,
              "operator": { "type": "number", "operation": "gte" }
            }
          ],
          "combinator": "and"
        },
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000007",
      "name": "Material Change?",
      "type": "n8n-nodes-base.if",
      "typeVersion": 2.2,
      "position": [1560, 300],
      "notesInFlow": true,
      "notes": "Four-part gate: fetch succeeded, hash differs, we have a prior snapshot to compare against, and length delta exceeds 0.5% (filters out single-character or whitespace-only edits)."
    },
    {
      "parameters": {
        "method": "POST",
        "url": "https://api.anthropic.com/v1/messages",
        "sendHeaders": true,
        "headerParameters": {
          "parameters": [
            { "name": "anthropic-version", "value": "2023-06-01" },
            { "name": "content-type", "value": "application/json" }
          ]
        },
        "authentication": "predefinedCredentialType",
        "nodeCredentialType": "httpHeaderAuth",
        "sendBody": true,
        "specifyBody": "json",
        "jsonBody": "={\n  \"model\": \"claude-sonnet-4-6\",\n  \"max_tokens\": 400,\n  \"system\": \"You compare two snapshots of a competitor's public web page and report what changed in a way that helps a B2B sales team. Output rules: (1) If the diff is cosmetic, navigation-only, footer-only, or you cannot identify a specific factual delta, return exactly the string NO_CHANGE on a single line. Nothing else. (2) Otherwise return two short sentences. Sentence one: what changed (a price, a feature, a target customer, a hire, a positioning shift). Sentence two: why a salesperson should care (a new objection to pre-empt, a new wedge to use, a new threat to flag). Do not invent details that are not in the diff. Do not speculate about strategy. Do not pad with generic commentary.\",\n  \"messages\": [\n    {\n      \"role\": \"user\",\n      \"content\": \"Competitor: {{ $json.competitor_name }}\\nPage type: {{ $json.page_type }}\\nURL: {{ $json.url }}\\n\\n--- PREVIOUS SNAPSHOT ---\\n{{ $json.old_text.slice(0, 6000) }}\\n\\n--- CURRENT SNAPSHOT ---\\n{{ $json.new_text.slice(0, 6000) }}\"\n    }\n  ]\n}",
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000008",
      "name": "Claude — Diff + Summarize",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [1780, 200],
      "credentials": {
        "httpHeaderAuth": {
          "id": "PLACEHOLDER_ANTHROPIC_CRED_ID",
          "name": "Anthropic — x-api-key"
        }
      },
      "notesInFlow": true,
      "notes": "Snapshots truncated to 6000 chars each — keeps input ≤ ~3k tokens per page. NO_CHANGE sentinel is the model's escape hatch when the diff is noisy."
    },
    {
      "parameters": {
        "jsCode": "// Pull the model's text out of the Anthropic response and decide whether to keep it.\nconst page = $('Material Change?').item.json;\nconst resp = $json;\nconst summary = (resp?.content?.[0]?.text || '').trim();\nconst isNoChange = summary === '' || summary === 'NO_CHANGE' || /^NO_CHANGE\\b/i.test(summary);\n\nreturn [{\n  json: {\n    page_id: page.page_id,\n    competitor_name: page.competitor_name,\n    page_type: page.page_type,\n    url: page.url,\n    new_hash: page.new_hash,\n    new_text: page.new_text,\n    summary,\n    is_material: !isNoChange,\n    summarized_at: new Date().toISOString(),\n    input_tokens: resp?.usage?.input_tokens || null,\n    output_tokens: resp?.usage?.output_tokens || null\n  }\n}];"
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000009",
      "name": "Parse Claude Response",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [2000, 200]
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "INSERT INTO competitor_change_log (\n  page_id, competitor_name, page_type, url,\n  content_hash, summary, is_material, detected_at\n) VALUES ($1, $2, $3, $4, $5, $6, $7, now())\nRETURNING id;\n\nUPDATE competitor_tracked_pages\nSET\n  last_content_hash = $5,\n  last_content_text = $8,\n  last_seen_at = now()\nWHERE page_id = $1;",
        "options": {
          "queryReplacement": "={{ $json.page_id }},{{ $json.competitor_name }},{{ $json.page_type }},{{ $json.url }},{{ $json.new_hash }},{{ JSON.stringify($json.summary) }},{{ $json.is_material }},{{ JSON.stringify($json.new_text) }}"
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-00000000000a",
      "name": "Persist Change + Update Snapshot",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [2220, 200],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_POSTGRES_CRED_ID",
          "name": "Postgres — competitive-intel"
        }
      },
      "notesInFlow": true,
      "notes": "Two statements: append to the change log (audit trail), then advance the snapshot. is_material flag drives the weekly digest filter."
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "UPDATE competitor_tracked_pages\nSET\n  last_content_hash = COALESCE($2, last_content_hash),\n  last_content_text = COALESCE($3, last_content_text),\n  last_seen_at = now()\nWHERE page_id = $1;",
        "options": {
          "queryReplacement": "={{ $json.page_id }},{{ $json.fetch_ok ? $json.new_hash : null }},{{ $json.fetch_ok ? JSON.stringify($json.new_text) : null }}"
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-00000000000b",
      "name": "Touch Snapshot (No Material Change)",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [1780, 400],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_POSTGRES_CRED_ID",
          "name": "Postgres — competitive-intel"
        }
      },
      "notesInFlow": true,
      "notes": "False branch: still advances the stored hash so the next run compares against the latest content, but does NOT spend a Claude call or write to the change log."
    },
    {
      "parameters": {
        "rule": {
          "interval": [
            {
              "field": "cronExpression",
              "expression": "30 14 * * 1"
            }
          ]
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-00000000000c",
      "name": "Weekly Digest Cron — Mon 14:30",
      "type": "n8n-nodes-base.scheduleTrigger",
      "typeVersion": 1,
      "position": [240, 700],
      "notesInFlow": true,
      "notes": "Independent trigger. Mondays at 14:30 in the workflow timezone — Tuesday morning for APAC, mid-morning for EU, breakfast for the US east coast."
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "SELECT\n  competitor_name,\n  json_agg(\n    json_build_object(\n      'page_type', page_type,\n      'url', url,\n      'summary', summary,\n      'detected_at', detected_at\n    ) ORDER BY detected_at DESC\n  ) AS changes\nFROM competitor_change_log\nWHERE is_material = true\n  AND detected_at >= now() - interval '7 days'\nGROUP BY competitor_name\nORDER BY competitor_name;",
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-00000000000d",
      "name": "Aggregate Last 7 Days Of Material Changes",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [460, 700],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_POSTGRES_CRED_ID",
          "name": "Postgres — competitive-intel"
        }
      }
    },
    {
      "parameters": {
        "conditions": {
          "options": {
            "caseSensitive": true,
            "leftValue": "",
            "typeValidation": "strict"
          },
          "conditions": [
            {
              "id": "have-changes",
              "leftValue": "={{ $json.competitor_name }}",
              "rightValue": "",
              "operator": { "type": "string", "operation": "notEmpty" }
            }
          ],
          "combinator": "and"
        },
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-00000000000e",
      "name": "Anything To Report?",
      "type": "n8n-nodes-base.if",
      "typeVersion": 2.2,
      "position": [680, 700],
      "notesInFlow": true,
      "notes": "Silent weeks stay silent — no 'no updates this week' filler messages. The channel never fires unless there is something actually worth reading."
    },
    {
      "parameters": {
        "jsCode": "// Render one Slack Block Kit payload per competitor with material changes this week.\nconst c = $json;\nconst changes = c.changes || [];\nconst blocks = [\n  {\n    type: 'header',\n    text: { type: 'plain_text', text: `Competitor update — ${c.competitor_name}`, emoji: false }\n  },\n  {\n    type: 'context',\n    elements: [\n      { type: 'mrkdwn', text: `${changes.length} material change${changes.length === 1 ? '' : 's'} in the last 7 days` }\n    ]\n  },\n  { type: 'divider' }\n];\nfor (const ch of changes) {\n  blocks.push({\n    type: 'section',\n    text: {\n      type: 'mrkdwn',\n      text: `*${ch.page_type}* — <${ch.url}|view page>\\n${ch.summary}`\n    }\n  });\n}\nreturn [{\n  json: {\n    competitor_name: c.competitor_name,\n    blocks,\n    fallback_text: `Competitor update — ${c.competitor_name} (${changes.length} material change${changes.length === 1 ? '' : 's'} this week)`\n  }\n}];"
      },
      "id": "2d2d2d2d-0002-0000-0000-00000000000f",
      "name": "Compose Slack Blocks",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [900, 700]
    },
    {
      "parameters": {
        "method": "POST",
        "url": "https://slack.com/api/chat.postMessage",
        "authentication": "predefinedCredentialType",
        "nodeCredentialType": "httpHeaderAuth",
        "sendHeaders": true,
        "headerParameters": {
          "parameters": [
            { "name": "content-type", "value": "application/json; charset=utf-8" }
          ]
        },
        "sendBody": true,
        "specifyBody": "json",
        "jsonBody": "={\n  \"channel\": \"#competitive-intel\",\n  \"text\": {{ JSON.stringify($json.fallback_text) }},\n  \"blocks\": {{ JSON.stringify($json.blocks) }},\n  \"unfurl_links\": false,\n  \"unfurl_media\": false\n}",
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000010",
      "name": "Slack — Post Weekly Digest",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [1120, 700],
      "credentials": {
        "httpHeaderAuth": {
          "id": "PLACEHOLDER_SLACK_CRED_ID",
          "name": "Slack — bot token"
        }
      },
      "notesInFlow": true,
      "notes": "One message per competitor, not one mega-post — sales reps mute long unbroken digests. Update channel name to your team's actual channel."
    },
    {
      "parameters": {
        "httpMethod": "POST",
        "path": "intel-on-demand",
        "responseMode": "responseNode",
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000011",
      "name": "On-Demand Webhook (Slack Slash Command)",
      "type": "n8n-nodes-base.webhook",
      "typeVersion": 2,
      "position": [240, 1100],
      "notesInFlow": true,
      "notes": "Wire a Slack slash command (e.g. /whatsnew acme) to this URL. Slack POSTs form-encoded body with text=<competitor query>."
    },
    {
      "parameters": {
        "jsCode": "// Parse Slack slash command payload, normalize the competitor name.\nconst body = $json.body || $json;\nconst raw = (body.text || '').trim();\nif (!raw) {\n  return [{ json: { error: 'Usage: /whatsnew <competitor>', _respond_immediately: true } }];\n}\nreturn [{ json: { query: raw.toLowerCase(), response_url: body.response_url || null } }];"
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000012",
      "name": "Parse Slash Command",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [460, 1100]
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "SELECT\n  competitor_name,\n  page_type,\n  url,\n  summary,\n  detected_at\nFROM competitor_change_log\nWHERE is_material = true\n  AND lower(competitor_name) LIKE '%' || $1 || '%'\n  AND detected_at >= now() - interval '90 days'\nORDER BY detected_at DESC\nLIMIT 10;",
        "options": {
          "queryReplacement": "={{ $json.query }}"
        }
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000013",
      "name": "Fetch On-Demand History",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [680, 1100],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_POSTGRES_CRED_ID",
          "name": "Postgres — competitive-intel"
        }
      }
    },
    {
      "parameters": {
        "respondWith": "json",
        "responseBody": "={\n  \"response_type\": \"ephemeral\",\n  \"text\": {{ JSON.stringify(($input.all().length === 0 ? 'No material changes recorded in the last 90 days.' : 'Last ' + $input.all().length + ' material changes:')) }},\n  \"blocks\": {{ JSON.stringify($input.all().map(i => ({ type: 'section', text: { type: 'mrkdwn', text: '*' + i.json.competitor_name + ' — ' + i.json.page_type + '* (' + new Date(i.json.detected_at).toISOString().slice(0,10) + ')\\n' + i.json.summary + '\\n<' + i.json.url + '|view page>' } }))) }}\n}",
        "options": {}
      },
      "id": "2d2d2d2d-0002-0000-0000-000000000014",
      "name": "Respond To Slack",
      "type": "n8n-nodes-base.respondToWebhook",
      "typeVersion": 1.1,
      "position": [900, 1100]
    }
  ],
  "connections": {
    "Daily Cron — 5am UTC": {
      "main": [
        [{ "node": "Pull Tracked Pages", "type": "main", "index": 0 }]
      ]
    },
    "Pull Tracked Pages": {
      "main": [
        [{ "node": "Iterate One Page At A Time", "type": "main", "index": 0 }]
      ]
    },
    "Iterate One Page At A Time": {
      "main": [
        [{ "node": "Throttle — 4s Between Fetches", "type": "main", "index": 0 }]
      ]
    },
    "Throttle — 4s Between Fetches": {
      "main": [
        [{ "node": "Fetch Page HTML", "type": "main", "index": 0 }]
      ]
    },
    "Fetch Page HTML": {
      "main": [
        [{ "node": "Normalize + Hash", "type": "main", "index": 0 }]
      ]
    },
    "Normalize + Hash": {
      "main": [
        [{ "node": "Material Change?", "type": "main", "index": 0 }]
      ]
    },
    "Material Change?": {
      "main": [
        [{ "node": "Claude — Diff + Summarize", "type": "main", "index": 0 }],
        [{ "node": "Touch Snapshot (No Material Change)", "type": "main", "index": 0 }]
      ]
    },
    "Claude — Diff + Summarize": {
      "main": [
        [{ "node": "Parse Claude Response", "type": "main", "index": 0 }]
      ]
    },
    "Parse Claude Response": {
      "main": [
        [{ "node": "Persist Change + Update Snapshot", "type": "main", "index": 0 }]
      ]
    },
    "Persist Change + Update Snapshot": {
      "main": [
        [{ "node": "Iterate One Page At A Time", "type": "main", "index": 0 }]
      ]
    },
    "Touch Snapshot (No Material Change)": {
      "main": [
        [{ "node": "Iterate One Page At A Time", "type": "main", "index": 0 }]
      ]
    },
    "Weekly Digest Cron — Mon 14:30": {
      "main": [
        [{ "node": "Aggregate Last 7 Days Of Material Changes", "type": "main", "index": 0 }]
      ]
    },
    "Aggregate Last 7 Days Of Material Changes": {
      "main": [
        [{ "node": "Anything To Report?", "type": "main", "index": 0 }]
      ]
    },
    "Anything To Report?": {
      "main": [
        [{ "node": "Compose Slack Blocks", "type": "main", "index": 0 }],
        []
      ]
    },
    "Compose Slack Blocks": {
      "main": [
        [{ "node": "Slack — Post Weekly Digest", "type": "main", "index": 0 }]
      ]
    },
    "On-Demand Webhook (Slack Slash Command)": {
      "main": [
        [{ "node": "Parse Slash Command", "type": "main", "index": 0 }]
      ]
    },
    "Parse Slash Command": {
      "main": [
        [{ "node": "Fetch On-Demand History", "type": "main", "index": 0 }]
      ]
    },
    "Fetch On-Demand History": {
      "main": [
        [{ "node": "Respond To Slack", "type": "main", "index": 0 }]
      ]
    }
  },
  "active": false,
  "settings": {
    "executionOrder": "v1",
    "timezone": "Europe/London",
    "saveDataErrorExecution": "all",
    "saveDataSuccessExecution": "all",
    "saveManualExecutions": true
  },
  "versionId": "2d2d2d2d-0002-0000-0000-0000000000ff",
  "meta": {
    "templateCreatedBy": "ooligo",
    "instanceId": "ooligo-pilot"
  },
  "id": "competitive-intel-tracker",
  "tags": [
    { "name": "revops" },
    { "name": "competitive-intel" },
    { "name": "sales-enablement" }
  ]
}