claude-skill

ClayとClaudeで公開シグナルからICPフィットのアカウントリストを構築する

Difficulty

中級

Setup time

45min

For

revops · sdr-leader

RevOps

Stack

ほとんどのICPの演習は形容詞のスープにドリフトします — 「フィンテックのミッドマーケットSaaS、成長マインドセット、セキュリティ意識が高い」。その種のブリーフから構築されたリストは、フィルターが厳しすぎて全員がすでに持っている30件の明白なアカウントしか得られないか（フィルターが緩すぎて4,000件のロゴになりAEがファイルを無視する）のどちらかに外れます。このページが提供するバンドルはその逆を行います：ICPを説明する代わりに、10〜20件のクローズウォンアカウントを指定してClaudeに共通点を逆エンジニアリングさせ、次にClayにそのシグネチャーをフィルターとエンリッチメントに変換させます。

アーティファクトはClaude スキル — icp-list-builder — で、シードからリストへのループをエンドツーエンドで実行し、ランク付けされた草案をClayテーブルに書き込みます。レビュアーにアウトバウンドに直接プッシュするのではなく、Markdownレポートと並べてClayテーブルを渡すよう設計されています。

使う場面

認識可能な形状を共有する10〜20件のクローズウォンアカウントを指定でき、次の100〜500件の候補がそれらに似てほしい場合にこのスキルを使います。実際の最も一般的なトリガー：

四半期テリトリー更新 — AEはリージョンごとに、現在の公開シグナルに照らして新たにスコアリングされた草案リストが必要
新しいウェッジプロダクトまたは新しい価格ティアがリリースされ、「はいと言った人々」のシードが小さいが実在する
アウトバウンドプログラムが明白なICPを消化し、チームが創業者が最初に想像したICPではなく何がクローズしたかで情報化された第2波が必要

スキルはProプラン以上のClayアカウントを前提としています。Pro以下ではエンリッチメントサーフェスが狭すぎてルックアライクループが有用ではなく、スプレッドシートとLinkedIn検索が行うことと大体同じワークフローに支払うことになります。

使わない場面

Tier-1指名アカウントABM。 25〜50件の戦略的アカウントの手作りリストには、スキルがモデル化できない顧客サクセスとエグゼクティブの入力が必要です。Tier-2とTier-3のアウトバウンドにこれを使ってください；ルックアライクループの分散はTier-1選択には高すぎます。
アウトバウンドシーケンスへの自動ロード。 出力はランク付けされた草案です。スキルはAEまたはSDRが送信前に見なければならないようにClayテーブルに書き込み、Markdownレポートを意図的に生成します。シーケンストリガーに出力を配線した場合、これを間違って使っています。
CRMにすでにあるアカウントの再スコアリング。 そのためにはCRMネイティブのインテントツールを使ってください。このスキルは純新規候補者を書き込みます；既知のものを再ランク付けしません。
保護クラスプロキシーでのスコアリング。 創業者の性別、創業者の民族性、出身大学、名前の起源 — これらはルーブリックに属しません。参照ルーブリックファイルは許可されているディメンションを列挙します；他を追加しないでください。
8件未満のシードリスト。 スキルは有効なシードが8件未満の場合の続行を拒否します。シグネチャーの抽出はより小さな基数では信頼できないからです。5件の勝利しかない場合は手動でリストを構築し、より多くできた時に戻ってください。

セットアップ

バンドルは apps/web/public/artifacts/icp-account-list-builder-clay/ にあり、以下を含みます：

SKILL.md — ループをオーケストレーションするClaude スキル定義
references/1-icp-rubric-template.md — チーム向けに記入するファームグラフィックゲートとシグナル重み
references/2-signal-source-matrix.md — どの公開ソースがプライマリー対コロボレーティングとしてカウントされ、どれが明示的に不可かを示す
references/3-exclusion-criteria.md — 出力に決して表示されてはならない禁止ドメイン、親会社、ファームグラフィックパターン

セットアップは最初は約45分、その後の更新ごとに5分です。

スキルをインストールする。 SKILL.md をClaude スキルディレクトリにドロップします（またはClaude Codeで /skill load を使ってロードします）。references/1-icp-rubric-template.md を実際のファームグラフィックゲート、テクノグラフィックシグナル、シグナル重みで記入します。references/3-exclusion-criteria.md を顧客、アクティブオポチュニティ、過去180日のクローズドロストアカウントの新鮮なCRMエクスポートから記入します。
シードリストを準備する。 company_name、domain、why_we_won（2文）を含むCSV。複数のAE、セグメント、クローズ月をまたいでシードを引き出してください — スキルはシードの60%以上が単一のAE、バーティカル、またはクローズ月を共有している場合に警告します。なぜなら、それは1人の担当者のテリトリーのように見えるリストを生成するからです。
Clayを接続する。 スキルはAPI経由でClayワークスペースを読み取ります。ワークスペースIDとAPIキーをスキルのローカル設定に設定します（これらをバンドルにコミットしないでください）。
最初の実行。 スキルをシードCSVと target_list_size 100で呼び出します。最初の実行はファームグラフィックユニバースがフィルタリングされていないため遅くなります；保存されたClayビューに対する後続の実行は速くなります。
Markdownレポートと並べてClayテーブルをレビューする。 レポートはシードシグネチャー、シグナルタイプの内訳、除外フラグカウントを説明します。ClayテーブルはAEの作業サーフェスです。

スキルが実際に行うこと

6ステップ、順に実行。順序が重要です — ファームグラフィックフィルターの前にLLMスコアリングを実行すると、クレジットを無駄にし、明白なミスフィットを引き込みます。

入力のロードと検証。 why_we_won が欠落しているシードを削除し、有効なシードが8件未満の場合は続行を拒否します。
シードシグネチャーを抽出する。 シードとICPルーブリックをClaudeに送信し、構造化されたシグネチャーを返します：業界コード、従業員数バンド、収益バンド、地理、資金調達ステージ、テクノグラフィックマーカー、インテントマーカー。why_we_won ノートはClayの列ではないシグナルをエンコードします（「セキュリティとコンプライアンスページを持っていた」）；それが決定論的フィルターの前にLLMパスが必要な理由です。
ClayでファームグラフィックフィルターをDeterministicallyに適用する。 シグネチャーのハードゲートをClayフィルターに変換し、まず実行してユニバースを約500〜3,000件の候補に絞ります。この段階で除外リストのものをすべて削除します。スコアリングの前にこれを行うと、ほとんどの却下が明白なファームグラフィックミスフィットで推論が不要なため、LLMコストが約30〜100倍削減されます。
インテントシグナルをエンリッチして裏付ける。 残りの各候補について、Clayにテクスタック、採用デルタ、過去90日間のアナウンスメントをエンリッチするよう依頼し、次にClaudeに引用付きのシグナルごとのマッチスコアを依頼します。単一のインテントシグナルにはプライマリーな裏付けシグナルが必要です — LinkedInの仕事の変更とプレスリリースなど。単一ソースのインテント主張は「裏付けなし」という理由で0スコアとなり、推測されません。
ランク付け、重複排除、Clayにバッチ書き込み。 合計スコアでソートし、親会社列で重複を排除し、上位 target_list_size 行を1回のバッチで新しいClayテーブルに書き込みます。行ごとの書き込みはクレジットを消費し、中断時に不整合な状態を残します；バッチ書き込みはそうではありません。
出力レポートを生成する。 上部にシードシグネチャー、ランク付きの候補テーブル、シグナルタイプの内訳、除外フラグカウント、実行メタデータを含むMarkdownドキュメント。レビュアーはClayテーブルを操作する前にこれを読みます。

コストの実態

主要なコストレバーはClayエンリッチメントクレジットとClaudeトークンです。target_list_size 100、フィルタリングされたユニバース1,800〜2,200件の候補に対する実行あたりの概算予算：

Claudeトークン（シグネチャー抽出＋候補ごとのスコアリング）。 Claude Opusでの実行あたり約500K〜700Kの入力トークンと80K〜120Kの出力トークン。Opus 4.7の定価で実行あたり約9〜14ドル。Claude Sonnetでの同じループは実行あたり約1.50〜2.50ドルで、シグネチャー抽出ステップに測定可能な品質の低下があります（シードパターンの推論は大きなモデルの恩恵を受けます）。推奨：シグネチャーステップにOpus、候補ごとのスコアリングステップにSonnet。
Clayクレジット。 2,000件の候補がエンリッチメントステップに入ることを前提に、100行の出力に対して実行あたり約800〜1,000件のエンリッチメントクレジット。Clay Proの価格で実行あたり約24〜30ドルのクレジットコスト；Explorerティアではクレジットが少なく target_list_size を50に落とすかより厳しくプレフィルタリングするべきです。
スケールで。 リージョンごとに週次でこれを実行するチーム（例：4つのAEポッド）は月に約1,300〜2,000ドルに着地します（Claudeが150〜200ドル、残りはClayクレジット）。これは単一のZoomInfo SalesOSシートのコストをはるかに下回り、より新鮮なリストを生成しますが、ルーブリックと除外ファイルを最新の状態に保つ必要があります — 古い入力はコストが間違う場所です（ステップ1を通過すべきでないアカウントをエンリッチするためにクレジットを支払う）。

支配的なコスト爆発パターンは、ルーズなファームグラフィックゲートでスキルを繰り返し呼び出し、候補ユニバースが膨張するのを見ることです。ガードはステップ3にあります：Clayが5,000件以上の候補を返した場合、スキルはセット全体をエンリッチするのではなく、1つのバンドを絞って再実行します。

成功指標

監視する指標は、AEが手動の修正なしに草案リストを作業セットに受け入れる率です。目標：70%以上受け入れ（つまり100件のランク付きの候補のうち、少なくとも70件が削除または再ラベルなしに誰かのアウトバウンドキューに入る）。受け入れが50%未満の場合、ルーブリックが間違っているか、シードリストにバイアスがあるか、除外ファイルが古いかのどれかです — その順序で診断してください。

二次：ベースラインのアウトバウンドリストと比較した受け入れられたアカウントでのミーティング予約率。スキルはクレジットコストを稼いでいます、そのレートがベースラインと少なくとも同等の場合；価値の追加はリスト構築時間の削減であり、必ずしも即時のコンバージョン向上ではありません。

代替手段との比較

LinkedIn Sales Navigator＋手動フィルタリング対比。 Sales NavはTier-1の手作りリストと個人プロスペクティングの候補には適したツールです。週次で100件のランク付きルックアライクを生成するには間違ったツールです — 保存された検索フィルターはインテントシグナルをキャプチャせず、リストごとの手動フィルター時間はSDRの週の約3〜5時間です。このスキルはその3〜5時間をランク付きの草案の5分間のレビューに置き換えます。
ZoomInfo SalesOS Intent対比。 SalesOSは成熟しており、エンタープライズアカウントに良いインテントデータを持ち、エンタープライズモーションと年間35,000〜80,000ドルのシートの予算がある場合の正しい答えです。より小さなチームのミッドマーケットモーションには、このスキルとClay Proが信号の約80%をコストの5〜10%で提供します。ルーブリックと除外リストを所有し、ベンダーのスコアリングに依存しないというトレードオフがあります。
Apollo Living Data対比。 Apolloのルックアライク機能はこのスキルに最も近い形状で、1クリック対45分のセットアップです。Apolloのルックアライクスコアリングは不透明です（シグナルの重みを見ることもオーバーライドもできません）、出力はファームグラフィックの類似性に過剰インデックスする傾向があります。このスキルはルーブリックと重みを検査可能にし、シグナルごとの裏付けを強制します；コストはセットアップ時間と参照ファイルを最新の状態に保つ要件です。
何もしない（現状、AEがリストを構築）対比。 AEが構築したリストは既知のアカウントを網羅し、AEが聞いたことのないルックアライクに弱いです。このスキルはその逆です — 指名された戦略的アカウントには弱く、次の100件のルックアライクを表面化するのに優れています。正直なパターンは両方を並行して実行することです：AEが指名リストを所有し、スキルがルックアライクの草案を生成します。

注意点

公開ソースからのジャンクのファームグラフィックデータ。 アグリゲーターの従業員数と収益数は現実より6〜18ヶ月遅れており、成長段階の会社では30〜50%間違っています。ガード：スキルは単一のファームグラフィックソースを方向性として扱い、ハードゲートを適用する前に2つの独立したソース間の合意を要求します。競合は出力レポートで表面化されます（「LinkedInは120と言い、BuiltWithは380と言う — 手動レビューのためにフラグ」）。静かに解決されるのではなく。
インテントシグナルのノイズ。 LinkedInからのみスクレイピングされた「営業VP採用」シグナルは、昇進、コントラクターの役割、タイトルの誇張を純新規採用として誤分類します。ガード：ステップ4はいかなるインテントシグナルも0以上のスコアになる前にプライマリーな裏付けシグナル（プレスリリース、第2角度のLinkedIn証拠）を要求します；裏付けのない主張は記録された理由で0スコアになります。
古いデータベースからのリストの汚染。 一部のClayエンリッチメントソースはゾンビ企業を運ぶ — 買収、合併、または廃業したもの — がフィルターを通過しますが買えません。ガード：ホームページのチェックで4xx/5xxを返す候補、過去90日間にLinkedInのアクティビティがない候補、または親会社フィールドが除外ファイルの既知の買収者に解決する候補を削除します。削除カウントは実行メタデータに表示されるため、オペレーターはスパイクを発見できます（除外ファイルまたはアグリゲーターソースが劣化しているサイン）。
シードバイアス。 1人のAEの1つのバーティカルからの10件の勝利のシードリストは、そのAEのテリトリーのように見えるリストを生成します。ガード：シードの60%以上が同じAE、バーティカル、またはクローズ月を共有している場合にスキルが警告し、続行前にシードを広げるようオペレーターに依頼します。
フィルターの過適合。 14件のシードのみにマッチするほど厳しいシグネチャーは0〜30件の候補を生成し、精度があるように感じますが役に立ちません。ガード：ステップ3が200件未満の候補を返した場合、スキルは不十分なユニバースで進めるのではなく、従業員数と収益バンドを1段階緩めて再実行します。
古い除外ファイル。 顧客リストエクスポートが2ヶ月前のものであれば、顧客がすり抜けてアウトバウンドに入る可能性があります。ガード：スキルは除外ファイルの last_refreshed が14日以上前の場合に出力レポートで警告します。

スタック

Clay（Pro以上） — エンリッチメントサブストレート、ファームグラフィックフィルター、宛先テーブル。Proがルックアライクループの現実的なフロアです。
Claude（シグネチャー抽出にOpus 4.7、候補ごとのスコアリングにSonnet） — シード why_we_won ノートに対するシグネチャー推論と引用付きのシグナルごとの裏付けスコアリング。2つのステップにわたってモデル選択を分割することがコスト品質のトレードオフが最もうまく着地するところです。
CRM（何でも） — シードリスト、顧客リスト、オポチュニティリスト、除外ファイルをフィードするクローズドロストリストのソース。スキルはCRMを直接読まない；オペレーターがCSVをエクスポートします。
アウトバウンド宛先（Outreach、Salesloft、Apollo、カスタム） — AEの受け入れ後にレビュー済みリストが届く場所。スキルは設計上Clayテーブルで止まります。

GitHubでこのページを編集

Files in this artifact

Download all (.zip)

---
name: icp-list-builder
description: Build a ranked target-account list from public signals using a closed-won seed pattern. Input is a seed of 10-20 closed-won accounts plus an ICP rubric; output is a ranked list of 100-500 lookalike candidates with per-account evidence, ready to write to a Clay table for outbound or AE territory routing.
---

# ICP list builder (Clay + Claude)

## When to invoke

Invoke this skill when a RevOps or SDR leader needs a fresh target-account list grounded in what already worked — not in a generic "mid-market SaaS in fintech" description. The skill takes closed-won accounts as the ground truth, extracts the firmographic + technographic + intent signature shared across them, then proposes Clay filters that produce lookalikes plus a ranked candidate list with evidence.

Typical triggers:

- Quarterly territory refresh — AEs need a new draft list per region
- New product or new wedge launch — the seed list is small (10-20 wins) and you want the next 100 to look like them
- Outbound program needs more accounts after the obvious ICP has been worked

Do NOT invoke this skill for:

- Auto-loading the output into outbound sequences without rep review. The output is a ranked draft, not a send list. AE or SDR review is mandatory.
- Scoring on protected-class proxies (founder gender, ethnicity, school). Rubric weighting must be on firmographic and intent signals only.
- Account lists for ABM tier-1 named accounts — that work is hand-built and this skill's lookalike loop has too much variance for tier-1 selection.
- Buying-signal scoring inside an existing CRM record (use a CRM-native intent tool — this skill writes new candidates, it does not re-score known ones).

## Inputs

Required:

- `seed_accounts` — CSV with columns `company_name`, `domain`, `why_we_won` (two sentences). 10-20 rows. Rows with missing `why_we_won` are dropped with a warning rather than silently skipped.
- `icp_rubric` — path to `references/1-icp-rubric-template.md` filled in for your team. Defines hard firmographic gates and signal-weight ordering.
- `target_list_size` — integer, the number of ranked candidates to return. Default 100. Hard cap 500 (above that, Clay credits and signal noise both blow up).

Optional:

- `signal_sources` — path to `references/2-signal-source-matrix.md` to override which public sources Clay/Claude should query and which it should ignore. Defaults: company website, LinkedIn company page, BuiltWith, public hiring pages, last-90-day press/funding announcements.
- `exclusion_list` — path to `references/3-exclusion-criteria.md`. Domains, parent companies, or firmographic patterns that must never appear in the output (existing customers, active opportunities, do-not-contact, known losses inside last 6 months).
- `territory_filter` — geography or vertical filter applied after scoring, for splitting the output by AE territory.

## Reference files

Always read the following from `references/` before generating the list. They contain the team's actual ICP definition, source preferences, and exclusions. Without them, the list is generic and may write banned domains.

- `references/1-icp-rubric-template.md` — hard firmographic gates plus the signal-weight order used for scoring. Replace template contents with your real rubric before first run.
- `references/2-signal-source-matrix.md` — which public sources count as primary vs corroborating, and which are explicitly disallowed (low-quality scraped databases, stale aggregators). Replace with the team's source policy.
- `references/3-exclusion-criteria.md` — banned domains, parent companies, firmographic patterns to drop. Replace with the team's actual exclusion list before first run.

## Method

Run these six steps in order. The deterministic firmographic filter MUST run before any LLM scoring — running scoring across the full Clay universe wastes credits and pulls in obvious misfits.

### 1. Load and validate inputs

Read `seed_accounts`, `icp_rubric`, and `exclusion_list`. Drop seed rows with missing `why_we_won` and warn. Refuse to proceed if fewer than 8 valid seed rows remain — the signature extraction is unreliable below that floor.

### 2. Extract the seed signature

Send the validated seeds to Claude with the ICP rubric as context. Ask for a structured signature: industry codes, headcount band, revenue band (if known), geography, funding stage, technographic markers (specific tools), intent markers (hiring patterns, page additions, public announcements), and disqualifiers observed.

Why Claude here, not a SQL query: the `why_we_won` notes encode tacit signals ("they had a security and compliance page" — not a Clay column) that firmographic queries miss.

### 3. Apply deterministic firmographic filter in Clay

Translate the signature's industry, headcount, revenue, geography, and funding gates into Clay filters. Run them first to narrow the universe to ~500-3000 candidates before any LLM cost is spent. Drop anything in `exclusion_list` at this stage.

Why deterministic first, LLM second: a single LLM scoring pass at full Clay universe scale costs roughly 30-100x more than a filter pass, and 80-90% of the rejections are obvious firmographic misfits that need no reasoning.

### 4. Enrich and corroborate intent signals

For each remaining candidate, ask Clay to enrich tech stack, hiring page deltas, and last-90-day announcements. Pass each candidate to Claude with the seed signature and ask for a per-signal match score (0-3) plus a corroborating citation URL.

Constraint: any single intent signal (e.g. "hired a VP of Revenue") requires a primary corroborating signal (LinkedIn job change visible from a second angle, or a press release citing the hire). Single-source intent claims are scored 0 with reason "uncorroborated" rather than guessed. This is the guard against intent-signal noise.

### 5. Rank, dedupe, and batch-write to Clay

Sort by total signal-match score, descending. Dedupe by domain (parent companies via Clay's parent-company column where present — same parent counts once). Write the top `target_list_size` rows to a new Clay table, one batch write at the end rather than per-row.

Why batch + dedupe before write: Clay enrichment is metered per row, and duplicate parent writes burn credits without adding accounts. A single batch write also keeps the table consistent if the run is interrupted.

### 6. Produce the output report

Generate the markdown output (format below). Include a top section that explains the seed signature so the AE/SDR reviewer can sanity-check the shape of the list before working it.

## Output format

Output is a single markdown document, written to disk and surfaced to the caller. Literal example shape:

```markdown
# ICP list — {date}

## Seed signature

- Industry: B2B SaaS — DevTools and Observability (NAICS 5415)
- Headcount: 80-350
- Revenue (where known): $10M-$60M ARR
- Geo: US + Canada, EMEA-EN
- Funding: Series B and C
- Technographic markers: Stripe, Datadog OR New Relic, Notion (corroborator)
- Intent markers: hired VP of Revenue or Head of Sales in last 9 months, or
shipped a new public security/compliance page in last 6 months
- Disqualifiers observed: government contractor, parent in F500

## Ranked candidates (top 100 of 217 scored)

| Rank | Company | Domain | Score | Top signal | Exclusions flagged |
|---|---|---|---|---|---|
| 1 | Acme Observability | acme.io | 14/15 | Hired VP Rev (LinkedIn + Sept 2026 press release) | none |
| 2 | Beacon Logs | beaconlogs.com | 13/15 | Stripe + Datadog + Series B Aug 2026 | none |
| 3 | Ledger Trace | ledgertrace.dev | 12/15 | New SOC 2 page Oct 2026 | EU-only — territory split required |
| ... | ... | ... | ... | ... | ... |

## Signal-type breakdown across the top 100

- Hiring-signal-driven: 38
- Technographic-driven: 29
- Funding/announcement-driven: 22
- Multi-signal (3+): 11

## Exclusion-flag explanations

- 14 candidates flagged "EU-only — territory split required": these passed
ICP but fall outside US/CA territories and should route to the EMEA pod.
- 3 candidates flagged "parent in F500": these were dropped from the ranked
list per `exclusion_list`. Listed for audit only.
- 9 candidates flagged "uncorroborated intent": dropped per Step 4 guard.

## Run metadata

- Seeds used: 14 (2 dropped for missing why_we_won)
- Clay universe after firmographic filter: 1,847
- Candidates scored by Claude: 217
- Final ranked list: 100
- Clay credits consumed: ~870 (enrichment) + 217 (scoring lookups)
```

## Watch-outs

- **Junk firmographic data from public sources.** Aggregator headcount and revenue numbers lag reality by 6-18 months and are wrong by 30-50% on growth-stage companies. Guard: treat any single firmographic source as directional, require headcount or revenue agreement across two independent sources before applying a hard gate, and surface conflicts in the output ("LinkedIn says 120, BuiltWith says 380 — flagged for manual review").
- **Intent-signal noise.** A "hired a VP of Sales" signal scraped from LinkedIn alone misclassifies promotions, contract roles, and job-title inflation as net-new hires. Guard: Step 4 requires a primary corroborating signal (press release, second-angle LinkedIn evidence) before any intent signal scores above 0.
- **List poisoning from outdated databases.** Some Clay enrichment sources carry zombie companies (acquired, merged, defunct) that pass filters but cannot buy. Guard: drop any candidate whose website returns a 4xx/5xx, has no LinkedIn activity in last 90 days, or whose parent-company field resolves to a known acquirer. These are reported in the run metadata, not silently dropped.
- **Seed bias.** A seed list of 10 wins from one AE in one vertical produces a list that looks like that AE's territory, not the company's ICP. Guard: the skill warns if more than 60% of seeds share the same primary AE, vertical, or close-month, and asks the operator to broaden the seed before proceeding.
- **Filter over-fit.** A signature so tight it matches only the 14 seeds produces 0-30 candidates and feels precise but is useless. Guard: if the Clay firmographic-filter step returns fewer than 200 candidates, the skill loosens the headcount and revenue bands by one notch and re-runs rather than proceeding.
- **AE review is non-optional.** Skill output is a ranked draft. The output format is markdown (not a Clay-to-sequence webhook) deliberately to force a human review step before any send.

# ICP rubric — TEMPLATE

> Replace this template's contents with your team's actual ICP rubric
> before the first run of `icp-list-builder`. The skill reads this file to
> set hard firmographic gates and signal-weight ordering. Without your
> real values, the candidate list will be generic.

## Hard firmographic gates

These are AND-gates — a candidate failing any single gate is dropped before any LLM scoring runs. Keep this list short (5-8 dimensions max). Long gate lists shrink the candidate universe below the 200-row floor and force the skill to loosen filters anyway.

| Dimension | In-ICP | Stretch (allowed but downweighted) | Out (dropped pre-scoring) |
|---|---|---|---|
| Industry (NAICS or custom tag) | {list} | {list} | {list} |
| Headcount | {range, e.g. 80-500} | {range, e.g. 50-79 or 501-800} | {below/above} |
| Revenue (where known) | {range} | {range} | {range} |
| Geography | {regions, e.g. US/CA, EMEA-EN} | {regions, e.g. APAC-EN} | {regions, e.g. China, sanctioned countries} |
| Funding stage | {stages, e.g. Series B-D} | {stages, e.g. Series A late or Series E} | {stages, e.g. pre-seed, IPO+} |
| Business model | {e.g. B2B SaaS subscription} | {e.g. usage-based} | {e.g. consulting, services} |

## Technographic signals

Tools that signal a fit (we win when these are present). Each entry should name a specific product, not a category.

- {Tool 1 — e.g. "Stripe (billing) — strong"}
- {Tool 2 — e.g. "Datadog or New Relic — strong"}
- {Tool 3 — e.g. "Notion (corroborator only, not a primary signal)"}

Tools that signal misfit. The skill downweights candidates with these.

- {Tool A — e.g. "Salesforce + manual CPQ — competing internal build"}
- {Tool B — e.g. "Legacy ERP with no API surface"}

## Intent signals

The signals the skill looks for in Step 4. Each must specify the primary source AND the required corroborating source. Single-source intent counts as zero per the skill's guard.

| Signal | Primary source | Required corroborator |
|---|---|---|
| Hired VP Rev / Head of Sales (last 9 months) | LinkedIn job change | Press release, company blog, or 2nd-angle LinkedIn evidence (e.g. announcement post) |
| New compliance page (SOC 2, ISO 27001, HIPAA) | Company website diff | Trust center URL or vendor listing |
| Funding round (Series A+) last 6 months | Crunchbase / company press | TechCrunch, PitchBook, or filed 8-K |
| Active hiring 5+ GTM roles | Public careers page | LinkedIn jobs cross-reference |

## Signal weights

The skill ranks candidates on a 0-15 scale across five signal categories. Set the per-category weights here. Total must equal 15.

- Industry + business-model match: weight {N, e.g. 4}
- Headcount + revenue match: weight {N, e.g. 3}
- Technographic match: weight {N, e.g. 3}
- Intent signal (corroborated): weight {N, e.g. 4}
- Geography + funding stage match: weight {N, e.g. 1}

## Disqualifiers (skill flags prominently if found)

Single signals that drop a candidate regardless of other fit. Keep narrow.

- {Disqualifier 1 — e.g. "Parent company in Fortune 500 (procurement model wrong for our motion)"}
- {Disqualifier 2 — e.g. "Government contractor (compliance posture we cannot meet)"}
- {Disqualifier 3 — e.g. "Active acquisition in last 90 days (buying frozen)"}

## Last edited

{YYYY-MM-DD} — update on every material change so the skill can warn when the rubric is stale (more than 6 months old triggers a stale-rubric notice in the output report).

# Signal source matrix — TEMPLATE

> Replace this template's contents with your team's actual source policy.
> The `icp-list-builder` skill reads this file to decide which public sources
> count as primary vs corroborating, and which to skip entirely.

## Why this file exists

The skill's intent-signal scoring (method Step 4) requires a primary source plus a corroborating source for any signal to count above 0. This matrix is where you encode which source plays which role for your team. Without it the skill defaults to a generic ordering that under-weights signals you trust and over-weights ones you've found unreliable.

## Source roles

For each public source the skill may query, assign one role:

- **primary** — a signal originating here can anchor a score (still needs one corroborator)
- **corroborator** — only counts as the second source confirming a primary signal; cannot anchor on its own
- **skip** — never query this source; results are too noisy or stale to trust

## Source × signal-type matrix

| Source | Firmographic | Technographic | Hiring | Funding | Compliance/security | Notes |
|---|---|---|---|---|---|---|
| Company website (homepage, About, careers, trust center) | corroborator | corroborator | primary | corroborator | primary | Source of truth for own claims; queried first |
| LinkedIn company page | primary | skip | primary | corroborator | skip | Strongest for headcount + role changes; rate-limited |
| LinkedIn jobs | skip | skip | corroborator | skip | skip | Volume signal only; titles are noisy |
| BuiltWith | skip | primary | skip | skip | corroborator | Strong for tech stack, weak for everything else |
| Crunchbase | corroborator | skip | skip | primary | skip | Funding date + amount only; org charts are stale |
| Public press releases (last 90 days) | skip | skip | corroborator | corroborator | corroborator | Date-stamped; cite URL + date |
| TechCrunch / industry press | skip | skip | corroborator | corroborator | skip | Use for funding corroboration |
| G2 / Capterra reviews | skip | corroborator | skip | skip | skip | Reviewer-self-reported; treat as weak signal only |
| Wayback Machine | corroborator | corroborator | corroborator | skip | corroborator | Use to date page additions (compliance pages, jobs page changes) |
| Generic data aggregators (ZoomInfo, Apollo passive scrape) | skip | skip | skip | skip | skip | High noise; use a paid product directly if needed, do not let the skill scrape |

## Sources explicitly disallowed

The skill must never query these. Add domains here as you discover sources that have produced bad data for your team.

- {Domain 1 — e.g. "scraped-data-aggregator-x.example"}
- {Domain 2}

## Refresh windows

How fresh a signal must be to count. Anything older than the window is dropped to corroborator-only or to 0.

| Signal type | Max age to count as primary | Max age to count as corroborator |
|---|---|---|
| Hiring announcement | 9 months | 18 months |
| Funding round | 6 months | 12 months |
| Compliance page addition | 6 months | 24 months |
| Tech-stack signal | 6 months (BuiltWith last-seen) | 12 months |
| Headcount snapshot | 90 days | 12 months |

## Last edited

{YYYY-MM-DD}

# Exclusion criteria — TEMPLATE

> Replace this template's contents with your team's actual exclusion list
> before the first run of `icp-list-builder`. The skill reads this file in
> method Step 1 and Step 3, and refuses to write any matched domain to the
> output Clay table.

## Why this file matters

A list builder that writes existing customers, active opportunities, or known-loss accounts back into outbound is worse than no list — it burns trust with AEs, with the prospect (who gets contacted as a stranger), and with CS (who finds out when the customer escalates). This file is the backstop. The skill treats it as a hard filter, not a downweighting signal.

## Banned domains

Exact-match domains that must never appear in the output. The skill matches on root domain and known parent-company domains.

```
# Existing customers
{customer1.com}
{customer2.com}

# Active opportunities (export from CRM monthly and paste here)
{opp1.com}
{opp2.com}

# Closed-lost in last 6 months (do not re-engage from outbound; route to AE for hand-touch)
{lost1.com}
{lost2.com}

# Do-not-contact requests honored
{dnc1.com}
{dnc2.com}

# Internal / partners / investors (never outbound)
{partner1.com}
{investor1.com}
```

## Banned parent companies

If the skill's enrichment step resolves a candidate to a parent in this list, the candidate is dropped regardless of root-domain match.

- {Parent 1 — e.g. "BigCo Inc — all subsidiaries flagged"}
- {Parent 2}

## Firmographic exclusion patterns

Patterns broader than a single domain. The skill applies these in Step 3 after firmographic enrichment.

- {Pattern 1 — e.g. "Any company tagged as government contractor by NAICS prefix 5417"}
- {Pattern 2 — e.g. "Any company headquartered in {sanctioned-country-list}"}
- {Pattern 3 — e.g. "Any company with active M&A announcement in last 90 days"}

## Audit posture

Excluded candidates are not silently dropped. The skill's run metadata section reports counts per exclusion category. This lets RevOps verify the exclusion file is current and catches the case where the customer list export was forgotten and a customer slipped through.

## Refresh cadence

This file is regenerated from CRM exports on a fixed cadence. Stale exclusion files are the most common source of bad list output.

- Customers: refresh weekly (every Monday)
- Active opportunities: refresh weekly (every Monday)
- Closed-lost: refresh monthly (first of month)
- DNC: refresh on every request received
- Partners / investors: refresh quarterly

## Last refreshed

{YYYY-MM-DD} — exports as of this date. The skill warns in its output report if this date is more than 14 days old.