n8n-flow

Beweismittelsammlungs-Orchestrierung für eDiscovery mit n8n

Difficulty

Profi

Setup time

180min

For

legal-ops · ediscovery-lead · in-house-counsel

Legal Ops

Stack

Ein n8n-Flow, der die Sammlungsphase von eDiscovery (die EDRM-„Collection”-Stufe) orchestriert — zieht Custodian-Listen-Daten aus dem HRIS des Unternehmens, generiert Per-Custodian-Sammlungsanfragen gegen die Datenquellen des Unternehmens (Google Workspace, Microsoft 365, Slack, Dateifreigaben, benutzerdefinierte SaaS), verfolgt Sammlungsabschluss und Chain-of-Custody, leitet gesammelte Daten an den Relativity-Workspace weiter (oder Everlaw / Logikcull) zur Verarbeitung. Jeder Schritt schreibt in ein unveränderliches Audit-Log, das der Counsel zur Verteidigung der Sammlungsadäquanz verwendet. Ersetzt die manuelle Sammlung des Legal-Ops-Admins per Tabelle und Screenshot durch einen deterministischen Flow.

Wann einsetzen

Unternehmen mit regulärer eDiscovery — typischerweise solche mit aktiven Rechtsstreit-Portfolios, wo Sammlung mehrmals pro Jahr stattfindet.
Custodian-Anzahl pro Verfahren ist groß genug, dass manuelle Sammlung operativ nicht durchführbar ist (typischerweise >5 Custodians pro Verfahren).
Das Unternehmen hat IT-Engineering-Kapazität, um die Connector-Schicht zu verdrahten (Google Workspace Vault, M365 eDiscovery, Slack Discovery API usw.). Der Flow ist die Orchestrierung; die Connectors sind per System.
Counsel zeichnet den Sammlungsumfang pro Custodian ab; der Flow führt gegen den genehmigten Umfang aus.

Wann NICHT einsetzen

Single-Custodian-Sammlungen — manuell ist in Ordnung; die Setup-Kosten des Flows (180 Minuten plus Connector-Verdrahtung) rechnen sich nicht.
Chain-of-Custody-Dokumentations-Expertise ersetzen. Der Flow generiert Audit-Datensätze; der eDiscovery-Lead validiert, dass die Datensätze dem Chain-of-Custody-Standard der Jurisdiktion entsprechen. Verschiedene Jurisdiktionen haben unterschiedliche Anforderungen.
Sammlungsumfang automatisch definieren. Counsel definiert den Umfang pro Verfahren; der Flow führt gegen den Umfang aus, verfasst ihn nicht.
Erste Verfahren des Unternehmens ohne etablierte Sammlungsverfahrens-Baseline. Der Flow codiert ein Verfahren; wenn es kein Verfahren zu codieren gibt, definieren Sie es zuerst.

Einrichtung

Flow importieren. Importieren Sie apps/web/public/artifacts/evidence-collection-ediscovery-n8n/evidence-collection-ediscovery-n8n.json in Ihre n8n-Instanz.
Credentials verdrahten. Pro Quelle: Google Workspace (Vault API; Service-Account mit delegierter Autorität), Microsoft 365 (Compliance Center API; Per-Tenant-App-Registrierung), Slack (Discovery API — nur im Enterprise Grid verfügbar), HRIS (Custodian-Quelle). Plus Relativity / Everlaw / Logikcull (die eDiscovery-Plattform) und Postgres (Audit-Log).
Per-Quellen-Sammlungsumfang-Vorlage verfassen. Pro Datenquelle dokumentieren: welche Umfänge sammelbar sind (Datumsbereich, Suchbegriffe, Custodian-spezifische Filter), welche Per-Quellen-Rate-Limits gelten, welches erwartete Output-Format ist.
Chain-of-Custody-Vorlage konfigurieren. Pro Verfahren und pro Custodian: wer gesammelt hat (Service-Account-Name + menschlicher Reviewer), wann, was gesammelt wurde, Hash der Sammlung bei Abschluss. Vorlage in _README.md.
eDiscovery-Plattform-Integration einrichten. Relativity Processing API oder äquivalent für Everlaw / Logikcull. Der Flow lädt in einen Per-Verfahrens-Workspace hoch; die Verarbeitungs-Pipeline (Deduplizierung, OCR usw.) läuft in der Plattform.
Dry-Run auf einem abgeschlossenen Verfahren. Sammlung für ein letztes Quartal abgeschlossenes Verfahren wiederholen. Bestätigen, dass das gesammelte Volumen mit dem ursprünglich produzierten übereinstimmt und die Chain-of-Custody-Datensätze mit dem übereinstimmen, was der Counsel zertifiziert hat.

Was der Flow macht

Acht Nodes. Per-Custodian-per-Quellen-Orchestrierung mit Chain-of-Custody bei jedem Schritt.

Collection Request Trigger — Webhook von der Legal-Ops-Plattform, wenn Counsel den Sammlungsumfang als genehmigt markiert.
Custodian + Umfang laden — zieht Custodian-Liste + Per-Custodian-Per-Quellen-Umfang aus dem Sammlungsplan des Verfahrens.
Per-Quellen-Dispatch — fächert einen Branch pro Datenquelle pro Custodian auf. Der komplexeste Teil des Flows — jede Quelle hat ihre eigene API und ihre eigenen Rate-Limit-Einschränkungen.
Quelle: Google Workspace Vault — Vault-Verfahren erstellt (oder wiederverwendet), Hold ausgestellt, Suche gegen das Gmail / Drive / Calendar des Custodians innerhalb des Umfangs ausgeführt, Ergebnisse exportiert.
Quelle: M365 Compliance — Content-Suche gegen das Postfach / OneDrive / Teams des Custodians innerhalb des Umfangs ausgeführt, Ergebnisse über das Compliance Center exportiert.
Quelle: Slack Discovery — Slack Enterprise Grid Discovery API; Per-Custodian-Per-Channel-Export innerhalb des Umfangs.
Hash + Chain-of-Custody-Append — jeder Per-Quellen-Export wird gehasht (SHA-256), und ein Chain-of-Custody-Datensatz wird an die Audit-Tabelle angehängt: {matter_id, custodian_id, source, scope_summary, collected_at, collected_by_service_account, hash, file_count, byte_count}.
Auf eDiscovery-Plattform hochladen — Exporte in den Per-Verfahrens-Relativity-Workspace pushen; Verarbeitungsjob auslösen; Plattformseitige Load-ID im Audit-Log für Rückverfolgbarkeit aufzeichnen.

Kostenrealität

Connector- / Quell-Plattform-Kosten — Google Vault, M365 E5 mit Advanced eDiscovery, Slack Enterprise Grid tragen alle Per-Seat-Kosten. Der Flow reduziert diese nicht; er sorgt dafür, dass sie effektiv genutzt werden.
n8n-Ausführungen — lang laufend (große Exporte dauern Stunden); verwenden Sie den Queue-Modus von n8n für die Produktion.
eDiscovery-Plattform-Verarbeitungskosten — Relativity / Everlaw / Logikcull berechnen alle per GB verarbeitet; der Flow ändert diese Mathematik nicht.
Legal-Ops-Admin-Zeit — das ist der Gewinn. Manuelle Orchestrierung einer 10-Custodian-Sammlung über 4 Quellen sind Tage Arbeit; der Flow läuft in Stunden unbeaufsichtigt.
Setup-Zeit — 180 Minuten für den Flow selbst + erhebliche Per-Quellen-Connector-Verdrahtung (die Connectors sind der Großteil des tatsächlichen Setups).

Erfolgsmetrik

Zeit von Counsel-Genehmigung bis Sammlungsabschluss — sollte von Tagen/Wochen (manuell) auf Stunden (Flow) sinken, vorbehaltlich der Export-Job-Dauer der Quell-Plattform.
Chain-of-Custody-Vollständigkeit — sollte 100 % pro Verfahren sein. Jede Lücke ist ein Verteidigbarkeitsrisiko.
Volumen-Drift — vom Flow gesammeltes Volumen vs. vom Counsel erwarteter Umfang. Innerhalb von 10 % ist normal (Filter-Kalibrierung); >25 % löst eine Re-Scope-Überprüfung aus.

Vergleich mit Alternativen

vs. native Sammlungsmodule der eDiscovery-Plattform (Relativity Collect, Everlaw Collections). Wählen Sie diese, wenn Ihr Team in der Plattform lebt und die Connectors der Plattform Ihre Quellen abdecken. Der Flow ist für Custom-Source-Verfahren oder Verfahren, die mehr Quellen umspannen, als eine einzelne Plattform nativ abdeckt.
vs. kommerzielle Sammlungs-Orchestrierungs-Tools (Reveal Brainspace, OpenText EnCase, Cellebrite, Onna). Wählen Sie diese für die hochwertigsten Verfahren mit forensischen Anforderungen. Der Flow ist das leichtgewichtige Mittelfeld für routinemäßige Corporate-eDiscovery.
vs. manuelle Sammlung. Bei kleinem Maßstab handhabbar; skaliert nicht auf Multi-Custodian-Verfahren.

Watch-outs

Chain-of-Custody-Integrität. Guard: Jeder Per-Quellen-Export wird zum Sammlungszeitpunkt und erneut vor dem Hochladen auf die eDiscovery-Plattform gehasht. Hash-Mismatches stoppen den Upload und alarmieren den eDiscovery-Lead.
Umfangs-Creep bei automatisierter Sammlung. Guard: Der Umfang des Flows wird aus dem Counsel-genehmigten Sammlungsplan gelesen; das Erweitern des Umfangs mid-Run erfordert eine Plan-Änderung, keine Flow-Anpassung. Das Audit-Log erfasst den Plan-SHA pro Run.
Quell-Plattform-Rate-Limit-Erschöpfung. Guard: Per-Quellen-Rate-Limiter in den Per-Quellen-Nodes des Flows. Die Slack Discovery API hat besonders aggressive Rate-Limits — der Flow passt das Tempo entsprechend an.
Privilege-Exposition bei der Sammlung. Guard: Die Sammlung erfasst alles im Umfang; die Privilege-Prüfung erfolgt nachgelagert in der eDiscovery-Plattform (der Privilege-Review-Batch-Skill ist die nächste Stufe). Der Flow filtert KEINE privilegierten Inhalte vorab — das ist eine nachgelagerte Entscheidung.
Custodian-Datenschutzbedenken. Guard: Der Flow operiert gegen die Systeme, die der Custodian für die Arbeit verwendet; persönliche Accounts (persönliches Gmail, persönliches Slack) sind außerhalb des Umfangs, es sei denn, Counsel hat sie explizit benannt. Der Sammlungsplan dokumentiert die Grenze.
Cross-Jurisdiktion-Datenlokalisierung. Guard: EU-ansässige Custodian-Daten können DSGVO-Datenlokalisierungsüberlegungen unterliegen; der Flow flaggt EU-ansässige Custodians per Umfang für Datenhandhabungs-Überprüfung, bevor der Export in einen nicht-EU-eDiscovery-Workspace erfolgt.

Stack

Das Bundle liegt unter apps/web/public/artifacts/evidence-collection-ediscovery-n8n/:

evidence-collection-ediscovery-n8n.json — der Flow-Export (Skeleton — tatsächliche Per-Quellen-Connectors sind unternehmensspezifisch)
_README.md — Credentials, Audit-Tabellen-Schema, Per-Quellen-Connector-Notizen, Chain-of-Custody-Vorlage

Tools: n8n, Relativity (oder Everlaw / Logikcull), Slack (nur Benachrichtigung). Quell-Plattform-Connectors: Google Workspace Vault, Microsoft 365 Compliance, Slack Discovery, Custom SaaS nach dem Stack des Unternehmens.

Verwandt: eDiscovery, EDRM-Modell, Matter Management, Privilege Review.

Diese Seite auf GitHub bearbeiten

Files in this artifact

Download all (.zip)

# Evidence collection for ediscovery — n8n flow (skeleton)

Orchestrates the EDRM "Collection" stage: per-custodian per-source dispatch against Google Workspace Vault, M365 Compliance, Slack Discovery, and custom SaaS sources. Hashes every export, writes chain-of-custody to an immutable audit table, uploads to the e-discovery platform.

**This is a skeleton flow.** The bundled n8n JSON shows the structure (request → load plan → dispatch per source → audit) and includes a working Google Vault saved-query node as an exemplar. Production deployment requires the firm's ediscovery engineer to:

1. Complete the per-source nodes (Google Vault has create-query → start-export → poll-export → fetch-blob; bundled flow shows only create-query).
2. Wire the M365 Compliance and Slack Discovery branches (skeleton has placeholders).
3. Replace the placeholder hash in `Hash + Chain-of-Custody` with actual export-bytes hashing.
4. Add the upload-to-Relativity / Everlaw / Logikcull node at the end.
5. Add per-source rate limiters.

The flow's value is in the structure (audit shape, dispatch pattern, chain-of-custody discipline) — the per-source connector code is firm-specific.

## Database tables

```sql
-- Counsel-approved collection plan. One row per (custodian, source) pair.
CREATE TABLE collection_plans (
    collection_plan_id   TEXT NOT NULL,
    plan_sha             TEXT NOT NULL,
    matter_id            TEXT NOT NULL,
    custodian_id         TEXT NOT NULL,
    source               TEXT NOT NULL,
    scope_json           JSONB NOT NULL,
    status               TEXT NOT NULL CHECK (status IN ('draft','approved','executed','superseded')),
    approved_by          TEXT,
    approved_at          TIMESTAMPTZ,
    PRIMARY KEY (collection_plan_id, custodian_id, source)
);

-- Chain-of-custody, append-only.
CREATE TABLE collection_audit (
    audit_id                          BIGSERIAL PRIMARY KEY,
    matter_id                         TEXT NOT NULL,
    collection_id                     TEXT NOT NULL,
    custodian_id                      TEXT NOT NULL,
    source                            TEXT NOT NULL,
    plan_sha                          TEXT NOT NULL,
    collected_at                      TIMESTAMPTZ NOT NULL,
    collected_by_service_account      TEXT NOT NULL,
    hash                              TEXT NOT NULL,
    file_count                        INTEGER NOT NULL,
    byte_count                        BIGINT NOT NULL,
    scope_summary                     TEXT,
    upload_load_id                    TEXT,  -- e-discovery platform load ID, written when upload completes
    upload_completed_at               TIMESTAMPTZ
);

CREATE INDEX collection_audit_matter_idx ON collection_audit (matter_id, collected_at);

-- Immutability:
REVOKE UPDATE, DELETE, TRUNCATE ON collection_audit FROM PUBLIC;
GRANT INSERT, SELECT ON collection_audit TO <ediscovery_app_role>;
-- upload_load_id and upload_completed_at can be UPDATEd via a function that
-- enforces "only when previously NULL" — implement as a stored procedure
-- if you need to record platform-side load IDs after collection.
```

## Per-source connector notes

### Google Workspace Vault

API doc: https://developers.google.com/vault/

- Service account with delegated authority to access user data.
- Create-query → start-export → poll-export-status → fetch-blob sequence. Exports are async; polling can take minutes to hours.
- Vault matter must exist; the flow can create-or-reuse.
- Hold should be in place at the matter level before query (separate workflow — see [litigation hold orchestration](../litigation-hold-orchestration-n8n/)).
- Rate limits: per-project quotas. Vault tends to be export-job-bound rather than rate-limit-bound.

### Microsoft 365 Compliance

API doc: https://learn.microsoft.com/en-us/microsoft-365/compliance/

- Per-tenant app registration with Compliance Center scopes (eDiscovery.Manage etc.).
- Content search → run-search → start-export → download-export sequence.
- Advanced eDiscovery (eDiscovery Premium) is an E5 add-on — confirm tenant licensing.
- Rate limits: per-tenant; varies by SKU.

### Slack Discovery

API doc: https://api.slack.com/enterprise/discovery (Enterprise Grid only)

- Discovery API only available on Slack Enterprise Grid.
- Per-channel and per-user export endpoints. The Discovery API is rate-limited aggressively (single-digit req/sec for most endpoints).
- Output is JSON-line message records; preserve files via separate file-export endpoint.
- Pagination is cursor-based; loop until empty.

### Custom SaaS

For internal tools or smaller SaaS that the team uses:

- Document the source's export shape and chain-of-custody implications.
- Build a connector node that writes to the same per-source pattern as the bundled examples.
- Hash the export at fetch time, append to audit table.

## Chain-of-custody record format

Each `collection_audit` row is the chain-of-custody record. Counsel demonstrates collection adequacy via these records:

```
Matter: M-2026-0042
Collection: coll-20260503-abc123
Custodian: jane-doe@firm.com
Source: google-vault
Collected at: 2026-05-03T14:00:00Z
Service account: ediscovery-bot@firm
Hash (SHA-256): a3f2b1c4...
File count: 1,247
Byte count: 4,231,789,022
Scope: { "email": "jane-doe@firm.com", "start_time": "2024-01-01", "end_time": "2026-04-30", "terms": "(\"Acme deal\" OR \"Project X\") AND -from:counsel@firm" }
Upload to e-discovery: load-2026-05-03-abc123 (Relativity workspace 'M-2026-0042')
```

For court submissions, the chain-of-custody records typically need to be produced in a more formal format — a paralegal exports the audit records and formats per jurisdictional requirements. The flow's records are the source data.

## Credentials

- `PLACEHOLDER_PLAN_DB_CRED_ID` — read access to `collection_plans`.
- `PLACEHOLDER_AUDIT_DB_CRED_ID` — write access to `collection_audit`.
- `PLACEHOLDER_GOOGLE_VAULT_CRED_ID` — service account with delegated authority.
- `PLACEHOLDER_M365_CRED_ID` — per-tenant app registration with Compliance Center scopes.
- `PLACEHOLDER_SLACK_DISCOVERY_CRED_ID` — Slack org-admin token with `discovery:read` scope.
- `PLACEHOLDER_RELATIVITY_CRED_ID` — Relativity REST API credentials (or Everlaw / Logikcull equivalent).

## Dry-run procedure

1. Provision tables on a non-production DB.
2. Wire credentials to staging endpoints (test Google project, test M365 tenant, test Slack workspace).
3. Replay a closed matter's collection plan against staging sources (with anonymized custodian data).
4. Verify chain-of-custody records and platform-side load IDs.
5. Switch to production credentials only after a full successful dry-run.

## Known limits / production-readiness gaps

This is a skeleton. Before production:

1. Per-source export polling — Google Vault and M365 Compliance exports are async; the flow needs a poll-and-resume pattern (not bundled).
2. Per-source export-blob fetching — once the export is ready, the flow needs to download the blob and hash it (skeleton uses placeholder hash).
3. M365 Compliance branch — entirely skeleton; needs Content Search + Search Result Export wiring.
4. Slack Discovery branch — entirely skeleton; needs cursor-based per-channel paging.
5. E-discovery platform upload — not bundled; per-platform Relativity / Everlaw / Logikcull connector required.
6. Per-source rate limiting — the per-source nodes need rate limiters in production.
7. Error recovery — failed-export retry / replay logic not bundled.

This skeleton's value is in the orchestration shape and the audit / chain-of-custody discipline; the connector layer is the firm's ediscovery engineering work.

{
  "name": "Evidence collection ediscovery (skeleton)",
  "nodes": [
    {
      "parameters": {
        "httpMethod": "POST",
        "path": "collection-request",
        "responseMode": "lastNode",
        "options": { "rawBody": false }
      },
      "id": "7a7a7a7a-0001-0000-0000-000000000001",
      "name": "Collection Request",
      "type": "n8n-nodes-base.webhook",
      "typeVersion": 2,
      "position": [240, 400],
      "webhookId": "collection-request",
      "notesInFlow": true,
      "notes": "Webhook from legal-ops platform: {matter_id, collection_plan_id}. The collection plan is the counsel-approved scope; this flow executes against it, doesn't author it."
    },
    {
      "parameters": {
        "operation": "executeQuery",
        "query": "WITH plan AS (\n  SELECT collection_plan_id, plan_sha, custodian_id, source, scope_json\n  FROM collection_plans\n  WHERE collection_plan_id = $1 AND status = 'approved'\n)\nSELECT * FROM plan;",
        "options": { "queryReplacement": "={{ $json.collection_plan_id }}" }
      },
      "id": "7a7a7a7a-0001-0000-0000-000000000002",
      "name": "Load Collection Plan",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [460, 400],
      "credentials": {
        "postgres": { "id": "PLACEHOLDER_PLAN_DB_CRED_ID", "name": "Postgres — collection plans" }
      }
    },
    {
      "parameters": {
        "jsCode": "// For each (custodian, source) pair, prepare a per-source dispatch payload.\n// The flow's per-source nodes receive these payloads.\nconst rows = $input.all().map(r => r.json);\nconst trigger = $('Collection Request').item.json;\n\nif (rows.length === 0) {\n  return [{ json: { status: 'halted', reason: 'no_approved_plan_rows', collection_plan_id: trigger.collection_plan_id } }];\n}\n\nconst out = rows.map(row => ({\n  json: {\n    matter_id: trigger.matter_id,\n    collection_plan_id: trigger.collection_plan_id,\n    plan_sha: row.plan_sha,\n    custodian_id: row.custodian_id,\n    source: row.source,\n    scope: typeof row.scope_json === 'string' ? JSON.parse(row.scope_json) : row.scope_json,\n    requested_at: new Date().toISOString(),\n    collection_id: `coll-${Date.now()}-${Math.random().toString(36).slice(2, 8)}`,\n  }\n}));\n\nreturn out;"
      },
      "id": "7a7a7a7a-0001-0000-0000-000000000003",
      "name": "Per-Source Dispatch",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [680, 400],
      "notesInFlow": true,
      "notes": "Fans out one item per (custodian, source) pair. The downstream switch routes by source."
    },
    {
      "parameters": {
        "rules": {
          "values": [
            {
              "conditions": {
                "options": { "caseSensitive": true },
                "conditions": [
                  { "leftValue": "={{ $json.source }}", "rightValue": "google-vault", "operator": { "type": "string", "operation": "equals" } }
                ],
                "combinator": "and"
              },
              "outputKey": "google"
            },
            {
              "conditions": {
                "options": { "caseSensitive": true },
                "conditions": [
                  { "leftValue": "={{ $json.source }}", "rightValue": "m365-compliance", "operator": { "type": "string", "operation": "equals" } }
                ],
                "combinator": "and"
              },
              "outputKey": "m365"
            },
            {
              "conditions": {
                "options": { "caseSensitive": true },
                "conditions": [
                  { "leftValue": "={{ $json.source }}", "rightValue": "slack-discovery", "operator": { "type": "string", "operation": "equals" } }
                ],
                "combinator": "and"
              },
              "outputKey": "slack"
            }
          ]
        },
        "options": { "fallbackOutput": "extra" }
      },
      "id": "7a7a7a7a-0001-0000-0000-000000000004",
      "name": "Source Switch",
      "type": "n8n-nodes-base.switch",
      "typeVersion": 3,
      "position": [900, 400]
    },
    {
      "parameters": {
        "method": "POST",
        "url": "https://vault.googleapis.com/v1/matters/{{ $env.GOOGLE_VAULT_MATTER_ID }}/savedQueries",
        "authentication": "predefinedCredentialType",
        "nodeCredentialType": "googleApi",
        "sendHeaders": true,
        "headerParameters": {
          "parameters": [
            { "name": "Content-Type", "value": "application/json" }
          ]
        },
        "sendBody": true,
        "specifyBody": "json",
        "jsonBody": "={\n  \"displayName\": \"{{ $json.collection_id }}\",\n  \"query\": {\n    \"corpus\": \"MAIL\",\n    \"dataScope\": \"ALL_DATA\",\n    \"searchMethod\": \"ACCOUNT\",\n    \"accountInfo\": { \"emails\": [\"{{ $json.scope.email }}\"] },\n    \"mailOptions\": { \"excludeDrafts\": false },\n    \"startTime\": \"{{ $json.scope.start_time }}\",\n    \"endTime\": \"{{ $json.scope.end_time }}\",\n    \"terms\": \"{{ $json.scope.terms }}\"\n  }\n}",
        "options": {
          "response": { "response": { "responseFormat": "json", "neverError": false } },
          "timeout": 60000
        }
      },
      "id": "7a7a7a7a-0001-0000-0000-000000000005",
      "name": "Google Vault: Saved Query",
      "type": "n8n-nodes-base.httpRequest",
      "typeVersion": 4.2,
      "position": [1120, 280],
      "credentials": {
        "googleApi": { "id": "PLACEHOLDER_GOOGLE_VAULT_CRED_ID", "name": "Google Vault service account" }
      },
      "notesInFlow": true,
      "notes": "Creates a saved query in the matter; an export job is the next step (separate API call). Real production flow needs the full create-query → poll-export sequence; skeleton shown."
    },
    {
      "parameters": {
        "jsCode": "// Compute SHA-256 of the export, append chain-of-custody record.\n// Skeleton — production flow includes the actual export-fetch step.\nconst crypto = require('crypto');\nconst input = $input.first().json;\nconst dispatch = $('Per-Source Dispatch').item.json;\n\n// In production: fetch the actual export bytes here, hash them.\n// Skeleton uses a deterministic placeholder so the audit record shape is correct.\nconst placeholderHash = crypto.createHash('sha256').update(`${dispatch.collection_id}-${dispatch.source}`).digest('hex');\n\nreturn [{\n  json: {\n    matter_id: dispatch.matter_id,\n    collection_id: dispatch.collection_id,\n    custodian_id: dispatch.custodian_id,\n    source: dispatch.source,\n    plan_sha: dispatch.plan_sha,\n    collected_at: new Date().toISOString(),\n    collected_by_service_account: $env.COLLECTION_SERVICE_ACCOUNT || 'ediscovery-bot@firm',\n    hash: placeholderHash,\n    file_count: input.fileCount || 0,\n    byte_count: input.byteCount || 0,\n    scope_summary: JSON.stringify(dispatch.scope).slice(0, 500),\n    skeleton_warning: 'This skeleton flow does not fetch and hash actual export bytes. Production: replace with fetch + bytewise hash.',\n  }\n}];"
      },
      "id": "7a7a7a7a-0001-0000-0000-000000000006",
      "name": "Hash + Chain-of-Custody",
      "type": "n8n-nodes-base.code",
      "typeVersion": 2,
      "position": [1340, 400]
    },
    {
      "parameters": {
        "operation": "insert",
        "schema": "public",
        "table": "collection_audit",
        "columns": "matter_id, collection_id, custodian_id, source, plan_sha, collected_at, collected_by_service_account, hash, file_count, byte_count, scope_summary",
        "additionalFields": {}
      },
      "id": "7a7a7a7a-0001-0000-0000-000000000007",
      "name": "Audit: Collection Complete",
      "type": "n8n-nodes-base.postgres",
      "typeVersion": 2.4,
      "position": [1560, 400],
      "credentials": {
        "postgres": {
          "id": "PLACEHOLDER_AUDIT_DB_CRED_ID",
          "name": "Postgres — chain-of-custody (append-only)"
        }
      }
    }
  ],
  "connections": {
    "Collection Request": { "main": [[{ "node": "Load Collection Plan", "type": "main", "index": 0 }]] },
    "Load Collection Plan": { "main": [[{ "node": "Per-Source Dispatch", "type": "main", "index": 0 }]] },
    "Per-Source Dispatch": { "main": [[{ "node": "Source Switch", "type": "main", "index": 0 }]] },
    "Source Switch": {
      "main": [
        [{ "node": "Google Vault: Saved Query", "type": "main", "index": 0 }],
        [],
        []
      ]
    },
    "Google Vault: Saved Query": { "main": [[{ "node": "Hash + Chain-of-Custody", "type": "main", "index": 0 }]] },
    "Hash + Chain-of-Custody": { "main": [[{ "node": "Audit: Collection Complete", "type": "main", "index": 0 }]] }
  },
  "settings": {
    "executionOrder": "v1",
    "timezone": "America/New_York",
    "saveExecutionProgress": true,
    "saveManualExecutions": true
  },
  "active": false,
  "versionId": "1"
}