Document-Identity Intelligence

The document knows where it went.

An invisible identifier, woven into the text itself, that survives copy-paste, PDF export, OCR, and ingestion into any AI tool. When your confidential file lands in ChatGPT — we know which one, classified how, by whom, at what second.

stego-encoder.py
// STEP 01: PLAINTEXT INGESTION

Project Atlas comprises the strategic review and evaluation of expansion opportunities in West Africa. This report outlines initial valuation models, local regulatory requirements, and key risk assumptions regarding Solvency II capital targets.

// STEP 02: DSID INSERTION (ZW HIDDEN)Stego Mode
ProjectAtlascomprisesthestrategicreviewandevaluationofexpansionopportunitiesinWestAfrica.Thisreportoutlinesinitialvaluationmodels,localregulatoryrequirements,andkeyriskassumptionsregardingSolvencyIIcapitaltargets.
DSID Payload: dsid:0x3f2a9b8c7e91... Ready to Traced

The Generative AI Exposure Problem

Traditional Data Loss Prevention (DLP) tools are blind to the modern workflow. Once sensitive content is copied into a browser tab, it vanishes from corporate sight.

80%+

GenAI Weekly Use

Over four-fifths of knowledge workers actively leverage ChatGPT, Gemini, or Claude to accelerate reporting.

01
~2 in 5

Sensitive Data Leaks

40% of standard AI tool prompts contain corporate IP, financial summaries, or customer PII.

02
223

Violations / Month

Average enterprise experiences hundreds of policy infractions per month as data slips through network firewalls.

03

Source: Enterprise Risk Telemetry Assessments, 2025

Why Legacy Security Fails

Comparing modern information security protocols against direct copy-paste exfiltration channels.

Solution ClassWhat it DoesWhere it FailsDocuSentinel Advantage
Prevention (Purview / Zscaler)Blocks full file uploads and restricts unsafe web domains.Bypassed entirely by copying text snippets directly out of Microsoft Word.Secures the plaintext itself. Even single paragraphs contain the DSID.
Pattern DLP (Nightfall)Scans outbound network packets for standard Regex patterns (SSNs, cards).Fails on custom IP, financial models, strategic memos, or code.Identity is unique. Traces specific files back to individual authors.
File Beacons (Canary)Embeds macro scripts or tracking assets inside document files.Script assets are stripped instantly on copy-paste or text conversion.Steganography survives conversion, OCR, text extraction, and exports.
DocuSentinel PlatformWeaves 3 layers of stego identifiers into the document body.Instant verification (500ms), indestructible chain-of-custody.

The Three-Step Lifecycle

From creation to audit, DocuSentinel operates transparently across your existing software stack.

01

01. Instrument

Office 365 Add-ins, Google Workspace extensions, and PDF plugins inject the 256-bit DSID the moment a document is saved or classified.

02

02. Detect

The endpoint agent, clipboard monitor, and browser extensions scan outbound streams to AI platforms, checking for stego sequences.

03

03. Audit & Comply

Detections fire alerts in under 500ms, storing a Merkle-proofed record in the dashboard and piping warnings directly into your corporate SIEM.

The Four-Layer Security Architecture

A comprehensive stack connecting endpoint instrumentation directly to governance dashboards.

L4: Operations & Dashboard
Command Center / Forensic Audits / SIEM
L3: Enrichment & Analytics
Risk Scoring / Registry Matching / Kafka Stream
L2: Detection Mesh
Endpoint Agents / Extensions / ICAP Proxy
L1: Instrumentation & Encoding
WASM Stego Engine / Office Add-ins / SDK
See interactive architecture flow

Indestructible identity.
The mark survives.

Traditional tags are easily lost when documents are converted or edited. Our steganography weaves the DSID into the grammar, whitespace, and letters, rendering it resistant to deliberate tampering.

  • Survives copy and paste
  • Survives PDF export & conversion
  • Survives text extraction & formatting strips
  • Survives Screenshot → OCR workflows
  • Survives paraphrasing & AI summaries (Layer C)

Resiliency Simulation

Layer A: Zero-width space insertion100% Intact
Layer B: Statistical spacing variation100% Intact
Layer C: Semantic micro-synonym substitutions88% Intact (Survives Paraphrase)
Compliance & Standards
GDPR READYPOPIA COMPLIANTGHANA DPAPCI-DSS V4SOC 2 TYPE II

Flexible, Transparent Pricing

Under 40% of standard Microsoft 365 licensing costs. High-fidelity corporate leak protection.

Essentials
$5/user/month

+$250/month platform fee

  • Layer A stego encoding
  • Browser extension mesh
  • Standard event console
Get Started
Recommended
Professional
$8/user/month

+$500/month platform fee

  • Layer A & B stego encoding
  • Endpoint agent (clipboard)
  • SIEM webhook pipelines
  • Automated reporting
Start Trial
Enterprise
$10-12/user/month

Custom platform pricing

  • Layer A, B & C encoding
  • Full Proxy & ICAP integration
  • BYOK Encryption keys
  • 24/7 Forensic SLA support
Contact Sales