AI-Augmented Investigative Research

Intelligence for missing-persons cases the world hasn't given up on.

MissingIntel turns decades of fragmented public investigation records — police dossiers, court documents, journalistic archives — into a searchable, interconnected knowledge graph. Starting with one of the most documented cases of our time.

50,000+
Document pages
40,000+
Entities indexed
11
Languages
17
PJ Volumes
How it works

Three capabilities, one investigative surface.

Built on the same retrieval and reasoning techniques that power frontier AI systems — adapted for the messy reality of historical case files: scanned PDFs, multilingual transcripts, decades-old witness statements.

🔍

Semantic Search

Ask questions in natural language. Retrieval-augmented generation surfaces relevant passages from across the entire corpus, with citations back to the original page in the original source.

🕸️

Knowledge Graph

Persons, locations, vehicles, and timelines are extracted and linked. See at a glance who met whom, who was where on which night, and which threads have been buried in the noise.

💡

Hypothesis Engine

The system surfaces statistical similarities between persons, places, and patterns — and flags them as hypotheses, never conclusions. The reader judges. The platform merely makes the connections findable.

Featured Case

The Madeleine McCann dossier.

Few missing-persons cases have generated as much public documentation. The complete Portuguese Polícia Judiciária file — 17 volumes, tens of thousands of pages, made public in 2008 — forms the spine of our pilot dossier, supplemented by court records, media archives, and witness timelines from the past 19 years.

⚖️ Unsolved · May 3, 2007

From scanned PJ files to a queryable case

Every page of the original Portuguese investigation has been OCR'd, translated to English and Dutch, indexed semantically, and structurally linked into a knowledge graph.

Search any name, any night, any vehicle license plate. Read the original Portuguese alongside the translation. Follow a witness through every appearance across the corpus.

Explore the case →

17
PJ Volumes
50k+
Pages OCR'd
200+
Persons of interest
3
Languages (PT/EN/NL)
Methodology

Transparent retrieval, sourced answers.

Every answer the platform produces is grounded in retrieved source documents and accompanied by citations. No black-box claims. No invented facts. If we can't find it in a public record, we don't claim it.

01 · INDEXING

OCR & embedding

Scanned pages are processed with high-accuracy OCR, translated where needed, and embedded into a multilingual vector store. Persons, locations, dates, and vehicles are extracted into a graph.

02 · RETRIEVAL

Semantic + graph search

Queries trigger both semantic similarity search and structured graph traversal. Top passages are ranked, deduplicated, and assembled into context — always with traceability back to source.

03 · REASONING

Sourced synthesis

A frontier reasoning model synthesizes the retrieved evidence into an answer. Citations are inline. Speculation is marked. Statistical similarities are presented as hypotheses, never as proof.

Scope & Responsibility

An assistant for research, not a replacement for justice.

✅ What the platform does

  • Searches public investigation records and presents findings with citations.
  • Reveals connections between persons, locations, events, and timelines.
  • Structures information that would otherwise be scattered across hundreds of documents.
  • Generates hypotheses, clearly marked as such, based on statistical overlaps.
  • Helps researchers, journalists, and the public find sources faster.
  • Maintains full transparency on where every claim originated.

❌ What the platform does not do

  • Replace police, prosecutorial, or licensed investigative work.
  • Provide legal advice or judgment on guilt or innocence.
  • Accuse any person of a crime — every individual mentioned is presumed innocent.
  • Process confidential or non-public information.
  • Treat AI-generated patterns as evidence — only as questions worth asking.
  • Substitute for the journalistic, legal, or investigative judgment of a qualified human.
What's next

One case today. Many cases tomorrow.

The Madeleine McCann dossier is the first public case on MissingIntel. The platform is designed to host additional cold cases, historical disappearances, and unresolved investigations over time — under the same standards of transparency, sourcing, and respect for due process.

Launch Platform →