Primary source for all released Epstein files. 12 datasets, 1.38M+ files. All direct dataset URLs return 404 as of Feb 2026. Use Internet Archive mirror.
Compliance tracking and disclosure documentation. URL returning 404. See archive.org mirror.
Pre-release FOIA documents and FBI investigation files.
Congressional investigations, subpoenas, and depositions. Active Epstein probe with scheduled depositions through March 2026.
SDNY Maxwell trial records, Florida case files.
Rep. Khanna's floor speech naming six individuals, H.R. 4405 Epstein Files Transparency Act legislative record.
Mirror of DOJ datasets 9-11, January 2026 release. Preserved against removal.
AI-searchable database of Epstein documents. Journalist tool with full-text search.
Aircraft ownership verification. Key registrations: N908JE, N212JE, N120JE.
Flight map, network graph, DOJ monitoring. Base data for this platform.
86,000+ entities extracted, significance ratings applied per record.
OCR processed documents with searchable full text extraction.
React-based document explorer with AI analysis features and entity linking.
Extracted and structured text message data from released files.
Complete archive index with all download links. Bulk access preservation.
Searchable directory of 1,971 names from Ghislaine Maxwell's contact book.
| # | Description | DOJ Files | Indexed | Coverage | Status | Archive |
|---|---|---|---|---|---|---|
| 01 | FBI 302s & Police Reports | 3,158 | 3,142 | 99.5% | OFFLINE | archive.org → |
| 02 | FBI Investigative Reports | 574 | 577 | 100% | OFFLINE | archive.org → |
| 03 | Prosecution Files | 67 | 69 | 100% | OFFLINE | archive.org → |
| 04 | Court Records & Motions | 152 | 154 | 100% | OFFLINE | archive.org → |
| 05 | Grand Jury Materials | 120 | 122 | 100% | OFFLINE | archive.org → |
| 06 | Supplemental Records | 13 | 15 | 100% | OFFLINE | archive.org → |
| 07 | Supplemental Files | 17 | 19 | 100% | OFFLINE | archive.org → |
| 08 | FBI Evidence Collection | 10,595 | 11,036 | 100% | OFFLINE | archive.org → |
| 09 | Emails & DOJ Internal | 531,285 | 531,285 | 100% | OFFLINE | archive.org → |
| 10 | Property Photos & Video | 503,154 | 503,154 | 100% | OFFLINE | archive.org → |
| 11 | Financial Records & Flights | 331,659 | 331,659 | 100% | OFFLINE | archive.org → |
| 12 | Supplemental Release | 152 | 154 | 100% | OFFLINE | archive.org → |
| TOTAL | 1,380,946 | 1,380,234 | 100.0% |
[i] All 1.38M files are browseable via the document viewer. DS 1-9, 11, 12 served from local archive; DS 10 via community mirrors.
[!] DOJ removed ALL direct file links ~Feb 11, 2026. All datasets archived via Internet Archive mirrors and community torrents. [GeekenDev mirror offline as of Feb 22]
[+] 877,756 PDFs OCR-processed via Tesseract 5.5.1 — 1,063 entities matched across 81,006 verified document-entity links.
The DOJ datasets total over 1.38 million files. Community-maintained URL manifests allow researchers to mirror the full corpus using standard download utilities. Use the commands below with the manifest files linked in this section.
# Install: apt install aria2 / brew install aria2 # Download manifest, then run: aria2c \ --input-file=dataset-1-urls.txt \ --max-concurrent-downloads=16 \ --split=4 \ --min-split-size=1M \ --continue=true \ --dir=./epstein-dataset-1
# Download files listed in a manifest: wget \ --input-file=dataset-1-urls.txt \ --continue \ --tries=3 \ --wait=1 \ --directory-prefix=./epstein-dataset-1 # Resume interrupted downloads with -c flag
# Loop over a manifest file:
mkdir -p ./epstein-dataset-1
while IFS= read -r url; do
filename=$(basename "$url")
curl -L -C - \
--retry 3 \
--output "./epstein-dataset-1/$filename" \
"$url"
done < dataset-1-urls.txt
Ringmast4r maintains per-dataset URL lists covering all 12 DOJ releases. Feed these directly into aria2c or wget.
Datasets 9-11 (January 2026 release) are mirrored on the Internet Archive. Use as a fallback if DOJ links degrade or files are removed.