Discover the story in the sources.

Open Console

Sourcebase searches troves of millions of documents and transcripts no team could read in a lifetime , answers any question with a citation down to the exact page, paragraph, or timestamp , reads documents, audio, video, and structured data together in one collection , and turns your sources into a fully cited first draft with Foundry .

Click any citation to learn more.

1:1

Ask a whole collection your question. Every answer links to the exact page, paragraph, or timestamp, so you can read the source yourself.

Used by newsrooms including

Miami Herald NOTUS TIME The Post & Courier Sveriges Radio CBS News / 60 Minutes

Latest updates

New and updated collections as they go live.

Built for the work of reporting

A two-minute deadline check and a six-month investigation run on the same source base.

Newsroom investigations

Dig through large collections for the details others miss. Track findings across every document with exact citations.

Daily reporting

Ask a quick question on deadline and get a cited answer in seconds, sourced from the material you trust.

Fact-checking and verification

Trace any claim back to the document, page, or timestamp behind it, and confirm what the source actually says.

Research and backgrounding

Build the full picture on a person, place, or topic from everything in the collection before you write.

“I’ve also been using this program called Sourcebase.ai. You could say, look, I want to know everything about this prison guard, for example, and they’ll find everything in a way that I might not have been able to find it.”
Julie K. Brown Miami Herald · Pulitzer Prize Winner

The tools inside Sourcebase

Three ways to work the same sources, from a quick cited answer to a finished draft.

Chat

Ask a question to a collection and get an answer cited to the exact page, paragraph, or timestamp.

See Chat

Deep Research

Hybrid search that surfaces details buried across thousands of documents.

See Deep Research

Build and manage

Bring your own archive of text, audio, and data into one or many searchable collections.

See Build and manage

From sources to a first draft

Foundry runs a guided pipeline over your sources. Stop at any step and take what it has produced, or let it run all the way through.

Plan
Research
Draft
Review
Fact Check
Copy Edit
Present

Not just documents

Spreadsheets, documents, and the open web sit in one collection. A single question searches across all of them.

U.S. Treasury
6M+ rows

Federal spending data

Six million rows across 300 columns of raw U.S. Treasury spending, queried alongside the reporting that explains it.

FIFA World Cup
100K+ rows

Match data and coverage

More than 100,000 rows of World Cup results sitting beside 17,000 articles, cross-referenced in one query.

U.S. Immigration
1,300 reports

Spreadsheets and filings

125 spreadsheets and 1,300 immigration reports, analyzed together into answers you can cite.

Collections in action

Our expertly curated public troves that are ready to search for free.

The Epstein Files
Legal & Government

The Epstein Files

Over 3.5 million pages of court documents and emails, spanning the late 1990s through 2024, every answer cited to the page.

Ask Trump
Politics & Investigations

Ask Trump

Query transcripts from every speech and statement since 2007 with timestamp citations.

Put your sources to work

Open the console to search a public collection, or bring your own documents and start reporting.

Open Console

Tools built for discovery

Sourcebase reads through collections too large to comb by hand. You point it at the question, check what it finds, and bring the editorial judgment.

Chat in plain language

Journalists and readers ask questions and get answers grounded in original sources, with citations that link to the exact page, paragraph, or timestamp.

Follow up, dig deeper, ask for clarification. Sourcebase keeps context across the conversation so you can explore a document set thoroughly.

  • Answers always grounded in original sources
  • Citations link directly to the source material
  • Fast, Thinking, and Pro modes for any depth
A Sourcebase chat answer with inline citations
A citation opened to the exact deposition page, with the cited passage highlighted
Tap a citation to open the exact source

Find the details that matter

Surface newsworthy information in any large trove of documents. Hybrid search combines semantic and keyword matching, with freshness ranking and query rewrite.

Match against individual passages with precision, then summarize entire collections of effectively unlimited size.

  • Hybrid search across semantic and keyword
  • Passage-level matching for precise citations
  • Summarize collections at any scale
  • Query structured datasets and tables, not just text
Sourcebase Deep Research

Build and manage your own collection

Our architecture scales on demand, holding video, audio, text, and images in one collection with full context. Tested past 20,000 files.

Curate your own collections, or have our team build custom ones for your organization. Enterprise ready with SSO, security, and governance.

  • One collection for audio and text
  • White-glove collection curation available
  • Enterprise SSO, security, and governance
Sourcebase collection management

Foundry builds the story

When you are ready to go further, Foundry takes your sources from a story idea to a fully cited first draft.

Build a story with Foundry

Foundry works your sources into story ideas, a research guide, and a finished first draft, and you decide how far it goes. Here is the whole flow, step by step.

1

Choose your sources

Foundry only writes from sources you choose. Combine our public collections, your own interviews, notes, and recordings, and any documents you add just for this story.

Your source base

Use one, two, or all three

Public collections

Our curated public collections, ready to search on day one.

Your interviews, notes, and recordings

The original material you bring to the story.

Documents for this story

Files you add for now. Temporary and private, gone when you finish.

Whatever you include works together as one fully cited set of sources.

Web search runs throughout. Confirm a fact on the open web at any point while you write.

2

Start the story

Choose how much effort Foundry should spend on the story, then start.

Low effortFast. A quick pass over the sources.
Medium effortBalanced. Solid depth without the wait.
High effortThorough. Digs deep across every source.
3

Stop wherever you want

Foundry moves through a pipeline, but you are never locked in. Take the output at any point and stop, or keep going. Each step builds on the one before it.

Plan
Research
Draft
Review
Fact Check
Copy Edit
Present
What each step does
Plan.Surveys the available sources and proposes an angle, structure, voice, and length for you to approve before anything else happens.
Research.Searches across the collection and the web, pulling sourced facts and quotes into a research notebook, each tied to its passage. It loops until the evidence is sufficient, up to eight rounds.
Draft.Writes the article from the research notebook, citing every claim back to the source it came from, then strips out AI-isms, hollow phrasing, and filler summaries.
Review.Scores the draft on impact, substance, originality, completeness, and style match, then revises the weak spots.
Fact Check.Re-checks every claim against its cited source and flags anything that is not fully supported. A separate check then verifies every fix is itself sourced, and escalates to a human when it cannot resolve something on its own.
Copy Edit.Tightens the prose for clarity, grammar, and house style without touching the facts.
Present.Lays out the final story so it is ready to read, share, or export.

Story ideas

A clear angle with proposed structure, voice, and target length. Take it and write the piece yourself, or keep going.

Your story angle, ready to write
You can stop here

Research guide

A research notebook of sourced facts and quotes, each citing the exact page, paragraph, or timestamp.

Research Notebook
1:1
2:1
4:1
You can stop here

Custom draft

A first draft in the voice you chose, with every claim cited to its source. Ready for you to edit and publish.

Story Draft
3:2
Take it all the way
4

Let it run, then pick it up

A full run, all the way to a first draft, can take an hour or more. You do not have to sit and watch it work.

Leave and come back whenever you like. Foundry keeps working in the background, and emails you the moment your story is ready.

When it is ready, you can
Read the full storyOpen the finished piece right inside Foundry.
Open any sourceClick a citation to read the exact passage it came from.
Share within your orgSend it to teammates with a link.
Download itExport to multiple formats for editing or publishing.

Built to stay accurate

Every stage is checked and reversible, and the whole trail is yours to inspect. You stay in control from the first plan to the final draft.

  • Every citation is verified against its source.
  • Every action and output is checked by at least one other AI agent.
  • Escalates to a journalist whenever judgment is needed.
  • Full audit trail, with every version and edit preserved.
  • Customizable to your organization’s workflow.

Start building with Foundry

Open the console and turn your sources into a fully cited first draft. Stop at the idea, the research, or the draft — it is always your call, and every claim links back to where it came from.

Open Console
12Specialized agents
7Pipeline stages
6Quality gates
3Feedback loops
5Human checkpoints

Our curated collections

Expertly curated public collections for reporters and researchers. Connect one and start working immediately, or build your own.

Source-based and journalist led.

A team of media and technology veterans rebuilding how information is sourced, analyzed, and published.

Every output traces back to verified source material, with a human in charge at every step.

AI is going to reshape how information gets made and moved. Most tools today approximate knowledge and blur their sources together; some present fiction as fact. For anyone whose work depends on getting it right, that is not an inconvenience. It is a liability.

We rebuilt the workflow around that problem, so a human stays in charge at every step, even when the system is running on its own.

We design for the AI capabilities of the next few years, and the system can be adopted incrementally as practice evolves. Sourcebase ingests and analyzes content at scale, source by source.

The future of media will be built on its sources, with people still making the calls. We are building it.

Media and technology veterans

Ron Suskind

Ron Suskind

CEO

Pulitzer winner, award-winning author & film producer, 5 tech patents.

John Nguyen

John Nguyen

CTO

Co-founded Vlingo, acquired by Samsung.

Winston Chen

Winston Chen

COO

Founder & CEO of Voice Dream, Apple Design Award winner.

Michael Caruso

Michael Caruso

CBO

Former CEO and Editor of Smithsonian and The New Republic.

Glenn Kessler

Glenn Kessler

CCO

Editor and chief writer of The Washington Post Fact Checker.

What's new

New and updated collections as they go live.

Plans that scale with the work

Start free to prove the value on public collections. Step up to Pro for serious research and Foundry, or talk to us about a private deployment for your whole organization.

Free
For journalists getting started and proving the value.
$0forever
Start free
  • Select public collections
  • 40 chat turns (lifetime)
  • 40 searches (lifetime)
  • 20MB uploads
  • 1 Foundry story
Pro
For working reporters and researchers who live in the archive.
$200per month
Start with Pro
  • All Sourcebase collections
  • 1,000 chat turns / month
  • 1,000 searches / month
  • 1GB uploads / month
  • 5 Foundry stories / month
Enterprise
For newsrooms, law firms, corporations, and government.
Customcontact for pricing
  • Full platform access
  • Custom knowledge bases
  • Private data ingestion
  • White-label options
  • Dedicated support & SLA

Bring Sourcebase to your newsroom

Private collections, your own archive at any scale, and security built for organizations. Tell us what you need.