Your Copilot Is Only as Smart as Your SharePoint: Why Stale Files Are Hallucinations Waiting to Happen

Three categories of trouble

Stale policies. The 2019 expense policy capped lunch reimbursements at $25; the 2024 update raised it to $40. If both PDFs are in the same SharePoint site and the 2019 one has a more “official-sounding” filename, Copilot may quote $25 to a new hire who then submits a $30 lunch and gets denied.

Duplicate-with-drift. Sales has version 1.0 of the price sheet in their site. Marketing has version 1.0 in theirs. Six months later, Sales updated theirs to 1.1 but Marketing didn’t. A prospect asks Copilot for current pricing through their account manager, who’s in Marketing. They get 1.0 prices. The deal closes at the wrong number.

Orphaned answers. The 2018 acquisition’s product wiki is still in the tenant — nobody owns it, nobody maintains it. A support agent asks Copilot about a feature, Copilot finds the answer in the orphaned wiki, the feature was deprecated in 2020. Customer is told it still exists.

In every case, the LLM is doing its job. The data is doing the lying.

Why this gets worse with Copilot, not better

Before Copilot, stale content was passively bad — it sat there until someone happened to read it.

Most of the time nobody did. Copilot changes the consumption model: every question pulls in candidate content, ranks it, and serves it.

Stale content goes from passively harmful to actively retrieved. The footprint of “files nobody looks at anymore” shrinks to zero — Copilot looks at everything.

The fix is operational, not technical

Microsoft’s response to this problem is “use sensitivity labels and information protection.”

That’s correct but slow — it’s a multi-year program and most organisations don’t finish it.

There’s a faster intervention available: identify the content that’s most likely to mislead Copilot (old, duplicate, unowned) and either archive, delete, or update it.

A scan that produces this inventory takes under an hour for most tenants. The cleanup itself can be staged over a quarter — archive stale sites, dedupe price sheets, assign owners to orphans.

The result isn’t perfect Copilot accuracy, but it removes the most common categories of corporate hallucination.

A storage scan and a Copilot-readiness scan are essentially the same scan. Storage savings are the financial argument; Copilot accuracy is the operational one. Most tenants we work with start the project for the financial reason and discover the operational benefit was worth more.

The 2019 Org Chart Strikes Back: A Copilot Hallucination Story

An anonymised account of what happens when Copilot grounds in a stale tenant — and what the post-mortem revealed about the cleanup we should have done first.