When the Expert Says No, but You Buy Anyway
Federal experts called Microsoft's cloud a 'pile of shit' — then approved it. The gap between technical judgment and institutional decisions is your problem too.
The federal government's cloud procurement story has a detail that should stop anyone who manages platform decisions cold: technical experts at CISA privately called Microsoft's cloud infrastructure a "pile of shit" — and then signed off on it anyway.
That's not a leak from a disgruntled employee. According to Ars Technica's reporting, it's documented in internal reviews surfaced through congressional scrutiny following the 2023 Microsoft Exchange breach that exposed senior U.S. government email. The experts knew. They approved it anyway.
Strip away the government context and this is one of the most common stories in technology procurement: the people who understand the technical risk are overridden by the institutional forces that make changing course too expensive to contemplate.
The Gap Between What Experts Know and What Organizations Do
Product managers live in this gap constantly. A senior engineer flags that the chosen infrastructure vendor has persistent reliability concerns. A researcher surfaces data showing the proposed solution doesn't match how users actually behave. A platform architect warns that the current stack won't survive 10x growth.
And then the decision goes the other way anyway — because switching costs are already baked in, because procurement has an existing relationship, because the alternative means retraining half the organization, because the vendor's relationship with leadership is stronger than the internal dissent.
The Microsoft case makes this dynamic unusually visible because it played out at national scale with documented consequences. The Exchange breach — attributed to Chinese state hackers — compromised accounts across the State Department and other sensitive agencies. The breach vector was Microsoft's cloud infrastructure. The experts had concerns on record. The contracts continued.
At QCon London this week, distributed systems researcher Martin Kleppmann made the argument for local-first software as a direct response to exactly this pattern. Europe's growing discomfort with dependency on U.S. cloud providers isn't abstract geopolitics — it's the realization that once you're architecturally embedded, the exit ramp is extraordinarily expensive. His framing: cloud dependency isn't just a technical risk, it's a sovereignty risk.
The Changelog's editors flagged this week that "the tech monoculture is finally breaking" — pointing to a broader pattern of organizations starting to question whether the convenience of a single dominant platform vendor is worth the lock-in it creates.
Why Institutional Momentum Beats Expert Opinion
Expert opinion is only one input into platform decisions, and often not the most powerful one. Three forces tend to win instead.
Switching costs compound invisibly. Every integration built on a vendor's APIs, every workflow shaped around a vendor's interface, every hire made because of familiarity with a vendor's tooling — each one raises the cost of leaving. By the time the expert opinion arrives, the organization is already three years deep and the exit math is brutal.
Accountability is diffuse. When a committee approves a platform decision, no single person owns the outcome. The expert who raised the flag gets noted in the minutes. The committee that overrode them faces no personal consequence when the flag turns out to be right. This is textbook diffusion of responsibility, and it's why "we documented the risk" becomes a substitute for actually mitigating it.
Vendor relationships operate on a different timeline than technical risk. A sales relationship built over years, with executive sponsorship and favorable contract terms, is a present and tangible asset. A technical risk is probabilistic and future. Humans — and organizations — systematically discount future risks against present benefits.
The federal government case isn't an anomaly. It's the archetype made visible.
What This Means for Platform Decisions on Your Roadmap
If you manage roadmap decisions that involve platform dependencies — and in 2026, almost every product roadmap does — the Microsoft story offers a few uncomfortable prompts worth taking seriously before you're too deep to course-correct.
Treat "switching cost" as a metric, not a vibe. When evaluating any platform dependency, actually model the exit scenario. What would it cost in engineering hours to migrate off this vendor in 18 months? In 36 months? Most organizations never run this exercise until it's urgent — at which point the answer is always worse than expected. Kleppmann's local-first argument is partly a technical one, but it's also an argument for keeping exit costs quantified and low.
Separate the technical assessment from the vendor relationship. This sounds obvious. It almost never happens. The people doing technical due diligence are often the same people who have worked with a vendor's representatives for years and have genuine goodwill toward them. Build a process where the technical risk assessment is documented independently of the procurement conversation, and where someone with no relationship stake has sign-off authority on the risk section.
Document dissent formally, not just in meeting notes. If a technical expert flags a platform risk and the organization proceeds anyway, that should generate a formal risk acceptance document — not just a bullet in someone's notes. This isn't about covering your back. It's about making the decision legible: we knew about this risk, we weighed it against these factors, and we accepted it for these reasons. That documentation forces the tradeoff to be explicit, and it creates accountability when the risk materializes.
Revisit platform decisions on a cadence. Most organizations make a platform decision, implement it, and then treat it as settled. But the risk profile of a vendor changes over time. Concentrating critical infrastructure with a single provider that has documented reliability concerns isn't a decision you make once — it's a decision you should be remaking every 12-18 months with fresh information.
The Approval That Shouldn't Have Happened
The federal government's Microsoft situation isn't a cautionary tale about cloud infrastructure specifically. It's a cautionary tale about what happens when the distance between "the experts know" and "the institution acts" gets too wide.
That distance exists in every organization that's large enough to have separated the people who understand the technical risk from the people who sign the contracts. Ars Technica's reporting makes it stark: the people with the technical judgment called it a "pile of shit." The institutional machinery approved it anyway.
The question worth sitting with isn't "how did the government get here?" It's "where is my organization already here, and don't know it yet?"
The most dangerous platform risks on your roadmap aren't the ones your organization is arguing about. They're the ones everyone privately agrees are a problem but that haven't been formally raised — because raising them creates friction, and the contract is already signed, and switching would be too expensive.
Until it isn't.