The Stack Overflow Collapse: Where Engineers Actually Get Answers in 2026
Stack Overflow's questions collapsed in 2025. The new taxonomy of where engineers actually get answers, and what teams should do about docs and onboarding
TL;DR
Reading the post…
The site that taught two generations of programmers how to code is now answering fewer questions per month than it did in its launch year. Where the questions went, and what we lose if we don’t think carefully about it, is more interesting than the decline itself.
In December 2025, Stack Overflow received roughly three thousand eight hundred questions across its entire platform. The number is staggering for what it represents: a seventy-eight percent year-on-year decline, a return to volumes the site has not seen since 2009, and the bottom of a curve that has been falling since 2014 and falling sharply since November 2022. A January 2026 data visualization by developer Sam Rose, drawn from Stack Overflow’s own public Data Explorer, made the collapse undeniable. Whatever your opinion of the site, the question-answering function it served for almost two decades has, statistically speaking, ceased to function.
This is not really a story about Stack Overflow. It is a story about where the question-and-answer behavior went, what kind of knowledge that flow used to create, and what teams should do about it. As Gergely Orosz documented in his “Stack Overflow is almost dead” piece, the trajectory has been visible for years. The conclusion has not been universally absorbed: most engineering organizations still operate as if a site that has lost ninety-eight percent of its question volume is a load-bearing piece of their developer experience. It isn’t. Building documentation strategy, onboarding plans, and open-source project policies around the old assumption is now actively hazardous.
What collapsed and what didn’t
Several different things were happening to Stack Overflow over the past decade, and conflating them produces bad analysis. Question volume started declining after the 2014 peak, well before ChatGPT existed, primarily because of aggressive moderation, duplicate-question closures, and a community culture many users found hostile. The site optimized for a small number of canonical, high-quality answers; the byproduct was that new users had a poor first-question experience. The site was already losing the long tail before AI arrived.
What ChatGPT did in November 2022 was remove the alternative. Before late 2022, even a developer who disliked Stack Overflow’s moderation often had nowhere else to go for an immediate answer. After late 2022, they did. The decline went from gradual to vertical. By May 2025, monthly questions were at 2009 levels. By December 2025, the platform was processing less than one fiftieth of its peak volume. Prosus, who bought the site for one and eight-tenths billion dollars in mid-2021, has pivoted the business model toward selling the site’s archived data to AI labs through API partnerships. Stack Overflow as a Q&A community is over; Stack Overflow as a training-data corpus is the going concern.
Several things did not collapse. The existing archive — fifteen years of canonical answers — remains valuable and is the substrate the AI tools answer from. Documentation sites for major frameworks (MDN, the Python docs, the Rust book) have held up much better; they serve a different function, providing authoritative reference rather than discussion. GitHub issues for active OSS projects have grown, partly because questions that used to be filed on Stack Overflow now get filed against the library’s issue tracker instead. Discord servers and niche subreddits for specific frameworks have absorbed some discussion-style questions. And a meaningful share of question-answering has moved into closed AI tools — Claude conversations, GitHub Copilot Chat, Cursor’s panel, ChatGPT side windows — which means the questions exist but the answers are not public, searchable, or available to train the next generation of either humans or models.
A taxonomy of question types and where they go now
The cleanest way to think about this is by question type. Different kinds of questions used to land on Stack Overflow for different reasons; the replacements differ by reason.
| Question type | Old default | Where it goes in 2026 | Why |
|---|---|---|---|
| ”How do I do X” (syntax, idiom) | Stack Overflow | LLM in IDE | Fast, in-context with code |
| ”Why is my code broken” (debugging) | Stack Overflow | LLM with code paste | LLM can see the actual code |
| ”What’s the canonical way to…” | Stack Overflow | Official docs + LLM | Need authoritative source |
| Architecture / design tradeoffs | Stack Overflow + blogs | Blogs, Twitter, internal Slack | Requires human judgment |
| Library-specific bugs | GitHub issues | GitHub issues | Maintainer presence required |
| Niche framework questions | Stack Overflow | Discord, GitHub Discussions | Closer to maintainers |
| ”Has anyone done X with Y” | Stack Overflow | Reddit, niche subreddits | Pattern-match real experience |
| Internal codebase questions | Internal wiki | Internal RAG over docs | Private context |
The pattern across the table is that each question type went to a different replacement, and most of those replacements are private or non-indexed. Syntax and idiom questions migrated to LLMs in the IDE, where they get answered in seconds, in-context, but the conversation is invisible to anyone except the person who asked. Architecture-style questions moved to blogs, social media, and conference talks, where the discussion is public but unstructured and not searchable the way Stack Overflow was. Niche framework questions moved into Discord servers, which are simultaneously closer to the maintainers and effectively unsearchable. Library bug questions moved into GitHub issues, which is good for the projects involved but means a former Stack Overflow user with a “is this a bug or am I holding it wrong” question now files a low-quality issue against a maintainer who is already drowning.
The economic effect is that the question-answering load got redistributed in ways that look fine in aggregate and are not fine in detail. The maintainers absorbing the GitHub-issues firehose are getting more low-quality submissions. The Discord moderators are doing volunteer triage with no archive. The LLM users are getting confidently wrong answers some fraction of the time — research has put the error rate on programming questions in the high tens of percent in some studies — without the social signal of a downvote to flag it.
The documentation strategy implications
For any team that maintains software others depend on, the strategic implication is sharp: official documentation is now the most important thing you publish. It is what LLMs train on. It is what your users will be quoting back to you in support tickets. It is what determines whether the AI tool gives users a correct answer or a hallucinated one. The documentation strategy that worked when Stack Overflow filled in the gaps no longer works. The gaps now produce wrong answers, not no answers.
Three practical shifts follow.
First, write for the LLM as a primary reader. The LLM is now the most frequent reader of your documentation, and the way it parses and weights content matters. Structured headings, explicit examples, clear parameter-by-parameter reference, and code samples that compile and run are dramatically more valuable than prose-heavy “introduction” pages. Conversely, marketing-language documentation is worse than useless — the LLM picks it up and quotes it as factual claims about the product.
Second, publish the FAQ as documentation. The kinds of questions that used to land on Stack Overflow are now landing in your support channels and in your users’ LLM conversations. Anticipate them. Write them down. Publish them as part of the documentation. The question “why does my JSON parsing throw on this input” should have an answer in your official docs, not in a Stack Overflow post nobody is writing anymore.
Third, invest in indexable, structured content. Discord transcripts, Slack archives, and YouTube videos are bad substrates for LLM training, which means they are bad substrates for whether your users get a correct answer from their AI tool. Markdown, official docs, blog posts on canonical domains, and GitHub README files are good substrates. The shift from “you’ll find a Stack Overflow post about it” to “the LLM will answer based on what’s in the docs” requires producing more, better-structured public content than most teams currently do.
The junior engineer onboarding problem
The harder, less-discussed effect is on junior engineer development. Stack Overflow was, for fifteen years, the primary way junior engineers learned how things worked. Not just “how do I do this,” but “here are five ways to do this, here’s why three of them are wrong, here’s the social signal of which one is preferred, here’s the discussion of edge cases.” The combination of seeing a question, seeing multiple answers, seeing the comments, and seeing the votes was a structured curriculum embedded in the workflow.
The LLM replacement is faster and more convenient but lossy in a specific way. The LLM gives a single answer. It is usually reasonable. It does not show the alternatives. It does not show the social signals about which alternative is preferred. It does not show the discussion of why an obvious-looking approach is actually wrong. The junior engineer gets a working solution and learns nothing about why the other solutions don’t work, because they never see them. The judgment-building exposure that came from reading the comments thread is gone.
This is a real problem and engineering leaders should treat it as one. Three responses.
Code review becomes more important as a teaching surface. The discussion that used to happen on Stack Overflow now needs to happen in PRs, and the senior engineers reviewing those PRs need to deliberately surface the “here’s why I would do it differently and what the tradeoffs are” reasoning that the LLM did not provide. Pair programming, mentoring, and structured curriculum for juniors all matter more than they did when the curriculum was implicit in the search results.
Internal documentation needs to compensate for what was previously externalized. The “here’s how we do things and why” knowledge that used to be implicit in “go read these Stack Overflow threads” is no longer implicit. It needs to be explicit, in internal docs, accessible through whatever internal RAG or AI search the team uses. The teams that have done this well are seeing junior engineers ramp faster than they did in the Stack Overflow era; the teams that have not are seeing the opposite.
Junior engineers need explicit instruction on how to use AI tools well — how to ask follow-up questions, how to verify answers, when to distrust a confident response, how to recognize a hallucinated API. The first generation of engineers who grew up with Stack Overflow learned skepticism implicitly because they saw downvoted answers and warring comment threads. The current generation does not have that exposure by default.
What OSS projects should do
Open-source projects are absorbing both the question load that used to go to Stack Overflow and the wider AI-slop problem. The practical patterns: enable GitHub Discussions and route question-style issues there explicitly; publish comprehensive FAQ documentation that LLMs can train on (the LLM is going to answer questions about your project either way; you might as well control what it has to work from); invest in maintainer-facing triage tools that filter AI-generated noise from real questions; and treat the documentation site as the primary canonical surface for everything the project wants AI tools to know.
For projects with a community Discord or Slack, the structural advice is to mirror the most valuable threads into searchable, indexable form. The convenience of real-time chat is real; the cost is that the answers never become part of the public record. The projects that have done this well — typically by maintaining a curated FAQ that captures the best of the Discord discussion — are getting both benefits. The ones that haven’t are watching their institutional knowledge accumulate in chat history nobody can search.
The Stack Overflow collapse is not, in itself, a tragedy. The site became less welcoming over fifteen years, and the technology that replaced it is genuinely better for many of the questions it used to answer. But the collapse did not produce a better version of public, searchable, peer-reviewed programming knowledge. It produced a privatized, faster, less-reliable substitute, with no equivalent of the social signals that used to teach beginners how to tell a good answer from a bad one. The teams that figure out how to fill in the gaps — through better documentation, more deliberate junior engineer development, and more conscious investment in indexable public content — will end up with better engineers and healthier projects than the teams that assume the LLM has it covered. The LLM does not have it covered. It has the easy half of it covered, very well, in private. The hard half — judgment, alternatives, tradeoffs, why-not — is the part that is now nobody’s job by default. Make it somebody’s, or watch the engineering depth of the next generation quietly thin out.