Open dataset · 2018–2026
Which industries face the hardest path to Wikipedia notability — from real WikiBusines assessments, 2018–2026.
Headline finding
The gap between the hardest and easiest industries is not marginal — it is structural, driven by how Wikipedia's reliable-source policies interact with each industry's media ecosystem.
Hardest
98% / 96%
Crypto / Web3 and Gambling face low notability confidence in nearly all cases — crypto-native and gambling media largely fails WP:RS; mainstream coverage rarely reaches individual subjects.
Mid-range
51–75%
Healthcare, IT, Education, Legal and Manufacturing sit in the middle: qualifying coverage exists in principle but is uneven — the outcome depends heavily on the individual subject's media footprint.
Most viable
75–81%
Finance, Media, PR and Politics reach medium-or-high confidence in the majority of cases — dense independent coverage from Bloomberg, FT, trade press, and public records creates a qualifying source base.
The data
Approximately half of all WikiBusines engagements were unclassified ("Other") and are excluded from this view. Only industries with a sufficient sample are shown, so no figure can be traced to an individual client (see methodology).
#1
low
#2
low
#3
low
#4
low
#5
low
#6
low
#7
low
#8
low
#9
low
#10
low
#11
low
#12
low
#13
low
Why the gaps exist
Notability difficulty is not random. It is determined almost entirely by one question: does an independent, reliable media ecosystem exist that covers subjects in this industry in depth? The answer varies enormously by vertical.
Crypto is the clearest case of a structural mismatch between industry size and Wikipedia eligibility. The primary journalism ecosystem — CoinDesk, CoinTelegraph, Decrypt, The Block, Cointribune — is treated by Wikipedia editors as industry-partisan press that fails WP:RS. Coverage in these outlets does not contribute to notability under Wikipedia policy, regardless of article depth or editorial quality. Mainstream outlets (Reuters, Bloomberg, the Financial Times) do cover crypto, but almost exclusively at the asset-class or exchange level: Bitcoin price movements, Binance regulatory actions, FTX collapse. Individual crypto projects, founders, DAOs and tokens below the very top tier receive no mainstream coverage at all — and the coverage that does exist is often generated by press releases, white papers (primary sources) or sponsored promotional content.
The result: 98% of the 107 crypto / Web3 subjects assessed by WikiBusines between 2018 and 2026 lack the qualifying independent coverage Wikipedia requires. This is not a problem that marketing spend or a better PR agency fixes — it is a category-level structural problem that requires mainstream editorial coverage, earned one story at a time, before a Wikipedia article becomes defensible.
Gambling faces a double headwind. Reputation concerns cause many mainstream publishers to minimise promotional coverage of gambling brands, and the specialist media covering the sector — Gambling Insider, iGaming Business, EGR — occupies a similar position to crypto-native media: industry trade press that Wikipedia editors routinely question for independence and reliability. Most iGaming operators also generate coverage primarily through affiliate networks and sponsored content, which is automatically excluded under WP:ORGIND. The rare exceptions are publicly listed operators (Entain, Flutter, MGM) that attract genuine financial journalism — but the vast majority of the iGaming landscape operates below that visibility threshold.
B2B industries operate largely outside the consumer press. A manufacturer of industrial components may be a market leader in its segment, employ thousands, and have decades of operating history — yet generate almost no coverage in outlets Wikipedia recognises as reliable, because trade press (industry journals, procurement publications) is often treated as specialist rather than mainstream and company-generated case studies are primary sources. Energy companies at national scale are an exception: their regulatory filings, government contracts and infrastructure projects attract business-press coverage. Logistics and supply-chain firms benefited from unusual mainstream attention during and after 2020, but most remain below the Wikipedia visibility threshold.
Charitable organisations face a counterintuitive challenge: the organisations themselves often generate their own coverage through press releases, impact reports and donor communications — all primary sources that fail WP:RS. Genuine editorial coverage by independent journalists is sparse unless the NGO is large enough to attract scrutiny (Amnesty International, MSF) or involved in a significant news event. Smaller national or local charities, professional associations and industry bodies rarely accumulate the three or more in-depth independent pieces that Wikipedia's General Notability Guideline requires.
Tech companies sit in the middle of the difficulty distribution, and the reason is variance. The tech media ecosystem (TechCrunch, Wired, The Verge, Ars Technica) is large and Wikipedia-accepted, but only a fraction of the tens of thousands of SaaS companies ever receive editorial coverage there. Most receive coverage only on funding announcements — and Wikipedia policy explicitly excludes routine press releases about investment rounds as notability evidence. AI and cybersecurity companies have benefited from increased mainstream interest since 2022–2023, but the supply of qualifying coverage still does not match the scale of the industry. This is the largest vertical in our dataset and shows exactly the bifurcation one would expect: well-funded, category-defining companies at the top of the distribution, and the long tail of series-A-and-below startups that have no qualifying press footprint.
Law firms operate under client-confidentiality constraints that suppress the natural source of independent coverage: case outcomes. High-profile litigation that reaches a verdict becomes public record and attracts mainstream coverage; everyday commercial work does not. The leading global firms (Kirkland, Latham, Linklaters, Freshfields) are covered regularly in the legal press (The American Lawyer, Legal Business) and national business sections, but national and regional firms outside the top tier rarely accumulate qualifying coverage. Law is also a profession where individual partners are the primary brand — but individual lawyers face Wikipedia's person-notability bar, which is generally harder to meet than company notability.
Healthcare sits close to the midpoint of the distribution, but for an unusual reason: a double standard. Wikipedia's WP:MEDRS policy applies a stricter reliability threshold to medical claims — peer-reviewed literature and established medical bodies are preferred, while mainstream press is often insufficient to source medical facts. Large pharmaceutical companies (Pfizer, Roche, AstraZeneca) and major hospital networks have abundant qualifying coverage and meet notability easily; mid-tier pharma, wellness brands and medical device companies face a tougher path. Wellness in particular suffers from a reputation problem: products with marginal evidence bases attract Wikipedia editors who decline to create or maintain articles that cannot be sourced to reliable medical evidence.
Politicians and public officials benefit from structural notability advantages: government activity generates mandatory public records, and political coverage is among the oldest and most voluminous categories in mainstream journalism. National-level politicians — members of parliament, ministers, mayors of significant cities — are almost automatically covered by news outlets that meet WP:RS. The 38% low-confidence figure in this dataset reflects subjects below that threshold: local councillors, advisors, deputy ministers without personal media coverage, campaign managers, and political consultants whose activity is documented in primary sources (government websites, official records) but not independently reported.
Finance is structurally the easiest major industry for Wikipedia notability, and the reason is straightforward: the financial system is one of the most extensively documented sectors in the world. Bloomberg, the Financial Times, Reuters, the Wall Street Journal, Euromoney and hundreds of specialist publications cover financial institutions in depth and independence. Regulatory filings, stock exchange disclosures and central bank records are primary sources that support the secondary coverage. Fintech companies at growth stage benefit from the same coverage ecosystem that tech companies have access to, with the added legitimacy signal of FCA, SEC or equivalent regulatory registration. The 4% high-confidence figure and 77% medium-confidence figure confirm that the majority of Finance subjects in WikiBusines' pipeline arrive with usable qualifying coverage — the challenge shifts from "do sources exist?" to "are they strong enough for the specific claims the article makes?"
The pattern in one sentence
Wikipedia notability difficulty tracks almost perfectly with one variable: whether the industry has a dense, independent, editorially rigorous journalism ecosystem that covers individual subjects — not just the industry as a category. Where that ecosystem exists (Finance, Media, Politics), notability is achievable for most subjects. Where it is absent or structurally hostile (Crypto, Gambling), no amount of effort substitutes for what is not there.
Methodology
Transparency about scope and bias is the methodology moat. A clearly labelled practitioner dataset is more credible than an anonymous survey.
Notability confidence is assessed per engagement using WikiBusines' internal source-scoring pipeline: every candidate source is evaluated against the six tests Wikipedia editors apply at Articles for Deletion — significant coverage, independent author, independent content, editorial reliability, secondary (not primary), and sufficient reach. The aggregate confidence rating (low / medium / high) reflects how clearly the subject's existing source base meets Wikipedia's General Notability Guideline (WP:NCORP for companies) at the time of assessment.
No client names, company names, article titles, prices, revenue, or identifiable deal details are published. Every industry cell in this dataset has N ≥ 15 (k-anonymity threshold), meaning the smallest group published contains at least 15 engagements — making re-identification of individual clients from the published data statistically infeasible. Industries below N = 15 are suppressed entirely, not rounded.
This dataset is WikiBusines' own practitioner pipeline — not a random sample of the global Wikipedia consultancy market. The classified engagements shown represent roughly half of total WikiBusines engagements 2018–2026; the remaining ~50% were unclassified ("Other") and are excluded from this view. To protect client confidentiality we publish only generalised percentage distributions — no per-industry counts, prices or volumes. Findings are specific to WikiBusines' client base and assessment methodology; they should not be extrapolated to the market as a whole.
Open data
Free to use, share and cite under CC BY 4.0. Please attribute WikiBusines and link to this page.
License & citation
License: CC BY 4.0 — free to use, share and adapt with attribution. Cite WikiBusines and link to https://www.wikibusines.net/notability-index.
WikiBusines (2026). Wikipedia Notability Index, v2026-v1. Retrieved from https://www.wikibusines.net/notability-index. CC BY 4.0.
Frequently asked questions
Next step
The index shows industry-level patterns. A notability audit maps your specific source base against the same criteria — and tells you whether you have a viable path, or what it would take to build one.
The notability audit maps your source base, scores Wikipedia readiness, and gives you a clear route — or an honest 'not yet' and what to do about it.