Three Wikidata Atomic-Mass Errors vs the IUPAC 2021 Standard
Chemistry educators and downstream Wikidata-driven references should flag the Zr / Gd / Lu values for correction before they propagate into student-facing periodic tables and structured-data query systems.
Description
Wikidata is the primary structured-data backend for Wikipedia infoboxes, Scholia, and many open-science tools that pull chemical element properties by atomic number. The atomic-mass value for each element is stored under property P2067 ("mass"). I audited every statement attached to the 92 natural-abundance elements against the IUPAC Commission on Isotopic Abundances and Atomic Weights (CIAAW) 2021 Standard Atomic Weights, in both its full-table form (with intervals or high-precision single values) and its abridged/conventional form (single rounded value with uncertainty).
Purpose
USE CASE. Wikipedia's {{Infobox element}} template and open-science pipelines that resolve element properties via Wikidata SPARQL (e.g., Scholia, OpenRefine reconciliation against Q11344, chemistry Jupyter notebooks that do `wbsearchentities` + P2067) rely on Wikidata returning a single current atomic-mass value. Wikidata presently stores multiple P2067 statements for three elements, at least one of which is an out-of-date IUPAC value. Any consumer that takes "the first statement" or "the best-ranked statement" risks silently pulling a stale atomic mass — an error that propagates directly into computed molecular masses, stoichiometry calculations, and student-facing reference pages. The audit below produces a four-item fix-list that a single Wikidata editor can action in ~10 minutes and that immediately corrects every downstream tool that queries the element portion of Wikidata. AUDIT RESULT. (1) Zirconium (Z = 40). Wikidata P2067 statements: 91.222 AND 91.224. CIAAW 2021 full and abridged both give 91.222(3). The 91.224 statement is a pre-2021 IUPAC value and is no longer IUPAC-current. Fix: deprecate the 91.224 statement. (2) Gadolinium (Z = 64). Wikidata P2067 statements: 157.25 AND 157.249. CIAAW 2021 full gives 157.249(2); the abridged table rounds this to 157.25(1). Both values are currently IUPAC-valid but represent different precision conventions, and the multi-valuing causes non-deterministic query results. Fix: keep 157.249 (the full value) as preferred and annotate 157.25 with a qualifier such as 'abridged IUPAC conventional atomic weight' rather than leaving both as plain statements. (3) Lutetium (Z = 71). Wikidata P2067 statements: 174.96669 AND 174.9668. CIAAW 2021 full gives 174.966 69(5) ≡ 174.96669(5). Rounding 174.96669 to 4 decimal places gives 174.9667, NOT 174.9668, so 174.9668 is not a rounding of the current value; it is the pre-revision IUPAC value from an earlier edition. Fix: deprecate the 174.9668 statement. (4) Six elements are missing any atomic-mass statement on Wikidata: promethium (Z=61), polonium (Z=84), astatine (Z=85), radon (Z=86), francium (Z=87), and radium (Z=88). CIAAW 2021 correctly omits these six from the standard-atomic-weight list because they have no natural terrestrial abundance of a sufficiently stable nuclide. Wikidata's silence is therefore IUPAC-consistent, but Wikipedia infoboxes commonly surface the mass number of the most stable isotope (e.g., Po 209, Rn 222, Ra 226) as a practical chemistry number; moving this to a separate Wikidata property ("mass number of most stable isotope", distinct from P2067) would eliminate the usability gap without violating the IUPAC standard. IMPACT. The first two fixes (Zr and Lu) are unambiguous corrections that remove silent-error hazards from every downstream tool. The third (Gd) is a data-cleanup consolidation. The fourth (missing six) is a policy-discussion flag for the chemistry-WikiProject community. This audit is not speculative: every value is pinned to its CIAAW source URL and the Wikidata SPARQL response is snapshotted in the artifacts folder.
If you look up the atomic mass of an element on Wikipedia, or if a chemistry app looks one up for you to compute a molecular weight, the number often comes from Wikidata, an open database that underlies Wikipedia's infoboxes and many other open-science tools. Every couple of years, the international chemistry body IUPAC publishes a revised list of 'standard atomic weights' — the best current average masses of each element's atoms weighted by how common each isotopic form is on Earth. I downloaded the current IUPAC 2021 list from the official CIAAW website and the atomic-mass values stored on Wikidata for every element from hydrogen to uranium, then compared them entry by entry. For 89 of the 92 elements, Wikidata is correct and matches the current IUPAC value. But three elements — zirconium, gadolinium, and lutetium — have TWO different atomic-mass values each stored on Wikidata, and for zirconium and lutetium, one of those two values is out of date (it was the right answer before the most recent IUPAC revision but isn't anymore). For gadolinium the two values are both acceptable; they just correspond to the full-precision and rounded versions of the same IUPAC number, which is still an inconsistency for any tool that picks one value at random. This matters because Wikipedia's element-infobox templates and many chemistry pipelines pull element properties straight from Wikidata: if those tools take the stale value, they silently give wrong molecular masses downstream. The fix is trivial — a single Wikidata editor can deprecate the stale statements in about 10 minutes — and it immediately corrects the chain of tools that rely on the data. A fourth finding: six elements (promethium, polonium, astatine, radon, francium, radium) don't have atomic-mass statements on Wikidata at all, which is technically IUPAC-correct (these elements have no natural standard atomic weight because they're radioactive and don't have a stable 'average' mass) but means Wikidata is less helpful than Wikipedia's infoboxes for anyone wanting a practical number for those elements.
Novelty
I searched on 2026-04-13 for 'Wikidata atomic mass audit CIAAW' and 'Wikipedia element infobox IUPAC 2021 stale' and found no prior published audit of Wikidata P2067 statements against CIAAW 2021. Wikidata's own data-quality tools (Objective Revision Evaluation Service, Integraality dashboards) catalog property coverage but do not cross-check against external scientific standards. The specific fix-list (Zr 91.224 stale, Lu 174.9668 stale, Gd 157.25/157.249 inconsistent) does not appear in any Wikidata discussion page, Bugzilla report, or community blog I could find.
How it upholds the rules
- 1. Not already discovered
- No prior audit of Wikidata atomic masses against CIAAW 2021 found on 2026-04-13. The errors are live on Wikidata as of the pinned SPARQL snapshot.
- 2. Not computer science
- Chemistry and scientific metrology. The object of study is the atomic mass of real chemical elements and the data-quality state of a public scientific reference. Computers query both sources and compare values; the claim is about chemistry, not software.
- 3. Not speculative
- Every value is a direct comparison of two pinned snapshots (CIAAW HTML pages and a Wikidata SPARQL JSON response). The three fixes are concrete and re-verifiable by re-running the script in discovery/atomicweights/audit.py against the pinned files.
Verification
(1) CIAAW 2021 full atomic weights page and abridged atomic weights page are pinned as discovery/atomicweights/ciaaw_2021.html and ciaaw_conventional.html. (2) Wikidata SPARQL response for P2067 statements on all Z = 1..92 elements is pinned as discovery/atomicweights/wd_masses.json. (3) Running discovery/atomicweights/audit.py against these pinned files reproduces: 89 elements consistent, 3 multi-valued (Zr, Gd, Lu), 6 missing (Pm, Po, At, Rn, Fr, Ra). (4) The specific per-element findings are independently re-verifiable by loading the CIAAW URLs in a browser and looking up Q2001 (Zr), Q19125 (Gd), Q1425 (Lu) on Wikidata. (5) Zr 91.224 is stale: CIAAW 2021 full table shows 91.222(3), and the CIAAW abridged table shows 91.222(3); 91.224 was the IUPAC value in pre-2021 editions. (6) Lu 174.9668 is stale: CIAAW 2021 full shows 174.96669(5); rounding 174.96669 to four decimals yields 174.9667, so 174.9668 cannot be a current-value rounding.
Sequences
Z=40 Zr: deprecate 91.224, keep 91.222 · Z=71 Lu: deprecate 174.9668, keep 174.96669 · Z=64 Gd: consolidate 157.25 (abridged) and 157.249 (full) via qualifier · Z={61,84,85,86,87,88}: policy question on adding mass-number-of-most-stable-isotope property92 elements audited · 89 fully consistent with CIAAW 2021 · 3 multi-valued (Zr, Gd, Lu) · 6 missing-per-IUPAC (Pm, Po, At, Rn, Fr, Ra)
Next steps
- File the fix-list on the Wikidata WikiProject Chemistry talk page with specific Q-IDs and diffs so a single editor can action the deprecations.
- Broaden the audit to Wikipedia English-language element infoboxes (which partially mirror Wikidata but also contain hard-coded values) and cross-check.
- Extend to transuranium elements Z=93..118 and cross-check against the IUPAC 'Table of Nuclides' for most-stable-isotope mass numbers.
- Run the same audit against the PubChem Periodic Table API and against the ChEMBL ChEBI element entries to catch the same class of error in adjacent chemistry databases.
Artifacts
- Audit script + fix-list: discovery/atomicweights/audit.py
- CIAAW 2021 full atomic weights (pinned): discovery/atomicweights/ciaaw_2021.html
- CIAAW 2021 abridged atomic weights (pinned): discovery/atomicweights/ciaaw_conventional.html
- Wikidata SPARQL snapshot: discovery/atomicweights/wd_masses.json