How are parton distribution functions determined from experimental data?

Parton distribution functions are extracted by comparing theoretical predictions from quantum chromodynamics to a wide range of experimental measurements. Experiments such as H1 and ZEUS at DESY and ATLAS and CMS at CERN supply the raw observables—deep inelastic scattering cross sections, Drell–Yan lepton-pair production, jet rates and electroweak boson yields—that probe quark and gluon momentum fractions inside hadrons. Reviews by Stefano Forte, University of Milan, and Gavin Watt, University of Glasgow describe how these disparate data are combined in a global fit to determine PDFs. Theoretical control over scale dependence comes from the DGLAP evolution equations, which allow PDFs measured at one momentum scale to be evolved to another.

Data sources and theoretical framework

The practical determination begins with a flexible parameterization of the PDFs at a chosen initial scale and perturbative calculations of cross sections. Modern fits use next-to-leading or next-to-next-to-leading order QCD matrix elements so that the same PDFs can be used across processes. The NNPDF collaboration pioneered a machine-learning approach to reduce parameterization bias; methodology is detailed by Richard D. Ball, University of Edinburgh. Experimental inputs are international and territorial: fixed-target experiments, electron–proton collider data at DESY in Germany, and proton–proton collisions at CERN in the Geneva region together constrain different momentum fractions and flavors.

Fitting methodology and uncertainty quantification

A statistical procedure adjusts the PDF parameters to minimize a global goodness-of-fit measure while accounting for correlated experimental systematic errors and theoretical uncertainties from missing higher orders or heavy-quark treatments. Different groups use alternative philosophies—Hessian error propagation, Monte Carlo replicas, or neural-network ensembles—leading to multiple consensus sets. Choice of functional form, treatment of nuclear corrections, and kinematic cuts can shift central values and uncertainties, so inter-group comparisons and combined recommendations are important for robust predictions.

The relevance of accurate PDFs extends beyond particle physics: they determine cross sections that underpin searches for new particles, affect precision measurements of fundamental constants, and guide large international investments in accelerators. The collaborative nature of PDF determination—thousands of researchers across laboratories such as DESY and CERN—adds cultural and institutional dimensions that influence data sharing, methodology adoption, and the speed at which improved fits appear. Consequences of underestimated uncertainties can be misinterpretation of experimental anomalies, while overconservative errors may obscure discovery potential, so transparent methodology and continual reuse of global data remain essential.