The number of publicly available proteomics datasets is growing rapidly, but a standardized approach for describing the associated metadata is lacking. Here, the authors propose a format and a software pipeline to present and validate metadata, and integrate them into ProteomeXchange repositories.