Implementation of the Clinical Data Interchange Standards Consortium (CDISC)'s Standard for Exchange of Nonclinical Data (SEND) by the United States Food and Drug Administration Center for Drug Evaluation and Research (US FDA CDER) has created large quantities of SEND data sets and a tremendous opportunity to apply large-scale data analytic approaches. To fully realize this opportunity, differences in SEND implementation that impair the ability to conduct cross-study analysis must be addressed. In this manuscript, a prototypical question regarding historical control data (see Table of Contents graphic) was used to identify areas for SEND harmonization and to develop algorithmic strategies for nonclinical cross-study analysis within a variety of databases. FDA CDER's repository of >1800 sponsor-submitted studies in SEND format was queried using the statistical programming language R to gain insight into how the CDISC SEND Implementation Guides are being applied across the industry. For each component needed to answer the question (defined as "query block"), the frequency of data population was determined and ranged from 6 to 99%. For fields populated