Toxicology as a discipline is moving toward increased standardization of methods for testing and reporting end points in model organisms for improved, cross-laboratory data integration. Labs engaged in high-throughput analyses have established handbooks to internally standardize results, but these standards are rarely applied across laboratories. This poses a problem for integrating and comparing data across labs, which could support more robust corroboration in the confidence of results from toxicological assessments. To set a baseline, an informal survey was created asking participants to score 24 lateral images of zebrafish larvae for chemical-induced malformations (i.e., abnormalities) according to their laboratory practices and standards. Participants were allowed to answer using free text. Eighteen researchers from fourteen different labs participated. The free text results included 1748 terms which were mapped to 48 traits from the Zebrafish Phenotype Ontology (ZP) for comparison. Both the reported endpoints and the terms used to describe them were very heterogeneous and had little agreement - even the most straightforward of endpoints, normal and dead embryos, were not consistently identified. Abnormalities of the heart and yolk were the most consistently identified (67% and 63% agreement, respectively) while abnormalities of the pectoral fin and the gut were least consistently identified (6% and 11% agreement, respectively). Abnormalities of the gut, pectoral fin, and otic vesicles were the most heterogeneously described while abnormalities of the trunk, snout, and heart were the most homogeneously described. Participants were able to score every image, all images were lateral views at stage 96 hours post fertilization (hpf) which were taken with the VAST (Vertebrate Automates Screening Technology)-System. It is clear from these results that the community requires improved standards for defining and scoring toxicological endpoints. Future work will include conducting a study that includes a controlled vocabulary with textual definitions for scoring embryo images, to determine if this improves consistency.