Abstract: Techniques of the present disclosure relate to validating data for a composition design. A method comprises applying a machine learning model to at least two inputs comprising parameters of a cement composition and experimental conditions such that the machine learning model outputs at least one predicted property of the cement composition; performing a laboratory experiment to determine at least one experimental property of the cement composition; calculating an error between the at least one predicted property and the at least one experimental property; and recording the experimental data in a cement property database if the error is within an error range or repeating the performing the laboratory experiment if the error is outside the error range.