What are the common data preprocessing steps for materials datasets before applying machine learning algorithms? How do researchers deal with missing data or outliers?
For your second question nowadays it is very common that researchers using forecasting methods ( mostly Machine Learning methods ) to produce new data same as missed data .