01 January 1970 13 3K Report

Data Quality impacts the accuracy and meanings of machine learning for industrial applications. Many industry practitioners do not know how to find/use the right data. There are two levels of data: visible vs. invisible. Most the visible data are from problem areas or based on our experiences. General questions for the visible data are: First, how to find the useful data? Second, how to evaluate which data is usable? Third, which data is most critical? As for the invisible data, vert often people use either an ad-hoc or trial-and-error approach to find and seek but often the work can not be reproduced by others. We need a systematic approach to address the data quality issues in AI for industrial applications. We welcome people to share their research experiences and thoughts.

More Jay Lee's questions See All
Similar questions and discussions