Hi

I would like to use Machine Learning methodology for automatic image-based classification of precious stones (like diamonds, sapphire, ruby, etc).. . So I have more then 100 000 images of these stones . So far my understanding CNNs are suitable for handling spatial data (images). But according to this paper Bona Hiu Yan Chow, ‘Automatic Gemstone Classification Using Computer Vision’ Article Automatic Gemstone Classification Using Computer Vision

the Random Forest algorithm provide the best accuracy and outperformed the CNN ( ResNet-18 and ResNet-50 ) which I dont really understand . Any explanation why is the case? Is the state of the art YOLO v8. or v7. more accurate as it works better for smaller objects such as stones?

So for my case when have a bid data set (over 100 000 images) should I stick to CNN or still use Random Forest? Any help, as I have to choose the right model with best accuracy

Thanks

Similar questions and discussions