Hi

I working on a algorithm to classify objects to size based classes like big, small, tiny and etc.

For example in dense captioning i want to say :

it's a big elephant.

I thinked to get objects width and height and compare to screen width and height but there is many challenges like:

I need to compare my object to other objects ? for example to say this is a big elephant when we want to produce a caption just for selected elephant in a image that include a cat that is nearest to camera and bigger than that elephant for nearest distance?

Challenge:

when we have a little cat nearest to camera and a big elephant that is so far to camera.

Any idea and any python code in opencv please.

Thanks,

Alireza Mirhabibi.

Similar questions and discussions