How do we evaluate the performance of detection algorithms on a set of images?

More Khadidja Belattar's questions See All

Do you think can be any Uranium bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about Uranium ore deposits in world.

11 August 2024 6,720 0 View

Do you think can be any diamond bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about diamond ore deposits in world.

11 August 2024 2,167 1 View

What is the difference between mathematical R^4 space and physical 4D unit space?

We assume that the difference is huge and that it is not possible to compare the two spaces. The R^4 mathematical space considers time as an external controller and the space itself is immobile in...

10 August 2024 6,678 14 View

If Banks do not provide credit facility, what are the options available for FPOs and impact on producer’s income?

10 August 2024 8,198 5 View

Controlling for pupil light reflex when analyzing pupil size time course?

I used eye tracking to examine how participants from two different populations (A and B) react to an image. Participants in population A exhibit larger pupil sizes over time, but they also have...

10 August 2024 3,229 0 View

What are a “Farmers Producer Organization” (FPO) and its essential features?

10 August 2024 477 5 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

Do interactions between biosphere, carbon cycle, & water cycle impact global warming & interaction between atmosphere & hydrosphere?

How do interactions between the biosphere, the carbon cycle, and the water cycle impact global warming and interaction between the atmosphere and the hydrosphere?

09 August 2024 3,291 2 View

How to get moment output in Abaqus Standart?

I have input a moment load in module load Abaqus, i put my moment load on the node surface (using reference point). I have define moment in history output and make a set for moment too. But the...

08 August 2024 4,831 4 View

How is energy cycled through the Earth's climate system and how do matter cycle and energy flow through the rock cycle?

08 August 2024 8,162 0 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Is there a problem with my RNA pellet?

Hello, I am currently having problems with RNA extraction. I am using mouse liver (C57BL6J), and I have extracted RNA from mouse liver before. Before this experiment, my final RNA pellets were...

11 August 2024 7,082 3 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

What are examples of AI for good projects a teacher can assign to students?

So I am organizing an AI seminar. What are possible AI projects in the AI for good spirit? something the students can do and have an impact?

08 August 2024 9,437 4 View

RNA Extraction Using Hot Borate Method No Longer Working?

I've been performing RNA extraction on cotton petiole tissue for a few months now using the method described in the following paper, a derivative of the typical hot borate method...

08 August 2024 9,882 2 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

How to design human-centered classroom in the age of A.I.?

08 August 2024 347 5 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

Gerard Sanroma

I assume your algorithm detects objects on images (e.g., faces, ...).

You can count the true positives (TP), false positives (FP) and false negatives (FN).

Where the TP are the number correct detections of your method that coincide with the ground-truth, FP are the number of false detections (i.e., your method thinks there is an object where there is not) and FN are the number of ground-truth objects that your method has missed.

You can consider an automated detection as a TP if it has high overlap with the ground-truth object (you can use the Dice ratio as overlap measure).

You can just provide the TP/FP/FN values or draw a ROC curve (TP, FP).

Christos P Loizou

The most appropriate way is to compare the detection of your algorithm is with a ground thruth. The ground truth is ususally the deliniation or the detection which is done manually by experts (observers). So if you have a number of images, you may ask some experts to deliniate them. Then you can compare the manual tracings with your automated segmenattaion tracings. Thgere are a nuber of metrics that can be used towards this direction, among them the true-positve, true negative, false positive and false negatives as said by Gewrard before. There are also other metrics that can be used such as the overalp, the willimas index, the men square error, and many more. Please have a look in a recent publication of ours where we compare manual and automated tracings in order to evalaute the performance of a video segmenattion algorithm.

C.P. Loizou, S. Petroudi, C.S. Pattichis, M. Pantziaris, A.N. Nicolaides, “An integrated system for the segmentation of atherosclerotic carotid plaque in ultrasound video”, IEEE Trans. Ultras. Ferroel. Freq. Contr., vol. 61, no. 1, pp. 86-101, 2014.

You may also have alook at another publication of our where similar problem was talked:

C.P. Loizou, C.S. Pattichis, M. Pantziaris, A. Nicolaides, “An integrated system for the segmentation of atherosclerotic carotid plaque,” IEEE Trans. Inform. Techn. Biomed.,” vol. 11, no. 6, pp. 661-667, 2007.

Paul Sturgess

Hi,

I would recommend following the VOC Challenge guidelines for object class detection, if it is object class detection that you mean by "detection".

http://homepages.inf.ed.ac.uk/ckiw/postscript/ijcv_voc09.pdf

The guidelines are practically a industry standard and outline good practice for the manual labeling of data for the use in an evaluation benchmark, and also outline the evaluation protocol. Pay careful attention to the detail, such as; if two or more predictions overlap the ground truth by 50%, only the one with the highest score is the a true positive, the rest are false positives; and you may label some of your ground truth with meta-labels such as "difficult", or "truncated", allowing you to evaluate on different sets, to get a better picture of where your approach succeeds and fails.

If you have Matlab, then you can try their evaluation toolkit:

http://pascallin.ecs.soton.ac.uk/challenges/VOC/voc2012/VOCdevkit_18-May-2011.tar

The actual workshops centered around the challenges have stopped running since the passing of Mark Everingham, R.I.P, one of the key people behind the benchmark.

http://www.computer.org/csdl/trans/tp/2012/11/ttp2012112081.html.