How to calculate mAP,mAR and F1 scores correctly for Mask-RCNN?

Hasindu Yahamapth Dias Dahanayake @Hasindu-Dahanayake

06 September 2021 1 695 Report

I am training a model using the Mask-RCNN deep learning model(There are multiple classes available ). I need to know for evaluation purposes how can I calculate the mAP(Mean Average Precision), mAR(Mean Average Recall), and F1 score correctly with k-fold cross-validation. I have noticed that different code segments out there in the issues section of the official repository regarding this matter. But the problem is there are mainly two approaches out there to calculate the F1 score and still the discussion is going on about which one is correct. Below source code is extracted from the issues section of Mask-RCNN repository(Link:https://github.com/matterport/Mask_RCNN/issues/2474) Even though one of the approaches is correct, according to my knowledge F1 is defined as follows.

PR(Precision)

RC(Recall)

F1 score = [(2 x PR x RC) x 100/(PR+RC)]

So I need to know,

1) Does PR = mAP and RC = mAR ?

2) If yes, then does calculating PR for a model mean calculating the mAP and calculating the RC 3) for a model mean calculating the mAR.Is my argument correct?

3) What do the precisions and recalls array contain?

4) What's the correct way to calculate mAp, mAR, and F1 metrics?

5) If I am using k-fold cross validation should I calculate each of these values at the end of each iteration and get the average?

Method 1

from mrcnn.model import load_image_gt

from mrcnn.model import mold_image

from mrcnn.utils import compute_ap, compute_recall

from numpy import expand_dims

from mrcnn import utils

def evaluate_model(dataset, model, cfg):

APs = list();

F1_scores = list();

for image_id in dataset.image_ids:

#image, image_meta, gt_class_id, gt_bbox, gt_mask = load_image_gt(dataset, cfg, image_id, use_mini_mask=False)

image, image_meta, gt_class_id, gt_bbox, gt_mask = load_image_gt(dataset, cfg, image_id)

scaled_image = mold_image(image, cfg)

sample = expand_dims(scaled_image, 0)

yhat = model.detect(sample, verbose=0)

r = yhat[0]

AP, precisions, recalls, overlaps = utils.compute_ap(gt_bbox, gt_class_id, gt_mask, r["rois"], r["class_ids"], r["scores"], r['masks'])

AR, positive_ids = compute_recall(r["rois"], gt_bbox, iou=0.2)

ARs.append(AR)

F1_scores.append((2* (mean(precisions) * mean(recalls)))/(mean(precisions) + mean(recalls)))#Method 1

APs.append(AP)

mAP = mean(APs)

mAR = mean(ARs)

return mAP, mAR, F1_scores

Method 2

mAP, mAR, F1_score = evaluate_model(dataset_val, model, inference_config)

print("mAP: %.3f" % mAP)

print("mAR: %.3f" % mAR)

print("first way calculate f1-score: ", F1_score)

F1_score_2 = (2 * mAP * mAR)/(mAP + mAR)#Method 2

print('second way calculate f1-score_2: ', F1_score_2)

Nitin Malik

Precision, Recall and F1 score are for individual classes. If you take mean over all the classes, it will be termed as Mean Average Precision and Mean Average Recall respectively. When we calculate these metrics for the model, it means the mean average values.

Badges
Science topic

Similar topics
Geoscience
Cartography
Mapping
Maps

More Hasindu Yahamapth Dias Dahanayake's questions See All

Should I need to calculate the MIou(Jaccard Coefficient) and Dice Coefficient by using all the trained weights during the Mask R-CNN evaluation ?

I need to know do I need to calculate the above metrics by selecting every available trained weight with all the test data to evaluate the model performance? Or do I only need to calculate those...

10 September 2021 2,339 3 View

Should I consider maximizing the number of labels or number of images to increase the performance of Mask R-CNN?

I need to know whether I should consider the number of labels available or the number of images available to increase the model performance. I mean in my case in a single image I have multiple...

10 September 2021 8,245 5 View

How to calculate True Negative in Mask-RCNN model ?

I need to calculate the accuracy, precision, recall, specificity, and F1 score for my Mask-RCNN model. Hence I hope to calculate the confusion matrix to the whole dataset first to get the TP, FP,...

05 September 2021 4,940 8 View

How to perform k-fold cross validation with Mask-RCNN?

Currently, I have trained a Mask-RCNN model on a dataset(train and Val)and chosen the final model by considering the minimum validation loss. Now I need to evaluate the model performance. So to do...

03 September 2021 510 3 View

Can anyone explain how to calculate convergence angle and collection angle in TEM/EELS ?

Our TEM is JEM 2100 with EELS Gatan 963 with GMS3 software. For the quantification we have to feed the collectinn angle and convergence angle values.

22 June 2016 5,504 4 View

Is this website real?

Is this website real? https://isar.org.in/event/registration.php?id=2434532

08 August 2024 484 1 View

Weak DAPI staining after immunohistochemistry - how to improve?

After immunohistochemistry of previously fixed in PFA and EtOH and then frozen 20 μm sections of zebrafish brain, DAPI staining is very weak (right) compared to the same sections stained without...

05 August 2024 9,637 2 View

Dirty and clean?

Hi everyone I need a file with a dirty and clean potato image

04 August 2024 7,199 4 View

Hello, regarding Mxene 2D titanium carbide?

I fabricated Ti3C2Tx using concentrated HF 40%, I plot an XRD as attached image below.. please let to know if I obtained it or not.

02 August 2024 6,789 4 View

Best mask design for metal deposition aiming build a Perovskite solar cell device?

What types of mask designs for metal deposition, to be used in a PVD system, are best suited for perovskite-based solar cells?

31 July 2024 4,835 3 View

IHC profiler in image j software. has anyone used it to quantify nuclear DAB positivity?

Dear researchers. I tried using the IHC PROFILER in image j to quantify nuclear DAB staining. I followed the instructions in the original article by "Varghese F, Bukhari AB, Malhotra R, De A...

29 July 2024 2,229 0 View

Are you looking for research collaboration ?

we have few papers ready for submission, and we need one co-author for each article who can pay article fee. Interested authors may text here or contact me on my following email id [email protected]

29 July 2024 6,626 0 View

While using the IHC PROFILER plugin in image j software, to quantify the epithelial cytoplasmic DAB staining do you crop remove the stromal tissue?

My question pertaining to the DAB staining in cytoplasm of human oral squamous cell carcinoma tissue. When quantifying the epithelial cancer cells do we have to crop remove the stromal tissue?...

29 July 2024 2,682 6 View

Why in 2D image created by discovery studio 2021, the halogens appear as alkyl group?

In my molecule there is Chloro group at 2-position of phenyl ring, but in 2D image it appears as methyl showing no interaction.

28 July 2024 734 0 View

How to merge two or more linkage maps?

I have three linkage maps that needs to be integrated to create a consensus map for QTL mapping.

25 July 2024 6,505 2 View