In a binary classification problem using MLP outputs, I am interested in finding optimal threshold from the ROC curve. While keeping in mind the fact that TPs are only 400 and FPs are 180000. In the attached images, one can clearly see that finding optimal value can reduce the FPs by huge number while not effecting the number of TPs. Ay help, Please?