Hello! I am working on a proposal for creating an electronic health record phenotyping classification algorithm (mental health focus). I am having a hard time finding solid guidance re:cohort identification. Specifically, is there a gold standard ratio of patients with the identified phenotype:healthy controls that should be gathered? I would be very appreciative of any guidance toward gold-standard studies or systematic reviews on this topic. Thanks in advance for taking the time to answer this question.