Skill in critically evaluating evidence leads researchers to fruitful research strategies and risk assessors to rigorous defensible assessments. How does one develop a scoring rationale (i.e. a recipe for skilled evidence evaluation) that can produce consistent and valid grading by multiple explorers?