I have measured the RMSE for both groups for different dentures, but have an issue with the statistical test that I should be using? can I use Paired T_test? or what test should I go for to combine all RMSE values of all dentures in each group? and how to compare two models for statistical significance?