I want to compare the different learning to rank techniques statistically. For that, I am using the P-Values and ANOVA. What are other methods? How to define this statistical confidence for better model
It is a little difficult to understand exactly what you want, but it may be that you need to use something like the AIC which is used to compare statistical models.
(ps- I recommend avoiding the term "statistical significance" as it can be misleading)
The test of hypothesis is a branch of inference statistics can be used to compare many groups statistically, and there are many statistical tests and each one is suitable for a specific hypothesis which depends on the data type and its distribution.