In threshold comparison method, the subject has to decide when he stopped hearing. This may not be accurate many of the times or it may become difficult to execute in old, too young or less co-operative subjects.
On the other hand, loudness comparison is more straight forward in comparing which one is louder; thats all.
So, I think, this may be the reason why loudness comparison method is better than threshold comparison method.