In training machine learning models it is believed that a k-fold cross-validation technique, usually offer better model performance in small dataset. Also, computationally inexpensive compare to other training techniques. However, why do most studies use 10-fold cross-validation?