For this I am using phyton and already generated the model has an accuracy of 92%, but use a standard of 70% training and 30% of the dataset. I understand that this is a convention but some other percentage could be used. Taking into account that the data set does not exceed 5000 records.