The bubble sort is better than merge sort in practice for small set of data, but as size of input data increases, the performance of bubble sort suddenly drop down and the exact opposite behavior I found with merge sort. I know it is all about the complexity of the algorithm structure but my point is that how can I illustrate it for students?