Usually the most important advantage is that calculating the inverse of a diagonal matrix is trivial and therefore reduces the numerical time needed to solve the problem. To obtain a diagonal mass matrix using Lagrange Elements it is also necessary to have hexahedral elements (as far as I know from seismic wave problems). Therefore you loose geometric flexibility when creating a mesh.