Reed, D. A., & Dongarra, J. (2015). Exascale computing and big data. Communications of the ACM, 58(7), 56-68.
Zou, H., Yu, Y., Tang, W., & Chen, H. W. M. (2014). FlexAnalytics: a flexible data analytics framework for big data applications with I/O performance improvement. Big Data Research, 1, 4-13.
Fox, G., Qiu, J., Jha, S., Ekanayake, S., & Kamburugamuve, S. (2015). Big data, simulations and hpc convergence. In Big Data Benchmarking (pp. 3-17). Springer, Cham.
Cox, M., & Ellsworth, D. (1997, August). Managing big data for scientific visualization. In ACM Siggraph (Vol. 97, pp. 21-38).
I would look at some of the unique research and work of Lurong Pan and some of her colleagues were working on flow control and see if they have branched into ML.
Reed, D. A., & Dongarra, J. (2015). Exascale computing and big data. Communications of the ACM, 58(7), 56-68.
Zou, H., Yu, Y., Tang, W., & Chen, H. W. M. (2014). FlexAnalytics: a flexible data analytics framework for big data applications with I/O performance improvement. Big Data Research, 1, 4-13.
Fox, G., Qiu, J., Jha, S., Ekanayake, S., & Kamburugamuve, S. (2015). Big data, simulations and hpc convergence. In Big Data Benchmarking (pp. 3-17). Springer, Cham.
Cox, M., & Ellsworth, D. (1997, August). Managing big data for scientific visualization. In ACM Siggraph (Vol. 97, pp. 21-38).
Specifically for PIC simulations there is openPMD file markup scheme (https://www.openpmd.org , https://github.com/openPMD) compatible with widely used open high-performance output and visualization tools; some custom tools are also available on github. Several notable large scale PIC codes have support for output in this scheme (https://github.com/openPMD/openPMD-projects).