Hello everyone,
I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from collected logs.
My goal is to develop a system capable of analyzing security data in real time, detecting potential threats and automatically generating relevant use cases to respond to them.
To do this, I plan to use machine learning algorithms to:
Normalize and correlate logs from different sources
Identify abnormal behaviors and suspicious activities
Predict attacks and generate alerts in real time
Dynamically create suitable security use cases
I plan to rely on an open source SIEM like Security Onion or ELK stack for data collection and storage. Then I would like to integrate Python libraries like scikit-learn or TensorFlow to implement the machine learning algorithms.
However, I am still in the research and design phase. I need advice and guidance to carry out this project. If any of you have already worked on similar topics, I would be happy to exchange with you and benefit from your feedback.
What are the best practices to follow? What are the technical challenges to anticipate? Which machine learning algorithms would be the most suitable? Do you have any resources or references to recommend to me?
Any help and suggestions will be greatly appreciated. Do not hesitate to contact me in comments or by private message.
Thank you in advance for your support!