Roadmap - Documentation #3

rothoma2 · 2024-05-27T22:19:40Z

The following is to document the Vision for the Roadmap and how pieces fit together.

Foundational Datasets:

When we look at the top level pieces, we have at the very base a set of Dataset that are collected from different sources. This dataset of fresh updated samples of different types of data (malware, phishing emails, malicious domains, malicious urls etc) is used to train, and constantly evaluate and retrain ML models. Its necessary to write and maintain automation scripts to harvest and collect this datasets. The language of tools is Python.

Foundational Models

Some base ML models, that are used in the industry. There's a lot of research on the features and success of this models. We will maintain the scripts to train the models and publish both the tooling to train and the model. Overtime we could approach Researchers to see if they want to do research with us, or improve one of our models. If a model drops below a certain threshold an idea would be to Create an open Kaggle Competition to get Data Scientist to compete and improve the model back into a given acceptable threshold.

Tools

Some good Security Tools, to remove unneeded functionality, or functionality that aids on additional prevention or detection. The language of choice is Python.

Thread Feeds

Another valuable service, is an aggregated IOC Feed, for CyberSecurity that are valuable to be added to other platforms such as DNS, or Firewalls.

OSS Modules

Once we have foundational datasets and models that are well maintain and frequently published, we can approach certain Opensource Projects and present to the maintainers our intentions to write plugins/modules to expand their functionality by leveraging inference of the ML models. This modules will likely require some work, and have to be written in C. At this point, hundreds or thousands of OpenSource users, could enable this modules in their systems and benefit of the work from the alliance.

rothoma2 added this to Collaboration Request May 27, 2024

rothoma2 self-assigned this Jun 14, 2024

rothoma2 added the top-level-task label Jun 14, 2024

rothoma2 moved this to In Progress in Collaboration Request Jun 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Roadmap - Documentation #3

Roadmap - Documentation #3

rothoma2 commented May 27, 2024 •

edited

Loading

Roadmap - Documentation #3

Roadmap - Documentation #3

Comments

rothoma2 commented May 27, 2024 • edited Loading

rothoma2 commented May 27, 2024 •

edited

Loading