You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The following is to document the Vision for the Roadmap and how pieces fit together.
Foundational Datasets:
When we look at the top level pieces, we have at the very base a set of Dataset that are collected from different sources. This dataset of fresh updated samples of different types of data (malware, phishing emails, malicious domains, malicious urls etc) is used to train, and constantly evaluate and retrain ML models. Its necessary to write and maintain automation scripts to harvest and collect this datasets. The language of tools is Python.
Foundational Models
Some base ML models, that are used in the industry. There's a lot of research on the features and success of this models. We will maintain the scripts to train the models and publish both the tooling to train and the model. Overtime we could approach Researchers to see if they want to do research with us, or improve one of our models. If a model drops below a certain threshold an idea would be to Create an open Kaggle Competition to get Data Scientist to compete and improve the model back into a given acceptable threshold.
Tools
Some good Security Tools, to remove unneeded functionality, or functionality that aids on additional prevention or detection. The language of choice is Python.
Thread Feeds
Another valuable service, is an aggregated IOC Feed, for CyberSecurity that are valuable to be added to other platforms such as DNS, or Firewalls.
OSS Modules
Once we have foundational datasets and models that are well maintain and frequently published, we can approach certain Opensource Projects and present to the maintainers our intentions to write plugins/modules to expand their functionality by leveraging inference of the ML models. This modules will likely require some work, and have to be written in C. At this point, hundreds or thousands of OpenSource users, could enable this modules in their systems and benefit of the work from the alliance.
The text was updated successfully, but these errors were encountered:
The following is to document the Vision for the Roadmap and how pieces fit together.
Foundational Datasets:
When we look at the top level pieces, we have at the very base a set of Dataset that are collected from different sources. This dataset of fresh updated samples of different types of data (malware, phishing emails, malicious domains, malicious urls etc) is used to train, and constantly evaluate and retrain ML models. Its necessary to write and maintain automation scripts to harvest and collect this datasets. The language of tools is Python.
Foundational Models
Some base ML models, that are used in the industry. There's a lot of research on the features and success of this models. We will maintain the scripts to train the models and publish both the tooling to train and the model. Overtime we could approach Researchers to see if they want to do research with us, or improve one of our models. If a model drops below a certain threshold an idea would be to Create an open Kaggle Competition to get Data Scientist to compete and improve the model back into a given acceptable threshold.
Tools
Some good Security Tools, to remove unneeded functionality, or functionality that aids on additional prevention or detection. The language of choice is Python.
Thread Feeds
Another valuable service, is an aggregated IOC Feed, for CyberSecurity that are valuable to be added to other platforms such as DNS, or Firewalls.
OSS Modules
Once we have foundational datasets and models that are well maintain and frequently published, we can approach certain Opensource Projects and present to the maintainers our intentions to write plugins/modules to expand their functionality by leveraging inference of the ML models. This modules will likely require some work, and have to be written in C. At this point, hundreds or thousands of OpenSource users, could enable this modules in their systems and benefit of the work from the alliance.
The text was updated successfully, but these errors were encountered: