Cloudtrail add actor and target #12685

romulets · 2025-02-10T12:25:04Z

This is a re-open of #11245, that had to be closed because a full commit history rewrite in the integration main.

Background

Elastic Cloud Security Team has been focusing, this past year, on Cloud Detection and Response (CDR). One of the first steps towards the CDR vision is to enhance investigation workflows for the Cloud Security use-case in SIEM.

As part of enhancing investigation workflows it's necessary to be able to correlate events and entities. Meaning, if an alert is triggered on the ec2 instance i-000000000, it is of great value to easily be able to search all the events related to that entity, across multiple indices, with one query. Therefore we are working on extracting entities and enabling them to be correlated.

What is an entity?

An "entity" in our context refers to any discrete component within an IT environment that can be uniquely identified and monitored. This broad term encompasses both managed and unmanaged elements.

The term "entity" is broader than the current set of available fields under related. Although ip, user and hosts can be identities, there is a lack of space to represent messaging queues, load balancers, storage systems, databases and others. Therefore the proposal to add a new field.

The proposed structure

There are two fields being added on this PR:

actor.entity.id captures entities that started the event, the actors
target.entity.id captures entities that were affected by the event. Being that created, updated, listed. We try to do as much as possible with the data present in the event.

Decisions made on the Painless Script

Structure

The painless script turned very large. There are essentially three parts to it:

Definition of helper functions. They are meant to facilitate the handling of the collections (related, actor and target).
Definition of enriching functions per AWS service. Even though there is no defined structure to requestParameters and repsonseElements, there is, usually a somewhat coherent structure per AWS service. I believe such separation brings better reading, creates a better headspace once working in a specific service and also breaks down the huge if else chain present in the previous state of the code
Calling functions and setting fields.

Why TreeSet as datastructure to hold `related`, `actor`, `target`.

There are two properties that this script must have:

Values must be unique
Values must be sorted (for testability and consistency on production)

Previously I had ensured both properties on "post processing", at the end of the script. Now it's ensured by the data structure itself.

I have not performance tested myself, but the usage of TreeSet should improve the time complexity of the algorithm, since we sort data on add, and previously we had to sort afterwards. I couldn't find a reliable source for time complexity of TreeSet.add vs Collections.sort - and honestly, the size of the list is so small that might not even matter.

Amount of tests

The testing was essential to me to validate what I was doing, to verify each output. And I would like to keep the tests for future reference and ensuring we are not changing anything by mistake. But the tests are starting to get slow. Specially if you compare with other integrations, such as okta.

elastic-vault-github-plugin-prod · 2025-02-28T16:02:37Z

🚀 Benchmarks report

To see the full report comment with /test benchmark fullreport

elasticmachine · 2025-02-28T16:03:15Z

💚 Build Succeeded

Buildkite Build
Commit: 869b298

History

💔 Build #22840 failed d73751c
💔 Build #22797 failed 7d7e7169ed22d5d907af7f1b5b4653165cb8c6f1
💔 Build #22041 failed a4e062e

elastic-sonarqube · 2025-02-28T16:03:19Z

Quality Gate passed

Issues
0 New issues
0 Fixed issues
0 Accepted issues

Measures
0 Security Hotspots
100.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube

efd6

Approving change in packages/aws/docs/cloudtrail.md as codeowner.

Add Cloudtrail Actor and Target

a4e062e

romulets requested a review from a team as a code owner February 10, 2025 12:25

romulets requested review from a team as code owners February 27, 2025 07:44

romulets force-pushed the cloudtrail-add-origin-and-target branch from 11f8282 to 7d7e716 Compare February 27, 2025 07:46

Update toggle description

d73751c

romulets force-pushed the cloudtrail-add-origin-and-target branch from 7d7e716 to d73751c Compare February 28, 2025 10:37

Fix accountId typo

869b298

efd6 approved these changes Mar 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cloudtrail add actor and target #12685

Cloudtrail add actor and target #12685

romulets commented Feb 10, 2025

elastic-vault-github-plugin-prod bot commented Feb 28, 2025

elasticmachine commented Feb 28, 2025

elastic-sonarqube bot commented Feb 28, 2025

efd6 left a comment

Cloudtrail add actor and target #12685

Are you sure you want to change the base?

Cloudtrail add actor and target #12685

Conversation

romulets commented Feb 10, 2025

Background

What is an entity?

The proposed structure

Decisions made on the Painless Script

Structure

Why TreeSet as datastructure to hold related, actor, target.

Amount of tests

elastic-vault-github-plugin-prod bot commented Feb 28, 2025

🚀 Benchmarks report

elasticmachine commented Feb 28, 2025

💚 Build Succeeded

History

elastic-sonarqube bot commented Feb 28, 2025

Quality Gate passed

efd6 left a comment

Choose a reason for hiding this comment

Why TreeSet as datastructure to hold `related`, `actor`, `target`.