Centralize malicious #132

Robin5605 · 2023-07-25T02:49:34Z

Blocked by #131
Closes #95

Add a new constant in constants.py that allows tweaking the score threshold
New endpoint: GET /scans that returns a list of all packages scanned since since, and a list of malicious packages.
- Malicious package is defined to be any package with a score greater than or equal to that defined in the constants

Add inspector URL to `first_safe_second_malicious.json` to records that have a score greater than 0 (since packages that match rules *must* have an inspector URL). This was done because of the conditional in L34-35 of `src/endpoints/scans.py`: ```py if scan.inspector_url is None: continue ``` The previous conditional that checks the score should ensure that the inspector_url is never actually null in a production environment where we get real data instead of our own fed in from test data. However, since we use one big table for everything, most of our columns are nullable and and "runtime checked." This means that even though the `inspector_url` will always not be null if the score is greater than 0, the type-checker doesn't know this. So this is mostly a check to appease the typechecker.

Add a `score_threshold` field to the `MainframeSettings` configuration class in `constants.py` that determines the minimum score required for a scan to show up in the `malicious_packages` field of `GET /scans` response. This score determines what packages are considered "malicious"

Add a new `GET /scans` endpoint under `src/mainframe/endpoints/scans.py` This endpoint takes one query string parameter, `since`, which is the UNIX epoch timestamp. It returns two fields: `all_scans` and `malicious_packages`. `all_scans` returns the package name and version of all packages that were scanned since `since`, while `malicious_packages` returns a list of packages that have a score higher than the set `score_threshold`.

import-pandas-as-numpy · 2023-07-26T09:34:27Z

I thought we had a /scans since endpoint already? I feel like I've seen it in the logs. Unless you moved things around.

Robin5605 · 2023-07-26T18:41:31Z

Yeah, that's what this PR originally added. I wanted to see if a websocket would be feasible, but I don't think it is anymore. We'd have to handle things like CD redeployments, disconnects, etc. Stateless HTTP might just be better.

…into centralize-malicious

Robin5605 added 4 commits July 24, 2023 21:47

Add tests

6fe5427

Robin5605 force-pushed the centralize-malicious branch from 94022c0 to 6fe5427 Compare July 26, 2023 18:43

Robin5605 and others added 5 commits July 26, 2023 13:46

Merge branch 'main' of https://github.com/vipyrsec/dragonfly-mainframe …

2064556

…into centralize-malicious

Make scans endpoint synchronous

4a2007a

Fix tests

b2c796c

Merge branch 'main' into centralize-malicious

0958ddf

Merge branch 'main' of https://github.com/vipyrsec/dragonfly-mainframe …

be55332

…into centralize-malicious

Robin5605 requested review from a team as code owners April 5, 2024 14:43

shenanigansd marked this pull request as draft May 30, 2024 01:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Centralize malicious #132

Centralize malicious #132

Robin5605 commented Jul 25, 2023 •

edited

Loading

import-pandas-as-numpy commented Jul 26, 2023

Robin5605 commented Jul 26, 2023

Centralize malicious #132

Are you sure you want to change the base?

Centralize malicious #132

Conversation

Robin5605 commented Jul 25, 2023 • edited Loading

import-pandas-as-numpy commented Jul 26, 2023

Robin5605 commented Jul 26, 2023

Robin5605 commented Jul 25, 2023 •

edited

Loading