chore: tool to aggregate allocator tracking logs #4439

kostasrim · 2025-01-10T11:21:56Z

Similar to parse_total_allocator_tracking_logs.py but outputs the total bytes allocated and deallcoated

add tool to aggregate allocator tracking logs

kostasrim · 2025-01-10T11:24:17Z

tools/aggregate_allocator_tracking_logs.py

+        allocating = False
+        deallocating = False
+
+        for word in line.rstrip().split():


Could be a little bit more pythonic but oh well 🤷

Yes this is not very quality code.
Its a tool and its not top priority but this kind buggy as we might print to dragonfly logs the word Allocating/Deallocating in other places than memory tracker.
See how parse_allocator_tracking_logs is using regex to parse this lines.
I dont see how this total allocation is valuable but if you think it is and you wish to merge this this should be written better.
I would actually suggest to add this functionality to the parse_allocator_tracking_logs script to sum the allocations and not have a separate script with duplicated code.
If you dont think its worth the effort of fixing this and checking the correctness it might not worth the merge

ts a tool and its not top priority but this kind buggy as we might print to dragonfly logs the word Allocating/Deallocating

Could you plz explain ? I thought the tracking allocator only prints the same formatted lines ? Allocating/Deallocating bytes ?

See how parse_allocator_tracking_logs is using regex to parse this lines.

My golden rule is stay away from regex, is slow and complicated for no real value. I would write it in a more pythonic away but I would not use a regex. In fact I would argue if my previous question is true (that the output of lines always start with Allocating/Deallocating) that a regex is completely redundant in that case.

I would actually suggest to add this functionality to the parse_allocator_tracking_logs script to sum the allocations and not have a separate script with duplicated code.

+1 on that

If you dont think its worth the effort of fixing this and checking the correctness it might not worth the merge

Will push something but it's not priority

Could you plz explain ? I thought the tracking allocator only prints the same formatted lines ? Allocating/Deallocating bytes ?

Yes allocator prints only this format. But what if we have other places in dragonfly where will will have the word Allocating/Deallocating printed to log. you would try to parse this lines. This is why using a regex here with form
re.compile(r"Allocating (\d+) bytes ((0x[0-9a-f]+))") is safer to catch only relevant lines

makes sense, cheers

chore: tool to aggregate allocator tracking logs

1183b0f

kostasrim force-pushed the kpr3 branch from cd28e76 to 1183b0f Compare January 10, 2025 11:23

kostasrim commented Jan 10, 2025

View reviewed changes

kostasrim self-assigned this Jan 13, 2025

kostasrim requested review from adiholden and romange January 13, 2025 09:17

kostasrim closed this Jan 27, 2025

kostasrim deleted the kpr3 branch January 27, 2025 09:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: tool to aggregate allocator tracking logs #4439

chore: tool to aggregate allocator tracking logs #4439

kostasrim commented Jan 10, 2025

kostasrim Jan 10, 2025

adiholden Jan 14, 2025

kostasrim Jan 14, 2025

adiholden Jan 14, 2025

kostasrim Jan 14, 2025

chore: tool to aggregate allocator tracking logs #4439

chore: tool to aggregate allocator tracking logs #4439

Conversation

kostasrim commented Jan 10, 2025

kostasrim Jan 10, 2025

Choose a reason for hiding this comment

adiholden Jan 14, 2025

Choose a reason for hiding this comment

kostasrim Jan 14, 2025

Choose a reason for hiding this comment

adiholden Jan 14, 2025

Choose a reason for hiding this comment

kostasrim Jan 14, 2025

Choose a reason for hiding this comment