direct parsing fuzzing test #1052

tyler92 · 2024-10-08T07:06:27Z

Two issues were found by the new fuzzing target: #1047 and #1048

I think detecting memory issues in case of new changes or refactoring might be useful.
I'm not sure about code formatting for the project, I will appreciate your feedback.

cppalliance-bot · 2024-10-08T07:12:28Z

An automated preview of the documentation is available at https://1052.jsondocs.prtest.cppalliance.org/libs/json/doc/html/index.html

cppalliance-bot · 2024-10-08T07:27:27Z

An automated preview of the documentation is available at https://1052.jsondocs.prtest.cppalliance.org/libs/json/doc/html/index.html

codecov · 2024-10-08T08:09:26Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 93.41%. Comparing base (502ac79) to head (3f88a33).
Report is 2 commits behind head on develop.

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #1052   +/-   ##
========================================
  Coverage    93.41%   93.41%           
========================================
  Files           91       91           
  Lines         8667     8667           
========================================
  Hits          8096     8096           
  Misses         571      571

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 502ac79...3f88a33. Read the comment docs.

cppalliance-bot · 2024-10-08T08:29:52Z

Benchmark test results. More info at https://benchmark.cppalliance.org/jsonbenchmarks-pullrequests/1052/pullrequest.html

cppalliance-bot · 2024-10-08T10:37:23Z

An automated preview of the documentation is available at https://1052.jsondocs.prtest.cppalliance.org/libs/json/doc/html/index.html

cppalliance-bot · 2024-10-08T11:39:53Z

Benchmark test results. More info at https://benchmark.cppalliance.org/jsonbenchmarks-pullrequests/1052/pullrequest.html

grisumbras · 2024-10-09T13:06:59Z

While the idea is interesting, I am sceptical fuzzing with random strings can be effective when the parser expects a very specific structure of the input. It will probably always fail on the second (on_object_begin), the third (on_key), or the fourth (on_bool) event.

tyler92 · 2024-10-09T13:18:54Z

According to coverage reports, it works well enough to find valid inputs, these two issues were found quite quickly. We can provide a dictionary for the fuzzer to help it a bit if we want to improve the performance.

sdarwin · 2024-10-09T15:32:01Z

It may be a coincidence. This pull request mentions "detecting memory issues". The drone agent on IBM s390 crashed with out-of-memory errors. I have rebooted the server.

grisumbras · 2024-10-09T15:38:52Z

It's also possible that linking the new fuzzer requires too much memory (variant parsers do do that).

tyler92 · 2024-10-09T19:05:24Z

I am sceptical fuzzing with random strings can be effective when the parser expects a very specific structure of the input. It will probably always fail on the second (on_object_begin), the third (on_key), or the fourth (on_bool) event.

Just for an experiment: I reverted the fix for #1047 and measured the time required to find that bug. Without a dictionary and with an empty corpus (the worst scenario) it takes about 10 seconds. The reason is the fuzzer doesn't have to find a valid input with all required fields. It's enough to generate a JSON with some fields to verify parser for a specific type only.

Adding corpus doesn't improve speed much. But I've added two valid JSONs to seed files, and it gives the fuzzer the ability to reach the "success" point. With this change, the fuzzer finds #1047 in less than one second. Also, I changed C++ version to C++14 to avoid potential OOM.

Please let me know if it makes sense

cppalliance-bot · 2024-10-09T19:12:25Z

An automated preview of the documentation is available at https://1052.jsondocs.prtest.cppalliance.org/libs/json/doc/html/index.html

cppalliance-bot · 2024-10-09T20:14:55Z

Benchmark test results. More info at https://benchmark.cppalliance.org/jsonbenchmarks-pullrequests/1052/pullrequest.html

grisumbras · 2024-10-10T08:05:36Z

fuzzing/fuzz_direct_parse.cpp

+    std::tuple<std::vector<std::string>, std::vector<double>> t3;
+
+#ifndef BOOST_NO_CXX17_HDR_VARIANT
+    std::variant<bool, std::uint64_t, std::int64_t, double, std::string> v;


Please, replace with boost::variant2::variant. It will reduce the amount of macro switching necessary.

grisumbras · 2024-10-10T08:11:04Z

.github/workflows/run_fuzzer.yml

@@ -60,6 +60,7 @@ jobs:
        buildtype: 'boost'
        path: 'head'
        toolset: clang-18
+        cxxstd: 14


Why have you reduced this from 17 to 14?

I wanted to make sure this MR has a chance to be approved and avoid potential OOMs on the drone agent before (although I'm not sure if it's related to the current MR). I reverted to 17; please let me know if there are any issues with it.

cppalliance-bot · 2024-10-10T19:22:24Z

An automated preview of the documentation is available at https://1052.jsondocs.prtest.cppalliance.org/libs/json/doc/html/index.html

cppalliance-bot · 2024-10-10T20:24:53Z

Benchmark test results. More info at https://benchmark.cppalliance.org/jsonbenchmarks-pullrequests/1052/pullrequest.html

grisumbras

Everything looks good. Can you please squsah and rebase on the current develop (if it's not already). I can also do it manually myself, but then the PR won't be considered merged by GitHub.

cppalliance-bot · 2024-10-13T13:27:24Z

An automated preview of the documentation is available at https://1052.jsondocs.prtest.cppalliance.org/libs/json/doc/html/index.html

cppalliance-bot · 2024-10-13T13:32:39Z

An automated preview of the documentation is available at https://1052.jsondocs.prtest.cppalliance.org/libs/json/doc/html/index.html

tyler92 · 2024-10-13T14:01:57Z

Can you please squsah and rebase on the current develop (if it's not already). I can also do it manually myself, but then the PR won't be considered merged by GitHub.

Done

cppalliance-bot · 2024-10-13T14:30:01Z

Benchmark test results. More info at https://benchmark.cppalliance.org/jsonbenchmarks-pullrequests/1052/pullrequest.html

grisumbras · 2024-10-14T16:17:38Z

Thank you for your contribution.

tyler92 force-pushed the direct-parse-fuzz branch from 94e3781 to ccb3339 Compare October 8, 2024 07:19

grisumbras reviewed Oct 10, 2024

View reviewed changes

tyler92 requested a review from grisumbras October 12, 2024 16:50

grisumbras approved these changes Oct 13, 2024

View reviewed changes

tyler92 force-pushed the direct-parse-fuzz branch from c24fd02 to 2fe5f18 Compare October 13, 2024 13:21

direct parsing fuzzing test

3f88a33

tyler92 force-pushed the direct-parse-fuzz branch from 2fe5f18 to 3f88a33 Compare October 13, 2024 13:22

tyler92 requested a review from grisumbras October 14, 2024 07:56

grisumbras merged commit 3f88a33 into boostorg:develop Oct 14, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

direct parsing fuzzing test #1052

direct parsing fuzzing test #1052

tyler92 commented Oct 8, 2024

cppalliance-bot commented Oct 8, 2024

cppalliance-bot commented Oct 8, 2024

codecov bot commented Oct 8, 2024 •

edited

Loading

cppalliance-bot commented Oct 8, 2024

cppalliance-bot commented Oct 8, 2024

cppalliance-bot commented Oct 8, 2024

grisumbras commented Oct 9, 2024

tyler92 commented Oct 9, 2024

sdarwin commented Oct 9, 2024

grisumbras commented Oct 9, 2024

tyler92 commented Oct 9, 2024

cppalliance-bot commented Oct 9, 2024

cppalliance-bot commented Oct 9, 2024

grisumbras Oct 10, 2024

tyler92 Oct 10, 2024

grisumbras Oct 10, 2024

tyler92 Oct 10, 2024

cppalliance-bot commented Oct 10, 2024

cppalliance-bot commented Oct 10, 2024

grisumbras left a comment

cppalliance-bot commented Oct 13, 2024

cppalliance-bot commented Oct 13, 2024

tyler92 commented Oct 13, 2024

cppalliance-bot commented Oct 13, 2024

grisumbras commented Oct 14, 2024

direct parsing fuzzing test #1052

direct parsing fuzzing test #1052

Conversation

tyler92 commented Oct 8, 2024

cppalliance-bot commented Oct 8, 2024

cppalliance-bot commented Oct 8, 2024

codecov bot commented Oct 8, 2024 • edited Loading

Codecov Report

cppalliance-bot commented Oct 8, 2024

cppalliance-bot commented Oct 8, 2024

cppalliance-bot commented Oct 8, 2024

grisumbras commented Oct 9, 2024

tyler92 commented Oct 9, 2024

sdarwin commented Oct 9, 2024

grisumbras commented Oct 9, 2024

tyler92 commented Oct 9, 2024

cppalliance-bot commented Oct 9, 2024

cppalliance-bot commented Oct 9, 2024

grisumbras Oct 10, 2024

Choose a reason for hiding this comment

tyler92 Oct 10, 2024

Choose a reason for hiding this comment

grisumbras Oct 10, 2024

Choose a reason for hiding this comment

tyler92 Oct 10, 2024

Choose a reason for hiding this comment

cppalliance-bot commented Oct 10, 2024

cppalliance-bot commented Oct 10, 2024

grisumbras left a comment

Choose a reason for hiding this comment

cppalliance-bot commented Oct 13, 2024

cppalliance-bot commented Oct 13, 2024

tyler92 commented Oct 13, 2024

cppalliance-bot commented Oct 13, 2024

grisumbras commented Oct 14, 2024

codecov bot commented Oct 8, 2024 •

edited

Loading