New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

bootstrap.sh improvements #10490

Open

nirbosl wants to merge 1 commit into hiero-ledger:main from nirbosl:bootstrap-script-improvements

Contributor

nirbosl commented Feb 25, 2025 •

edited by steven-sheehy

Loading

Description:

Simplified process tracking by removing the processing_files associative array and relying on the pids array for job tracking
Fixed an issue where all child processes would print out the same PID as theirs, which was in fact the script parent-group-ID; each process now properly prints its own job PID, also resulting in PGAPPNAME reflecting the correct PID in the DB COPY jobs.
Extended the validate_special_files function with additional validation logic
- Fixes an issue of an error thrown for special files validation when re-running the script
Improved robustness of file validation checks
Enhanced the import_file function with additional error handling and verification steps
Improved import reliability
Improved code organization
Maintained the same overall script structure and core functionality

Notes for reviewer:
The changes to the script's code are due to an issue a customer ran into. I was not able to reproduce the issue they ran into (valid file being imported but post-import row-count returns incorrectly then counts again and increments by the amount of rows it counts over and over).

I have focused on potential weaknesses I've identified that could potentially result in such edge-cases, and have refactored and improved their code to be more robust and safer in order to avoid such issues in the future.

I have repeat-tested the script using a dataset that included two large table file parts of each large table, and all small tables in the following two scnearios:

Single run import with no interruptions (good path)
Started an import, let it complete several files successfully then stop the script and re-run it (to validate resumption logic and proper skip of already-imported files)

All repeat tests in both scenarios finished successfully.

I have then started a local mirror-importer against the bootstrapped DB and it started cleanly and resumed sync from September 18th timestamps, which is the date this data export was taken on (0.113.2).

Checklist

Documented (Code comments)
Tested (Actual runs of the script, described above in the Notes section.)


          bootstrap.sh improvements

bec44c9

Signed-off-by: Nir Ben-Or <[email protected]>

nirbosl requested a review from a team as a code owner

February 25, 2025 18:02

codecov bot commented Feb 25, 2025 •

edited

Loading

Codecov Report

All modified and coverable lines are covered by tests ✅

steven-sheehy assigned nirbosl

steven-sheehy added bug database enhancement and removed bug labels

steven-sheehy added this to the 0.125.0 milestone

steven-sheehy reviewed

View reviewed changes

hedera-mirror-importer/src/main/resources/db/scripts/bootstrap.sh

Comment on lines +5 to +6

		# Start a new process group and detach from terminal
		set -m

Contributor

steven-sheehy Feb 25, 2025

Revert this, it causes problems with spotless.

hedera-mirror-importer/src/main/resources/db/scripts/bootstrap.sh

    
            @@ -394,7 +398,7 @@ process_manifest() {
          
                  if [[ -f "$file_path" ]]; then

                    # Skip validation if file is already imported successfully

                    if grep -q "^$file_path IMPORTED" "$TRACKING_FILE" 2>/dev/null; then

                    if [[ "$(read_tracking_status "$(basename "$file_path")")" == "IMPORTED" ]]; then

Contributor

steven-sheehy Feb 26, 2025

read_tracking_status internally calls basename so this can be omitted, I believe.

hedera-mirror-importer/src/main/resources/db/scripts/bootstrap.sh

Comment on lines +497 to +500

+                    current_status=$(read_tracking_status "$file")
+                    if [[ "$current_status" != "IMPORTED" ]]; then
+                      write_tracking_file "$file" "IMPORTED"
+                    fi

Contributor

steven-sheehy Feb 26, 2025

Does it really matter if you check the status before write? It doesn't harm it to always set status IMPORTED like before.

hedera-mirror-importer/src/main/resources/db/scripts/bootstrap.sh

                   fi
+                done
+                # Check the exit status of the count query

Contributor

steven-sheehy Feb 26, 2025 •

edited

Loading

If the count query fails after 3 times we should fallback to it returning true after logging an error. In theory, the import should've returned failure if any rows were missed and this is just a safety check. This change alone would've unblocked the customer.

hedera-mirror-importer/src/main/resources/db/scripts/bootstrap.sh

+                      actual_count=$(psql -v ON_ERROR_STOP=1 -q -Atc "SELECT COUNT(*) FROM ${table};")
+                      psql_status=$?
+                    else
+                      actual_count=$(psql -v ON_ERROR_STOP=1 -q -Atc "SELECT COUNT(*) FROM ${table} WHERE consensus_timestamp BETWEEN '$start_ts' AND '$end_ts';")

Contributor

steven-sheehy Feb 26, 2025

We could also fallback to a quicker query like where consensus_timestamp in ($start_ts, $end_ts) and verify the first and last are present.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

database enhancement