Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document data docker schema update mode #527

Open
wants to merge 4 commits into
base: master
Choose a base branch
from

Conversation

hqpho
Copy link
Contributor

@hqpho hqpho commented Oct 22, 2024

This mode is added by datacommonsorg/website#4686. A subsequent PR to mixer will link the new docsite page directly from the schema check error message: datacommonsorg/mixer#1440

@hqpho hqpho changed the title Document the data docker schema update mode Document data docker schema update mode Oct 22, 2024
@hqpho hqpho requested review from keyurva and kmoscoe October 22, 2024 20:59
Copy link

@keyurva keyurva left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM modulo the decision on the env var.


While starting Data Commons services, you may see an error that starts with `SQL schema check failed`. This means your database schema must be updated for compatibility with the latest Data Commons services.

You can update your database by running a data management job with the environment variable `SCHEMA_UPDATE_ONLY` set to `true`. This will alter your database without modifying already-imported data.
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

May need to be updated based on what we decide in https://github.com/datacommonsorg/website/pull/4686/files#r1815646141

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated.

@hqpho
Copy link
Contributor Author

hqpho commented Oct 25, 2024

Will wait to commit this until the mode is available in the stable data container image.

hqpho added a commit to datacommonsorg/website that referenced this pull request Oct 25, 2024
Use the `DATA_RUN_MODE` environment variable to decide what mode to pass
to run_stats.sh and whether to build embeddings. The mode `schemaupdate`
for run_stats.sh is added by
datacommonsorg/import#344, which this PR updates
the import submodule to include.

A docsite page will describe how to pass in this environment variable:
datacommonsorg/docsite#527
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants