Skip to content

mila-iqia/diffusion_for_multi_scale_molecular_dynamics

Repository files navigation

Diffusion for Multiscale Molecular Dynamics

This project implements diffusion-based generative models for periodic atomistic systems (i.e., crystals). The aim of this project is to be able to train such a model and use it as part of an active learning framework, where a Machine Learning Interatomic Potential (MLIP) is continually fine-tuned on labels obtained from a costly oracle such as Density Functional Theory (DFT). The generative model is used to create few-atom configurations that are computationally tractable for the costly oracle by inpainting around problematic atomic configurations.

Instructions to set up the project

Creating a Virtual Environment

The project dependencies are stated in the pyproject.toml file. They must be installed in a virtual environment.

uv

The recommended way of creating a virtual environment is to use the tool uv. Once uv is installed locally, the virtual environment can be created with the command

uv sync

which will install the exact environment described in file uv.lock. The environment can then be activated with the command

source .venv/bin/activate

pip

Alternatively, pip can be used to create the virtual environment. Assuming python and pip are already available on the system, create a virtual env folder in the root directory with the command

python -m venv ./.venv/

The environment must then be activated with the command

source .venv/bin/activate

and the environment should be created in editable mode so that the source code can be modified directly,

pip install -e .

Testing the Installation

The test suite should be executed to make sure that the environment is properly installed. After activating the environment, the tests can be executed with the command

pytest [--quick] [-n auto]

The argument --quick is optional; a few tests are a bit slow and will be skipped if this flag is present. The argument -n auto is optional; if toggled, the tests will run in parallel and go a little faster.

Setting up the Development Tools

Various automated tools are used in order to maintain a high quality code base. These must be set up to start developing. We use

  • flake8 to insure the coding style is enforced.
  • isort to insure that the imports are properly ordered.
  • black to format the code.

Setup pre-commit hooks

The folder ./hooks/ contain "pre-commit" scripts that automate various checks at every git commit. These hooks will

  • validate flake8 before any commit;
  • check that jupyter notebook outputs have been stripped.

There are two pre-commit scripts, pre-commit and pre-commit_staged. Both scripts perform the same checks; pre-commit is used within the continuous integration (CI), while pre-commit_staged only validates files that are staged in git, making it more developer-friendly.

To activate the pre-commit hook,

cd .git/hooks/ && ln -s ../../hooks/pre-commit .

Alternatively, to only lint files that have been staged in git, use

cd .git/hooks/ && ln -s ../../hooks/pre-commit_staged pre-commit

Setup Continuous Integration

GitHub Actions is used for running continuous integration (CI) checks. The cI workflow is described in .github/workflows/ci.yml.

CI will run the following:

  • check the code syntax with flake8
  • execute the unit tests in ./tests/.
  • Checks on documentation presence and format (using sphinx).

Since the various tests are relatively costly, the CI actions will only be executed for pull requests to the main branch.

Instructions to run an example experiment

To use Comet as an experiment logger, an account must be available and a global configuration file must be created at $HOME/.comet.config with content of the form

[comet]
api_key=YOUR_API_KEY

A simple experiment is described in the configuration file

examples/config_files/diffusion/config_diffusion_mlp.yaml

To run the experiment described in this file, a dataset must first be created by executing the script

data/si_diffusion_1x1x1/create_data.sh

Then, the experiment itself can be executed by running the script

examples/local/diffusion/run_diffusion.sh

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages