Phenocam image processing

This repository is an experimental version of image processing pipelines for the Phenocam images on COSMOS-UK sensors. It was created using the UKCEH python project template and repurposes some of the pipeline code in plankton_ml. It is intended for rapid prototyping and use case refinement.

Note on dependency versions

We're using the thingsvision package to simplify extracting features from different computer vision models. It currently requires python <3.11 and numpy <2. If the approach stays useful, it makes sense to remove thingsvision in favour of model-specific code.

Getting Started

Set up virtual environment

See the installation instructions for uv.

uv python install 3.10
uv sync
source .venv/bin/activate

This should handle the pip install of dependencies

Using the Githook

From the root directory of the repo, run:

git config --local core.hooksPath .githooks/

This will set this repo up to use the git hooks in the .githooks/ directory. The hook runs ruff format --check and ruff check to prevent commits that are not formatted correctly or have errors. The hook intentionally does not alter the files, but informs the user which command to run.

Installing the package

The docs, tests, and linter packages can be installed together with:

pip install -e .[dev]

Run the tests

python -m pytest

Run the pipeline

This includes a Luigi pipeline which does the following work:

Splits a set of dual-hemisphere images into left and right halves
defisheye to flatten the perspective, and saves the results at 600x600px dimensions
Extract and store image embeddings using a model from thingsvision
Stores the embedding vectors, and metadata derived from the filename, in a sqlite database

Run with test data and dummy output locations:

python src/phenocam/pipeline_luigi.py

This should create a small image set inside data/images/ and feature embeddings and the sqlite database inside data/vectors/

Run the FastAPI API

This exposes an API around the vector search abstraction, it basically only has one query which is "find me all the URLs whose embeddings are closest to this one"

fastapi run src/phenocam/data/api.py

Visit http://localhost:8000/docs

Test the query with input like this:

{ 
    "url": "WADDN_20140101_0902_ID405_L.jpg",
    "n_results": 5
}

Run the visualisation

There's a simple, self-contained visualisation of the N closest images done in p5js. It's a static HTML file with a Javascript file that calls the API above. It could do a lot more, depends what questions we now want to ask!

cd src/app
python -m http.server 8082

Then visit http://localhost:8082

Building Docs Locally

The documentation is driven by Sphinx an industry standard for documentation with a healthy userbase and lots of add-ons. It uses sphinx-apidoc to generate API documentation for the codebase from Python docstrings.

To run sphinx-apidoc run:

# Install your package with optional dependencies for docs
pip install -e .[docs]

cd docs
make apidoc

This will populate ./docs/sources/... with *.rst files for each Python module, which may be included into the documentation.

Documentation can then be built locally by running make html, or found on the GitHub Deployment.

Run the Tests

To run the tests run:

#Install package with optional dependencies for testing
pip install -e .[test]

pytest

Automatic Versioning

This codebase is set up using autosemver a tool that uses git commit history to calculate the package version. Each time you make a commit, it increments the patch version by 1. You can increment by:

Normal commit. Use for bugfixes and small updates
- Increments patch version: x.x.5 -> x.x.6
Commit starts with * NEW:. Use for new features
- Increments minor version x.1.x -> x.2.x
Commit starts with * INCOMPATIBLE:. Use for API breaking changes
- Increments major version 2.x.x -> 3.x.x

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.githooks		.githooks
.github/workflows		.github/workflows
data/images		data/images
docs		docs
notebooks		notebooks
src		src
tests		tests
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Phenocam image processing

Note on dependency versions

Getting Started

Set up virtual environment

Using the Githook

Installing the package

Run the tests

Run the pipeline

Run the FastAPI API

Run the visualisation

Building Docs Locally

Run the Tests

Automatic Versioning

About

Releases

Packages

Languages

License

ukceh-rse/fdri-phenocam

Folders and files

Latest commit

History

Repository files navigation

Phenocam image processing

Note on dependency versions

Getting Started

Set up virtual environment

Using the Githook

Installing the package

Run the tests

Run the pipeline

Run the FastAPI API

Run the visualisation

Building Docs Locally

Run the Tests

Automatic Versioning

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages