Skip to content

Latest commit

 

History

History
55 lines (36 loc) · 1.64 KB

README.md

File metadata and controls

55 lines (36 loc) · 1.64 KB

djanco-python2

A crate containing Djanco queries.

This crate contains queries gathering contents of Python 2 files. It finds commits older than December 2008 (date of introduction of Python 3) and retrieves the contents of changed Python files (identified by extension). There are also auxiliary queries gathering general information about Python projects in the queried dataset.

Djanco

Djanco is a query system for querying GitHub datasets downloaded by Parasite as part of the CodeDJ project.

Running the queries

To generate a harness for executing the queries in this crate install cargo-djanco:

cargo install --git https://github.com/PRL-PRG/cargo-djanco

Then, generate the harness:

cargo djanco

This generates s source file src/bin/djanco.rs, which you can run to execute all your queries:

cargo run --bin djanco --release -- --dataset-path DATASET_LIVES_HERE --output-path WRITE_RESULTS_HERE 

Template

The template file for the python-2 crate comes from here.

To create a new crate from the template, first install cargo-generate:

cargo install cargo-generate

Then, use the generate command in cargo to create a new crate from the template:

cargo generate --git https://github.com/PRL-PRG/djanco-query-template --name my-query-crate

Then you can add your query functions. There's an example function in src/lib.rs to get you started.