Creating an implementation of the `knapsack` problem from scratch

This documents creating an implementation of a program using genetic algorithms to evolve solutions to the knapsack problem using the unhindered-ec library.

Create the project and add dependencies

Create the project using cargo init knapsack.

Then add the (course-specific, and therefore temporary) registry so we can add dependencies. We need to add the following to .cargo/config.toml in the project NOT Cargo.toml. This allows to project to access this registry.

[registries.ec-course]
index = "https://github.com/UMM-CSci-4553-S25/registry.git"

to .cargo/config.toml in the project so it has access to the registry.

Now we can add ec-core, ec-linear, and course-helpers as dependencies:

cargo add --registry ec-course ec-core ec-linear course-helpers

Since we want to keep the error handling as simple as possible, we're adding the anyhow crate, which allows us to return out any occurring errors with Rust's ? operator.

cargo add anyhow

We also added the test_case library which simplifies certain testing patterns.

cargo add test_case

We need the rand crate since evolutionary computation generates a lot of random numbers. For complex reasons not really relevant, we're currently a specific beta version of the rand crate, so we have to explicitly specify that version when we add the crate. (Hopefully v0.9 will be released shortly, at which point we can drop the specification of the particular beta version.)

cargo add [email protected]

Structure of the implementation

Now that we have a basic file structure, there are two major pieces that need to be implemented:

The project itself (in our case the knapsack problem)
The main() function which will construct and run an instance of the evolutionary system, outputting some sort of result

Implement `Knapsack`

For whatever problem you're trying to solve, you'll have to implement a model of that problem. In this case, that is the type Knapsack and its helper type Item. In this example, those types are fairly straightforward data containers, but the parsing from a file is a little more complex, especially if you're new to Rust.

The details of this will be problem dependent. You should definitely ask for help if you're stuck while trying to implement a new problem.

Add documentation to Knapsack and Item.

Implement `main`

Have `main` return `anyhow::Result`

To simplify the error handling, we want main to return anyhow::Result<()>, which essentially says that main can return either the unit type () if successful, or any error type using the ? operator.

fn main() -> anyhow::Result<()> {
    // The returns the unit type `()` wrapped in the `Ok` variant of
    // `Result`. The lack of a semicolon (`;`) at the end of the line
    // makes this the last value in the function, which is what Rust
    // will return in the absence of an explicit `return` statement.
    Ok(())
}

Initial decisions

In general when setting up an evolutionary computation system, We have to decide:

What is our representation for the problem?
- We typically want to implement some problem-specific type, like Knapsack in this example, that encapsulates the details of a problem instance.
- It's also useful to be able to create instances of the problem, often from files. Here Knapsack::from_file_path() creates an instance of Knapsack based on data in the given file.
What is our representation for solutions?
- In this case it will be fixed length Bitstring from ec-linear.
We have to implement some kind of scoring, which will typically be problem specific.
- We're using CliffScorer in this example.
We need to have some kind of selection.
- To keep things simple, we'll just use Tournament selection here.
- We'll start with binary tournaments, but you might want to increase the tournament size, especially if you have large populations.
- If your problem naturally generates a collection of values (scores or errors), then you might consider using Lexicase selection.
We need to have a mutator and crossover; presumably something from ec-linear will do.
- We'll use WithOneOverLength for mutation and
- UniformXo for crossover.
We also need to choose simple values like population size and max number of generations.
- We'll use 1,000 for both values here, but those are quite arbitrary choices.
What do we need to record and/or collect as each generation proceeds?
- In this example we'll print out the best individual in each generation, and save the best individual from across the entire run.

We also need a specific instance of the problem that we want to try to solve, and might need to create a file with the appropriate representation of that problem instance.

Creating a run

Assuming we have an instance of the knapsack problem, knapsack, we have to build the run. Our Run uses the builder pattern which allows us to specify the various values and properties a run must have and then assemble the final complete Run. In our example this looks like:

    let run = Run::builder()
        // The number of bits should equal the number of items.
        .bit_length(knapsack.num_items())
        // The maximum number of generations to run; this is somewhat arbitrary
        .max_generations(1_000)
        // The population size, which is also somewhat arbitrary, but larger is better
        // until it's so big that memory management becomes a problem.
        .population_size(1_000)
        // How do we want to select parent individuals? This takes two individuals at
        // random from the population, and then chooses the better of the two from this
        // tournament. You can change this to larger tournaments by changing `2` to your
        // desired tournament size.
        .selector(Tournament::of_size::<2>())
        // How do we want to mutate individual knapsack solutions? This flips
        // on average one bit, thereby adding or removing one item from the solution.
        .mutator(WithOneOverLength)
        // How do we want to recombine parent solutions? This randomly chooses for
        // each bit whether to take it from the first or the second parent, giving
        // use a "shuffled" set of choices from both parents.
        .recombinator(UniformXo)
        // Do we want to use parallel evaluation? If this is `true`, the run will use
        // all the available cores to evaluate the population in parallel. This can speed
        // up the process considerably, at the cost of heating up your CPU.
        .parallel_evaluation(true)
        // How do we want to score different knapsack "solutions"? This is the only
        // problem dependent part of building the run. We'll start with a simple scorer
        // that returns a `CliffScore`. This is an `enum` with two variants: `Score(v)`
        // where `v` is the value of the items if they fit in the knapsack
        // and `Overloaded` otherwise.  This is implemented so that `Overloaded` is
        // always worse than any `Score(v)` value.
        .scorer(CliffScorer::new(knapsack))
        // Add an inspector. This is a function that is called after each generation
        // and can be used to collect and/or print out information about the run. We'll use this to
        // print out the best score in each generation, and to keep track of the best score in the run.
        .inspector(|generation_number, population| {
            report_on_generation(generation_number, population, &mut best_in_run, &mut rng);
        })
        // Now that we've specified all the elements, we can build the run.
        .build();

STUFF STILL TO-DO

Add rand crate
How to implement scoring
How to choose/implement different mutators, recombinators, selectors

Possible improvements

Add command-line argument parsing via clap to allow for specification of things like problem instance file, population size, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.cargo		.cargo
.vscode		.vscode
knapsacks		knapsacks
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Creating an implementation of the `knapsack` problem from scratch

Create the project and add dependencies

Structure of the implementation

Implement `Knapsack`

Implement `main`

Have `main` return `anyhow::Result`

Initial decisions

Creating a run

STUFF STILL TO-DO

Possible improvements

About

Releases

Packages

Languages

UMM-CSci-4553-S25/knapsack-ga

Folders and files

Latest commit

History

Repository files navigation

Creating an implementation of the knapsack problem from scratch

Create the project and add dependencies

Structure of the implementation

Implement Knapsack

Implement main

Have main return anyhow::Result

Initial decisions

Creating a run

STUFF STILL TO-DO

Possible improvements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Creating an implementation of the `knapsack` problem from scratch

Implement `Knapsack`

Implement `main`

Have `main` return `anyhow::Result`

Packages