JAX backend of TreeMHN #30

pawel-czyz · 2023-11-24T10:16:13Z

Premise

The $Q$ rate matrix is built out of rates $Q_{ij} = \lambda_{\pi_{ij}}(\theta)$, where $\pi$ is a lineage, and $\theta$ is the (log-) mutual hazard network.

We can construct the necessary lineages $\pi_{ij}$ in the preprocessing step, so that for a given $\theta$ we can construct the $Q$ matrix using only JAX operations. This allows us to use fast numerical algebra operations as well as to obtain gradient of the loglikelihood automatically.
Additionally, by implementing custom forward substitution algorithm, we can work with the $\log$-values, which helps with numerical stability.

Limitations

As the shapes of the structures built for different trees are different, JIT has to recompile the function for each of the trees. The compilation for 100 trees took about ~45 second on my computer.
We use $O(N)$ memory, where $O(N)$ is the number of subtrees and $O(N^2)$ time.
We use padding, which can induce some additional cost, but makes it JAX-compatible.
To make the exit rates dependent on present mutations one has to construct the paths for exit rates. Currently we use placeholder values, which assume empty paths. (I.e., the rates do not depend on the mutations in this form, although the fix is easy once one decides how the paths should be constructed.)

laurenzkeller · 2023-11-24T15:11:15Z

I'm not sure if this version will be faster. Creating the paths matrix is indeed a good idea in terms of speed (we only need to create it once for each tree), however the preprocessing step will probably consume even more memory this way (I tried something similar). Regarding forward substitution: If you want to have as many calculations as you have non-zero entries, then you would need to iterate through the columns of the V matrix, not through the rows (because we solve the system V_transposed * x = b). However, one of the benefits of iterating through the rows first would be lost: When we determine a diagonal entry we can simultaneously find the off-diagonal entries in that same row (we don't need to recalculate the lambdas on the off-diagonal). If we iterate through the columns on the other hand, then we cannot use the diagonal entry to calculate the off-diagonal entries in the same column. Maybe it is still possible to calculate each distinct lambda exactly once, but you would be jumping around in the paths matrix all the time (so when you calculate an off-diagonal entry you would add the value to the corresponding diagonal entry).

pawel-czyz added 5 commits November 24, 2023 11:15

Construct data structures for conversion from MHN to rate matrix

9cf918f

Add loglikelihood calculation

fbdd298

Utilities for creating path matrices

af95b0a

WIP: slightly refactor the code

9efe79d

WIP: slightly refactor the code

2c65f43

pawel-czyz added 18 commits December 6, 2023 15:17

Refactoring the code.

c3ba998

WIP: Refactoring the code.

b92d8e3

WIP: Refactoring the code.

f5647f4

WIP: Annotate functions as untested

d5d4f7c

WIP: Add test for COOMatrix

e898b7d

Rename _log_rates.py to rates.py

d687817

Add tests for _construct_log_transition_rate

afad10c

Test _construct_log_exit_rate

ac6b007

Test _construct_log_Q_offdiag

feb935f

Test segment_logsumexp

5bb86c2

Test segment_logsumexp

7d5590b

Test _log_neg_Q_to_log_V

ab1d95b

Test _construct_log_U

ae3fdd8

Add smoke test

6368980

Add retrieving a single row

60b3f24

Add a test for solver.

1010105

Add the loglikelihood function

3e9f9ff

Add creating paths.

4f2532f

pawel-czyz marked this pull request as ready for review December 7, 2023 00:16

pawel-czyz added 🚂 enhancement New feature or request 👕 effort M labels Dec 7, 2023

pawel-czyz added 2 commits December 7, 2023 13:16

Add example

f9ca9ba

Merge branch 'main' into treemhn-jax

3aa9023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JAX backend of TreeMHN #30

JAX backend of TreeMHN #30

pawel-czyz commented Nov 24, 2023 •

edited

Loading

laurenzkeller commented Nov 24, 2023 •

edited

Loading

JAX backend of TreeMHN #30

Are you sure you want to change the base?

JAX backend of TreeMHN #30

Conversation

pawel-czyz commented Nov 24, 2023 • edited Loading

Premise

Limitations

laurenzkeller commented Nov 24, 2023 • edited Loading

pawel-czyz commented Nov 24, 2023 •

edited

Loading

laurenzkeller commented Nov 24, 2023 •

edited

Loading