Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jacobi #779

Open
wants to merge 6 commits into
base: main
Choose a base branch
from
Open

Jacobi #779

wants to merge 6 commits into from

Conversation

yodada
Copy link
Collaborator

@yodada yodada commented Mar 29, 2022

This PR merges Jacobi device code, which can be found at examples/cuda/test_jacobi/

Jacobi 3D takes an input of Nx * Ny * Nz. This implementation is unrolled along Nx, and distributes Ny and Nz along tileX and tileY respectively. So the minimal valid input is 64 * 18 * 10. Another valid input is 126 * 18 *10. Note there along Nx it reads 64 inputs and generates 62 outputs. There is overlapping between steps.

@drichmond
Copy link
Collaborator

Merged kernel code from manycore

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants