Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added cuda kernel for measuring cosimulation performance #363

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

drichmond
Copy link
Contributor

@drichmond drichmond commented Jul 22, 2020

This kernel does a simple vector add. The source is padded with nops so that the binary is 1K instructions. This means the entire icache has to be initialized for each tile.

(I couldn't get unrolling and nops to cooperate, so i copy/pasted)

@drichmond drichmond requested a review from bornaehsani July 22, 2020 23:08
@drichmond drichmond force-pushed the ci_cuda_perf_test branch from 2b5473a to f4cca05 Compare July 29, 2020 18:39
Copy link
Contributor

@bornaehsani bornaehsani left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Very creative way to fill the icache :P

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants