Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Execution of Inference Workloads on Hikey970 with layer splitting #882

Closed
Shraddhaa1 opened this issue Mar 15, 2021 · 17 comments
Closed

Execution of Inference Workloads on Hikey970 with layer splitting #882

Shraddhaa1 opened this issue Mar 15, 2021 · 17 comments
Assignees
Labels
Milestone

Comments

@Shraddhaa1
Copy link

Hello,

I am currently working on executing inference workloads on Hikey970. I am trying to split the layers of a network amongst CPU and GPU, and run the workloads to reduce inference latency. I am following the repo attached below to run the models with CPU and GPU utilization.

https://github.com/adityagupta1089/ComputeLibrary.git

Could you guys help me understand how I can split the layers of the network and assign them to CPU and GPU?

Is there any API specific for CPU and GPU in ARM-CL?

Thanks.

@morgolock morgolock self-assigned this Mar 16, 2021
@morgolock morgolock added this to the v21.05 milestone Mar 16, 2021
@morgolock
Copy link

Hi @Shraddhaa1

The graph api in ACL is experimental and does not support that level of granularity to specify the backend for each individual layer.

You could experiment with the functions interface which lets you mix GPU and CPU kernels, please see the example: https://github.com/ARM-software/ComputeLibrary/blob/master/examples/neoncl_scale_median_gaussian.cpp

Hope this helps.

@Shraddhaa1
Copy link
Author

Shraddhaa1 commented Apr 9, 2021 via email

@Shraddhaa1
Copy link
Author

Shraddhaa1 commented Jun 14, 2021 via email

@developer-compute
Copy link

developer-compute commented Jun 17, 2021 via email

@Shraddhaa1
Copy link
Author

Shraddhaa1 commented Jun 21, 2021 via email

@developer-compute
Copy link

developer-compute commented Jun 22, 2021 via email

@Shraddhaa1
Copy link
Author

Shraddhaa1 commented Jun 22, 2021 via email

@Shraddhaa1
Copy link
Author

Shraddhaa1 commented Jun 23, 2021 via email

@Shraddhaa1
Copy link
Author

Shraddhaa1 commented Jul 1, 2021 via email

@Shraddhaa1
Copy link
Author

Shraddhaa1 commented Jul 9, 2021 via email

@Shraddhaa1
Copy link
Author

Shraddhaa1 commented Jul 12, 2021 via email

@Shraddhaa1
Copy link
Author

Shraddhaa1 commented Aug 24, 2021 via email

@HungYangChang
Copy link

@Shraddhaa1 I am also working on using ARM CL with HiKey 970. Would you like to discuss this?

@Shraddhaa1
Copy link
Author

Shraddhaa1 commented Sep 11, 2021 via email

@HungYangChang
Copy link

Hello @Shraddhaa1

I indeed went through the GitHub you share above, but for my work I will only focus on using CPU.
Here is the reference github: https://github.com/Ehsan-aghapour/ARMCL-pipe-all.

Btw now I have moved to ARM NN, because ARM NN is built on top of ARM CL. you can check ARM NN for more info.

@Shraddhaa1
Copy link
Author

Shraddhaa1 commented Sep 14, 2021 via email

@Shraddhaa1
Copy link
Author

Shraddhaa1 commented Sep 25, 2021 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants