Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FORK][FEATURE] DQ IP: performance enhansments #272

Open
wants to merge 1 commit into
base: v3.6_for_ie_master
Choose a base branch
from

Conversation

dmitry-gorokhov
Copy link
Collaborator

@dmitry-gorokhov dmitry-gorokhov commented Jan 21, 2025

  • allocate aux accums regs on stack
  • precompute grouped src sums
  • optimize pointer arithmetic
  • reduce aux vecs count requred for the microkernel

@dmitry-gorokhov dmitry-gorokhov force-pushed the feature/dq_ip_accum_stack_allocation branch 3 times, most recently from 9960b67 to d421730 Compare January 24, 2025 08:54
- allocate aux accums regs on stack
- precompute grouped src sums
- optimize pointer arithmetic
- reduce aux vecs count requred for the microkernel
@dmitry-gorokhov dmitry-gorokhov force-pushed the feature/dq_ip_accum_stack_allocation branch from bc4e68a to b145489 Compare February 3, 2025 06:46
@dmitry-gorokhov dmitry-gorokhov changed the title [FORK][FIX] DQ IP: allocate aux accums via stack FORK][FEATURE] DQ IP: performance enhansments Feb 3, 2025
@dmitry-gorokhov dmitry-gorokhov changed the title FORK][FEATURE] DQ IP: performance enhansments [FORK][FEATURE] DQ IP: performance enhansments Feb 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant