feature: enabling oneDPL and sort primitive refactoring #3046

Alexandr-Solovev · 2025-01-16T15:58:06Z

Description:

Feature: enabling oneDPL and sort primitive refactoring

Summary:

This PR introduces oneDPL enabling and radix sort replacement.

PR completeness and readability

I have reviewed my changes thoroughly before submitting this pull request.
I have commented my code, particularly in hard-to-understand areas.
I have updated the documentation to reflect the changes or created a separate PR with update and provided its number in the description, if necessary.
Git commit message contains an appropriate signed-off-by string (see CONTRIBUTING.md for details).
I have added a respective label(s) to PR if I have a permission for that.
I have resolved any merge conflicts that might occur with the base branch.

Testing

I have run it locally and tested the changes extensively.
All CI jobs are green or I have provided justification why they aren't.
I have extended testing suite if new functionality was introduced in this PR.

Performance

I have measured performance for affected algorithms using scikit-learn_bench and provided at least summary table with measured data, if performance change is expected.
I have provided justification why performance has changed or why changes are not expected.
I have provided justification why quality metrics have changed or why changes are not expected.
I have extended benchmarking suite and provided corresponding scikit-learn_bench PR if new measurable functionality was introduced in this PR.

WORKSPACE

david-cortes-intel · 2025-01-20T07:52:04Z

Before merging, please remember to add this new dependency to the installation instructions in INSTALL.md, along with instructions for setting necessary env. variables when using conda:
https://github.com/uxlfoundation/oneDAL/blob/main/INSTALL.md

Alexandr-Solovev · 2025-01-22T20:28:28Z

/intelci: run

.ci/pipeline/ci.yml

Alexandr-Solovev · 2025-01-23T10:19:04Z

/azp run CI

azure-pipelines · 2025-01-23T10:19:12Z

Azure Pipelines failed to run 1 pipeline(s).

david-cortes-intel · 2025-01-23T15:09:19Z

INSTALL.md

@@ -23,6 +23,7 @@ Required Software:
 * BLAS and LAPACK libraries - both provided by oneMKL
 * Python version 3.9 or higher
 * TBB library (repository contains script to download it)
+* oneDPL library


@Alexandr-Solovev Please remember to update also the conda instructions that appear towards the end of this file.

The package name for conda should be onedpl-devel, and it needs to update the list of environment variables to add DPL_ROOT.

icfaust · 2025-01-27T00:11:41Z

/intelci: run

Alexandr-Solovev · 2025-01-27T09:27:00Z

@icfaust currently it works only with custom ci branch:
http://intel-ci.intel.com/efd9a7d1-b105-f1dd-a5e6-a4bf010d0e2d

david-cortes-intel · 2025-01-27T15:09:35Z

INSTALL.md

@@ -113,9 +114,25 @@ is available as an alternative to the manual setup.

            ./dev/download_tbb.sh

-6. Download and install Python (version 3.9 or higher).
+6. Set up Intel(R) Threading Building Blocks (Intel(R) TBB):


Suggested change

6. Set up Intel(R) Threading Building Blocks (Intel(R) TBB):

6. Set up Intel(R) OneDPL

Vika-F · 2025-02-12T12:31:59Z

cpp/oneapi/dal/algo/decision_forest/backend/gpu/train_kernel_hist_impl_dpc.cpp

+                sycl::buffer<std::int64_t, 1> num_buf{
+                    &sum_result,
+                    sycl::range<1>(1)
+                }; // Create buffer with a single element
+
+                const sycl::nd_range<1> nd_range =
+                    bk::make_multiple_nd_range_1d(ctx.selected_row_total_count_, 1);
+
+                queue_
+                    .submit([&](sycl::handler& h) {
+                        // Create an accessor for the buffer
+                        sycl::accessor<std::int64_t,
+                                       1,
+                                       sycl::access::mode::read_write,
+                                       sycl::access::target::device>
+                            acc(num_buf, h);


Why are buffer-accessor APIs used here instead of USM?

Vika-F · 2025-02-12T12:44:56Z

cpp/oneapi/dal/backend/primitives/sort/sort_dpc.cpp

+    auto event = oneapi::dpl::experimental::kt::gpu::esimd::radix_sort_by_key<true, 8>(
+        queue,
+        val_in.get_mutable_data(),
+        val_in.get_mutable_data() + val_in.get_count(),
+        ind_in.get_mutable_data(),
+        dpl::experimental::kt::kernel_param<256, 32>{});


Magic value 8, 256, 32 should be at least explained here.

Also, it might be beneficial not to hardcode them, but let the algorithmic kernel define them. I think different values can be chosen for different algorithms, or for different hardware platforms for better performance.

Vika-F · 2025-02-12T12:50:06Z

cpp/oneapi/dal/backend/primitives/sort/sort_dpc.cpp

+    const auto col_count = val_in.get_dimension(1);
+    sycl::event radix_sort_event;
+
+    for (std::int64_t row = 0; row < row_count; ++row) {


Looks like the previous implementation can be more performant for the matrices with large number of rows and small number of columns.

I would not delete it, but provide some performance considerations when it is preferred to use the old implementation, and when - the new one.

init adding dpl

65d9322

david-cortes-intel reviewed Jan 20, 2025

View reviewed changes

WORKSPACE Show resolved Hide resolved

Alexandr-Solovev added 11 commits January 20, 2025 06:51

fixes for dpl

f8028b7

minor fix

0b553e8

minor fix

2a91928

minor fix for dpl from toolkit

ab367c0

minor fix for script

e053cdf

minor fixes

3f1a6fe

minor fix

700cd10

minor fix

809760f

minor fix for dpl

d01ea31

fix correct link

064bb12

minor fixes

6e3587d

Alexandr-Solovev added dpc++ Issue/PR related to DPC++ functionality dependencies Pull requests that update a dependency file labels Jan 22, 2025

Alexandr-Solovev changed the title ~~init adding dpl~~ feature: enabling oneDPL and sorting primitive refactoring Jan 22, 2025

Alexandr-Solovev marked this pull request as ready for review January 22, 2025 20:28

Alexandr-Solovev requested review from Alexsandruss, samir-nasibli, napetrov, homksei, ahuber21 and ethanglaser as code owners January 22, 2025 20:28

Alexandr-Solovev changed the title ~~feature: enabling oneDPL and sorting primitive refactoring~~ feature: enabling oneDPL and sort primitive refactoring Jan 22, 2025

napetrov reviewed Jan 22, 2025

View reviewed changes

.ci/pipeline/ci.yml Outdated Show resolved Hide resolved

Alexandr-Solovev added 2 commits January 23, 2025 09:24

Merge branch 'uxlfoundation:main' into dev/asolovev_radix_sort_opt

0d9edd6

minor fix

a60eb07

Alexandr-Solovev requested a review from Vika-F as a code owner January 23, 2025 09:18

Alexandr-Solovev requested a review from maria-Petrova as a code owner January 23, 2025 09:18

minor fix

16c8f6c

david-cortes-intel reviewed Jan 23, 2025

View reviewed changes

david-cortes-intel reviewed Jan 27, 2025

View reviewed changes

Alexandr-Solovev added 8 commits February 5, 2025 08:45

Merge branch 'uxlfoundation:main' into dev/asolovev_radix_sort_opt

91410dc

fixes

8fd11bb

fixes for memory

bf9d31f

reduce memory usage

4575642

optimizations

16b1ca5

fixes for tree_order

09995cf

initial internal dispatcher

0ecfd35

fixes

94b260b

Vika-F reviewed Feb 12, 2025

View reviewed changes

minor fixes

5bdd54f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature: enabling oneDPL and sort primitive refactoring #3046

feature: enabling oneDPL and sort primitive refactoring #3046

Alexandr-Solovev commented Jan 16, 2025 •

edited

Loading

david-cortes-intel commented Jan 20, 2025

Alexandr-Solovev commented Jan 22, 2025

Alexandr-Solovev commented Jan 23, 2025

azure-pipelines bot commented Jan 23, 2025

david-cortes-intel Jan 23, 2025

icfaust commented Jan 27, 2025

Alexandr-Solovev commented Jan 27, 2025

david-cortes-intel Jan 27, 2025

Vika-F Feb 12, 2025

Vika-F Feb 12, 2025

Vika-F Feb 12, 2025

	6. Set up Intel(R) Threading Building Blocks (Intel(R) TBB):
	6. Set up Intel(R) OneDPL

feature: enabling oneDPL and sort primitive refactoring #3046

Are you sure you want to change the base?

feature: enabling oneDPL and sort primitive refactoring #3046

Conversation

Alexandr-Solovev commented Jan 16, 2025 • edited Loading

Description:

Summary:

david-cortes-intel commented Jan 20, 2025

Alexandr-Solovev commented Jan 22, 2025

Alexandr-Solovev commented Jan 23, 2025

azure-pipelines bot commented Jan 23, 2025

david-cortes-intel Jan 23, 2025

Choose a reason for hiding this comment

icfaust commented Jan 27, 2025

Alexandr-Solovev commented Jan 27, 2025

david-cortes-intel Jan 27, 2025

Choose a reason for hiding this comment

Vika-F Feb 12, 2025

Choose a reason for hiding this comment

Vika-F Feb 12, 2025

Choose a reason for hiding this comment

Vika-F Feb 12, 2025

Choose a reason for hiding this comment

Alexandr-Solovev commented Jan 16, 2025 •

edited

Loading