#0: [tt-train] DRAFT add new ttml ops #17814

jaykru-tt · 2025-02-11T06:19:31Z

Placeholder PR to save some work that spilled out of another PR:

Adds matmul and sqrt with backward support
Attempt at adding generic backward with broadcasting for ttml::mul; IIRC this is incomplete and ought to be revisited.

Ticket

Link to Github Issue

Problem description

Provide context for the problem.

What's changed

Describe the approach used to solve the problem.
Summarize the changes made and its impact.

Checklist

All post commit CI passes
Blackhole Post commit CI passes (if applicable)
Model regression CI passes (if applicable)
Device performance regression CI passes (if applicable)
(For models and ops writers) Full new models tests CI passes (if applicable)
New/Existing tests provide coverage for changes

dmakoviichuk-tt · 2025-02-11T17:10:02Z

tt-train/sources/ttml/core/tt_tensor_utils.cpp

@@ -326,4 +326,43 @@ template tt::tt_metal::Tensor from_xtensor<uint32_t, DataType::UINT32>(
    const XTensorToMeshVariant<uint32_t>& composer,
    Layout layout);

+ttnn::Tensor unsqueeze_to_rank(const ttnn::Tensor& t, size_t rank) {
+    auto logical_shape = t.get_logical_shape();


@sminakov-tt could you take a look at this function please?

dmakoviichuk-tt · 2025-02-11T17:10:28Z

tt-train/sources/ttml/core/tt_tensor_utils.cpp

@@ -326,4 +326,43 @@ template tt::tt_metal::Tensor from_xtensor<uint32_t, DataType::UINT32>(
    const XTensorToMeshVariant<uint32_t>& composer,
    Layout layout);

+ttnn::Tensor unsqueeze_to_rank(const ttnn::Tensor& t, size_t rank) {


please throw if rank > 4 or 8?

dmakoviichuk-tt · 2025-02-11T17:10:57Z

tt-train/sources/ttml/core/tt_tensor_utils.cpp

+    auto logical_shape = t.get_logical_shape();
+    auto physical_shape = t.get_padded_shape();
+    auto t_rank = logical_shape.rank();
+    TT_FATAL(t_rank >= rank, "Cannot squeeze to rank {} from rank {}", rank, t_rank);


please don't use TT_FATAL in tt-train. Just check and throw exception.

dmakoviichuk-tt · 2025-02-11T17:11:17Z

tt-train/sources/ttml/ops/binary_ops.cpp

@@ -4,6 +4,7 @@

 #include "binary_ops.hpp"

+#include <core/compute_kernel_config.hpp>


please use ""

dmakoviichuk-tt · 2025-02-11T17:12:23Z

tt-train/sources/ttml/ttnn_fixed/trivial_ttnn_ops.cpp

+
+// Overload supporting generic sum over multiple dimensions
+tt::tt_metal::Tensor sum_moreh(
+    const tt::tt_metal::Tensor& t, std::optional<ttnn::SmallVector<int64_t>> dims, bool keep_dim) {


please don't use optional.

I am not super familiar with best practice for std::optional in C++, so I checked the Google C++ style guide. They write that one should use optional for by-value parameters that are optional. I want to give an easy way to sum over all dims without having to construct that small vector as the called.

In this case I have, I think, 4 options:

Use std::optional. nullopt passed -> sum over all dims.

Use const * and treat nullptr as nothing passed -> sum over all dims. This is suitable for this specific case because we should avoid passing the vec by value

Special case empty vector as the all dims case and use that as the default value for the parameter. This seems arbitrary to me and isn't generally applicable to all types.

Overload without the optional parameter.

Is nullable const * okay in this case? And in general, what should we do for passing by value where there isn't an obvious value to signal the nothing passed case?

Thanks in advance for your guidance 😁

dmakoviichuk-tt · 2025-02-11T17:15:44Z

tt-train/sources/ttml/ops/binary_ops.cpp

@@ -102,6 +103,42 @@ autograd::TensorPtr operator*(const autograd::TensorPtr& a, const autograd::Tens
        auto a_grad = ttnn::multiply(out->get_grad(), b->get_value());
        auto b_grad = ttnn::multiply(out->get_grad(), a->get_value());

+        auto clamp_to_rank = [](const ttnn::Tensor& tensor, size_t rank) {


@jaykru-tt if you decided to add broadcasting it should work for all binary ops and be implemented in other way. So it should be functions which are independent from the exact op. If we decide to add this code to each op it would look pretty bad.

dmakoviichuk-tt · 2025-02-11T17:16:50Z

tt-train/sources/ttml/ops/binary_ops.cpp

+        /* program_config */ std::nullopt,
+        /* activation */ std::nullopt,
+        /* compute_kernel_config */ core::ComputeKernelConfig::matmul(),
+        /* core_grid */ std::nullopt,  // NOTE: I believe matmul will use the


I already had comment in other pr. Please use our core grid. If we decide to use default parameter it should be used everywhere.

Def will change this, just didn't change it yet since it won't make it into the other PR.

dmakoviichuk-tt · 2025-02-11T17:17:41Z

tt-train/sources/ttml/ops/binary_ops.hpp

 autograd::TensorPtr div(const autograd::TensorPtr& a, const autograd::TensorPtr& b);
+autograd::TensorPtr div(const autograd::TensorPtr& a, float b);


if we have mul(sclar, tensor) we should have a div too.

dmakoviichuk-tt · 2025-02-11T17:18:52Z

tt-train/sources/ttml/ops/unary_ops.cpp

+}
+
+autograd::TensorPtr sum(const autograd::TensorPtr& tensor) {
+    auto out = autograd::create_tensor();


I don't like sum op without dims parameter.

dmakoviichuk-tt · 2025-02-11T17:19:59Z

tt-train/tests/ops/unary_ops_test.cpp

@@ -45,6 +46,43 @@ TEST_F(UnaryOpsTest, GlobalMean) {
    }
 }

+TEST_F(UnaryOpsTest, Sum) {


you must add tests for all your new ops:

matmuls

all new overloads of the mul, div and etc.

broadcasting. (But Id make a separate pr for a broadcasting as it is a pretty complex feature)

#0: aadd new utils and ttml ops

cc6f265

jaykru-tt changed the title ~~#0: add new utils and ttml ops~~ #0: [tt-train] DRAFT add new ttml ops Feb 11, 2025

dmakoviichuk-tt reviewed Feb 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#0: [tt-train] DRAFT add new ttml ops #17814

#0: [tt-train] DRAFT add new ttml ops #17814

jaykru-tt commented Feb 11, 2025 •

edited

Loading

dmakoviichuk-tt Feb 11, 2025

dmakoviichuk-tt Feb 11, 2025

dmakoviichuk-tt Feb 11, 2025

dmakoviichuk-tt Feb 11, 2025

dmakoviichuk-tt Feb 11, 2025

jaykru-tt Feb 11, 2025

dmakoviichuk-tt Feb 11, 2025

dmakoviichuk-tt Feb 11, 2025

jaykru-tt Feb 11, 2025

dmakoviichuk-tt Feb 11, 2025

dmakoviichuk-tt Feb 11, 2025

dmakoviichuk-tt Feb 11, 2025

		@@ -4,6 +4,7 @@

		#include "binary_ops.hpp"

		#include <core/compute_kernel_config.hpp>

		autograd::TensorPtr div(const autograd::TensorPtr& a, const autograd::TensorPtr& b);
		autograd::TensorPtr div(const autograd::TensorPtr& a, float b);

#0: [tt-train] DRAFT add new ttml ops #17814

Are you sure you want to change the base?

#0: [tt-train] DRAFT add new ttml ops #17814

Conversation

jaykru-tt commented Feb 11, 2025 • edited Loading

Ticket

Problem description

What's changed

Checklist

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jaykru-tt commented Feb 11, 2025 •

edited

Loading