Enable opti on Microsoft.ML.OnnxRuntime with RelWithDebInfo config #23463

verdie-g · 2025-01-22T16:19:48Z

Microsoft.ML.OnnxRuntime is not built with the Release configuration but RelWithDebInfo which is not recognized by the MSBuild SDK. Consequently, the optimizations are not enabled. A fix would be to simply force the configuration to be Release when building the .NET code even if it was set to RelWithDebInfo in the command line arguments but I could not find an easy way to do that. Instead, I try to mimic the behavior of the Release configuration by setting the optimize property.

I can see a 15% performance improvement using this simple model summing up the 3 inputs:

using System.Buffers;
using System.Collections.Frozen;
using System.Net;
using System.Net.Sockets;
using System.Runtime.CompilerServices;
using System.Runtime.InteropServices;
using System.Text;
using System.Text.RegularExpressions;
using BenchmarkDotNet.Attributes;
using BenchmarkDotNet.Configs;
using BenchmarkDotNet.Running;
using Microsoft.ML.OnnxRuntime;

var config = DefaultConfig.Instance; //.WithOptions(ConfigOptions.DisableOptimizationsValidator);
BenchmarkSwitcher.FromAssembly(typeof(Program).Assembly).Run(args, config);

public class OnnxBench
{
    private const int Iterations = 100_000;
    private const int BatchSize = 50;
    
    private InferenceSession _session = default!;
    private string[] _inputNames = default!;
    private OrtValue[] _inputValues = default!;
    private RunOptions _runOptions = default!;

    [GlobalSetup]
    public void GlobalSetup()
    {
        using SessionOptions sessionOptions = new();
        sessionOptions.InterOpNumThreads = 1;
        sessionOptions.IntraOpNumThreads = 1;
        sessionOptions.GraphOptimizationLevel = GraphOptimizationLevel.ORT_ENABLE_ALL;
        sessionOptions.ExecutionMode = ExecutionMode.ORT_SEQUENTIAL;

        _session = new InferenceSession(
            Convert.FromBase64String("CAo6cAoOCgFBCgFCEgFEIgNBZGQKDgoBQwoBRBIBWCIDQWRkEgJscloRCgFBEgwKCggBEgYKAAoCCAFaEQoBQhIMCgoIARIGCgAKAggBWhEKAUMSDAoKCAESBgoACgIIAWIRCgFYEgwKCggBEgYKAAoCCAFCBAoAEBU="),
            sessionOptions);
        _inputNames = ["A", "B", "C"];
        _inputValues =
        [
            OrtValue.CreateTensorValueFromMemory(new float[BatchSize], [BatchSize, 1]),
            OrtValue.CreateTensorValueFromMemory(new float[BatchSize], [BatchSize, 1]),
            OrtValue.CreateTensorValueFromMemory(new float[BatchSize], [BatchSize, 1]),
        ];
        _runOptions = new RunOptions();
    }

    [Benchmark(OperationsPerInvoke = Iterations)]
    public float Run()
    {
        var inputValues0Span = _inputValues[0].GetTensorMutableDataAsSpan<float>();
        var inputValues1Span = _inputValues[1].GetTensorMutableDataAsSpan<float>();
        var inputValues2Span = _inputValues[2].GetTensorMutableDataAsSpan<float>();
        for (int i = 0; i < BatchSize; i += 1)
        {
            inputValues0Span[i] = Random.Shared.NextSingle();
            inputValues1Span[i] = Random.Shared.NextSingle();
            inputValues2Span[i] = Random.Shared.NextSingle();
        }
        
        float sum = 0f;
        for (int i = 0; i < Iterations; i += 1)
        {
            using var output = _session.Run(_runOptions, _inputNames, _inputValues, _session.OutputNames);
            ReadOnlySpan<float> outputData = output[0].GetTensorDataAsSpan<float>();
            for (int j = 0; j < outputData.Length; j += 1)
            {
                sum += outputData[j];
            }
        }
        
        return sum;
    }
}

Method	Mean	Error	StdDev
Before	5.003 us	0.0318 us	0.0297 us
After	4.325 us	0.0568 us	0.0503 us

snnn · 2025-01-22T19:34:37Z

It was for reducing binary size.

verdie-g · 2025-01-23T08:47:27Z

Note that I'm not sure it's exactly equivalent to building in release. Though, it's a good first step.

snnn · 2025-01-23T17:34:38Z

/azp run Big Models, Linux Android Emulator QNN CI Pipeline, Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline

snnn · 2025-01-23T17:34:44Z

/azp run Linux OpenVINO CI Pipeline, Linux QNN CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows CPU CI Pipeline,

snnn · 2025-01-23T17:34:48Z

/azp run Windows GPU CUDA CI Pipeline, Windows GPU DML CI Pipeline, Windows GPU Doc Gen CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows x64 QNN CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline,

azure-pipelines · 2025-01-23T17:35:10Z

Azure Pipelines successfully started running 6 pipeline(s).

azure-pipelines · 2025-01-23T17:35:14Z

Azure Pipelines successfully started running 6 pipeline(s).

azure-pipelines · 2025-01-23T17:35:24Z

Azure Pipelines successfully started running 7 pipeline(s).

snnn · 2025-01-24T17:52:12Z

Thanks. I merged it.

Enable opti on Microsoft.ML.OnnxRuntime with RelWithDebInfo config

02a2567

snnn added the api:CSharp issues related to the C# API label Jan 22, 2025

snnn approved these changes Jan 22, 2025

View reviewed changes

snnn merged commit c89a798 into microsoft:main Jan 24, 2025
82 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable opti on Microsoft.ML.OnnxRuntime with RelWithDebInfo config #23463

Enable opti on Microsoft.ML.OnnxRuntime with RelWithDebInfo config #23463

verdie-g commented Jan 22, 2025

snnn commented Jan 22, 2025

verdie-g commented Jan 23, 2025

snnn commented Jan 23, 2025

snnn commented Jan 23, 2025

snnn commented Jan 23, 2025

azure-pipelines bot commented Jan 23, 2025

azure-pipelines bot commented Jan 23, 2025

azure-pipelines bot commented Jan 23, 2025

snnn commented Jan 24, 2025

Enable opti on Microsoft.ML.OnnxRuntime with RelWithDebInfo config #23463

Enable opti on Microsoft.ML.OnnxRuntime with RelWithDebInfo config #23463

Conversation

verdie-g commented Jan 22, 2025

snnn commented Jan 22, 2025

verdie-g commented Jan 23, 2025

snnn commented Jan 23, 2025

snnn commented Jan 23, 2025

snnn commented Jan 23, 2025

azure-pipelines bot commented Jan 23, 2025

azure-pipelines bot commented Jan 23, 2025

azure-pipelines bot commented Jan 23, 2025

snnn commented Jan 24, 2025