From a9c8b990f2116f123a36c56928458789540bc267 Mon Sep 17 00:00:00 2001 From: Sebastian Golebiewski Date: Thu, 25 Jul 2024 18:27:27 +0200 Subject: [PATCH] [DOCS] Updating NPU device article for master (#25734) Porting: https://github.com/openvinotoolkit/openvino/pull/25727 --- .../npu-device.rst | 36 +++++++++++++------ 1 file changed, 26 insertions(+), 10 deletions(-) diff --git a/docs/articles_en/openvino-workflow/running-inference/inference-devices-and-modes/npu-device.rst b/docs/articles_en/openvino-workflow/running-inference/inference-devices-and-modes/npu-device.rst index 4c262b49f6f704..f701774d19e42e 100644 --- a/docs/articles_en/openvino-workflow/running-inference/inference-devices-and-modes/npu-device.rst +++ b/docs/articles_en/openvino-workflow/running-inference/inference-devices-and-modes/npu-device.rst @@ -164,8 +164,8 @@ offer a limited set of supported OpenVINO features. **ov::intel_npu::compilation_mode_params** -``ov::intel_npu::compilation_mode_params`` is an NPU-specific property that allows to -control model compilation for NPU. +``ov::intel_npu::compilation_mode_params`` is an NPU-specific property that allows +control of model compilation for NPU. .. note:: @@ -176,7 +176,7 @@ Following configuration options are supported: **optimization-level** -Defines a preset of optimization passes to be applied during compilation. +Defines an optimization effort hint to the compiler. .. list-table:: :widths: 10 200 @@ -185,7 +185,7 @@ Defines a preset of optimization passes to be applied during compilation. * - **Value** - **Description** * - 0 - - Reduced subset of optimization passes. Smaller compile time. + - Reduced subset of optimization passes. May result in smaller compile time. * - 1 - **Default.** Balanced performance/compile time. * - 2 @@ -193,7 +193,7 @@ Defines a preset of optimization passes to be applied during compilation. **performance-hint-override** -An extension for LATENCY mode being specified using ``ov::hint::performance_mode`` +The LATENCY mode can be overridden by specifying ``ov::hint::performance_mode`` Has no effect for other ``ov::hint::PerformanceMode`` hints. .. list-table:: @@ -207,15 +207,31 @@ Has no effect for other ``ov::hint::PerformanceMode`` hints. * - latency - Prioritize performance over power efficiency. -.. tab-set:: +Usage example: - .. tab-item:: Usage example +.. code-block:: - .. code-block:: + map config = {ov::intel_npu::compilation_mode_params.name(), ov::Any("optimization-level=1 performance-hint-override=latency")}; + + compile_model(model, config); + +**npu_turbo** + +The turbo mode, where available, provides a hint to the system to maintain the +maximum NPU frequency and memory throughput within the platform TDP limits. +The turbo mode is not recommended for sustainable workloads due to higher power +consumption and potential impact on other compute resources. + +.. code-block:: + + core.set_property("NPU", ov::intel_npu::turbo(true)); + +or + +.. code-block:: - map config = {ov::intel_npu::compilation_mode_params.name(), ov::Any("optimization-level=1 performance-hint-override=latency")}; + core.compile_model(ov_model, "NPU", {ov::intel_npu::turbo(true)}); - compile_model(model, config); Limitations #############################