> For the complete documentation index, see [llms.txt](https://docs.perfectscale.io/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.perfectscale.io/visibility-and-optimization/gpu-optimization.md).

# GPU optimization

{% hint style="info" %}
PerfectScale now only supports NVIDIA Data Center GPU Manager (DCGM).
{% endhint %}

{% hint style="info" %}
**GPU** support is available starting with the **exporter version 1.0.55**.\
**GPU memory** support is available starting with the **exporter version 1.1.11**.
{% endhint %}

PerfectScale delivers exceptional **GPU** and **GPU memory** utilization visibility to monitor and optimize GPU resources within your Kubernetes clusters. This feature helps teams identify optimization opportunities, reduce resource waste, and improve overall K8s efficiency.

{% hint style="warning" %}
To enable GPU visibility support in PerfectScale, the **NVIDIA DCGM exporter** should be installed. Additionally, specific configuration parameters should be set when deploying or upgrading the PerfectScale agent. Learn more [here](/getting-started/how-to-onboard-a-cluster.md#gpu-support).
{% endhint %}

When PerfectScale detects active GPU resources within a cluster, it automatically enables GPU and GPU memory widgets and utilization insights in the UI.&#x20;

<figure><img src="/files/D0eDhv79Lg1rDmu4J3ge" alt=""><figcaption><p>GPU widgets</p></figcaption></figure>

This view provides detailed GPU usage and allocation efficiency metrics, helping you identify savings opportunities and drive further data-driven Kubernetes optimization.

## Podfit GPU visibility

<figure><img src="/files/u0Le4WV3xBvVkPUDNWqy" alt=""><figcaption><p>GPU view</p></figcaption></figure>

To quickly identify GPU-allocated workloads in **PodFit**, switch to the **GPU view** by clicking the GPU tab from the view selector, as shown below. Once there, you will see the GPU and GPU memory utilization metrics per container, as well as the GPU scheduler type, for example, KAI, and the sharing type, for example, TimeSlicing. Sort the table by GPU usage to bring all GPU-consuming workloads to the top.

Clicking on a specific workload opens the detailed workload view - Zoom-in window. This panel provides in-depth information about the workload’s current state and behavior, along with historical data on resource allocation and utilization over time. It includes GPU and GPU memory utilization metrics as well as other key workload metrics. Learn more about zoom-in capabilities [here](/visibility-and-optimization/podfit-or-vertical-pod-right-sizing.md#detailed-workload-analysis).

<figure><img src="/files/PDMmvUf1KGtbXGJ0Wfw5" alt=""><figcaption><p>Workload GPU utilization widgets</p></figcaption></figure>

## Infrafit GPU visibility

To see detailed GPU usage across your infrastructure, go to **InfraFit**. The GPU chart shows how much of your GPUs are being used versus how much was requested, making it easy to spot inefficiencies and find ways to optimize.

<figure><img src="/files/RnJAP2nvUpzfXj410KhT" alt=""><figcaption><p>Node group GPU utilization</p></figcaption></figure>

This view helps you quickly evaluate the difference between requested GPU and GPU memory resources and actual usage, making it easy to pinpoint underutilized or idle GPU capacity across your clusters.&#x20;

By clicking on the specific node group, you will get a granular breakdown of individual instances within that group, along with key metrics for each one.

<figure><img src="/files/Zfkpo1rkOHSm60mTbbon" alt=""><figcaption><p>Node type GPU utilization</p></figcaption></figure>

Clicking on a specific instance will display a list of workloads running on that machine, allowing for deeper investigation and analysis.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.perfectscale.io/visibility-and-optimization/gpu-optimization.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
