> For the complete documentation index, see [llms.txt](https://docs.perfectscale.io/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.perfectscale.io/2.0-self-hosted-or-perfectscale-documentation/visibility-and-optimization/gpu-optimization.md).

# GPU optimization

{% hint style="info" %}
PerfectScale now only supports NVIDIA Data Center GPU Manager (DCGM). GPU support is available starting with the exporter version 1.0.55.
{% endhint %}

PerfectScale delivers exceptional GPU utilization visibility to monitor and optimize GPU resources within your Kubernetes clusters. This feature helps teams identify optimization opportunities, reduce resource waste, and improve overall K8s efficiency.

{% hint style="warning" %}
To enable GPU visibility support in PerfectScale, the **NVIDIA DCGM exporter** should be installed. Additionally, specific configuration parameters should be set when deploying or upgrading the PerfectScale agent. Learn more [here](/2.0-self-hosted-or-perfectscale-documentation/getting-started/how-to-onboard-a-cluster.md#gpu-support).
{% endhint %}

When PerfectScale detects active GPU resources within a cluster, it automatically enables GPU-specific widgets and utilization insights in the UI. These components provide detailed metrics on GPU usage, allocation efficiency, and workload distribution, enabling data-driven K8s optimization.

## Podfit GPU visibility

To quickly identify GPU-allocated workloads in **PodFit**, switch to the **GPU view** by clicking the GPU tab from the view selector, as shown below, then sort the table by GPU usage. This will bring all GPU-consuming workloads to the top.

<figure><img src="/files/ldr4l3o6jGOJ9DoRuUSJ" alt=""><figcaption><p>GPU view - Podfit</p></figcaption></figure>

Click on a workload to open the detailed Zoom-in view. This panel provides in-depth information about the workload’s current state and behavior, along with historical data on resource allocation and utilization over time. It includes GPU utilization metrics, as well as other resource usage, and detected performance risks. To learn more about zoom-in capabilities, [go here](/2.0-self-hosted-or-perfectscale-documentation/visibility-and-optimization/podfit-or-vertical-pod-right-sizing.md#detailed-workload-analysis).

<figure><img src="/files/bWZjisRSKqtYwKwjgGKZ" alt=""><figcaption><p>Workload details - GPU widget</p></figcaption></figure>

## Infrafit GPU visibility

To see detailed GPU usage across your infrastructure, go to **InfraFit**. The GPU chart shows how much of your GPUs are being used versus how much was requested, making it easy to spot inefficiencies and find ways to optimize.

<figure><img src="/files/nUjiEr2LfTIO81fBYg5s" alt=""><figcaption><p>GPU view - Infrafit</p></figcaption></figure>

This view helps you quickly evaluate the difference between requested GPU resources and actual usage, making it easy to pinpoint underutilized or idle GPU capacity across your clusters.&#x20;

By clicking on the specific node group, you will get a granular breakdown of individual instances within that group, along with key metrics for each one.

<figure><img src="/files/MPe8Vj1j8lzOBg7QlEhV" alt=""><figcaption><p>GPU utilization by instance</p></figcaption></figure>

Clicking on a specific instance will display a list of workloads running on that machine, allowing for deeper investigation and analysis.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.perfectscale.io/2.0-self-hosted-or-perfectscale-documentation/visibility-and-optimization/gpu-optimization.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.