Support for querying maximum/available GPU memory

It would be useful to have the ability in WebGPU to get the available/recommended maximum memory, e.g., to determine whether/how much of a local LLM can be offloaded to the GPU. Apologies if this has been discussed before, I couldn't find any existing issues. I also realize there are security considerations around this (https://www.w3.org/TR/webgpu/#security-memory-resources), but I think the use-cases are important enough that it's worth figuring out if it's possible. For example, WebGPU could follow the convention used by JavaScript's [Device Memory API](https://developer.mozilla.org/en-US/docs/Web/API/Device_Memory_API) and only expose very rough thresholds, which would still be useful for applications.

Right now, the value I am using is `maxBufferSize`, but I don't think this is always representative. I'd also appreciate pointers to another method if there is an accepted one.

There are ways to expose this information on the existing backends:

## Metal
- [recommendedMaxWorkingSetSize](https://developer.apple.com/documentation/metal/mtldevice/recommendedmaxworkingsetsize) for the total memory.
- optionally, [currentlyAllocatedSize](https://developer.apple.com/documentation/metal/mtldevice/currentallocatedsize) to avoid multiple applications clobbering each other.

## Vulkan
- [vkGetPhysicalDeviceMemoryProperties](https://docs.vulkan.org/refpages/latest/refpages/source/vkGetPhysicalDeviceMemoryProperties.html) and examining the returned heaps for the total memory
- optionally, using [VkPhysicalDeviceMemoryBudgetPropertiesEXT](https://docs.vulkan.org/refpages/latest/refpages/source/VkPhysicalDeviceMemoryBudgetPropertiesEXT.html) to take into account existing allocations.

## DirectX
I'm not as familiar with DirectX, but I think something like [DXGI_ADAPTER_DESC3](https://learn.microsoft.com/en-us/windows/win32/api/dxgi1_6/ns-dxgi1_6-dxgi_adapter_desc3) would expose enough information to get a rough idea.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for querying maximum/available GPU memory #5505

Metal

Vulkan

DirectX

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support for querying maximum/available GPU memory #5505

Description

Metal

Vulkan

DirectX

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions