API to get performance status/information about GPU/CPU of instance #9658

trollkarlen · 2025-03-11T18:42:14Z

This is a feature request but may mitigate instances that break due to current and feature bugs.

I use a proxy infront of my ollama instances to make it multi user/requests.
But some times the ollama server loses the connection with the GPU, and then the perfomance reduces alot.

This happens some times due to the cgroup issue that can be mitigated with the docker/daemon.json

"exec-opts": [
        "native.cgroupdriver=cgroupfs"
    ]

And somtimes due to other reasons(GPU hangs mm).

So it would be nice to be able to query the instance throught the API.
To get the performance of the node CPU/GPU wise, this way the proxy can detect the performance of the instances and detect when performance decline. This data can be used from both a proxy and/or client.

This feature request is on the same theme and maybe could be combined with this request, to have one place to get information about the node.
#2004

rick-github · 2025-03-11T19:48:06Z

#3144

WRT the GPU hanging, it might be that the model has lost coherence and is "rambling". This can be limited by setting num_predict.

trollkarlen · 2025-03-11T22:38:53Z

#3144

Looks like the no gpus and no gpus avalible is missing in metrics, but i guess it can be added.
Also it looks like the feature request is stalled :/

WRT the GPU hanging, it might be that the model has lost coherence and is "rambling". This can be limited by setting num_predict.

Thanx will look in to the num_predict, and see if it helps mitigate the issue with offline GPU:s

trollkarlen added the feature request New feature or request label Mar 11, 2025

pdevine assigned dhiltgen Mar 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API to get performance status/information about GPU/CPU of instance #9658

API to get performance status/information about GPU/CPU of instance #9658

API to get performance status/information about GPU/CPU of instance #9658

API to get performance status/information about GPU/CPU of instance #9658

Comments