Llama.cpp Server cache_prompt Function • Open WebUI Community

Whitepaper

Docs Careers Sign In

Function

filter

v1.0.0

Llama.cpp Server cache_prompt

Last Updated

10 months ago

Created

10 months ago

Function ID

llama_server_cache_prompt

Creator

@drunnells

Downloads

215+

Sponsored by Open WebUI Enterprise

Upgrade to a licensed plan for enhanced capabilities, including custom theming and branding, and dedicated support.

Description

Adds the cache_prompt option for llama-server completions, enabling users to reuse the KV cache when supported, improving completion speed in long conversations when using llama.cpp server. See: https://github.com/ggerganov/llama.cpp/blob/master/examples/server/README.md#post-completion-given-a-prompt-it-returns-the-predicted-completion

README

No README available