Function
filter
v0.1.0
Gemini Vision for Text LLM
Last Updated
5 days ago
Created
5 days ago
Function ID
gemini_vision_for_text_llm
Creator
@mmie
Downloads
40+
Description
an Open WebUI inlet filter that converts images in user messages into structured, lossless text descriptions using Google Gemini Vision (exact transcriptions, detailed visual elements, and brief technical interpretation; LaTeX for detected formulas). Configurable via valves (API key, model, max images, caching). If no API key is set, images are removed with a clear warning.
README