Configure Danswer to use a Custom Model Server via requests
Danswer can also make requests to an arbitrary model server via REST requests. Optionally an access token can be passed in. To customize the request format and handling of the response, it may be necessary to update/rebuild the Danswer containers.
There’s a very minimal interface to be implemented which can support any arbitrary LLM Model Server. Simply update the code here and rebuild.
The default implementation is compatible with the blog demo shown below.
Llama-2-13B-chat-GGML
using a custom FastAPI Server.