Configure Danswer to use GPT4All models in memory
gpt4all==2.0.2
in
danswer/backend/requirements/default.txt and rebuilding with GPT4All installed.
Note: Despite GPT4All offering quantized models, it is still significantly slower than models fully hosted on GPUs.
If you’re running the models purely on CPU, there may be significant delay to processing the context documents and in
generating answers.