local-llm-server/README.md at 9341492a8a118d81aa6e0f0b0fc6825783e75873 - local-llm-server - EVULID GITSERVER

367 B

Raw Blame History

Nginx

Make sure your proxies all have a long timeout:

proxy_read_timeout 300;
proxy_connect_timeout 300;
proxy_send_timeout 300;

The LLM middleware has a request timeout of 95 so this longer timeout is to avoid any issues.

Model Preperation

Make sure your model's tokenizer_config.json has 4096 set equal to or greater than your token limit.