r/LocalLLaMA 16h
Qwen3.6 27B and llama.cpp appreciation post
To preface, here's my config: llama-server \ --host 0.0.0.0 \ --port 1235 \ --models-preset %h/Software/models.ini \ --models-max 1 \ --sleep-idle-seconds 3600 \ --timeout 3600 \ -
To preface, here's my config: llama-server \ --host 0.0.0.0 \ --port 1235 \ --models-preset %h/Software/models.ini \ --models-max 1 \ --sleep-idle-seconds 3600 \ --timeout 3600 \ -
Hey folks, just a heads-up for anyone running Qwen3.6 through `llama-server`. I ran into an issue where the `preserve_thinking` parameter wasn't working as expected, even though I had it explicitly enabled in my `models.ini` config. After some digging, I found that **extra spaces in the JSON string are breaking the parser** for this specific parameter in my build. ❌ **Does NOT work:** `chat-te