Activity - llama.cpp, the underlying engine, doesn’t support extended RoPE yet. Basically...

brucethemoose , 5 hours ago

llama.cpp, the underlying engine, doesn’t support extended RoPE yet. Basically this means long context doesnt work and short context could be messed up too.

I am also hearing rumblings of a messed up chat template?

Basically with any LLM in any UI that uses a GGUF, you have to be very careful of bugs you wouldn’t get in the huggingface-based backends. A lot of models run without errors, but not quite right.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...