Qwen2.5-Coder-7B

@[email protected] · 2 months ago

Qwen2.5-Coder-7B

@[email protected] · 2 months ago

I have found the problem with the cut off, by default aider only sends 2048 tokens to ollama, this is why i have not noticed it anywhere else except for coding.

When running /tokens in aider:

$ 0.0000   16,836 tokens total
           15,932 tokens remaining in context window
           32,768 tokens max context window size

Even though it will only send 2048 tokens to ollama.

To fix it i needed to add a file .aider.model.settings.yml to the repository:

- name: aider/extra_params
  extra_params:
    num_ctx: 32768

@brucethemoose · 2 months ago

That’s because ollama’s default max ctx is 2048, as far as I know.