Skip to content

Claude Code 32000 Output Token Error #36

@binihui134

Description

@binihui134

Unless I specifically use nemotron-3-super:cloud (and probably other cloud models), offline models just do not work. First off I tried Gemma 4, it said 4 things to me

  1. I
  2. Starting
  3. To
  4. Under

(and other times i tried it gave me more random words)
then it just throws a

API Error: Claude's response exceeded the 32000 output token maximum. To configure this behavior, set the
     CLAUDE_CODE_MAX_OUTPUT_TOKENS environment variable.
``` at me. Right now I am testing qwen3.5:9b

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions