Anthropic Claude

KhunQuant connects to Anthropic via the official anthropic-sdk-go. The anthropic/ provider implements the LLMProvider interface and additionally implements ThinkingCapable for extended thinking support.

Configuration

Add to ~/.khunquant/config.json:

{
  "model_list": [
    {
      "model_name": "claude",
      "model": "claude-sonnet-4-6",
      "api_key": "ANTHROPIC_API_KEY"
    }
  ]
}

Add the API key to ~/.khunquant/.security.yml:

model_list:
  - model_name: claude
    api_key: "sk-ant-..."

Extended thinking

Claude models support extended thinking, which lets the model reason step-by-step before producing a final response. Configure the thinking level per model:

`thinking_level`	Description
`"none"`	No extended thinking (default)
`"low"`	Brief thinking budget
`"high"`	Maximum thinking budget

{
  "model_list": [
    {
      "model_name": "claude-think",
      "model": "claude-opus-4-7",
      "api_key": "ANTHROPIC_API_KEY",
      "thinking_level": "high"
    }
  ]
}

Fallback chain example

Configure a primary model with a cheaper fallback:

{
  "agents": {
    "defaults": {
      "model_name": "claude",
      "model_fallbacks": ["claude-haiku"]
    }
  },
  "model_list": [
    { "model_name": "claude",       "model": "claude-sonnet-4-6", "api_key": "ANTHROPIC_API_KEY" },
    { "model_name": "claude-haiku", "model": "claude-haiku-4-5-20251001",  "api_key": "ANTHROPIC_API_KEY" }
  ]
}

The fallback chain automatically retries on rate limits, overload, or context overflow errors.

Configuration​

Extended thinking​

Fallback chain example​

Configuration

Extended thinking

Fallback chain example