Multi-Model Support
Hebb Mind supports multiple LLM providers through LiteLLM. This enables memory consolidation with any major language model.
Supported Providers
| Provider | Model Example | Configuration |
|---|---|---|
| OpenAI | openai/gpt-4o-mini | llm_api_key |
| OpenAI | openai/gpt-4o | llm_api_key |
| Anthropic | anthropic/claude-3-haiku-20240307 | llm_api_key |
| Anthropic | anthropic/claude-3-5-sonnet-20241022 | llm_api_key |
| Qwen (Alibaba) | openai/qwen-plus | llm_api_key + llm_base_url |
| GLM (Zhipu) | openai/glm-4 | llm_api_key + llm_base_url |
| Kimi (Moonshot) | openai/moonshot-v1-8k | llm_api_key + llm_base_url |
Configuration
OpenAI
hebb config set llm_model openai/gpt-4o-mini
hebb config set llm_api_key sk-your-openai-keyAnthropic
hebb config set llm_model anthropic/claude-3-haiku-20240307
hebb config set llm_api_key sk-ant-your-anthropic-keyQwen (Alibaba Cloud)
hebb config set llm_model openai/qwen-plus
hebb config set llm_api_key sk-your-qwen-key
hebb config set llm_base_url https://dashscope.aliyuncs.com/compatible-mode/v1GLM (Zhipu AI)
hebb config set llm_model openai/glm-4
hebb config set llm_api_key your-zhipu-key
hebb config set llm_base_url https://open.bigmodel.cn/api/paas/v4Kimi (Moonshot AI)
hebb config set llm_model openai/moonshot-v1-8k
hebb config set llm_api_key sk-your-moonshot-key
hebb config set llm_base_url https://api.moonshot.cn/v1How It Works
For Chinese model providers (Qwen, GLM, Kimi), the openai/ prefix tells LiteLLM to use the OpenAI-compatible API format. The llm_base_url points to the provider's endpoint. This works because these providers implement the OpenAI chat completion API specification.
Embedding Model
The embedding model runs locally via sentence-transformers. No external API calls are needed for generating embeddings. hebb setup selects the default model by content language:
- English:
BAAI/bge-large-en-v1.5 - Chinese or multilingual:
BAAI/bge-m3
Download region is independent from language. Use hebb setup --language en --region cn for English content on a China network, or hebb setup --language zh --region global for Chinese content on a global network.
To swap to a different model after first install:
hebb config set embedding_model "paraphrase-multilingual-MiniLM-L12-v2"
hebb config set embedding_dim 384
hebb service restart
hebb memory reembed # required if the dimension changedTIP
Changing the embedding dimension invalidates all stored vectors — the vector table is auto-reset on next startup. Run hebb memory reembed afterwards to repopulate. See Switch the Embedding Model for the full walkthrough (CLI, Web Console, and re-embed details).
This means:
- Embedding is free -- no API costs
- Low latency -- no network round-trip
- Privacy -- your text never leaves the machine for embedding
- Offline capable -- works without internet after initial download
The embedding model is separate from the LLM model. You can use any LLM provider while keeping the local embedding model.
Testing Your Configuration
After configuring a model, test the connection:
curl -X POST http://localhost:8321/api/v1/config/test-llmThis sends a simple test request to verify that the API key and endpoint are working correctly.
Choosing a Model
For memory consolidation, smaller and faster models work well since the task involves classification and summarization rather than creative generation. Recommended starting points:
- Budget-conscious:
openai/gpt-4o-minioropenai/qwen-plus - Higher quality:
openai/gpt-4ooranthropic/claude-3-5-sonnet-20241022 - Chinese-language memories:
openai/qwen-plusoropenai/glm-4