Skip to content

feat: switch from qwen-coder to GLM-4.7-Flash MLX, remove Ollama entirely#332

Open
funkymonkeymonk wants to merge 14 commits into
mainfrom
feat/glm47-flash-mlx
Open

feat: switch from qwen-coder to GLM-4.7-Flash MLX, remove Ollama entirely#332
funkymonkeymonk wants to merge 14 commits into
mainfrom
feat/glm47-flash-mlx

Conversation

@funkymonkeymonk

Copy link
Copy Markdown
Owner

Summary

Replaces all qwen-coder MLX models on MegamanX with GLM-4.7-Flash (all 3 quants: 4bit, 6bit, 8bit). Removes Ollama service, llm-host role, and all ollama configurations across all targets.

Changes

GLM-4.7-Flash MLX switch

  • MegamanX: Added glm47-flash-4bit (fast), glm47-flash-6bit (balanced), glm47-flash-8bit (quality), kept qwen-embed
  • OpenCode: Default model → higgs/glm47-flash-4bit, provider models updated
  • Vane: Chat model list updated to all 3 GLM variants
  • Pi: Model reference updated to glm47-flash-4bit

Ollama removed (all targets)

  • Role: Deleted llm-host role (only enabled Ollama)
  • Service: Deleted modules/services/ollama/ (darwin, nixos, common)
  • Options: Removed myConfig.ollama and myConfig.roles.llm-host
  • Targets: MegamanX, wweaver, darwin-server — all ollama config removed
  • Profiles: monkey, wweaver — ollama providers and llm-host role removed
  • Tests: All ollama/llm-host tests removed
  • Flake: All ollama service imports removed across 8 configs

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant