ollama run frob/glm-5.1:744b-a40b-ud-q8_0
Updated 9 hours ago
9 hours ago
080ea071a15e · 801GB ·
Imported from hf.co/unsloth/GLM-5.1-GGUF.
Note that ollama does not yet support this model. To run it, ollama needs to be patched with #14864.
$ git clone https://github.com/ollama/ollama.git .
$ git checkout v0.20.0
$ curl -L https://github.com/ollama/ollama/pull/14864.diff | patch -p1
$ docker build -t ollama/ollama:0.20.0-14864 .
This model wants to use the tool format it was trained with and resists using the simpler JSON format that is easily encoded in an ollama template. For this reason, this model is not a good tool user. Better tool use will be enabled with an ollama PARSER.
$ ollama run frob/glm-5.1
>>> Why don't scientists trust atoms?
Thinking...
1. **Analyze the Request:** The user is asking a classic riddle/joke: "Why don't
scientists trust atoms?"
2. **Identify the Intent:** The intent is humor/wordplay based on a well-known
scientific joke.
3. **Retrieve Knowledge:** Access the punchline for this specific joke. The
standard punchline is "Because they make up everything!"
4. **Formulate Response:** Deliver the punchline clearly and concisely, perhaps
with a slight playful tone since it's a joke.
...done thinking.
Because they make up everything!
$ ollama run frob/glm-5.1 hello. --think=false
Hello! How can I help you today? 😊