Ollama release candidate v0.30.9-rc1 updates the underlying llama.cpp library to commit b9637. This change was contributed by @jmorganca in PR #16609. The release is a minor dependency bump from the previous v0.30.8 version.
The b9660 release of llama.cpp includes a targeted fix for a double-escaping bug in LFM2 tool-call parsing (issue #24667). Escape test cases were added to verify the correction. The release provides pre-built binaries across numerous platforms including macOS Apple Silicon, Linux, Windows, and Android variants, with various GPU backends. No other functional changes are noted.
The llama.cpp project released tag b9659, which includes a bug fix for the mtmd component that was miscounting n_tokens (PR #24656). This release also provides pre-built binaries for a wide range of platforms, including macOS (ARM64, Intel), Linux (x64, ARM64, s390x with Vulkan, ROCm, OpenVINO, SYCL), Android (ARM64), and Windows (x64, ARM64 with CUDA 12/13, Vulkan, SYCL, HIP). Notably, the macOS Apple Silicon build with KleidiAI enabled is marked as disabled, while the iOS XCFramework artifact is available.
The llama.cpp project released build b9658. A key change improves chat debugging: on parse errors, the debug output now includes the full unparsed prompt. The release also provides pre-built binaries for many platforms, including macOS (Apple Silicon, Intel), Linux (CPU, Vulkan, ROCm, OpenVINO, SYCL), Android (arm64 CPU), and Windows (CPU, CUDA, Vulkan, SYCL, HIP). The KleidiAI-enabled macOS Apple Silicon build is currently disabled in this release.
llama.cpp release b9656 hardens the PEG-native tool call parsing. It now accepts an optional leading "type":"function" field to accommodate OpenAI-style tool call serialization. On a final parse failure, the parser returns a clean error and logs the unparsed fragment instead of throwing raw internal state. The raw arguments string is preserved when it is not valid JSON, preventing an abort of the prompt rendering. Parse failures are surfaced with clearer error messages, eliminating silent empty assistant turns. The lenient handling of the "type":"function" field is gated behind an analysis flag.
The llama.cpp project released tag b9655, which fixes an 'oldie but goodie' grammar generator bug in the chat feature that surfaced during recent changes (PR #24653). Additionally, an erroneous case in the PEG parser test was updated. The release provides pre-built binaries for a wide range of platforms including macOS (Apple Silicon, Intel, KleidiAI), Linux (x64, arm64, s390x, Vulkan, ROCm, OpenVINO, SYCL), Android (arm64), and Windows (x64, arm64, CUDA 12/13, Vulkan, SYCL, HIP). openEuler builds and UI components are also included.