Thinkgap feed

AI signal, minus the noise.

Curated items are read from the processed items table and served as a bilingual feed.

80 items

GITHUBJun 15, 2026

Ollama v0.30.9-rc1: llama.cpp Updated to Commit b9637

Ollama release candidate v0.30.9-rc1 updates the underlying llama.cpp library to commit b9637. This change was contributed by @jmorganca in PR #16609. The release is a minor dependency bump from the previous v0.30.8 version.

GITHUBJun 15, 2026

llama.cpp b9660: Fix LFM2 Tool-Call Parsing Double-Escaping

The b9660 release of llama.cpp includes a targeted fix for a double-escaping bug in LFM2 tool-call parsing (issue #24667). Escape test cases were added to verify the correction. The release provides pre-built binaries across numerous platforms including macOS Apple Silicon, Linux, Windows, and Android variants, with various GPU backends. No other functional changes are noted.

GITHUBJun 15, 2026

llama.cpp Release b9659 Fixes Miscounting of n_tokens in mtmd (#24656)

The llama.cpp project released tag b9659, which includes a bug fix for the mtmd component that was miscounting n_tokens (PR #24656). This release also provides pre-built binaries for a wide range of platforms, including macOS (ARM64, Intel), Linux (x64, ARM64, s390x with Vulkan, ROCm, OpenVINO, SYCL), Android (ARM64), and Windows (x64, ARM64 with CUDA 12/13, Vulkan, SYCL, HIP). Notably, the macOS Apple Silicon build with KleidiAI enabled is marked as disabled, while the iOS XCFramework artifact is available.

GITHUBJun 15, 2026

llama.cpp b9658 Now Includes Full Unparsed Prompt in Debug on Chat Parse Errors

The llama.cpp project released build b9658. A key change improves chat debugging: on parse errors, the debug output now includes the full unparsed prompt. The release also provides pre-built binaries for many platforms, including macOS (Apple Silicon, Intel), Linux (CPU, Vulkan, ROCm, OpenVINO, SYCL), Android (arm64 CPU), and Windows (CPU, CUDA, Vulkan, SYCL, HIP). The KleidiAI-enabled macOS Apple Silicon build is currently disabled in this release.

GITHUBJun 15, 2026

llama.cpp Release b9656 Hardens PEG Tool Call Parsing and Error Handling

llama.cpp release b9656 hardens the PEG-native tool call parsing. It now accepts an optional leading "type":"function" field to accommodate OpenAI-style tool call serialization. On a final parse failure, the parser returns a clean error and logs the unparsed fragment instead of throwing raw internal state. The raw arguments string is preserved when it is not valid JSON, preventing an abort of the prompt rendering. Parse failures are surfaced with clearer error messages, eliminating silent empty assistant turns. The lenient handling of the "type":"function" field is gated behind an analysis flag.

GITHUBJun 15, 2026

llama.cpp Release b9655 Fixes Long-standing Grammar Generator Bug in Chat

The llama.cpp project released tag b9655, which fixes an 'oldie but goodie' grammar generator bug in the chat feature that surfaced during recent changes (PR #24653). Additionally, an erroneous case in the PEG parser test was updated. The release provides pre-built binaries for a wide range of platforms including macOS (Apple Silicon, Intel, KleidiAI), Linux (x64, arm64, s390x, Vulkan, ROCm, OpenVINO, SYCL), Android (arm64), and Windows (x64, arm64, CUDA 12/13, Vulkan, SYCL, HIP). openEuler builds and UI components are also included.