ReposSource: GITHUBJune 16, 2026Importance: 2/5

llama.cpp b9658 Now Includes Full Unparsed Prompt in Debug on Chat Parse Errors

English summary

The llama.cpp project released build b9658. A key change improves chat debugging: on parse errors, the debug output now includes the full unparsed prompt. The release also provides pre-built binaries for many platforms, including macOS (Apple Silicon, Intel), Linux (CPU, Vulkan, ROCm, OpenVINO, SYCL), Android (arm64 CPU), and Windows (CPU, CUDA, Vulkan, SYCL, HIP). The KleidiAI-enabled macOS Apple Silicon build is currently disabled in this release.

Chinese summary

llama.cpp 项目发布了 build b9658。一项重要改进是增强了聊天调试功能：当发生解析错误时，调试信息现在会包含完整的未解析提示。该版本仍提供面向多个平台的预编译二进制文件，包括 macOS（Apple Silicon 和 Intel）、Linux（CPU、Vulkan、ROCm、OpenVINO、SYCL）、Android（arm64 CPU）以及 Windows（CPU、CUDA、Vulkan、SYCL、HIP）。目前，启用 KleidiAI 的 macOS Apple Silicon 版本被暂时禁用。

Key points

Chat debug now includes the full unparsed prompt on parse errors (PR #24650).
聊天解析错误时，调试输出现在包含完整的未解析提示（PR #24650）。
Pre-built binaries released for macOS, Linux, Android, and Windows across many backends (CPU, CUDA, Vulkan, ROCm, OpenVINO, SYCL, HIP).
发布了面向 macOS、Linux、Android 和 Windows 多个后端（CPU、CUDA、Vulkan、ROCm、OpenVINO、SYCL、HIP）的预编译二进制文件。
The KleidiAI-enabled macOS Apple Silicon package is disabled in this build.
本次构建中，启用 KleidiAI 的 macOS Apple Silicon 包被禁用。

Open original