WeaveBench is introduced as a comprehensive benchmark for evaluating computer-use agents (CUAs) operating across hybrid interfaces, requiring both GUI and CLI/code operations. It encompasses 114 long-horizon tasks spanning 8 real-world work domains, all evaluated on a real Ubuntu desktop. The benchmark includes a trajectory-aware judge that inspects agent deliverables and detects shortcut behaviors, addressing limitations of traditional evaluation methods. The PassRate across tested model-runtime pairings is only 41.2%, highlighting a significant performance gap in long-horizon task orchestration.
Slashy is a new AI-native email client and assistant that drafts email replies in the user's own writing style. It automatically triages incoming messages to highlight what matters and ensures no follow-up tasks are forgotten. The tool connects with the user's email, calendar, and CRM services. It was launched on Product Hunt to help professionals reduce time spent in their inbox.
Athenic AI released version 2.0 of its agentic data analyst platform, now available on Product Hunt. The tool lets teams connect business apps, SQL databases, or uploaded files and then query data using plain English. It supports workflow automation and building interactive charts and dashboards to monitor key metrics. The update positions Athenic as a no-code, AI-powered analytics assistant for non-technical users.
SocialSource: XImportance: 3/5
Replit announced a new capability that enables users to run multiple agents in parallel, moving beyond building a single item at a time. Users can now ship a website, mobile app, video, and other artifacts concurrently within the platform. The feature is showcased in a newly released video, though specific technical details are not provided. This represents a shift toward multi-agent workflows for faster, more complex project delivery on Replit.
TutorialsSource: MARKTECHPOSTImportance: 4/5
Databricks released Omnigent, an Apache 2.0-licensed open-source meta-harness that standardizes the interface across terminal coding agents (Claude Code, Codex, Pi) and agent SDKs, turning them into interchangeable components. It adds a shared layer for composition (switching agents with one-line changes), contextual control (e.g., pausing at cost limits, requiring human approval for sensitive git pushes), and collaboration (sharing live agent sessions via URL). The architecture consists of a sandboxed runner with a uniform API and a policy server, and sessions sync across terminal, web UI, and mobile. An OS sandbox (Omnibox) secures credentials by injecting tokens only in approved proxy requests. Two example agents—Polly (a multi-agent coding orchestrator) and Debby (a two-headed brainstorming partner)—illustrate its patterns, and an interactive concept demo shows parallel agent delegation and policy enforcement.
SocialSource: XImportance: 2/5
In June 2026, Tsinghua University's K1 humanoid robots were demonstrated at a shopping mall in Hong Kong, performing Michael Jackson-inspired dance moves and subsequently playing football with children. The showcase highlighted the robots' agility, balance, and ability to interact naturally in a public environment. The event drew public attention to advances in humanoid robotics and human-robot interaction.