LangChain Improves Deep Agents with Harness Engineering
LangChain has demonstrated a significant leap in coding agent performance, moving from Top 30 to Top 5 on Terminal Bench 2.0 through "harness engineer...
LangChain has demonstrated a significant leap in coding agent performance, moving from Top 30 to Top 5 on Terminal Bench 2.0 through "harness engineer...
Alibaba's Qwen has launched the first two models in its Qwen 3.5 series, featuring native multimodal capabilities with vision input. Among them is the...
OpenAI has released GABRIEL, an open-source toolkit designed to assist social scientists. This tool leverages GPT models to convert qualitative data f...
GitHub has launched Agentic Workflows in technical preview, enabling developers to automate various repository tasks using coding agents within GitHub...
Hugging Face is exploring the creation of custom CUDA kernels, leveraging the capabilities of models like Codex and Claude. This initiative aims to en...
OpenAI's GPT-5.2 has reportedly derived a novel formula for a gluon amplitude, a significant development in theoretical physics. This new result was s...
Details in article.
Z.ai has released GLM-5, a substantial new MIT-licensed model featuring 754 billion parameters and 1.51TB of data on Hugging Face, making it twice the...
LangChain has published an analysis of two key patterns for how AI agents connect with sandboxed environments. This exploration is vital for agents th...
A new paper by Damon McMillan delves into 'Structured Context Engineering for File-Native Agentic Systems,' addressing challenging LLM context tasks. ...
Hugging Face has made a preview of Transformers.js v4 available on NPM, signaling an update to their popular library for running transformer models in...
Anthropic has released Claude Opus 4.6, an upgrade to its most advanced AI model, showcasing industry-leading performance across various domains. This...