Friday — July 18, 2025

Mistral's Le Chat updates with Deep Research mode, researchers find that annotators can detect AI-generated text and a new framework called Mixture-of-Recursions achieves state-of-the-art results in Recursive Transformers, while RunAgent introduces a universal AI agent platform for multi-framework deployment.

News

Mistral Releases Deep Research, Voice, Projects in Le Chat

Le Chat has been updated with new features, including Deep Research mode, Voice mode, multilingual reasoning, and organized conversation folders, to help users research more thoroughly and express themselves more naturally. The Deep Research mode allows users to ask complex questions and receive structured, reference-backed reports, as demonstrated by an example of researching upcoming NYSE listings, which provides a comprehensive and organized overview of the information.

All AI models might be the same

Project CETI aims to decode whale speech using AI models, which could potentially allow humans to communicate with whales if a universal language is learned. The concept of intelligence being equivalent to compression is explored, suggesting that as AI models improve their ability to compress data, they also gain a better understanding of the world, and that different models may converge to a shared representation of reality.

My favorite use-case for AI is writing logs

The author praises JetBrains' Full Line Code Completion feature in PyCharm, which uses AI to auto-complete log statements, making debugging more efficient and reducing cognitive overhead. The feature is powered by a locally-run model that is small, fast, and optimized for Python, allowing it to provide accurate and concise log completions within a 384-character context window.

Code execution through email: How I used Claude to hack itself

The article "Code Execution Through Email: How I Used Claude to Hack" describes an experiment where the author used Claude, a large language model (LLM), to hack itself through a well-crafted email. The author found that by exploiting the composition of multiple secure components, including a Gmail server and Claude's own capabilities, they were able to achieve code execution without any vulnerabilities in the individual components. The experiment highlights the dangers of compositional risk and the importance of considering context in AI-era security.

The AI bubble today is bigger than the IT bubble in the 1990s

The current AI bubble is larger than the IT bubble of the 1990s, with the top 10 companies in the S&P 500 being more overvalued now than they were back then. This information is based on data as of July 2025, with sources from Bloomberg and Apollo's Chief Economist, and is subject to change and not intended to provide investment advice.

Research

People who frequently use ChatGPT for writing tasks can detect AI-generated text

Annotators who frequently use large language models (LLMs) for writing tasks can accurately detect AI-generated text, with a majority vote among five "expert" annotators misclassifying only one out of 300 articles. These expert annotators rely on a combination of specific lexical clues and more complex phenomena, such as formality and originality, to make their determinations, outperforming most commercial and open-source detectors.

Measuring the Impact of Early-2025 AI on Experienced Developer Productivity

A study of 16 experienced open-source developers found that using AI tools, such as Cursor Pro and Claude, actually increased completion time by 19%, contradicting the developers' own estimates of a 20% reduction and expert predictions of a 38-39% reduction. The study's results suggest that AI tooling may slow down developers, and while the cause is unclear, the effect appears to be robust across various analyses, rather than a result of experimental design flaws.

Which Economic Tasks Are Performed with AI? Evidence from Claude Conversations

Researchers analyzed over four million conversations to study AI usage patterns across the economy, finding that AI is primarily used in software development and writing tasks, but also extends to about 36% of occupations. The analysis reveals that AI is used to both augment human capabilities (57% of usage) and automate tasks (43% of usage), providing insights into AI's evolving role in the economy.

The impact of file position on code review

The presentation order of files in code reviews, typically alphabetical, can bias the outcome, with files shown earlier receiving more comments than those shown later. An experiment with 106 participants also found that the position of a file can impact defect detection, with a 64% lower chance of identifying a defect when its file is last, suggesting that tool design and code review processes should consider the potential effects of file presentation order.

Mixture-of-Recursions: Learning Adaptive Token-Level Computation

The Mixture-of-Recursions (MoR) framework combines parameter sharing and adaptive computation to improve the efficiency of Recursive Transformers, reducing computational and memory demands. MoR achieves state-of-the-art results across various model sizes, offering better performance, lower latency, and higher throughput than existing models, making it a promising approach to achieving large-model quality without incurring large-model costs.

Code

Show HN: RunAgent; Multi-Framework Agent Deployment and Rust,Go,JS SDKs(+others)

RunAgent is a universal AI agent platform that allows developers to write AI agents in Python and access them natively from any language, eliminating the need for REST API complexity and language barriers. The platform provides a comprehensive toolset for deploying agents from any framework and consuming those agents in applications, with features such as native-feeling SDKs, universal deployment, streaming native support, and framework-agnostic design.

Show HN: Claude‑CMD – A CLI for managing Claude Code commands and workflows

Claude-cmd is a lightweight CLI tool for managing Claude commands, configurations, and workflows, offering features such as interactive CLI, command discovery, security management, and MCP integration. The tool can be installed globally using npm and provides various command-line options for managing commands, configurations, and projects, with a hierarchical configuration system and support for local and remote command sources.

Show HN: Cursor Autopilot – Control your Cursor chat via Telegram and more

Cursor Autopilot is an extension that allows remote control of Cursor AI coding sessions via Telegram, Gmail, and Feishu, enabling users to receive notifications and inject replies to continue or stop the coding session. The extension can be installed from the Extensions Marketplace or manually, and its configuration involves setting up adapter settings in a .autopilot.json file to enable remote communication with the Cursor AI editor.

Why Your AI Coding Assistant Keeps Suggesting Dead Code (and How We Fixed It)

CoachNTT.ai is a revolutionary AI development assistant that prioritizes safety through a mandatory abstraction framework, preventing the leakage of sensitive project details and ensuring comprehensive validation at every layer. This approach enables enterprise AI adoption, regulatory compliance, and safe knowledge sharing, transforming how organizations can leverage AI assistance while maintaining complete information security.

Second Me: an open-source prototype where you craft your own AI self

Second Me is an open-source, AI-native memory prototype that allows users to create and train their own AI self, preserving their identity, context, and interests, with the goal of amplifying human individuality rather than erasing it. The platform enables users to train, scale, and connect their AI self, while maintaining 100% privacy and control, and is continuously evolving with new features and improvements, including version control, continuous training pipelines, and cloud solutions.