Friday — June 20, 2025

Andrej Karpathy predicts LLMs will revolutionize software, MIT finds AI chatbot use reduces brain activity, and ProtoReasoning framework enhances reasoning in Large Reasoning Models.

Interested in AI engineering? Let's talk

News

Andrej Karpathy: Software in the era of AI [video]

Andrej Karpathy discusses how software is undergoing a fundamental change with the emergence of Large Language Models (LLMs), which can be programmed in English and have properties of utilities, fabrication labs, and operating systems. He believes LLMs are a new kind of computer that will revolutionize the way we interact with software, making it more accessible and enabling the creation of partially autonomous products.

From LLM to AI Agent: What's the Real Journey Behind AI System Development?

Not every AI system needs to be an autonomous agent, as simpler and more cost-effective solutions can be sufficient for many real-world use cases, and the key lies in choosing the right architecture for the problem at hand. Large Language Models (LLMs) can be used in various ways, ranging from pure LLMs to more complex systems like RAG, tool use, and AI workflows, and even AI agents, each with its own strengths and suitable applications, and the choice of architecture depends on the specific requirements of the task.

Brain activity much lower when using AI chatbots, MIT boffins find

Researchers at MIT found that using AI chatbots, such as ChatGPT, to complete tasks like writing essays results in significantly lower brain activity compared to completing the same tasks without assistance. The study, which used EEG headsets to measure brain activity, also found that participants who relied on AI chatbots performed worse on recall tests and had poorer fact retention, suggesting that over-reliance on AI may hinder learning and cognitive development.

Contra Ptacek's Terrible Article on AI

The author strongly disagrees with an article titled "My AI Skeptic Friends Are All Nuts" by Thomas Ptacek, calling it poorly reasoned and full of "half-baked ideas" and "trash-tier ethics". The author criticizes Ptacek's piece for dismissing skepticism of AI as outdated and for failing to demonstrate the benefits of AI, instead relying on marketing hype and breathless enthusiasm from its proponents.

Show HN: On-Demand H200 GPU Cloud from $0.79/HR

Researchers and AI startups can receive vouchers ranging from $100 to $1,500 when topping up their accounts with $1,000 to $10,000, valid for 3 to 5 months. HPC-AI.COM offers on-demand access to NVIDIA H200 and H100 GPUs with competitive pricing, starting at $1.89/GPU/hour, and boasts superior storage and network performance, including InfiniBand support, making it a cost-effective option for high-performance computing needs.

Research

The Impact of Generative AI on Social Media: An Experimental Study

A controlled experiment was conducted to study the impact of generative AI tools on user behavior and experience in a social media context, revealing that while AI tools can increase engagement and content volume, they can also decrease perceived quality and authenticity of discussions. The study's findings led to the proposal of four design principles to guide the ethical and effective integration of AI into social media, including transparent disclosure, user-focused personalization, context-sensitivity, and intuitive user interfaces.

Who is using AI to code? Global diffusion and impact of generative AI

The adoption of AI-generated coding tools is increasing, with an estimated 30.1% of Python functions from US contributors being written by AI by December 2024, and this adoption is leading to significant productivity gains, with a 2.4% increase in quarterly commits. The value of AI-assisted coding in the US is estimated to be between $9.6-$14.4 billion annually, with potential to rise to $64-$96 billion with higher productivity effects, and is also driving learning and innovation among programmers.

Wanting to Be Understood Explains the Meta-Problem of Consciousness

We create external representations like language and art to convey our inner states and make our experiences accessible to others, but these representations can never fully capture the richness of our raw experiences. The drive to be understood and the impossibility of fully conveying the feel of experience fuels the "hard problem of consciousness," but it also motivates us to continually create new ways to communicate and think about our experiences.

Sekai: A Video Dataset Towards World Exploration

The Sekai dataset is a large, high-quality collection of first-person view videos from over 100 countries, featuring over 5,000 hours of footage with rich annotations for world exploration. This dataset is used to train an interactive video world exploration model called YUME, and is expected to benefit the areas of video generation and world exploration, enabling valuable applications.

ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs

Recent advances in Large Reasoning Models (LRMs) have shown impressive cross-domain generalization capabilities, which researchers believe may be due to shared abstract reasoning prototypes that capture fundamental problem-solving patterns. The proposed ProtoReasoning framework, which leverages these prototypes, has been shown to improve reasoning abilities in various tasks, including logical reasoning, planning, and mathematics, with experiments demonstrating significant improvements over baseline models.

Code

Show HN: EnrichMCP – A Python ORM for Agents

EnrichMCP is a Python framework that enables AI agents to understand and navigate data by adding a semantic layer to the data model, allowing for typed, discoverable tools and automatic schema discovery. It provides features such as relationship navigation, type safety and validation, mutability and CRUD operations, pagination, and context and authentication, making it a powerful tool for building AI-navigable data layers.

Show HN: Tool to Automatically Create Organized Commits for PRs

Git Smart Squash is a tool that uses AI to transform messy commit histories into clean, logical commits by analyzing the entire diff and grouping related changes together. It can be installed and set up in minutes, and provides options for local or cloud-based AI processing, with features such as dry runs, backup branches, and easy recovery to ensure safe and controlled use.

Claudemacs: AI Pair Programming with Claude Code in Emacs

Claudemacs is an Emacs package that enables pair programming with Claude Code, providing features such as project-based sessions, system notifications, and terminal fixes. The package can be installed through various methods, including Doom Emacs, use-package, and straight.el, and requires the Claude Code CLI and eat package to be installed, with setup and customization options available for keybindings, fonts, and system notifications.

Show HN: Run your favorite open-source AI coding agent in the browser

Newrev is a web-based interface for terminal-based AI coding agents, allowing users to run their favorite open-source AI coding agents in their browser, with features such as chat modes, basic IDE functionality, and live preview. The project is currently experimental and in active development, with a focus on being lightweight, portable, and customizable, and welcomes contributions, bug reports, and feature requests from the community.

opencode - AI coding agent built for the terminal

OpenCode is a terminal-based AI assistant for developers, providing intelligent coding assistance directly in the terminal through a Terminal User Interface (TUI) that interacts with various AI models. The project is currently in early development and offers features such as interactive TUI, multiple AI providers, session management, tool integration, and more, but is not yet ready for production use.