Wednesday May 21, 2025

Google's new Gemma 3n AI model aims for fast on-device experiences, "is-even-ai" humorously checks even numbers with GPT-3.5, and Robin discovers a novel treatment for a leading cause of blindness.

News

Veo 3 and Imagen 4, and a new tool for filmmaking called Flow

Google has introduced new generative media models and tools, including Veo 3 and Imagen 4, which can create realistic images and videos, and Flow, a filmmaking tool that allows users to create cinematic films with advanced control. These models and tools aim to empower artists and everyone to express their creativity and bring their vision to life.

Deep Learning Is Applied Topology

Topology, the study of surfaces, plays a crucial role in understanding AI and deep learning, as it allows for the manipulation of surfaces to separate and classify complex data. Neural networks can be thought of as "topology generators" that learn to represent data in high-dimensional spaces, creating surfaces where similar data points are grouped together, and this process can be seen as a form of reasoning that is indistinguishable from human reasoning in high enough dimensional spaces.

AI in my plasma physics research didn’t go the way I expected

Physicist Nick McGreivy, who once believed AI could accelerate physics research, became disillusioned after trying to apply AI techniques to real physics problems and finding the results to be disappointing and often exaggerated. McGreivy's experience, along with similar issues identified by others, has led him to question the potential of AI to revolutionize science, suggesting that its impact may be more incremental and less transformative than often claimed.

Gemma 3n preview: Mobile-first AI

Google has introduced Gemma 3n, a new open AI model that enables fast, low-footprint AI experiences to run locally on devices such as phones, tablets, and laptops, with capabilities including optimized on-device performance, multimodal understanding, and improved multilingual capabilities. Gemma 3n is available for preview today through Google AI Studio and Google AI Edge, allowing developers to start building and experimenting with this technology, which will become available on major platforms such as Android and Chrome later this year.

Is-even-ai – Check if a number is even using the power of AI

The "is-even-ai" npm package uses OpenAI's GPT-3.5-turbo model to determine if a number is even, and also provides functions to check for odd numbers, equality, inequality, and comparison of numbers. The package can be installed via npm and requires an OpenAI API key to function, with options to customize the model and temperature used for the AI calculations.

Research

The Dangers of Browsing AI Agents

Autonomous browsing agents powered by large language models are vulnerable to a broad range of attacks due to their reliance on dynamic content and user-provided data, exposing systemic vulnerabilities across multiple architectural layers. This paper presents a comprehensive security evaluation and proposes a defense-in-depth strategy to address these threats, including input sanitization and session safeguards, and demonstrates the effectiveness of these measures through a white box analysis of a popular open source project.

Sugar-Coated Poison: Benign Generation Unlocks LLM Jailbreaking

Large Language Models (LLMs) are vulnerable to jailbreak attacks due to a weakness known as Defense Threshold Decay (DTD), where the model's attention shifts from input to prior output as it generates benign content, making it more susceptible to attacks. A novel jailbreak method called Sugar-Coated Poison (SCP) exploits this vulnerability, but a defense strategy called POSD can effectively mitigate such attacks while preserving the model's capabilities.

Agent Name Service (ANS):A Directory for AI Agent Discovery and Interoperability

The Agent Name Service (ANS) is a novel architecture that provides a public agent discovery framework, utilizing DNS and Public Key Infrastructure (PKI) certificates to enable secure and verifiable agent identity and trust. The ANS architecture features a range of innovations, including a formalized registration mechanism and modular protocol support, to create a foundational directory service for secure discovery and interaction in multi-agent systems.

Questioning Representational Optimism in Deep Learning

Researchers compared neural networks trained using conventional methods to those evolved through an open-ended search process, finding that despite similar output, the internal representations differed significantly, with conventionally trained networks exhibiting "fractured entangled representation" (FER) and evolved networks approaching a more unified representation. Understanding and mitigating FER could be crucial for improving core model capacities such as generalization, creativity, and learning in large AI models.

Robin: A multi-agent system for automating scientific discovery

The Robin system is a multi-agent artificial intelligence platform that automates the key steps of the scientific process, including hypothesis generation, experimentation, and data analysis. By using Robin, researchers were able to identify a novel treatment for dry age-related macular degeneration, a leading cause of blindness, and propose a therapeutic candidate, ripasudil, which was validated through follow-up experiments.

Code

Questioning Representational Optimism in Deep Learning

Researchers compared neural networks trained using stochastic gradient descent (SGD) to those evolved through an open-ended search process, finding that while both produced the same output, their internal representations differed significantly, with SGD-trained networks exhibiting a "fractured entangled representation" that may degrade core model capacities. The study suggests that understanding and mitigating this phenomenon could be crucial for the future of representation learning.

Show HN: Rocketship – Open-source E2E testing that's self-hostable

Rocketship is an open-source testing engine that allows users to verify complex, API-driven scenarios, offering features such as durable execution, a plugin ecosystem, and declarative YAML specs. It can be run locally or in a cloud environment, and its core features include a CLI, deploy-ready images, and support for long-running and highly scalable tests.

LLM-D: Kubernetes-Native Distributed Inference at Scale

There is no text to summarize.

Show HN: I Spent Years Building a FOSS Unified Zero Trust Secure Access Platform

Octelium is a free and open-source, self-hosted platform that provides a unified solution for zero-trust resource access, offering a modern alternative to corporate VPNs and remote access tools. It features a scalable zero-trust architecture, dynamic secret-less access, and context-aware, identity-based access control, making it suitable for various use cases, including ZTNA/BeyondCorp architecture, remote access VPN, secure tunnels, and API gateway.

Show HN: I built a programming language, inspired by Golang

The Nature programming language is a general-purpose, open-source language designed to provide an elegant and concise development experience, enabling developers to build secure and reliable cross-platform software. It features a simple syntax, comprehensive type system, high-performance garbage collection, and coroutine implementation, making it suitable for various applications, including game development, scientific computing, and web development.