Monday June 2, 2025

Alarm as AI malware targets users, Google AI Edge unveils its cross-platform model showcase app, and researchers propose a triadic neuronal modulation for faster machine learning.

News

Google AI Edge – On-device cross-platform AI deployment

Google AI Edge is a platform that enables developers to deploy AI models across mobile, web, and embedded applications, with features such as on-device hardware acceleration, cross-platform compatibility, and multi-framework support. The platform offers a range of tools and frameworks, including MediaPipe, LiteRT, and Model Explorer, to help developers build and deploy custom AI models, as well as pre-built solutions for common AI tasks such as generative AI, vision, text, and audio processing.

The Rise of Judgement over Technical Skill

Musician Brian Eno's 1995 observation that computer sequencers shift the focus from technical skill to judgement is now relevant in the AI-powered world, where tools democratize creative and professional tasks, making technical ability less important than the ability to make meaningful choices and exercise good judgement. As AI continues to evolve, professionals who can ask the right questions, frame problems effectively, and make sound decisions will become the most valuable, with their ability to exercise good judgement being their most valuable asset.

AI Malware Is Here: New Report Shows How Fake AI Tools Are Spreading Ransomware

Cisco Talos has discovered new malware threats, including CyberLock and Lucky_Gh0$t ransomware, as well as a destructive malware called Numero, which masquerade as legitimate AI tool installers to deceive unsuspecting businesses and individuals. These threats, often distributed through fake websites and social media, can compromise sensitive data, financial assets, and undermine trust in legitimate AI solutions, highlighting the need for extreme caution and verification of sources when downloading AI tools.

How can AI researchers save energy? By going backward

Reversible computing, which involves running programs backward as easily as forward without deleting data, may soon power AI and other technologies as traditional computing hits physical limitations. Researchers like Michael Frank and Christof Teuscher are exploring this approach, which could save energy by avoiding the heat loss associated with deleting data, and are making progress in developing reversible computers that could keep computational progress going.

Elevenlabs Conversational AI 2.0

Conversational AI 2.0 is a significant evolution of the ElevenLabs platform, introducing custom models for smoother and more intuitive interactions, as well as features like natural turn-taking, multilingual communication, and integrated knowledge access. This new version is designed to enable the creation of sophisticated, capable, and trustworthy voice agents, with a focus on enterprise readiness, trust, security, and scalability, making it suitable for a wide range of applications, from customer service to interactive content and enterprise knowledge management.

Research

Beyond Attention: Toward Machines with Intrinsic Higher Mental States

This work proposes a new approach to determining relevance in machine learning models, inspired by cellular neurobiological evidence, which enables models to pre-select relevant information before applying attention. The approach, which involves triadic neuronal-level modulation loops, leads to significantly faster learning and reduced computational demand, with results demonstrated in various applications including reinforcement learning, computer vision, and natural language question answering.

TPDE: A Fast Adaptable Compiler Back-End Framework

TPDE is a compiler back-end framework that generates machine code quickly, adapting to existing code representations and performing compilation in a single pass, making it suitable for fast start-up just-in-time compilation. The framework demonstrates its effectiveness by compiling LLVM-IR 8-24x faster than LLVM while maintaining similar run-time performance, and also shows benefits in domain-specific contexts such as WebAssembly and database query compilation.

LLMs replacing human participants harmfully misportray, flatten identity groups

Large language models (LLMs) are limited in their ability to capture the influence of social identities, such as gender and race, due to inherent limitations in their training, which can lead to misportrayal and flattening of demographic groups. As a result, using LLMs as replacements for human participants can be harmful, particularly for marginalized groups, and caution is urged, although techniques can be used to reduce these harms in certain cases.

Dissolving the Fermi Paradox

The Fermi paradox arises from the discrepancy between the expected high probability of intelligent life existing elsewhere in the universe and the lack of observed evidence, which is often attributed to models like the Drake equation. However, when uncertainties in parameters such as the origin of life are taken into account, the probability of there being no other intelligent life in the observable universe becomes substantial, thereby resolving the Fermi paradox.

Breaking the Sorting Barrier for Directed Single-Source Shortest Paths [pdf]

A new algorithm for single-source shortest paths on directed graphs achieves a time complexity of $O(m\log^{2/3}n)$, outperforming Dijkstra's algorithm on sparse graphs. This result breaks the long-standing $O(m+n\log n)$ time bound of Dijkstra's algorithm, demonstrating that it is not optimal for solving the single-source shortest paths problem.

Code

Google AI Edge Gallery

The Google AI Edge Gallery is an experimental app that allows users to explore and experience the capabilities of cutting-edge Generative AI models directly on their Android and iOS devices, without needing an internet connection. The app features various tools, including image analysis, prompt lab, AI chat, and performance insights, and allows users to choose from different models and even bring their own models to test.

Show HN: Deep Research – Open-Source Customizable Reasoning Framework for Devs

Deep Research is an open-source library that conducts deep, multi-hop research with reasoning capabilities, providing comprehensive and evidence-backed answers to complex questions through focused web searches and recursive exploration. It features advanced multi-hop reasoning, real-time web search, automatic subquery generation, and evidence-based report generation, among other capabilities, and can be customized and scaled for various research needs.

Show HN: Tracking Merged PRs by OpenAI's Codex and GitHub's Copilot

The text provides statistics on pull requests (PRs) for projects Copilot, Codex, and Devin, including total PRs, merged PRs, and merge rates, with Codex having the highest merge rate at 83.28%. The data is sourced from GitHub search queries and can be viewed in more detail through an interactive dashboard.

Show HN: I built an AI Agent that uses the iPhone

PhoneAgent is an iPhone app that utilizes OpenAI models to perform tasks across multiple apps, similar to a human user, by interacting with the accessibility features of iOS apps. The app can be controlled through text or voice commands, and can perform actions such as sending messages, opening apps, and enabling system features, but has some limitations and is considered experimental software.

Show HN: Gemini-Engineer Tool

Gemini Engineer is an interactive, AI-driven terminal application that assists with software engineering tasks, leveraging Google's Gemini API to provide intelligent coding assistance and perform file system operations. The application features a user-friendly terminal interface, AI-powered coding suggestions, and safety features such as path validation and binary file detection to prevent potential security issues.