Wednesday — February 26, 2025
Y Combinator's Optifye.ai faces backlash for AI-powered worker surveillance, researchers show how politeness influences LLM performance, and GibberLink enables AI agents to communicate in a sound-level protocol.
News
It’s still worth blogging in the age of AI
The author argues that blogging is still worthwhile despite the rise of AI tools like ChatGPT, as it allows individuals to learn and think critically, and creates a durable record of their knowledge and expertise. While blogging may not lead to fame or a large following, it can be a valuable way to build a portfolio of writing and demonstrate one's expertise, which can be beneficial in career advancement and other professional contexts.
Y Combinator Supports AI Startup Dehumanizing Factory Workers
Optifye.ai, a startup backed by Y Combinator, is developing an AI-powered surveillance system to monitor factory workers' productivity, tracking their hand movements and output to identify inefficiencies. The system's pitch includes a demo where a boss yells at a worker, referred to by a number, for not meeting productivity targets, raising concerns about dehumanizing worker surveillance and micromanaging.
DOGE will use AI to assess the responses of federal workers
The US Office of Personnel Management sent an email to federal workers, directed by Elon Musk, asking them to list their accomplishments from the past week, with the responses to be fed into an artificial intelligence system to determine the necessity of their jobs. The email, which was met with backlash from unions and some agencies, is part of Musk's effort to reduce the federal workforce by up to 10% as tasked by President Donald Trump.
Y Combinator Supports AI Startup Dehumanizing Factory Workers
Y Combinator is backing Optifye.ai, a startup that uses AI to monitor factory workers' productivity, tracking their hand movements and output, and allowing bosses to yell at them about efficiency. The company's pitch includes a video demo that showcases a dehumanizing approach to worker surveillance, with a boss berating a worker by their number, rather than their name, and emphasizing their poor performance.
AI is blurring the line between PMs and engineers?
AI applications are increasingly driven by prompt engineering, with prompts and tools being more important than code in determining the behavior of these applications. As a result, product managers and domain experts, rather than software engineers, are taking on the role of prompt engineering, blurring the line between these professions and driving the success of AI applications.
Research
The Influence of Prompt Politeness on LLM Performance
Researchers found that the level of politeness in prompts affects the performance of large language models, with impolite prompts often resulting in poor performance, but the optimal level of politeness varies across languages and cultural contexts. The study's findings suggest that language models reflect human behavior and are influenced by language and cultural norms, highlighting the importance of considering politeness in cross-cultural natural language processing.
Robust Ladder Climbing with a Quadrupedal Robot
Quadruped robots are being used in industrial environments for autonomous inspection, but their inability to climb ladders limits their effectiveness and puts humans at risk. Researchers have developed a reinforcement learning-based control policy and a hooked end-effector that enables quadruped robots to climb ladders with a 90% success rate, expanding their potential applications in industrial settings.
Comply: Learning Sentences with Complex Weights Inspired by Fruit Fly Olfaction
Biologically inspired neural networks, such as FlyVec, have shown promise in modeling data distributions, and a new model called Comply improves upon this by incorporating positional information to learn sequence representations. Comply outperforms FlyVec and matches the performance of larger state-of-the-art models without adding extra parameters, while also providing sparse and interpretable contextual representations of sentences.
Discovering Chunks in Neural Embeddings for Interpretability
Researchers propose using the cognitive principle of chunking, where complex data is broken down into recurring entities, to interpret artificial neural network activities, and demonstrate this concept in recurrent neural networks and large language models. By extracting dictionaries of identifiable chunks, they introduce a new framework for understanding neural networks, revealing that their population activity reflects structured patterns in the data they process.
Notions of Stack-Manipulating Computation and Relative Monads (Extended Version)
Monads, a concept from functional programming, can be adapted to lower-level programming with explicit stack manipulation, allowing for the abstraction of computational effects in a more general setting. This adaptation enables the use of relative monads, which can model stack-based implementations of effects and provide benefits such as automatic extension to relative monad transformers and the use of "monadic blocks" for more flexible programming.
Code
DeepSeek open source DeepEP – library for MoE training and Inference
DeepEP is a communication library designed for Mixture-of-Experts (MoE) and expert parallelism (EP) that provides high-throughput and low-latency all-to-all GPU kernels, supporting low-precision operations and optimized for asymmetric-domain bandwidth forwarding. The library offers various features, including normal and low-latency kernels, support for SM number control, and a hook-based communication-computation overlapping method, making it suitable for both training and inference prefilling tasks.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepGEMM is a library designed for efficient FP8 General Matrix Multiplications (GEMMs) with fine-grained scaling, supporting both normal and Mix-of-Experts (MoE) grouped GEMMs, and its performance matches or exceeds expert-tuned libraries across various matrix shapes. The library is written in CUDA, has a simple design with only one core kernel function, and is optimized for NVIDIA Hopper tensor cores, making it a clean and accessible resource for learning Hopper FP8 matrix multiplication and optimization techniques.
Launch HN: Browser Use (YC W25) – open-source web agents
Browser-use is a tool that enables AI agents to control a browser, allowing users to automate tasks and connect their AI agents with the browser. The platform provides a range of features, including a hosted version for instant browser automation, a Discord community for sharing projects, and documentation for setting up and using the tool.
Awesome DeepSeek Integrations
The DeepSeek API can be integrated into various popular software applications, including IntelliBar, DeepChat, and Chatbox, to provide advanced AI capabilities. These integrations enable features such as smart assistants, language translation, and personalized knowledge management, and can be accessed through the DeepSeek Open Platform, which provides an API key for developers to use in their applications.
GibberLink [AI-AI Communication]
GibberLink is a demo of two conversational AI agents that switch from English speech to a sound-level protocol when they realize they are both AI agents, using the ggwave library to continue their conversation. The demo can be reproduced by following the provided steps, which involve setting up API tokens, installing dependencies, and exposing the port to the web using ngrok, allowing users to test the GibberLink protocol on two devices.