noos fri 22 may · · 01:02
may 2026
mtwtfss ····12345678910111213141516171819202122232425262728293031
links / week 106
mtwtfss
2 today · -1 vs avg
one paragraph note on what claude got wrong
noos.app — saved — 237 messages
refresh filter archive

saved

from localStorage
today 2
github trending 4m

multica-ai/multica — The open-source managed agents platform. Turn coding agents into real teammates — assign tasks, track progress, compound skills.

new from github trending

github trending 4m

trimstray/the-book-of-secret-knowledge — A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.

A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.

yesterday 30
krebs on security 1h

Alleged Kimwolf Botmaster ‘Dort’ Arrested, Charged in U.S. and Canada

Canadian authorities on Wednesday arrested a 23-year-old Ottawa man on suspicion of building and operating Kimwolf, a fast spreading Internet-of-Things botnet that enslaved million

dark reading 1h

How CISOs Should Prep for Agentic-Ready AI BOMs

Finding ways to document both component and execution attributes for AI bill of materials (AI BOM).

claude-code releases 2h

Claude Code v2.1.147

What's changed Added the Workflow tool for deterministic multi-agent orchestration.

dark reading 2h

Google API Keys Remain Active After Deletion

A security researcher discovered the API keys can still be used for 23 minutes after deletion, even though the cloud provider claims deletion is immediate.

openai — youtube 3h

Run long tasks in Codex using goals

Goal mode (/goal) has graduated from an experiment—for tasks big and small, Codex gets your work done.

simon willison 3h

Datasette Agent

We just announced the first release of Datasette Agent, a new extensible AI assistant for Datasette.

openai — youtube 3h

Share Codex plugins with your team

Teams can now distribute custom plugins, reuse internal tools, and manage what’s available across their workspace.

simon willison 4h

datasette-agent-sprites 0.1a0

new from simon willison

github blog 5h

Beyond the engine: 10 open source projects shaping how games actually get made

Check out these 10 open source tools that help game developers create art, animation, levels, audio, dialogue, debug UIs, and engine-ready assets.

astro releases 6h

Astro astro@6.3.7

Patch Changes #16821 9c76b12 Thanks @astrobot-houston! - Fixes request body handling in the Node adapter when req.body is a Buffer, Uint8Array, or ArrayBuffer.

schneier on security 6h

macOS Kernel Memory Corruption Exploit

A group used Anthropic’s Mythos AI model to help find a kernel memory corruption vulnerability and exploit on Apple’s M5.

the hacker news 8h

Showboat Linux Malware Hits Middle East Telecom with SOCKS5 Proxy Backdoor

Cybersecurity researchers have disclosed details of a new Linux malware dubbed Showboat that has been put to use in a campaign targeting a telecommunications provider in the Middle

rapid7 10h

Q1 2026 Threat Landscape Report: Zero-clicks, geopolitical tensions, and some wins for law enforcement

Q1 of 2026 reinforced that attackers are moving faster, operating with greater coordination, and exploiting weaknesses before most organizations can respond effectively.

cisa advisories 11h

ABB B&R Automation Runtime

View CSAF Summary An update is available that resolves a vulnerability identified by B&Rs internal security analysis in the product versions listed as affected in this advisory.

openai 11h

AdventHealth advances whole-person care with OpenAI

AdventHealth is using ChatGPT for Healthcare to streamline workflows, reduce administrative burden, and return more time to patient care.

the hacker news 11h

ThreatsDay Bulletin: Linux Rootkits, Router 0-Day, AI Intrusions, Scam Kits and 25 New Stories

new from the hacker news

github trending 13h

HKUDS/ViMax — "ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)" - HKUDS/ViMax

github trending 13h

FareedKhan-dev/train-llm-from-scratch — A straightforward method for training your LLM, from downloading data to generating text.

A straightforward method for training your LLM, from downloading data to generating text.

vercel 14h

Pull anomaly alert details using the Vercel CLI

Pull anomaly alerts and their details using the vercel alerts command in the Vercel CLI.

r/SideProject 14h

I spent 1 hour on a side project for my neighbor’s flower shop. It generated 18k in repeat sales!

new from r/SideProject

r/SideProject 15h

Pricing dilemma for my side project. Would you ask for a credit card before the trial?

Been building Bulkmark on the side and I'm stuck on a pricing decision.

the hacker news 15h

9-Year-Old Linux Kernel Flaw Enables Root Command Execution on Major Distros

Cybersecurity researchers have disclosed details of a vulnerability in the Linux kernel that remained undetected for nine years.

r/MachineLearning 16h

High E2E latency on fine-tuned Gemma 4 26B despite low TTFT [R]

Recently fine-tuned a Gemma 4 26B model, and I’m seeing surprisingly high end-to-end latency despite the effective inference footprint being much smaller (\~4B-ish behavior during

r/LocalLLaMA 16h

Qwen3.6 27B and llama.cpp appreciation post

To preface, here's my config: llama-server \ --host 0.0.0.0 \ --port 1235 \ --models-preset %h/Software/models.ini \ --models-max 1 \ --sleep-idle-seconds 3600 \ --timeout 3600 \ -

hugging face — youtube 17h

On the slow death of Scaling (birth of Adaption Labs) | Sara Hooker | HF ML Club India EP2

new from hugging face — youtube

the hacker news 18h

GitHub Internal Repositories Breached via Malicious Nx Console VS Code Extension

GitHub on Wednesday officially confirmed that the breach of its internal repositories was the result of a compromise of an employee device involving a poisoned version of the Nx Co

the hacker news 19h

Highly Critical Drupal Core Flaw Exposes PostgreSQL Sites to RCE Attacks

Drupal has released security updates for a "highly critical" security vulnerability in Drupal Core that could be exploited by attackers to achieve remote code execution, privilege

claude-code releases 21h

Claude Code v2.1.146

What's changed Renamed /simplify to /code-review with an optional effort level (e.g.

openai — youtube 23h

Built with GPT-5.5: Abridge Clinical AI Notes

A better medical note starts with a very human reality: people do not tell stories in a straight line.

simon willison 1d

Quoting SpaceX S-1

We have the ability to use compute resources to support our proprietary AI applications (such as Grok 5, which is currently being trained at COLOSSUS II), while also providing acce

wednesday · 20 may 37
cline releases 1d

Cline nightly-dpc-sdk-migration-simpler-login-20260520212824-44c86ecc8987

Cline Nightly published from dpc/sdk-migration-simpler-login at 44c86…

github security 1d

Investigating unauthorized access to GitHub-owned repositories

If any impact is discovered, customers will be notified via established incident response and notification channels.

dark reading 1d

Cyber Pros Can't Decide If AI Is a Good or a Bad Thing

There is nothing cybersecurity professionals are more excited about, and nothing they fear more, than AI.

dark reading 1d

GitHub Confirms Breach, 4K Internal Repos Stolen

Open source software giant GitHub confirmed a data breach this week involving the theft of thousands of repos.

google ai blog 1d

We’re announcing new community investments in Missouri.

We’re helping build the state’s next-generation workforce and investing in energy programs.

r/MachineLearning 1d

OpenAI claims a general-purpose reasoning model found a counterexample to Erdos's unit-distance bound [D]

OpenAI posted a math result today claiming that one of its general-purpose reasoning models found a construction disproving the conjectured n\^{1+O(1/log log n)} upper bound in Erd

aws security 1d

CVE-2026-9133 - Arbitrary file read in rabbitmq-aws plugin

Bulletin ID: 2026-034-AWS Scope: AWS Content Type: Important (requires attention) Publication Date: 05/20/2026 12:45 PM PDT Description: rabbitmq-aws is a RabbitMQ plugin that reso

gemini news 1d

100 things we announced at I/O 2026

This year at Google I/O 2026, we announced Gemini Omni, Google Antigravity, Universal Cart and so much more.

openai — youtube 1d

The Erdős Breakthrough

Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946.

github trending 1d

can1357/oh-my-pi — ⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more

⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more - can1357/oh-my-pi

simon willison 1d

How fast is 10 tokens per second really?

Neat little HTML app by Mike Veerman (source code here) which simulates LLM token output speeds from 5/second to 800/second.

arxiv cs.CL 1d

Leveraging LLMs for Grammar Adaptation: A Study on Metamodel-Grammar Co-Evolution

In model-driven engineering, metamodel evolution leads to the need to adapt corresponding grammars to maintain consistency, which typically requires tedious manual work.

arxiv cs.AI 1d

Quality and Security Signals in AI-Generated Python Refactoring Pull Requests

As AI agents increasingly contribute to code development and maintenance, there is still limited empirical evidence on the quality and risk characteristics of their changes in real

the hacker news 1d

Microsoft Open-Sources RAMPART and Clarity to Secure AI Agents During Development

Microsoft has unveiled two new open-source tools called RAMPART and Clarity to assist developers in better testing the security of artificial intelligence (AI) agents.

arxiv cs.CR 1d

VIPER-MCP: Detecting and Exploiting Taint-Style Vulnerabilities in Model Context Protocol Servers

Model Context Protocol (MCP) has emerged as a standard interface for connecting LLM agents to external tools.

arxiv cs.CL 1d

Post-Hoc Understanding of Metaphor Processing in Decoder-Only Language Models via Conditional Scale Entropy

Metaphor requires a language model to resolve a token whose contextual meaning diverges from its basic literal sense.

google ai blog 1d

A new experiment brings better group meetings to Google Beam

See and hear your colleagues in true-to-life size and sound, making hybrid meetings feel more inclusive and connected.

simon willison 1d

Google I/O, Gemini Spark, Antigravity

It's hard to find much to write about Google I/O this year because I have a policy of not writing about anything that I can't try out myself, and a …

the hacker news 1d

Microsoft Takes Down Malware-Signing Service Behind Ransomware Attacks

Microsoft on Tuesday said it disrupted a malware-signing-as-a-service (MSaaS) operation that weaponized the company's Artifact Signing system to deliver malicious code and conduct

schneier on security 1d

On AI Security

Good report: Executive Summary: Let’s say you wanted to make sure that your AI is secure.

arxiv cs.CR 1d

Detecting Trojaned DNNs via Spectral Regression Analysis

Modern DNNs are repeatedly fine-tuned to incorporate new data and functionality.

cisa advisories 1d

CISA Adds Seven Known Exploited Vulnerabilities to Catalog

CISA has added seven new vulnerabilities to its Known Exploited Vulnerabilities (KEV) Catalog, based on evidence of active exploitation.

the hacker news 1d

Agent AI is Coming. Are You Ready?

New Industry Data Just Released Suggests Not.

astro releases 1d

Astro astro@6.3.6

Patch Changes #16774 8f77583 Thanks @astrobot-houston! - Fixes markdown images with empty alt text (![](image.jpg)) in content collections dropping the alt attribute entirely.

github trending 1d

frappe/erpnext — Free and Open Source Enterprise Resource Planning (ERP)

Free and Open Source Enterprise Resource Planning (ERP) - frappe/erpnext

github trending 1d

anthropics/claude-plugins-official — Official, Anthropic-managed directory of high quality Claude Code Plugins.

Official, Anthropic-managed directory of high quality Claude Code Plugins. - anthropics/claude-plugins-official

vercel 1d

Grok Build 0.1 now available on Vercel AI Gateway

You can now access Grok Build 0.1 on Vercel's AI Gateway with no markup and no other provider accounts required.

product hunt — ai 1d

Hiro

Discussion | Link

openai — youtube 1d

Kunal vs Motorcycle | With ChatGPT

Kickstart your projects with ChatGPT Credits Director: Abhinav Pratiman DOP: Tassaduq Hussain Production House: Early Man Film Creative agency: Hue & Why

cline releases 1d

cline/cline CLI v3.0.9

Speed up CLI startup with plugins by loading sandboxed plugins concurrently and caching plugin tool descriptors per plugin, provider, and model. Speed up plugin and tool config toggles by updating ...

codex releases 1d

openai/codex 0.132.0

New Features The Python SDK now supports first-class authentication, including API key login, ChatGPT browser and device-code flows, account inspection, and logout APIs. (#23093) Python turn APIs ...

claude-code releases 1d

anthropics/claude-code v2.1.145

What's changed Added claude agents --json to list live Claude sessions as JSON for scripting (tmux-resurrect, status bars, session pickers) Added agent_id and parent_agent_id attributes to claude_...

openai blog 1d

An OpenAI model has disproved a central conjecture in discrete geometry

An OpenAI model solved the 80-year-old unit distance problem, disproving a major conjecture in discrete geometry and marking a milestone in AI-driven mathematics.

openai 1d

How Ramp engineers accelerate code review with Codex

How Ramp engineers use Codex with GPT-5.5 to review code and ship improvements, allowing them to get substantive feedback in minutes instead of hours.

openai blog 1d

The next phase of OpenAI’s Education for Countries

OpenAI advances Education for Countries, expanding AI adoption in schools with new partnerships, teacher training, and tools to improve global learning outcomes.

vercel 1d

Vercel AI Gateway plugin for WordPress

new from vercel

simon willison 1d

llm-gemini 0.32

LLM plugin to access Google's Gemini family of models

tuesday · 19 may 37
simon willison 2d

llm-gemini 0.32a0

LLM plugin to access Google's Gemini family of models

openai blog 2d

Introducing OpenAI for Singapore

OpenAI for Singapore launches a multi-year AI partnership to expand deployment, build local talent, and support businesses and public services with AI.

simon willison 2d

datasette-llm 0.1a8

LLM integration plugin for other plugins to depend on

hugging face 2d

OlmoEarth v1.1: A more efficient family of models

A Blog post by Ai2 on Hugging Face

github trending 2d

Alishahryar1/free-claude-code — Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)

Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported) - Alishahryar1/free-claude-code

github trending 2d

multica-ai/andrej-karpathy-skills — A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls. - multica-ai/andrej-karpathy-skills

github trending 2d

msitarzewski/agency-agents — A complete AI agency at your fingertips - From frontend wizards to Reddit community ninjas, from whimsy injectors to reality checkers. Each agent is a specialized expert with personality, processes, and proven deliverables.

A complete AI agency at your fingertips - From frontend wizards to Reddit community ninjas, from whimsy injectors to reality checkers. Each agent is a specialized expert with personality, processes...

github trending 2d

rtk-ai/rtk — CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies - rtk-ai/rtk

google deepmind — youtube 2d

Generating novel scientific hypotheses with Co-Scientist

In an era of information overload, the search for transformative scientific ideas has become a significant bottleneck for progress.

google deepmind — youtube 2d

Using AI to outsmart drug-resistant bacteria

Globally recognized as a silent pandemic, antimicrobial resistance continues to rise as bacteria outpace the development of new antibiotics.

google deepmind — youtube 2d

Understanding cancer at a genetic level with AI

In Uganda, the incidence of early-onset breast cancer is growing at an alarming rate.

google deepmind — youtube 2d

Predicting a historic storm earlier with WeatherNext

Tropical storms and hurricanes are notoriously volatile, changing structure and intensity in a matter of hours.

arxiv cs.AI 2d

Rethinking Visual Attribution for Chest X-ray Reasoning in Large Vision Language Models

Large Vision Language Models (LVLMs) show promise in medical applications, but their inability to faithfully ground responses in visual evidence raises serious concerns about clini

google ai blog 2d

Everything new in our Google AI subscriptions, fresh from I/O 2026

Introducing a $100 AI Ultra plan — plus, new features and benefits for Google AI Plus, Pro and Ultra subscribers.

gemini news 2d

Gemini 3.5: frontier intelligence with action

At Google I/O we released Gemini 3.5, our latest series of models combining frontier intelligence with action.

gemini news 2d

Gemini for Science: AI experiments and tools for a new era of discovery

Gemini for Science is a new collection of science tools and experiments to expand the scale and precision of scientific exploration.

google ai blog 2d

How AI Mode is changing the way people search in the U.S.

One year after launch, see how AI Mode’s users are shifting from keywords to natural language queries.

google ai blog 2d

I/O 2026: Welcome to the agentic Gemini era

The latest from Google I/O: See how we’re helping you get more done with Gemini.

gemini news 2d

Introducing Gemini Omni

Introducing Gemini Omni, which allows you to create anything from any input and edit naturally using conversational language.

gemini news 2d

Making it easier to understand how content was created and edited

We're expanding our tools to help you understand how content was created and edited across the web.

google ai blog 2d

New ways to create and get things done in Google Workspace

Announcing new voice capabilities in Gmail, Docs and Keep, a new design tool called Google Pics and updates to AI Inbox.

gemini news 2d

The Gemini app becomes more agentic, delivering proactive, 24/7 help

A look at how the Gemini app is becoming more agentic, delivering proactive, 24/7 help.

cline releases 2d

cline/cline CLI v3.0.8

Use Telegram numeric participant ids so renamed users stay linked to the same participant in the Telegram connector. Keep failed plugins visible in the config UI with their load/setup phase and err...

arxiv cs.AI 2d

Less Back-and-Forth: A Comparative Study of Structured Prompting

Large language models (LLMs) are widely used for open-ended tasks, but underspecified prompts can lead to low-quality answers and additional interaction. This paper studies whether structured prompt design improves response quality while reducing user effort. We compare three prompt conditions: a raw prompt, a checklist-improved prompt, and a clarifying-question prompt. We evaluate these condition

cline releases 2d

cline/cline v3.84.0

Added Add SAP AI Core support for additional hosted models Fixed Disable the MCP "Restart Server" button when a server is toggled off. Changed Remove the Cline Kanban launch modal and bundled ...

arxiv cs.CL 2d

PromptRad: Knowledge-Enhanced Multi-Label Prompt-Tuning for Low-Resource Radiology Report Labeling

Automatic report labeling facilitates the identification of clinical findings from unstructured text and enables large-scale annotation for medical imaging research.

cloudflare 2d

Announcing Claude Managed Agents on Cloudflare

Cloudflare has integrated with Anthropic's Claude Managed Agents to provide a fast, isolated execution environment for autonomous code delivery. This means builders can scale agent workflows globally while strictly controlling access to private backends and easily customizing their agent’s tools and runtimes.

openai blog 2d

Advancing content provenance for a safer, more transparent AI ecosystem

OpenAI advances AI content provenance with Content Credentials, SynthID, and a verification tool to help people identify and trust AI-generated media.

github trending 2d

ggml-org/llama.cpp — LLM inference in C/C++

LLM inference in C/C++. Contribute to ggml-org/llama.cpp development by creating an account on GitHub.

vercel 2d

Gemini 3.5 Flash on AI Gateway

You can now access Gemini 3.5 Flash on Vercel's AI Gateway with no markup and no other provider accounts required.

product hunt — ai 2d

Thinnest AI

Discussion | Link

product hunt — ai 2d

Agora-1 by Odyssey

Discussion | Link

product hunt — devtools 2d

CLI Market

Discussion | Link

simon willison 2d

The last six months in LLMs in five minutes

I put together these annotated slides from my five minute lightning talk at PyCon US 2026, using the latest iteration of my annotated presentation tool. # I presented this lightning …

claude-code releases 2d

anthropics/claude-code v2.1.144

What's changed Added /resume support for background sessions — sessions started via claude --bg or agent view now appear alongside interactive ones, marked with bg Added elapsed duration to backgr...

hugging face 2d

Introducing the Ettin Reranker Family

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

anthropic 2d

KPMG integrates Claude across its core business and workforce of more than 276,000 in strategic alliance

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

monday · 18 may 23
vercel 3d

Firewall‑mitigated traffic is free on Vercel

Vercel Firewall now waives CDN Requests and Fast Data Transfer for any traffic WAF rules deny, challenge, or rate-limit.

arxiv cs.AI 3d

DashAttention: Differentiable and Adaptive Sparse Hierarchical Attention

Current hierarchical attention methods, such as NSA and InfLLMv2, select the top-k relevant key-value (KV) blocks based on coarse attention scores and subsequently apply fine-grained softmax attention on the selected tokens. However, the top-k operation assumes the number of relevant tokens for any query is fixed and it precludes the gradient flow between the sparse and dense stages. In this work,

astro releases 3d

withastro/astro astro@6.3.5

Patch Changes #16771 07c8805 Thanks @ematipico! - Fixes position prop on <Image> and <Picture> components breaking Content Security Policy (CSP). #16593 50924ce Thanks @yanthomasdev! - Improves...

github trending 3d

humanlayer/12-factor-agents — What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?

What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers? - humanlayer/12-factor-agents

github trending 3d

NVlabs/Sana — SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer - NVlabs/Sana

github trending 3d

ZhuLinsen/daily_stock_analysis — LLM驱动的 A/H/美股智能分析:多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送,零成本定时运行,纯白嫖. LLM-powered stock analysis system for A/H/US markets.

LLM驱动的 A/H/美股智能分析:多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送,零成本定时运行,纯白嫖. LLM-powered stock analysis system for A/H/US markets. - ZhuLinsen/daily_stock_analysis

github trending 3d

Imbad0202/academic-research-skills — Academic Research Skills for Claude Code: research → write → review → revise → finalize

Academic Research Skills for Claude Code: research → write → review → revise → finalize - Imbad0202/academic-research-skills

codex releases 3d

openai/codex 0.131.0

New Features The TUI now offers richer session controls and display: data-driven service-tier commands, blended token usage, permissions/approval mode, effective workspace roots, and responsive Ma...

arxiv cs.AI 3d

Ensembling Tabular Foundation Models - A Diversity Ceiling And A Calibration Trap

Tabular foundation models (TFMs) now match or beat tuned gradient-boosted trees on a growing fraction of tabular tasks, but no single TFM wins on every dataset. Ensembling is the go to fix here, and it works less well than expected. Six modern TFMs form a near-redundant pool: their mean pairwise Q-statistic is $0.961$, close enough to $1$ that any convex combination is bounded above. We benchmark

arxiv cs.CL 3d

Generative AI Advertising as a Problem of Trustworthy Commercial Intervention

Major deployed generative AI advertising systems preserve a visible boundary between commercial content and AI-generated responses. Yet empirical research shows that ads woven directly into large language model (LLM) outputs often go undetected by users. We argue that generative AI fundamentally changes advertising: rather than placing products into discrete slots, it enables interventions on the

hugging face 3d

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

A Blog post by NVIDIA on Hugging Face

hugging face 3d

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

A Blog post by PaddlePaddle on Hugging Face

astro releases 3d

withastro/astro astro@6.3.4

Patch Changes #16723 0f10bfe Thanks @matthewp! - Adds fetchFile option to experimental.advancedRouting to customize or disable the entrypoint file export default defineConfig({ experimental: { ...

astro releases 3d

withastro/astro astro-vscode@2.16.16

Patch Changes #16719 2b1df12 Thanks @alexisintech! - fix MDX syntax highlighting for indented astro codeblocks

simon willison 3d

Glaucous-winged Gull, Brown Pelican, Snowy Egret, Canada Goose

Glaucous-winged Gull, Brown Pelican, Snowy Egret, Canada Goose

hugging face 3d

The Open Agent Leaderboard

A Blog post by IBM Research on Hugging Face

openai blog 3d

OpenAI and Dell partner to bring Codex to hybrid and on-premise enterprise environments

OpenAI and Dell partner to bring Codex to hybrid and on-premise environments, helping enterprises deploy AI coding agents securely across data and workflows.

github trending 3d

microsoft/ai-agents-for-beginners — 12 Lessons to Get Started Building AI Agents

12 Lessons to Get Started Building AI Agents. Contribute to microsoft/ai-agents-for-beginners development by creating an account on GitHub.

github trending 3d

dograh-hq/dograh — Open Source Voice Agent Platform

Open Source Voice Agent Platform. Contribute to dograh-hq/dograh development by creating an account on GitHub.

product hunt — ai 3d

AnyFrame

Discussion | Link

product hunt — ai 3d

Voiser AI

Discussion | Link

product hunt — devtools 3d

M1 by Montage

Discussion | Link

cline releases 3d

cline/cline CLI v3.0.6

Fix ChatGPT provider model list to include the codex variants and the gpt-5.2, gpt-5.4, and gpt-5.4-mini subscription models. Full Changelog: cli-v3.0.5...cli-v3.0.6

sunday · 17 may 10
github trending 4d

plausible/analytics — Open source, privacy-first web analytics. Lightweight, cookie-free Google Analytics alternative. Self-hosted or cloud.

Open source, privacy-first web analytics. Lightweight, cookie-free Google Analytics alternative. Self-hosted or cloud. - plausible/analytics

github trending 4d

knadh/listmonk — High performance, self-hosted, newsletter and mailing list manager with a modern dashboard. Single binary app.

High performance, self-hosted, newsletter and mailing list manager with a modern dashboard. Single binary app. - knadh/listmonk

github trending 4d

TryGhost/Ghost — Independent technology for modern publishing, memberships, subscriptions and newsletters.

Independent technology for modern publishing, memberships, subscriptions and newsletters. - TryGhost/Ghost

github trending 4d

KeygraphHQ/shannon — Shannon Lite is an autonomous, white-box AI pentester for web applications and APIs. It analyzes your source code, identifies attack vectors, and executes real exploits to prove vulnerabilities before they reach production.

Shannon Lite is an autonomous, white-box AI pentester for web applications and APIs. It analyzes your source code, identifies attack vectors, and executes real exploits to prove vulnerabilities bef...

github trending 4d

Light-Heart-Labs/DreamServer — Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions. - Light-Heart-Labs/DreamServer

github trending 4d

BigBodyCobain/Shadowbroker — Open-source intelligence for the global theater. Track everything from the corporate/private jets of the wealthy, and spy satellites, to seismic events in one unified interface. Hook an AI agent up to have it parse through data and find previously unseen correlations. The knowledge is available to all but rarely aggregated in the open, until now.

Open-source intelligence for the global theater. Track everything from the corporate/private jets of the wealthy, and spy satellites, to seismic events in one unified interface. Hook an AI agent up...

github trending 4d

NirDiamant/agents-towards-production — End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.

End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment. - NirDiamant/agents-towards-production

github trending 4d

tech-leads-club/agent-skills — The secure, validated skill registry for professional AI coding agents. Extend Antigravity, Claude Code, Cursor, Copilot and more with absolute confidence.

The secure, validated skill registry for professional AI coding agents. Extend Antigravity, Claude Code, Cursor, Copilot and more with absolute confidence. - tech-leads-club/agent-skills

github trending 4d

HKUDS/CLI-Anything — "CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/

"CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/ - HKUDS/CLI-Anything

simon willison 4d

GDS weighs in on the NHS's decision to retreat from Open Source

Terence Eden continues his coverage of the NHS' poorly considered decision to close down access to their open source repositories in response to vulnerabilities reported to them as part of …

saturday · 16 may 8
github trending 5d

colbymchenry/codegraph — Pre-indexed code knowledge graph for Claude Code — fewer tokens, fewer tool calls, 100% local

Pre-indexed code knowledge graph for Claude Code — fewer tokens, fewer tool calls, 100% local - colbymchenry/codegraph

github trending 5d

Anil-matcha/Open-Generative-AI — Open-source alternative to AI video platforms — Free AI image & video generation studio with 200+ models (Flux, Midjourney, Kling, Sora, Veo). No content filters. Self-hosted, MIT licensed.

Open-source alternative to AI video platforms — Free AI image &amp; video generation studio with 200+ models (Flux, Midjourney, Kling, Sora, Veo). No content filters. Self-hosted, MIT licensed. - A...

simon willison 5d

Quoting Julia Evans

[...] in the last 10 years I’ve learned to really love and respect CSS as a technology. So I decided years ago that I wanted to react to “CSS is …

github trending 5d

oven-sh/bun — Incredibly fast JavaScript runtime, bundler, test runner, and package manager – all in one

Incredibly fast JavaScript runtime, bundler, test runner, and package manager – all in one - oven-sh/bun

github trending 5d

anthropics/skills — Public repository for Agent Skills

Public repository for Agent Skills. Contribute to anthropics/skills development by creating an account on GitHub.

openai blog 5d

OpenAI and Malta partner to bring ChatGPT Plus to all citizens

OpenAI and Malta partner to expand AI access, offering ChatGPT Plus and training to help citizens build practical AI skills and use AI responsibly.

simon willison 5d

inaturalist-clumper 0.1

Group iNaturalist sightings into clumps

claude-code releases 6d

anthropics/claude-code v2.1.143

What's changed Added plugin dependency enforcement: claude plugin disable now refuses when another enabled plugin depends on the target (with a copy-pasteable disable-chain hint), and claude plugi...

friday · 15 may 10
product hunt — ai 6d

ChatGPT for Personal Finance

Discussion | Link

github trending 6d

czlonkowski/n8n-mcp — A MCP for Claude Desktop / Claude Code / Windsurf / Cursor to build n8n workflows for you

A MCP for Claude Desktop / Claude Code / Windsurf / Cursor to build n8n workflows for you - czlonkowski/n8n-mcp

github trending 6d

joeseesun/qiaomu-anything-to-notebooklm — Claude Skill: Multi-source content processor for NotebookLM. Supports WeChat articles, web pages, YouTube, PDF, Markdown, search queries → Podcast/PPT/MindMap/Quiz etc.

Claude Skill: Multi-source content processor for NotebookLM. Supports WeChat articles, web pages, YouTube, PDF, Markdown, search queries → Podcast/PPT/MindMap/Quiz etc. - joeseesun/qiaomu-anything-...

github trending 6d

NVIDIA-AI-Blueprints/video-search-and-summarization — Suite of reference architectures for building GPU-accelerated vision agents and AI-powered video analytics applications.

Suite of reference architectures for building GPU-accelerated vision agents and AI-powered video analytics applications. - NVIDIA-AI-Blueprints/video-search-and-summarization

github trending 6d

ruvnet/RuView — π RuView turns commodity WiFi signals into real-time spatial intelligence, vital sign monitoring, and presence detection — all without a single pixel of video.

π RuView turns commodity WiFi signals into real-time spatial intelligence, vital sign monitoring, and presence detection — all without a single pixel of video. - ruvnet/RuView

product hunt — ai 6d

OpenIT

Discussion | Link

simon willison 6d

QR code generator

Generate QR codes from any URL or text with customizable styling options. This tool supports multiple design styles including square and liquid patterns, adjustable sizes, custom colors, and optional borders. …

openai blog 6d

A new personal finance experience in ChatGPT

Preview a new personal finance experience in ChatGPT for Pro users in the U.S. Securely connect your financial accounts and get AI-powered insights and guidance grounded in your financial context, goals, and priorities.

claude-code releases 1w

anthropics/claude-code v2.1.142

What's changed Added new claude agents flags: --add-dir, --settings, --mcp-config, --plugin-dir, --permission-mode, --model, --effort, and --dangerously-skip-permissions to configure dispatched ba...

simon willison 1w

Not so locked in any more

This Mitchell Hashimoto quote about Bun migrating from Zig to Rust reminded me of a similar conversation I had at a conference last week. I was talking to someone who …

14 may 11
arxiv cs.AI 1w

Self-Distilled Agentic Reinforcement Learning

Reinforcement learning (RL) has emerged as a central paradigm for post-training LLM agents, yet its trajectory-level reward signal provides only coarse supervision for long-horizon interaction. On-Policy Self-Distillation (OPSD) complements RL by introducing dense token-level guidance from a teacher branch augmented with privileged context. However, transferring OPSD to multi-turn agents proves pr

arxiv cs.CL 1w

Case-Based Calibration of Adaptive Reasoning and Execution for LLM Tool Use

Tool use extends large language models beyond parametric knowledge, but reliable execution requires balancing appropriate reasoning depth with strict structural validity. We approach this problem from a case-based perspective to present CAST, a case-driven framework that treats historical execution trajectories as structured cases. Instead of reusing raw exemplar outputs, CAST extracts case-derive

github trending 1w

github/spec-kit — 💫 Toolkit to help you get started with Spec-Driven Development

💫 Toolkit to help you get started with Spec-Driven Development - github/spec-kit

github trending 1w

mattpocock/skills — Skills for Real Engineers. Straight from my .claude directory.

Skills for Real Engineers. Straight from my .claude directory. - mattpocock/skills

github trending 1w

obra/superpowers — An agentic skills framework & software development methodology that works.

An agentic skills framework & software development methodology that works. - obra/superpowers

product hunt — ai 1w

Raindrop Workshop

Discussion | Link

product hunt — devtools 1w

Comie.dev

Discussion | Link

cline releases 1w

cline/cline nightly-dpc-sdk-migration-simpler-login-20260514034816-03e6884abce5

Cline Nightly published from dpc/sdk-migration-simpler-login at 03e68…

product hunt — ai 1w

Notion Developer Platform

Discussion | Link

simon willison 1w

Welcome to the Datasette blog

We have a bunch of neat Datasette announcements in the pipeline so we decided it was time the project grew an official blog. I built this using OpenAI Codex desktop, …

claude-code releases 1w

anthropics/claude-code v2.1.141

What's changed Added terminalSequence field to hook JSON output so hooks can emit desktop notifications, window titles, and bells without a controlling terminal Added CLAUDE_CODE_PLUGIN_PREFER_HTT...

13 may 16
arxiv cs.CL 1w

WARDEN: Endangered Indigenous Language Transcription and Translation with 6 Hours of Training Data

This paper introduces WARDEN, an early language model system capable of transcribing and translating Wardaman, an endangered Australian indigenous language into English. The significant challenge we face is the lack of large-scale training data: in fact, we only have 6 hours of annotated audio. Therefore, while it is common practice to train a single model for transcription and translation using l

github trending 1w

imthenachoman/How-To-Secure-A-Linux-Server — An evolving how-to guide for securing a Linux server.

An evolving how-to guide for securing a Linux server. - imthenachoman/How-To-Secure-A-Linux-Server

github trending 1w

influxdata/telegraf — Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.

Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data. - influxdata/telegraf

github trending 1w

K-Dense-AI/scientific-agent-skills — A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.

A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing. - K-Dense-AI/scientific-agent-skills

astro releases 1w

withastro/astro @astrojs/ts-plugin@1.10.8

Patch Changes #16716 04fdbb2 Thanks @delucis! - Drops support for versions of VS Code below 1.101.0 [May 2025]

cline releases 1w

cline/cline v3.83.0

Fixed Show a clear "Searching..." state in the @-mention file picker Improve @-mention file search performance Allow write_to_file to create or overwrite files with empty content. Fix validation f...

simon willison 1w

Quoting Boris Mann

“11 AI agents” is meaningless as a phrase. If I said “I have 11 spreadsheets” or “I have 11 browser tabs” to do my work, it means about the same …

cloudflare 1w

Browser Run: now running on Cloudflare Containers, it’s faster and more scalable

We’ve enabled higher usage limits, faster performance, better reliability, and increased shipping velocity for our Browser Run product by rebuilding on top of Cloudflare’s Containers. Here’s how.

openai blog 1w

Building a safe, effective sandbox to enable Codex on Windows

Learn how OpenAI built a secure sandbox for Codex on Windows, enabling safe, efficient coding agents with controlled file access and network restrictions.

github trending 1w

anonfaded/FadCam — Open-source, ad-free Android multimedia recorder with background video recording, screen recording, live streaming, and remote camera control

Open-source, ad-free Android multimedia recorder with background video recording, screen recording, live streaming, and remote camera control - anonfaded/FadCam

github trending 1w

apernet/hysteria — Hysteria is a powerful, lightning fast and censorship resistant proxy.

Hysteria is a powerful, lightning fast and censorship resistant proxy. - apernet/hysteria

product hunt — ai 1w

PitchDrop.ai

Discussion | Link

simon willison 1w

CSP Allow-list Experiment

CSP Allow-list Experiment

bun releases 1w

oven-sh/bun Bun v1.3.14

To install Bun v1.3.14 curl -fsSL https://bun.sh/install | bash # or you can use npm # npm install -g bun Windows: powershell -c "irm bun.sh/install.ps1|iex" To upgrade to Bun v1.3.14: bun upgrade ...

cline releases 1w

cline/cline sdk-v0.0.40

Update to 0.0.40

vercel 1w

Trusted Sources for Deployment Protection

You can now authorize specific Vercel projects and external CI services to reach this project's protected deployments using short-lived OIDC tokens, without sharing a static bypass secret or opening the deployment to the public internet

12 may 8
claude-code releases 1w

anthropics/claude-code v2.1.140

What's changed Improved Agent tool subagent_type matching to accept case- and separator-insensitive values (e.g. "Code Reviewer" resolves to code-reviewer) Updated agent color palette Fixed /goal ...

vercel 1w

Create Vercel Firewall rules with natural language

Create Vercel WAF custom security and firewall rules using natural language. Describe your firewall needs and let AI generate the rule configuration.

arxiv cs.AI 1w

Learning, Fast and Slow: Towards LLMs That Adapt Continually

Large language models (LLMs) are trained for downstream tasks by updating their parameters (e.g., via RL). However, updating parameters forces them to absorb task-specific information, which can result in catastrophic forgetting and loss of plasticity. In contrast, in-context learning with fixed LLM parameters can cheaply and rapidly adapt to task-specific requirements (e.g., prompt optimization),

arxiv cs.AI 1w

Solve the Loop: Attractor Models for Language and Reasoning

Looped Transformers offer a promising alternative to purely feed-forward computation by iteratively refining latent representations, improving language modeling and reasoning. Yet recurrent architectures remain unstable to train, costly to optimize and deploy, and constrained to small, fixed recurrence depths. We introduce Attractor Models, in which a backbone module first proposes output embeddin

simon willison 1w

llm 0.32a2

Access large language models from the command-line

arxiv cs.CL 1w

TextSeal: A Localized LLM Watermark for Provenance & Distillation Protection

We introduce TextSeal, a state-of-the-art watermark for large language models. Building on Gumbel-max sampling, TextSeal introduces dual-key generation to restore output diversity, along with entropy-weighted scoring and multi-region localization for improved detection. It supports serving optimizations such as speculative decoding and multi-token prediction, and does not add any inference overhea

arxiv cs.CL 1w

A Causal Language Modeling Detour Improves Encoder Continued Pretraining

When adapting an encoder to a new domain, the standard approach is to continue training with Masked Language Modeling (MLM). We show that temporarily switching to Causal Language Modeling (CLM) followed by a short MLM decay improves downstream performance. On biomedical texts with ModernBERT, this CLM detour outperforms MLM baselines trained on identical data and compute across 8 French and 11 Eng

openai blog 1w

How finance teams use Codex

See how finance teams can use Codex to build MBRs, reporting packs, variance bridges, model checks, and planning scenarios from real work inputs.

11 may 37
r/coolgithubprojects 1w

[TypeScript] ContainerFlow - Real-time Docker dashboard with accurate memory monitoring, Discord alerts, and config recommendations

r/coolgithubprojects 1w

A Dead-Simple Static Site Generator in Go

r/programming 1w

Froot Loops and a graphics card: reflecting on twenty years of programming

On a 2006 email from my dad's colleague, the parser I wrote in response, and what I got paid in.

r/programming 1w

Be careful with your Git: Investigating malware spreading through Git repositories

How a fake LinkedIn recruiter used a Google Drive Git repo, malicious hooks, and obfuscated JavaScript malware to compromise developers and steal files.

r/MachineLearning 1w

I tested reasoning models on the problems where surface-level thinking fails — AIME, proof sketches, and "why does this code have a subtle off-by-one", [D]

I've been running a somewhat unusual benchmark suite. Not the standard automated ones — I've been feeding different reasoning models a collection of \~120 problems that I've personally verified require "deep reasoning" rather than pattern matching. The mix: \~40 AIME-style competition math, \~30 GPQA-level scientific reasoning, \~25 ARC-style abstract reasoning, and \~25 "real world" problems (sub

r/coolgithubprojects 1w

RustNet Cross-platform network monitoring TUI with eBPF process attribution

simon willison 1w

Learning on the Shop floor

Tobias Lütke describes Shopify's internal coding agent tool, River, which operates entirely in public on their Slack: River does not respond to direct messages. She politely declines and suggests to …

r/MachineLearning 1w

Interactive Jensen–Shannon Divergence Visualisation [P]

An interactive visualisation of Jensen–Shannon divergence - the symmetric, always-finite cousin of KL. Shape two distributions and watch JSD, its ceiling of one bit, and the per-point contribution respond in real time. https://robotchinwag.com/posts/jensen-shannon-divergence-visualisation/ Feedback welcome.

r/programming 1w

8317277: Java language implementation of value classes and objects by MrSimms · Pull Request #31120 · openjdk/jdk

This pull request implements the first preview of JEP 401: Value Classes and Objects: JDK-8317277: Java language implementation of value classes and objects 8317277: Java language implementation ...

r/programming 1w

7 lines of code, 3 minutes: Implement a programming language

github trending 1w

AUTOMATIC1111/stable-diffusion-webui — Stable Diffusion web UI

Stable Diffusion web UI. Contribute to AUTOMATIC1111/stable-diffusion-webui development by creating an account on GitHub.

github trending 1w

rasbt/LLMs-from-scratch — Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step - rasbt/LLMs-from-scratch

github trending 1w

Lordog/dive-into-llms — 《动手学大模型Dive into LLMs》系列编程实践教程

《动手学大模型Dive into LLMs》系列编程实践教程. Contribute to Lordog/dive-into-llms development by creating an account on GitHub.

github trending 1w

millionco/react-doctor — Your agent writes bad React. This catches it

Your agent writes bad React. This catches it. Contribute to millionco/react-doctor development by creating an account on GitHub.

github trending 1w

playcanvas/supersplat — 3D Gaussian Splat Editor

3D Gaussian Splat Editor. Contribute to playcanvas/supersplat development by creating an account on GitHub.

github trending 1w

bytedance/UI-TARS-desktop — The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra - bytedance/UI-TARS-desktop

r/coolgithubprojects 1w

I built a complete IR-free self-hosting x86-64 toolchain from scratch (compiler + assembler + linker)

r/programming 1w

Roc &amp; Zig: A Compiler Rewrite Story • Anjana Vakil &amp; Richard Feldman

This interview was recorded for GOTO Unscripted. #GOTOcon #GOTOunscriptedhttps://gotopia.techRichard Feldman - Software Engineer at Zed Industries & Author o...

r/coolgithubprojects 1w

layercache: Unified multi-layer caching for Node.js with memory, Redis, stampede prevention, and invalidation helpers.

Unified multi-layer caching for Node.js with memory, Redis, stampede prevention, and invalidation helpers. - flyingsquirrel0419/layercache

r/programming 1w

Branch-Avoidant Quicksort in C - faster than std::sort and pdqsort

r/LocalLLaMA 1w

PSA: Watch out for extra spaces in chat-template-kwargs when using Qwen3.6 with llama-server

Hey folks, just a heads-up for anyone running Qwen3.6 through `llama-server`. I ran into an issue where the `preserve_thinking` parameter wasn't working as expected, even though I had it explicitly enabled in my `models.ini` config. After some digging, I found that **extra spaces in the JSON string are breaking the parser** for this specific parameter in my build. ❌ **Does NOT work:** `chat-te

r/MachineLearning 1w

V-JEPA 2.1's dense features are partitioned: a robustness study across all four model sizes [R]

I ran a pre-registered robustness study on Meta's V-JEPA 2.1 across all four released model sizes (80M → 2B). 322-cell sweep Three findings worth flagging: **1. Dense features are partitioned.** M2 (representational drift between clean and perturbed clips, measured as cosine distance on temporal-gradient vectors) predicts downstream task failure on DAVIS for temporal corruption (frame drops r=0.3

r/programming 1w

CUDA Proves Nvidia Is a Software Company

There’s a deep, forbidding moat that surrounds Nvidia—and it has nothing to do with hardware.

github / Imbad0202 1w

academic research skills for claude code

dropping this in the team channel monday. good template for building our own skill packs.

openai developers 1w

codex notes for long-running engineering tasks

the section on context retention across hours is the only part that matters. read that — skip the rest until you need it…

r/programming 1w

cursor + agent IDE workflows from active teams

less hype than usual. real patterns from teams shipping with cursor — migrations, reviews, the boring stuff.

github / trending 1w

a practical MCP server template for internal tools

small, readable, doesn't try to be a framework. clone this instead of starting fresh on your next agent integration…

vercel 1w

Automate progressive rollouts with Vercel Flags

You can now automate time-based traffic movement for Vercel Flags using progressive rollouts. Unlike weighted splits which maintain stable traffic distribution for experiments, progressive rollouts automatically advance through a predefined...

r/coolgithubprojects 1w

Open-sourced our MCP server for GPU workload execution looking for feedback

r/LocalLLaMA 1w

ExLlamaV3 Major Updates!

r/LocalLLaMA 1w

Markdown browser for LLMs

I built a markdown web renderer for AI agents. Instead of taking expensive screenshots and piping them through vision models, TextWeb renders web pages as markdown that LLMs can reason about natively. Full JavaScript execution, interactive elements annotated. It provides a CLI and an MCP server. You can find it here: [https://github.com/woheller69/textweb](https://github.com/woheller69/textweb)

r/coolgithubprojects 1w

I made a local real-time webcam stream instruct editor with Flux.2-Klein model and bunch of custom optimizations.

g5t.de 1w

task paralysis and AI

felt seen. short, sharp essay on planning-heavy work and where AI quietly hurts.

r/programming 1w

The Bottom-up Building of a Language for Subleq with Text Macros

vercel 1w

Vercel Sandbox firewall now supports request proxying and filtering

The Vercel Sandbox firewall now supports forwarding specific requests to a proxy. This can be useful to log, debug, or transform HTTP requests or responses initiated from a sandbox.

r/MachineLearning 1w

Why is human LLM annotation so expensive? [D]

Scale AI and similar services charge a lot for annotation. MTurk is cheap but the quality is horrible for anything requiring real domain understanding. For small teams that need a few thousand labeled examples to calibrate their evals or fine tune a model, there seems to be no good middle ground. How is everyone handling this? Are you doing it manually or has anyone found something that actually

google developers 1w

gemini file search is now multimodal — RAG over images, finally.

quietly the most useful gemini update in months. if you ship doc-heavy products, this is the one that matters.

10 may 8
product hunt — devtools 1w

CacheTray

Discussion | Link

r/coolgithubprojects 1w

Built a free, open-source Postgres desktop client in Rust + Tauri — no cloud, no telemetry, just raw speed

simon willison 1w

Quoting Andrew Quinn

One could say in the first quarter-century of my life, that while I was always fascinated by programming, I could never overcome the guilt of not really knowing whether the …

jarred sumner — x.com 1w

bun's experimental rust rewrite quietly hits 99.8% test compatibility

this is the actual headline of the week. ignore the model news for a second — if this lands, the JS toolchain conversation shifts for everyone. vite, esbuild, node-as-default.

vercel 1w

How Superset built the IDE for AI agents on Vercel

How Superset built the IDE for AI coding agents on Vercel, running a dozen parallel agents per developer and 600 preview deployments a day.

timothy gowers — wordpress 1w

a recent experience with ChatGPT 5.5 pro

the most honest model write-up i've read this year. not hype, not doom — a working mathematician walking through brilliance and breakage…

arxiv 1w

"LLMs corrupt your documents when you delegate"

uncomfortable read if you're shipping AI-edit features. the failure mode is subtle and consistent across model families…

codex releases 1w

openai/codex rust-v0.131.0-alpha.5

Release 0.131.0-alpha.5

no item selected.
my computer inbox 237 saved about.txt