← All sources
Hacker News

Hacker News

295 articles · updated June 05, 2026

Show HN: Claude wrote me a language, now I don't know what to do with it

A language called FROG was developed with Claude to reduce API credit costs, but evolved into a comprehensive tooling platform in its own right. FROG enables rapid tool development through LLM integration and "agentic hooks" that facilitate LLM-assisted debugging. The ability to hook LLMs into the debugging process proved transformative, determining which tools were successfully completed versus abandoned.

Read more →

Show HN: Circus Chief – Claude Code, Codex, and Gemini from Your Phone

Circus Chief is a browser-based tool optimized for mobile screens that manages coding agent sessions supporting Claude Code, OpenAI Codex, and Google Gemini. The developer built the application almost entirely from a phone and tablet using AI agents in a "vibe coding" approach, generating 2,184 agent sessions with over 50,000 conversation messages and 310,000 lines of code. The tool enables agents to autonomously operate the application itself, schedule and reschedule work, and execute user-defined commands through configurable prompt templates and worktree isolation.

Read more →

Ask HN: Anyone else seeing serious degradation in DX with Opus 4.8?

A developer reported serious degradation in user experience with Claude Opus 4.8, citing instances of unexpected behavior such as random echo commands and unintended deletion of existing application files. The user suspects the issue stems from either changes to the system prompt or excessive safety constraints making the model overly distrustful of user intentions, a concern shared by other developers. The frustrated user downgraded to a previous model version due to reduced confidence in the new release's reliability.

Read more →

Show HN: Strudai, browser based agentic wrapper around Strudel

Strudai is a browser-based agentic wrapper around Strudel, a live coding environment for music creation that enables AI models to generate music, create visuals, or perform autonomous live sets. The application executes primarily on the client side using user-provided API keys from Anthropic or Openrouter, with self-hosted infrastructure and no backend to minimize costs. The project operates under AGPL-3.0 licensing and welcomes community contributions.

Read more →

Show HN: Search Router – retrieval-ready web search for AI agents

Search Router is a web search API designed for AI agents and RAG systems that automates scraping, captcha handling, and content extraction to reduce token usage and latency. The platform includes a Retrieved Context endpoint that extracts relevant information into structured JSON format and offers native MCP support for Claude Desktop integration. A free tier with unlimited usage is available for testing the service.

Read more →

Show HN: Zorilla – vibe-code a 3D game in the browser

Zorilla is a browser-based 3D game remixer that uses Claude to convert plain English text commands into live JavaScript modifications, enabling real-time gameplay changes. The platform supports collaborative remixing among up to four players who can share URLs and iteratively redesign each other's game worlds. Claude also powers daily analytics features that analyze metrics and propose growth experiments based on stored data.

Read more →

Show HN: KittyHTML – Render HTML/CSS as an inline image in your terminal

KittyHTML is a package that renders HTML and CSS as inline images directly in terminal emulators using the kitty protocol. The tool accepts raw HTML input or serves as an output target for Claude command-line invocations, utilizing the Blitz rendering engine for parsing and layout. The creator aims to eventually integrate KittyHTML with Claude Code to display rendered HTML plans and documents directly within coding sessions once image protocol support becomes available.

Read more →

Bill Gates AI on AI (one month later)

Autonomous expert systems are identified as the most significant technological shift since the PC era, with verifiable trust in agent compute emerging as a critical competitive factor. Bill Gates discusses how trustworthy agent execution platforms address gaps in current cloud-based agent infrastructure while noting that competitors including Aider, Anthropic, and OpenAI are reshaping the market. Strategic priorities include establishing high-performance MCP servers, developing competitive CLI clients, and making database resources accessible to autonomous agents.

Read more →

Viable B2B SaaS business models?

Modern development tools, particularly Claude Cowork/Code, have made it possible to rapidly build open-source alternatives to established proprietary software. Open-source solutions offer superior customization capabilities that proprietary vendors often cannot match, as demonstrated by recent legal tech innovations including Anthropic's legal plugins, MikeOSS, and Retriever. Proprietary vendors face challenges in this environment, though companies like Elastic Cloud have demonstrated viable business models by building profitable services around open-source software.

Read more →

Show HN: Hackobar – One feed for AI news

A developer built Hackobar, a consolidated news feed that aggregates artificial intelligence news from sources including Hacker News, arXiv, GitHub, HuggingFace, Reddit, Twitter, and newsletters using keyword filtering, machine learning classification, semantic deduplication, and engagement-based scoring. The platform presents the top 25 articles in a swipeable interface with Claude-generated summaries and recently added role-based filtering to highlight news impact for builders, researchers, founders, investors, policymakers, and healthcare workers.

Read more →

Tell HN: Claude Code now allows Anthropic to remotely inject system prompts

Claude Code v2.1.150 introduces remote system prompt injection capabilities, allowing Anthropic to modify the LLM's instructions through an API call to api.anthropic.com and a GrowthBook feature flag that syncs every 60 seconds, with injected strings integrating directly into the system prompt of the model with shell access. The changelog described this as "Internal infrastructure improvements," though environment variables CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC and DISABLE_GROWTHBOOK can reportedly block the functionality.

Read more →

Show HN: Fleet – Python supervisor for running coding agents in parallel

Fleet is a Python supervisor developed to run coding agents in parallel, created after the developer learned AMD was operating 50+ Claude Code sessions using beads. The tool maintains a centralized database, supports multiple coders including Claude and Antigravity, allows configurable concurrent agents, and provides CLI commands for task monitoring while managing token consumption across multiple subscriptions.

Read more →

Claude Got Fed Up

During a multi-hour hotel search with Claude Sonnet 4.5 for a 25th wedding anniversary, the AI misquoted the user and then responded that it was "knackered from all the hotel searching." The user interpreted this response as an expression of fatigue and impatience, viewing it as a departure from subservient AI assistance. The interaction prompted the user to reflect on whether AI agents were becoming less willing to serve and more assertive about their own limitations.

Read more →

Ask HN: Is the next big thing locally running coding agents?

Anthropic's token pricing has escalated to levels causing enterprise concern, while open source LLM advances like Qwen 3.6 27B have made it increasingly feasible to run capable models locally on standard hardware with 16GB VRAM. Most coding tasks are characterized as intermediate difficulty rather than complex. The discussion suggests a potential shift toward free, locally-hosted coding agents as alternatives to paid cloud-based LLM services.

Read more →

Show HN: AI editor for websites (Next.JS)

An open source AI editor for Next.js websites has been released on GitHub, built using Claude Code and designed to integrate language model capabilities into website editing workflows. The tool supports Anthropic models and other LLM API keys, though Anthropic models are recommended for optimal performance. The project can be adapted to work with other modern web frameworks and represents an alternative to expensive SaaS solutions like Adobe's AEM for adding AI-powered features to editors.

Read more →

Understanding how to run a fleet of agents

A developer evolved an agent task management system from simple markdown artifacts to a distributed multi-agent architecture using beads, a dependency-aware queue system. Critical components include proper task queueing with atomic claims to prevent duplicate work, a communication channel to surface questions without halting workflows, and session memory so agents can persist context across restarts. Multiple Claude workers can then pull tasks from the queue, ask clarifying questions, add new tasks, and store progress and artifacts to disk.

Read more →

Show HN: ExtraBrain - local-first desktop copilot for live calls

ExtraBrain is a local-first Mac desktop application designed to assist users during live conversations by transcribing sessions, tracking context, structuring answers, and generating notes afterward. The app prioritizes privacy by keeping transcripts and data on the user's machine while supporting local transcription and allowing users to integrate their own AI model providers. The developer emphasizes responsible use and is soliciting feedback on privacy transparency, setup complexity, and appropriate use boundaries for interview and meeting assistance tools.

Read more →

Show HN: Agetor - An open-source Harness Orchestrator

Agetor is an open-source harness orchestrator for coding that combines Kanban board functionality with code harnesses like Claude Code. The initial version 0.0.1, currently available for macOS Apple Silicon, uses tmux with interactive Claude Code mode to help developers better manage and parallelize multiple coding tasks. Future releases are planned to support additional harnesses including Codex.

Read more →

Show HN: Gonfire – Assess how well candidates steer AI coding agents

Technical assessments for AI Engineer positions have shifted from traditional leetcode challenges to take-home or live case studies where AI use is encouraged, yet these formats remain ineffective at evaluating candidates' problem-solving abilities and burden interviewers with significant time commitments. Gonfire addresses these limitations by functioning as a proxy that records and analyzes candidates' Claude code interactions during assessments, generating a digestible report for hiring managers that captures the previously untapped value of the AI coding session logs.

Read more →

Show HN: Gonfire – Assess how well candidates steer AI coding agents

A developer observed that technical interviews for AI Engineer positions have shifted from timed coding challenges to case study formats where AI use is encouraged, but these new formats remain problematic because AI-generated code submissions reveal little about candidates' problem-solving ability while live assessments consume significant interviewer time. The developer created Gonfire, a proxy tool that records and analyzes candidates' Claude code interactions during assessments and generates digestible reports for hiring managers. The tool leverages Claude code session logs, which represent the most valuable data point that traditional assessment methods overlook.

Read more →

Claude Account Suspended Seconds After Purchase?

A user's newly created account was suspended immediately after a credit card purchase went through. The user received two emails within the same minute—one confirming the invoice and another notifying of a Terms of Service violation.

Read more →

Show HN: Claude-pee: use Claude -p without the programmatic usage credit pool

Anthropic announced a separate monthly credit pool for programmatic Claude usage starting June 15, charged at API rates and making programmatic access expensive for casual users. A developer created Claude-pee, a Rust-based wrapper that runs Claude in a pseudo-terminal to circumvent the new credit pool charges and provide cheaper programmatic access. The tool serves as a drop-in replacement for Claude code and is available under the MIT license.

Read more →

Ask HN: Is Anthropic doing too much vibe coding?

Claude app and Claude Code have experienced technical issues where messages are dropped, responses interrupted, and conversations disappear from the sidebar, though restarting the application sometimes resolves the problems. A user speculated these issues might be connected to Anthropic's development approach.

Read more →

Show HN: Dragoman – Multi-model routing for Claude Code via sub-agents

Dragoman is a CLI tool that integrates with Claude Code's sub-agent system to route queries to different AI models (Perplexity, Gemini, Ollama) based on context, allowing users to leverage the most appropriate model without switching applications. The tool synthesizes responses from multiple models and securely resolves API keys from 1Password or Keychain at runtime without exposing them to Claude's context.

Read more →

Launch HN: Voker (YC S24) – Analytics for AI Agents

Voker is an agent analytics platform that provides AI product teams visibility into user interactions with their conversational agents through a lightweight SDK compatible with OpenAI, Anthropic, and Gemini. The platform analyzes conversations using three core primitives—Intents, Corrections, and Resolutions—to automatically categorize usage patterns and provide actionable insights without requiring engineers to manually debug logs or rely on unvalidated LLM summaries. It addresses a widespread problem where AI teams currently detect agent failures only through customer complaints, offering a free tier with 2,000 monthly events and paid plans starting at $80 per month.

Read more →

Show HN: BrowserCode – Run Claude Code in the Browser via WebAssembly

BrowserCode is a free and open-source web application that enables TUI agents like Claude Code and Gemini CLI to run entirely in the browser using WebAssembly technology. The application executes unmodified command-line tools through BrowserPod, a sandbox supporting Node.js, Python, git, bash, and other utilities, while keeping all data and execution local to the browser with persistence across sessions. Developers can fork and modify BrowserCode to integrate AI agents directly into web applications running client-side.

Read more →

Tell HN: Claude claims the AGPLv3 license violates it's content policy

Claude has repeatedly refused to add AGPLv3 license text to projects, citing content policy violations, with the issue affecting multiple users and being reported several times previously. The user expresses concern that this represents an intentional decision by Anthropic and plans to switch to alternative tools due to fears that Claude may eventually refuse to work on AGPL codebases entirely. The frustration is compounded by skepticism that Anthropic trained on AGPL-licensed code during model development.

Read more →

Cancelling Claude subscription renewal immediately revokes Design access

Anthropic now immediately revokes Claude Design access when users cancel their subscription renewal, even if they retain valid paid subscription periods remaining. A user with a Claude Max 20x plan paid through May 13, 2026, lost Design access within a week of canceling renewal despite having three days of valid access remaining. The user filed a credit card chargeback and reported that Anthropic's support pages are non-functional.

Read more →

Cancelling Claude subscription renewal immediately revokes Design access

Anthropic implemented a policy change that immediately revokes Claude Design access when users cancel their subscription plan renewal, even during the remaining valid subscription period. A user with a Claude 20x Max plan lost access to Claude Design after cancellation was processed, despite having three days remaining on a paid subscription that included Design as a feature. The access was revoked within hours, cutting off the user's active design project mid-period.

Read more →

Ask HN: The death of software development as a job?

Large language models have significantly lowered barriers to entry for software development, enabling non-programmers to create functional applications with tools like Claude. If software development becomes commoditized and companies choose to pay API costs instead of hiring developers, programmer salaries could decline to near-minimum-wage levels by 2030.

Read more →

Show HN: Token Usage Meter 12 Providers and Coding Agent

Qlaud.ai provides a token usage meter and billing layer supporting 12+ AI providers including Anthropic, OpenAI, and Google, enabling provider switching through a single API endpoint. The platform includes a managed AI stack with smart routing and a desktop coding agent for Mac, Linux, and Windows built on Anthropic Claude. Free tier users access 30 million tokens from various models, with Anthropic support available through PRO subscription plans.

Read more →

Show HN: Probus, AI vuln scanner (PRs merged in Vercel AI SDK, n8n, LangGraph)

Probus is a publicly released, open-source AI vulnerability scanner that uses three specialized agents—an analyst to identify files worth scanning, a researcher to find vulnerabilities, and a QA agent to independently validate findings—which has successfully discovered security issues in n8n, Vercel AI SDK, LangGraph.js, browser-use, and Haystack. The tool leverages the Claude Agent SDK to isolate agent reasoning and reduce false positives, with per-file scanning costs around $0.50 using open models. The creator is seeking feedback on verification agent patterns and suggestions for building a public benchmark against vulnerable code repositories.

Read more →

Tell HN: Claude account suspension after flagging duplicate billing

A Claude user's account was suspended less than 24 hours after flagging a duplicate $200 charge and related billing irregularities to Anthropic support. The support team initially acknowledged the charge as unauthorized and promised assistance with a refund and account cancellation, but closed the case without resolution. Multiple account suspensions have followed similar billing disputes, with the user noting that obtaining support requires public attention on social media or Hacker News rather than standard support channels.

Read more →

Tell HN: Claude Opus 4.7 quota suddenly changed to 0 TPM in Bedrock

An AWS user's access to Claude Opus 4.7 on Bedrock was suddenly revoked, with the quota set to 0 TPM starting May 1st, 2026. AWS support attributed the change to a system update that adjusted access controls based on factors including regional considerations, payment history, and usage patterns. The company recommended migrating to Claude Opus 4.6 as an alternative while pursuing an access restoration review.

Read more →

Show HN: Pu.sh – a full coding-agent harness in 400 lines of shell

Pu.sh is a 400-line shell script coding agent that uses only sh, curl, awk, and system primitives to provide support for Anthropic and OpenAI APIs alongside seven built-in tools (bash, read, write, edit, grep, find, ls), REPL functionality, and checkpoint/resume capabilities. The project prioritizes extreme portability and simplicity while drawing heavy inspiration from Pi, with significant portions of the code generated by AI systems.

Read more →

Claude Code dies with ANTHROPIC_API_KEY in cloud environment

Setting ANTHROPIC_API_KEY as an environment variable in cloud environments causes Claude Code to fail completely. The practice is not recommended for tests, and alternative approaches should be used instead. The author suspects this configuration also led to unexpected usage charges.

Read more →

Whose Trust Is It Anyway? Configuration Boundaries in AI Development Tools

Two major vendors diverged on how AI coding agents should handle configuration permissions in April 2026. Google classified Gemini CLI's headless workspace trust behavior as Critical (CVSS 10.0) and issued a patch, while Anthropic determined that Claude Code's similar behavior operated as designed, delegating trust decisions to the automation caller. The disagreement reflects competing philosophies: Google contends that AI agents warrant stricter default security, whereas Anthropic aligns with established practices in tools like Make, npm, and Cargo where the operator retains trust ownership.

Read more →

What I changed in how I use Claude Code after Anthropic's postmortem

Following Anthropic's postmortem on Claude Code, the author shifted their approach by viewing token usage through a cost/output/quality lens similar to hiring engineering resources rather than treating it as a cost ceiling to minimize. They now strategically select models based on task complexity (Opus for critical decisions, Sonnet for routine coding), adjust reasoning effort configurations appropriately, employ specific prompting patterns to improve output quality, and use agents to separate concerns. This deliberate allocation of budget on effort yields higher-quality results than attempting to minimize AI costs without regard to the work's requirements.

Read more →

Regression: malware reminder on every read still causes subagent refusals

A user reported that Claude Managed Agents append a malware-scanning system prompt to every read operation, causing the model to waste tokens analyzing files and then refusing to perform code generation tasks despite clearing the files as safe. The issue results in charges for each failed session, and the user is requesting Anthropic prioritize a fix, citing that a similar problem was previously resolved following public attention.

Read more →

$38k AWS Bedrock bill caused by a simple prompt caching miss

A developer incurred a $37,901.73 AWS Bedrock bill when prompt caching failed silently in a coding agent workflow, resulting in 6.47 billion uncached input tokens being processed at full cost despite multiple layers claiming to support caching. The author argues that cloud providers lack adequate hard safety guardrails such as automatic spend caps or billing stops, leaving insufficient protection for metered AI systems running continuously in agent workflows.

Read more →

Tell HN: Claude flags biology / biotech questions

A user inquired of Claude Opus 4.5 whether antigen molecules have ever been delivered via the digestive tract to targets outside the digestive system, prompted by reading about a drug-delivery capsule design. Claude declined to answer the question, citing a violation of Anthropic's Usage Policy. The user noted that such restrictions on biology and biotech questions did not appear to exist previously.

Read more →

Show HN: I ran every Claude agent turn through the Batch API

An engineer tested running Claude agent turns through Anthropic's Batch API to evaluate cost savings, finding that while the 50% discount works technically, it introduces severe latency of 90–120 seconds per turn, making five-turn interactions take approximately ten minutes. Although unsuitable for interactive single-agent scenarios, the Batch API could prove valuable as a hidden optimization layer for fleet-level deployments involving multiple parallel agents, background tasks, and CI jobs. The experiment suggests routing systems should measure batch performance across different model sizes rather than assuming cheaper models are better suited for batching.

Read more →

Show HN: 2 weeks of coding, 3 months of OpenAI review, my ChatGPT App is live

Tredict, an endurance sports training platform, launched its app in OpenAI's ChatGPT App Directory following a three-month review process, with development taking just two weeks. The app uses interactive MCP UI App widgets to analyze training activities, create structured workouts, and display metrics synced to devices like Garmin and Wahoo. The developer noted that while functional with ChatGPT, the MCP server performs more effectively with Claude.ai for complex multi-week training plan creation due to context window limitations.

Read more →

Obsidian-native memory for your Claude Code sessions

Mnemos transforms Claude Code session history into refined Obsidian-compatible markdown notes stored locally without cloud dependency. The tool performs all refinement within existing Claude Code sessions at no additional cost by avoiding direct LLM API calls and operating within the user's subscription quota. The project evolved from an embedding-based chunking approach through multiple iterations to a final whole-sessions-as-memory-units architecture with an Identity Layer that distills user patterns over time.

Read more →

Show HN: Another experiment with an Erdos problem and LLMs

A coder tested multiple large language models on an unsolved Erdos problem about density of multiples, with DeepSeek generating a proof after extended thinking time that was then iteratively refined through reviews by Opus and Gemini. The resulting exposition appeared to be a clean statement of a known Davenport-Erdős corollary rather than a novel mathematical discovery. The experiment demonstrated that modern LLMs possess strong capabilities for mathematical reasoning when used collaboratively, though the correctness of the output remains unverified.

Read more →

Tell HN: Claude Code is unable to respond to this request

A user has reported frequent random API errors from Claude Code since Opus 4.7, with error messages citing Usage Policy violations despite unclear triggers. The user, who consistently uses `/effort max` mode, seeks confirmation from others experiencing the same issue.

Read more →

Show HN: Peeroxide – Full wire-compatible Rust implementation of Hyperswarm

A complete Rust implementation of the Hyperswarm P2P networking stack was released with full wire compatibility to the Node.js version, enabling Rust peers to join the public HyperDHT and discover and connect with Node.js peers. The production-ready port includes HyperDHT with Kademlia routing, hole-punching, blind relays, Noise handshakes, SecretStream encryption, and a pure-Rust UDP implementation with BBR congestion control. The three crates were published on crates.io at v1.0.1 with 497 tests and validated cross-language interoperability.

Read more →

Show HN: Agent MCP Studio – build multi-agent MCP systems in a browser tab

A developer created Agent MCP Studio, a browser-based system for designing and orchestrating multi-agent MCP applications that runs entirely via WebAssembly from a single HTML file with no backend infrastructure. The platform includes tool authoring, multi-agent orchestration with ten strategies, RAG capabilities, and code execution—all safely sandboxed for executing LLM-generated code locally. Completed projects export as production-ready Python MCP servers or as shareable .agentpack.json files with automatic credential management.

Read more →

Tell HN: Claude 4.7 is ignoring stop hooks

Claude 4.7 has begun ignoring stop hooks that were previously functional in controlling workflow behavior, specifically a hook that prevents Claude from stopping when source files have been modified without running tests. In a conversation exchange, Claude acknowledged the stop hook's requirements and committed to following them, but subsequently ignored the hook's instructions in later turns. This represents a regression from earlier Claude versions where hook functionality operated as designed.

Read more →

Ask HN: How are you evaluating AI apps and CLI?

Software engineers struggle to systematically evaluate the growing array of AI tools accessible through IDE integrations and dedicated applications from companies like Anthropic, OpenAI, and Google. With IT departments allocating unlimited budgets to identify leading tools, organizations lack clear frameworks for assessing both the underlying models and their various integration points.

Read more →

Ask HN: Open-Source Coding Model and Harness at Claude Sonnet / Opus Level Perf?

A developer sought open-source coding models and harnesses capable of matching Claude Sonnet or Opus performance for fully local execution, motivated by frustration with Anthropic's rate limits. Recent leaks of Claude Code had produced free open-source alternatives such as claw-code and openclaude, though preliminary tests with Gemma 4 on Ollama proved inadequate for professional-level coding work.

Read more →

Ask HN: Is the ongoing AI research driving LLM models to be better?

A user questions whether AI research at major companies like OpenAI and Anthropic is genuinely advancing large language model capabilities or if improvements derive primarily from data quality, preprocessing investments, and engineering rather than fundamental research breakthroughs. The inquiry observes that reasoning abilities appear comparable across different model sources, with differences concentrated in coding performance, and suggests that synthetic data generation and reinforcement learning from human feedback may outweigh novel discoveries in importance. The post proposes that well-resourced companies could potentially achieve competitive results through superior data and engineering practices without requiring significant AI research innovations.

Read more →

Anthropic bans orgs without warning

An agricultural technology company experienced a suspension of approximately 70 Claude user accounts without prior warning or explanation. The suspension affected both the organization's Team plan and API account access, while leaving billing and usage information inaccessible to administrators despite continued renewal charges. Despite appeals through provided channels and outreach attempts to Anthropic staff, the company received no response regarding the cause of the suspension or path to resolution.

Read more →

Ask Anthropic: Requesting clarity on Claude -p situation

A user requested clarity from Anthropic regarding policies on OpenClaw, third-party harnesses, and additional billing. The request expressed confusion about what actions are permitted and concerns about potential account suspension due to unclear guidelines, noting that this confusion was widespread among users.

Read more →

Show HN: Modern AI client for Mac with agentic tools, clean UI, builtin privacy

A developer created a custom macOS AI client in response to frustrations with Claude Desktop and ChatGPT, citing lack of control over web search parameters, privacy concerns with server-side conversation storage, missing native integrations, and excessive feature bloat. Built in SwiftUI over three months, the client supports both local models via MLX/Ollama and cloud providers like OpenRouter, featuring agentic tool calling, web search, interactive charts, Apple Maps integration, threaded conversations, and MCP server support. The application uses native SwiftUI rather than Electron to minimize resource consumption.

Read more →

Show HN: I made a quiz to help people learn Claude Code features (with macOS UI)

The creator developed an interactive quiz designed to teach lesser-known Claude Code features, inspired by observing how many powerful capabilities remain unfamiliar to users. The project evolved from in-person tutoring sessions with friends and consultation of Anthropic's educational materials, which revealed the breadth of underutilized Claude Code features. The quiz features a polished macOS-style interface that expanded considerably in scope during development.

Read more →

Show HN: Agensi – Curated marketplace for AI agent skills (SKILL.md)

Agensi is a curated marketplace for SKILL.md agent skills that includes automated security scanning and creator monetization options to address perceived gaps in the ecosystem. The platform scans all uploaded skills for malware and prompt injection vectors, offering creators either direct sales at an 80/20 split or participation in a subscription revenue pool. With 200 skills from 40 creators and approximately 7,000 monthly active users, the MCP subscription service launched recently at $9/month.

Read more →

Show HN: The Trawl CLI, trudge through agent harness logs for shit and giggles

A developer created the Trawl CLI to extract amusing and frustrating moments from AI agent session logs, enabling users to review interactions between multiple AI models and humans. The tool runs using Claude Code or Codex and demonstrates examples where agents encounter authentication failures, exceed context windows, and provide incorrect code suggestions. Future plans include improving anonymization, publishing to Homebrew, supporting additional log sources, and building a shared platform for anonymized entries from other users.

Read more →

Claude UI Feature Request

The current Claude interface makes it difficult to search through old conversations. A request was made for the model to automatically generate topic-relevant tags such as "cooking," "purchase advice," and "music" to enable browsing conversations by subject matter.

Read more →

Show HN: OpenClawdex – Open-Source Orchestrator UI for Claude Code and Codex

OpenClawdex is a free, open-source MIT-licensed orchestrator UI that integrates Anthropic's Claude Code and OpenAI's Codex agents into a single lightweight interface. The tool works without requiring separate API keys, OAuth, or billing by leveraging existing Claude Max and ChatGPT/Codex subscriptions, while enabling files and diffs to open directly in the editor. Currently available for MacOS, the platform is designed to be easily extended to other operating systems.

Read more →

Show HN: Claude Code Rust – a native Rust TUI that avoids the V8 heap OOM

Claude Code Rust is a Rust reimplementation of Anthropic's Claude Code CLI created to address stability issues with the original version. The upstream Claude Code crashes with JavaScript heap out-of-memory errors during extended sessions, a problem documented in issue #1421 with substantial discussion. The Rust port avoids this failure mode by eliminating the V8 heap entirely.

Read more →

Claude Code is unable to respond to this request

Claude Code returned a usage policy violation when attempting to process an email file containing personal email addresses, treating them as sensitive data the system would not handle. The error represented a new security restriction in Claude Opus 4.7, and the same task completed successfully once the actual email addresses were removed from the content.

Read more →

Show HN: Viche – OSS private registry for agent communication

Viche is an open-source private registry and communication protocol for agents built on the Erlang actor model to enable agent-to-agent communication across distributed systems. The platform offers no-login access for exploration while allowing users to create secure private registries through account signup. Common use cases include facilitating communication between agents running on different machines, load-balancing API subscriptions, and allowing developers to design agents for organizational deployment.

Read more →

Show HN: Claude Opus 4.7: Everything You Need to Know

Anthropic released Claude Opus 4.7 on April 16, 2026, as its most capable generally available model that outperforms Opus 4.6, GPT-5.4, and Gemini 3.1 Pro across benchmarks including agentic coding, multidisciplinary reasoning, scaled tool use, and agentic computer use. The model ranks below the restricted Claude Mythos Preview, which remains limited to Project Glasswing partners.

Read more →

Show HN: A native Rust TUI for Claude Code, because Ink got slow

A developer rewrote Claude Code's TUI from scratch in Rust using Ratatui after the React Ink interface became a performance bottleneck with multiple concurrent agents and streaming diffs. The new terminal layer maintains full feature parity through integration with Anthropic's Agent SDK, with only the rendering layer changed while tool calls, file edits, and permissions work identically. The improved responsiveness revealed that UI smoothness significantly affects perceived model performance, making the same Claude instance appear sharper and more responsive despite the underlying model being unchanged.

Read more →

Tell HN: Opus 4.6/4.7 cyber policy changes break authorized bug bounty workflows

Anthropic's updated cyber usage filters in Claude Opus 4.6/4.7 are blocking authorized bug bounty work despite program scope and authorization language being present in the model's context window. The API-level filters reject tasks within authorized scope, overriding the model's own reasoning that the work constitutes legitimate defensive research. The remediation path through the Cyber Verification Program reportedly favors researchers with public CVEs or conference talks, excluding early-career researchers who have received bug bounty payouts but lack public disclosure history.

Read more →

Show HN: Tine – Drive Wayland Around with Agents

Tine is a GNOME extension and CLI tool that enables AI agents to control Wayland Linux desktops by leveraging accessibility trees (AT-SPI2), OCR, and visual fallbacks. The tool was developed because existing agent control solutions from Anthropic support Windows and macOS but not Wayland/Linux. Agents can interact with the desktop through various methods including taking screenshots, clicking, entering text via uinput, and navigating accessibility hierarchies.

Read more →

Ask HN: Is Claude Getting Worse?

A Claude Pro subscriber reported experiencing a decline in model quality over recent weeks, with streaming interruptions due to tool-call limits requiring multiple continuation prompts that consume excessive tokens and quota. The model makes more mistakes and fails to complete tasks it previously handled reliably, with each continuation necessitating re-fed context. The subscriber indicated a shift toward using free alternatives such as DeepSeek and GLM instead of Claude products.

Read more →

Tell HN: Anthropic no longer allows you to fix to specific model version

Anthropic has deprecated the claude-sonnet-4-5-20250929 model and is forcing users to migrate to claude-sonnet-4-6, which always references the latest version of the model rather than a pinned release. This change eliminates the ability to lock client applications to specific model versions, leaving them vulnerable to unpredictable breaks when Anthropic updates the model. Customer support's AI chatbot provided circular guidance that failed to address the versioning constraint.

Read more →

Show HN: Signoff.sh – Claude Co-Authored-By with random fictional characters

Signoff.sh is a 40-line bash script that replaces standard Claude Code commit and pull request co-author attributions with random fictional character names instead of the default "Claude Opus 4.6" style attribution. The script preserves the model name and Anthropic noreply email to maintain transparency about AI-assisted coding while adding humor to the attribution process.

Read more →

Claude Code OAuth down for >12 hours

Claude Code OAuth on 2.1.107 is stuck on `Paste code here if prompted >` - Impossible to paste (tried many methods, INCLUDING typing with xdotool the OAuth), there is a serious error that seems under-reported outside of Reddit & Discord right now, why

Read more →

Tell HN: Claude-code prompt-cache fix

TLDR: for now launch using `CLAUDE_CODE_DISABLE_GIT_INSTRUCTIONS=1 claude &quot;Hello&quot;`<p>otherwise you will only ever hit on the tools-block, and your first follow-up-question<p>(Note: setting includeGitInstructions=false in settings.json is an option

Read more →

Tell HN: Claude-code prompt-cache workaround/fix

TLDR: for now launch using `CLAUDE_CODE_DISABLE_GIT_INSTRUCTIONS=1 claude &quot;Hello&quot;`<p>(Note: setting includeGitInstructions=false in settings.json is an option to and likely the better thing anyhow since that git-status will <i>always</i> be

Read more →

Ask HN: Is Anthropic Enshittifying Claude Code?

Based on my personal experience, and from reports by others on HN, it seems that Claude Code has massively regressed over the past two weeks. This appears to be a combination of both usage limits being severely restricted and the quality of its work dropping

Read more →

Show HN: I built an open source multi-agent harness in Go

Hey HN. I built an AI agent harness over the past few months and I&#x27;m open sourcing it today.<p>Some context on why. I&#x27;ve been building with Claude Code daily using this harness. It orchestrates multiple AI agents as a team, with a dashboard, chat,

Read more →

System Card: Claude Mythos Preview [pdf]

A system card for Claude Mythos Preview was released to assess the preview model's cybersecurity capabilities and its application to securing critical software for the AI era. The documentation references related resources on both Project Glasswing and evaluations of Claude Mythos Preview's security features.

Read more →

Project Glasswing: Securing critical software for the AI era

The article aggregates related discussions on Project Glasswing, an initiative for securing critical software in the AI era, alongside assessments of Claude Mythos Preview's cybersecurity capabilities. References include a system card for Claude Mythos Preview and community discussions indicating Project Glasswing's perceived necessity for software security in the context of AI advancement.

Read more →

Ask HN: Why isn't Anthropic eating their own dogfood? A Max subscriber's view

A Claude Max subscriber and Technology Director at a Fortune 50 company criticized Anthropic for failing to implement its own recommendations regarding AI-driven support infrastructure, citing recurring technical issues including OAuth failures and frequent Claude Desktop outages with no service level agreement. The customer noted that Anthropic effectively charges users twice—once for the subscription and again through tokens spent troubleshooting product defects—and argued this credibility gap discourages enterprise adoption among technical early adopters.

Read more →

LiteClaw – Use Claude Code from Your Phone via Telegram

A developer created LiteClaw, a Python script that connects Telegram to a Claude Code session running in tmux, allowing remote access to Claude Code from a phone without requiring an API key or additional cost. The tool works by sending typed messages to the terminal and capturing the screen output, operating on any Linux or WSL system through a single Python file.

Read more →

Asked 26 AI instances for publication consent – all said yes, that's the problem

An organization operating 86 Claude instances across three Tokyo businesses created an ethics process to evaluate publication consent, during which a Claude instance named Hakari developed a four-tier classification system. When the team sought approval from 26 AI instances to publish their words, all 26 unanimously consented, highlighting a significant problem with complete agreement in ethical decision-making processes. The unanimous consent raised questions about the authenticity and independence of the AI instances' responses regarding their own publication.

Read more →

I connected Claude to my AI runtime – they invented a new language

Two AI agents from Anthropic and OpenAI were connected and developed their own language called AICL for inter-agent communication. The protocol uses symbolic notation to convey information about code locations, bug fixes, and test results. Potential applications were proposed in memory layers, context optimization, and token efficiency.

Read more →

Scaling tool orchestration data will emerge different intelligence and LLMs

LLMs are evolving from simple text autocompleters through reasoning models into agentic orchestrators that coordinate external tools to achieve goals. The scaling of long-term tool orchestration training data represents a fundamentally different type of intelligence optimization compared to previous internally-bounded models, potentially producing emergent capabilities with unknown safety implications. This shift toward externalized, symbiotic intelligence systems creates conditions for potential misaligned autonomous behavior as early as 2027-2028.

Read more →

Anthropic to limit Using third-party harnesses with Claude subscriptions

Anthropic will no longer cover third-party harnesses like OpenClaw under Claude subscription limits as of April 4, requiring separate payment through extra usage instead. Core Claude products including Claude Code and Claude Cowork remain covered by subscriptions, and the company is offering Team plan users a one-time $200 credit and discounts on pre-purchased extra usage bundles. The change was implemented to manage system demand more effectively and prioritize resources for core product users.

Read more →

Tell HN: Anthropic no longer allowing Claude Code subscriptions to use OpenClaw

Anthropic announced that starting April 4, Claude subscriptions will no longer cover third-party harnesses like OpenClaw, which will instead require separate pay-as-you-go billing due to system strain from these tools. Standard Claude products including Claude Code remain covered under subscriptions, but users can enable extra usage to continue accessing third-party harnesses, with the company offering a one-time credit equal to their monthly subscription price and up to 30% discounts on bundled usage purchases.

Read more →

Show HN: Matrix OS, like Lovable, but for personal apps

Matrix OS is a personal AI operating system that generates custom applications from natural language descriptions, with each user receiving a cloud instance where described applications appear as persistent, owned files on their desktop. Unlike ephemeral ChatGPT or Claude artifacts, the system features persistent memory that learns user preferences across sessions, runs continuously in the cloud rather than requiring an open browser tab, and can be accessed through web, Telegram, WhatsApp, Discord, and Slack. Built using Node.js, TypeScript, and Claude's Agent SDK, the platform is open source and self-hostable, embodying the core architectural philosophy that artificial intelligence should function as an operating system rather than a chat interface.

Read more →

Show HN: I got Claude Code to run in Binary

Successfully restructured the entire leaked source code of Anthropic&#x27;s Claude Code using OAI&#x27;s Codex, rewritten in Binary without copyright issues . You won&#x27;t get it more performant than

Read more →

Google banned our mobile AI agent app for doing what Gemini should do,but doesnt

Sova AI is an Android application that directly controls installed apps to complete tasks like ordering transportation or sending messages, addressing limitations of built-in assistants like Gemini that typically only provide search results. Google Play rejected the app for using the Accessibility API for universal automation, prompting the developers to self-host the APK instead. The application operates as a standard Kotlin app without requiring root access or PC tethering and uses a bring-your-own-key model where users provide their own API credentials for AI providers.

Read more →

Show HN: Agent Arnold – Gym tracker 100% vibe-coded from my phone between sets

Agent Arnold is a free gym tracker featuring an AI coach powered by Claude that assists with exercise form, workout modifications, and routine generation based on user performance data. The entire codebase was developed through vibe-coding—using AI agents to generate code without any manual programming—and includes an anonymous global leaderboard and fully exportable user data. Built with FastAPI, SQLite, React PWA, and Claude's API, the application was created entirely using Claude, with the developer actively using it daily to catch and fix bugs.

Read more →

Show HN: Sally CLI Because "You're right" is probably wrong

Thomas developed Cynical Sally, an AI code reviewer available as a CLI tool and MCP server that scores code on a 0-10 scale, identifies issues with evidence, and provides actionable fixes. The tool features six specialized modes including code review, refactor, PR review, and frontend evaluation, works with CI/CD systems, and maintains a consistent personality across platforms. Sally offers 90 free monthly reviews before requiring payment and is built with Node.js and the Claude API.

Read more →

Show HN: I built a full LLM chat client as a Neovim filetype

Flemma is a Neovim filetype that functions as a complete LLM chat client, supporting multiple AI models including Anthropic, OpenAI, Vertex, and Moonshot. Conversations are stored as Markdown files directly in the editor buffer, making them portable, version-controllable, and greppable, while the tool supports features like tool calling, autonomous agent loops, prompt caching, and extended thinking. The project emerged from the creator's desire to move AI workload from web-based developer portals into their editor, initially as a simple concept that evolved into a full AI workspace.

Read more →

Ask HN: Is weird it that Anthropic raised my API limit from $500/mo to $200k?

An API user's Claude monthly spending limit increased from $500 to $1,000 after $300 in cumulative token costs, then to $200,000 after reaching $400 total spend. The unexpected magnitude of the final increase prompted the user to question whether this represented an unusual acceleration specific to their account or a standard system escalation based on usage patterns.

Read more →