Google banned our mobile AI agent app for doing what Gemini should do,but doesnt

Sova AI is an Android application that directly controls installed apps to complete tasks like ordering transportation or sending messages, addressing limitations of built-in assistants like Gemini that typically only provide search results. Google Play rejected the app for using the Accessibility API for universal automation, prompting the developers to self-host the APK instead. The application operates as a standard Kotlin app without requiring root access or PC tethering and uses a bring-your-own-key model where users provide their own API credentials for AI providers.

Detailed Analysis

Sova AI, an Android application developed by two independent developers, has been banned from the Google Play Store after Google rejected its submission for leveraging the Android Accessibility API to enable "universal automation" — the ability to map, click, scroll, and type within other installed applications without requiring root access, ADB connections, or third-party frameworks. The app, distributed freely on a bring-your-own-key (BYOK) model, allows users to issue natural language commands — such as ordering an Uber or sending a Telegram message — and have the agent execute those tasks end-to-end by reading the device's UI node tree and simulating human interaction. The developers support a range of AI backends including OpenAI, Anthropic's Claude, Google's Gemini, and DeepSeek, and are pursuing further support for locally-hosted models via Ollama and LM Studio. With Google Play distribution blocked, Sova is now offered as a sideloaded APK directly from the developers' website.

The central irony the developers highlight is a pointed one: Google banned Sova for doing precisely what Gemini — Google's own flagship AI assistant, deeply integrated into Android at the OS level — promises but consistently fails to deliver. When users ask Gemini to perform agentic tasks like booking a ride or sending a message through a third-party application, the assistant typically responds with web search results or a prompt to open the app manually, stopping well short of task execution. This gap between Gemini's marketed capabilities and its actual on-device agentic performance is not merely a perception issue; it reflects a structural tension in how platform-controlling companies approach first-party versus third-party automation. Google's Play Store policy restricts Accessibility API use to assistive purposes, yet its own assistant occupies that same agentic space — creating a regulatory asymmetry that effectively shields Gemini from direct competition on Android.

This episode underscores a broader and increasingly visible divergence in agentic AI strategy among major providers. Anthropic's Claude, one of the AI backends Sova supports, has drawn favorable comparisons to Gemini in precisely the dimensions most relevant to agentic execution: instruction fidelity, multi-step reasoning, and reliable output structuring. Independent benchmarks and professional assessments have noted Claude's advantages on tasks requiring sustained context handling, code generation with tool integration, and transparent chain-of-thought reasoning — qualities that translate directly into more reliable agentic behavior when an AI must navigate dynamic UI states across thousands of device configurations. Google's Gemini, by contrast, draws its competitive strength from native multimodal processing, ecosystem integration with Google Workspace, and speed — capabilities that favor ambient, conversational use cases over deep task execution.

The technical challenges Sova's developers describe illuminate why mobile AI agency remains an unsolved problem even for well-resourced incumbents. Translating LLM output into accurate X/Y coordinates on dynamically rendered Android screens — across a fragmented ecosystem of device resolutions, manufacturer UI skins, and app-level rendering behaviors — represents a significant engineering obstacle. The developers acknowledge their solution is imperfect, framing the public APK release as an open invitation to stress-test edge cases. This approach, combining BYOK economics with community-driven adversarial testing, reflects a product philosophy that prioritizes functional deployment over polished certification — a contrast to the cautious, policy-compliant rollout strategies of major AI labs.

The Sova ban sits within a larger pattern of platform gatekeeping emerging as agentic AI capabilities mature. As AI agents move from answering questions to executing real-world tasks within existing application ecosystems, they inevitably brush against the distribution and API policies of the platforms those apps inhabit. Google's rejection of Sova is unlikely to be an isolated incident; similar tensions are likely to arise as other developers attempt to build cross-app automation layers on iOS and Android. The episode raises unresolved questions about whether the dominant mobile platform owners will ultimately build genuinely capable agentic assistants themselves, acquire the developers who do, or continue enforcing policies that structurally limit third-party competition in the agentic layer — a layer that may prove to be among the most commercially significant battlegrounds in consumer AI over the next several years.

Read original article →

Detailed Analysis

Don't Miss a Deploy