Google just tested a bunch of new AI models for Android app coding – here are the rankings
AI-summarised brief · reviewed before publication
Google updated its "Android Bench" rankings for AI models in Android app development, with GPT 5.5 now leading the pack, outperforming GPT 5.4 and Gemini 3.1 Pro by nearly 2%. The update includes new "open-weight" models and details on tokens used and costs. Google's benchmark assesses AI models based on common Android development tasks and best practices. The rankings show GPT 5.5's strength, but also its higher cost, over twice that of Gemini 3.1 Pro. The update adds models like Gemma, Qwen, and DeepSeek, with GLM 5.1 scoring highest among open-weight models. Google updates the "Android Bench" monthly, with new models emerging.
💡 Why It Matters
- · GPT 5.5's lead underscores OpenAI's dominance in AI-powered coding, setting a high bar for competitors.
- · Its higher cost may hinder adoption, leaving room for alternatives like Gemini 3.1 Pro to gain traction.