
Grok 3.5 vs Grok 3: How Elon Musk’s AI Is Evolving Fast
Fresh benchmarks and new features reveal Grok 3.5’s push toward AI supremacy.
Grok 3.5 Surpasses Its Predecessor with Key Improvements
xAI’s latest AI chatbot, Grok 3.5, arrives with notable upgrades, setting a new benchmark for Elon Musk’s AI ambitions. As an evolution of Grok 3, the new model shows measurable advances in technical tasks, particularly in mathematics and coding.
Benchmark Results Highlight Clear Performance Gains
According to official testing, Grok 3.5 achieved a 95.8% score on the AIME 2025 math benchmark, outperforming Grok 3’s 93.3%. In LiveCodeBench, which measures programming skills, Grok 3.5 improved slightly from 79.4% to 80.4%. While there was a minor dip in the GPQA physics benchmark—from 84.6% to 84.0%—the overall trajectory points upward.
New Capabilities: Vision Input, Memory, and Image Editing
Grok 3.5 introduces multimodal features, enabling users to input visuals through device cameras. It also supports real-time voice interactions enriched with visual context. On top of that, xAI plans to roll out Google Drive integration and conversational memory—giving Grok 3.5 a major edge over Grok 3.
Technical Backbone: Large Context Window and Expected Speed Boosts
Like its predecessor, Grok 3.5 maintains a massive 1-million-token context window, supporting complex tasks and extended conversations. While Grok 3’s generation speed was clocked at 42.8 tokens per second, no official speed figures have been shared yet for Grok 3.5, though improvements are expected based on performance metrics.
Grok 3.5 Access and Pricing Details
While Grok 3 was priced at $6 per million tokens, including $3 for inputs and $15 for outputs, Grok 3.5’s pricing remains unconfirmed. Nevertheless, its expanded feature set should provide better value for power users and SuperGrok subscribers. Grok has also transitioned from being exclusive to the X platform into a standalone app available on iOS, Android, macOS, and Windows.
Grok 3.5 Marks a Strategic Leap Forward in AI
With better technical scores, added multimodal intelligence, and broader platform availability, Grok 3.5 signals xAI’s serious bid to compete head-to-head with ChatGPT and Gemini. For users needing both high-level computation and creative visual processing, Grok 3.5 stands as a compelling choice.
Responses