Google quietly just re-lit the “reasoning race.”

This week, Google rolled out a major upgrade to Gemini 3 “Deep Think”—and the benchmark jumps are… hard to ignore.

What changed (highlights):

  • 84.6% on ARC-AGI-2 (verified by the ARC Prize Foundation, per Google) and 48.4% on Humanity’s Last Exam (no tools)
  • 3,455 Elo on Codeforces, plus gold-medal-level performance across Olympiad-style evaluations
  • Introduction of Aletheia, a math research agent designed to iteratively generate + verify + revise proofs—aimed at pushing beyond “competition math” into research workflows

Access:
Deep Think’s upgrade is live for Google AI Ultra users in the Gemini app, and Google is opening early access via the Gemini API to researchers/selected partners.

Why this matters (my take):
For much of early 2026, the narrative has been “OpenAI vs Anthropic.” But Google is still a heavyweight—and reasoning + math/science agents are starting to look like the next platform shift (not just better chat). If Aletheia-style systems keep improving, we’ll measure progress less by “can it answer?” and more by “can it discover, verify, and iterate with minimal supervision?”

Questions I’m watching next:

  • Do these gains translate to reliability in real engineering work (not just scoreboards)?
  • How quickly do we get accessible APIs + enterprise controls for these reasoning modes?
  • What does “human review” look like when the system can verify and revise its own proofs?

If you’re building anything in AI-assisted engineering, math, or research ops, 2026 is going to get weird—in a good way.

https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-deep-think

More coverage

Gemini 3 Deep Think is Google's 'most advanced reasoning feature' - and it's available now

This tiny AI startup just crushed Google's Gemini 3 on a key reasoning test - here's what we know

xAI’s New Direction: Scaling AI Beyond Earth

xAI just held its first all-hands meeting since merging with SpaceX, and Elon Musk laid out a bold reorganization and roadmap aimed at pushing the company to the front of the AI race.

Key takeaways:
• xAI is restructuring into four focused teams: Grok (chat & voice), Coding, Imagine, and Macrohard (agent-based companies) to scale execution faster.
• Infrastructure ambitions now extend beyond Earth, with plans to leverage lunar resources and solar energy for future AI satellite and data infrastructure.
• SpaceX is also exploring an electromagnetic mass driver concept to launch AI hardware components for deep-space data centers.

Why it matters:
Musk’s timelines often stretch, but the message is clear — xAI wants to solve AI’s future compute and energy needs by expanding beyond Earth’s resource limits, not just competing within them. Whether practical or aspirational, this vision pushes the conversation about how far AI infrastructure might ultimately scale.

The AI race is no longer just about better models — it’s about who can build the infrastructure to sustain them long term.

What do you think: visionary roadmap or science fiction marketing?

#AI #xAI #SpaceX #Infrastructure #Innovation #FutureOfAI

Leadership Shifts at xAI Raise Questions Amid Expansion Push

Two more founding members of Elon Musk’s artificial intelligence venture, xAI, have announced their departures, adding to a growing list of early leaders exiting the company at a critical moment in its evolution.

Co-founders Tony Wu and Jimmy Ba confirmed their exits this week, becoming the fourth and fifth founding members to leave the startup. Their departures come shortly after xAI’s high-profile strategic alignment and operational integration with SpaceX, a move intended to accelerate the company’s infrastructure and model-development ambitions.

Key Departures

Tony Wu, who led reasoning model development for xAI’s flagship Grok system, shared on X that it was “time for my next chapter,” emphasizing his belief that small, AI-empowered teams can “move mountains and redefine what’s possible.” Wu joined xAI in 2023 after leaving Google and worked closely with Musk, reportedly reporting directly to him.

Jimmy Ba, another founding figure, also confirmed his departure, suggesting he intends to focus on what he described as a pivotal period ahead for AI and society. He noted that 2026 could become one of the most consequential years for humanity due to rapid advances in artificial intelligence.

Neither executive publicly cited specific reasons for leaving.

Pressure Around Product Timelines

Their exits come amid reports that Musk has grown frustrated with delays in rolling out updated versions of Grok, including the anticipated Grok 4.20 release, which has yet to materialize. The competitive pressure in AI model development has intensified dramatically as rivals accelerate releases and enterprise adoption expands.

For startups operating at frontier scale, delays can quickly translate into competitive risk, particularly as major players pour billions into compute infrastructure and model training.

Expansion Meets Organizational Strain

At the same time, xAI’s ambitions are expanding rapidly. Its integration efforts with SpaceX signal plans for large-scale computing infrastructure, including space-enabled data operations — an unprecedented scale jump for an already ambitious startup.

But expansion brings complexity. Leadership churn at this stage often raises questions about execution pace, strategic direction, and internal pressure.

Leadership turnover is not unusual in hypergrowth startups, especially those pushing technological boundaries. Still, multiple high-level exits in close succession can trigger concern among investors, partners, and employees about long-term stability.

Why It Matters

xAI operates in one of the most competitive technology races in history. AI model capabilities are advancing quickly, regulatory scrutiny is intensifying, and public concerns around misinformation and deepfakes continue to grow. Managing rapid innovation while addressing societal concerns already poses enormous challenges.

Layer on leadership turnover and infrastructure expansion, and the stakes become even higher.

Yet Musk has repeatedly navigated turbulence at Tesla, SpaceX, and other ventures, often steering companies through periods of skepticism and operational chaos toward eventual breakthroughs.

Whether this latest wave of departures represents normal startup evolution or signals deeper organizational challenges remains to be seen. What is clear is that xAI’s next year will be pivotal — not only for the company, but potentially for the broader AI landscape.

The industry will be watching closely.

ByteDance’s Seedance 2.0 Signals a New Leap in AI Video Generation

Chinese tech giant ByteDance is drawing global attention with the early rollout of Seedance 2.0, a next-generation AI video model that is rapidly gaining traction across social media for its cinematic quality, visual consistency, and synchronized audio output.

Currently in beta, Seedance 2.0 is being positioned as a major step forward in generative video, with early testers suggesting it rivals or even surpasses many of today’s leading publicly available systems.

What Makes Seedance 2.0 Different?

Seedance 2.0 is designed as a multimodal system capable of handling text, image, audio, and video inputs, enabling creators to generate videos across a wide range of styles and formats. Early demonstrations show the model performing well in areas traditionally difficult for AI video systems, including:

  • Smooth action and fight sequences
  • Character and scene consistency across shots
  • Animation and motion graphics
  • User-generated content and social media-style clips

The model also introduces native audio generation, allowing synchronized sound to be produced alongside visuals rather than added separately. Outputs reportedly support 2K resolution videos with lengths of up to 15 seconds, currently accessible through ByteDance’s Jimeng AI video platform.

Alongside Seedance 2.0, ByteDance appears to have quietly previewed a new image model, Seedream 5.0, on select third-party applications, positioning it as a competitor to other emerging high-end image generation systems.

Fierce Competition in China’s AI Video Race

The timing of Seedance 2.0’s release is notable. It arrives just days after competitor Kuaishou introduced Kling 3.0, another powerful AI video model. Together, these launches suggest Chinese AI labs are moving quickly toward the cutting edge of generative video technology.

Competition in this space is accelerating globally, with models now pushing beyond simple short clips toward cinematic storytelling, animation, marketing visuals, and creator-driven content production.

Why This Matters

Video generation has long been one of AI’s most difficult challenges due to issues like motion consistency, scene continuity, and believable audio synchronization. Progress in these areas could significantly disrupt creative industries by lowering production costs and enabling entirely new forms of digital content creation.

Seedance 2.0’s early demonstrations—featuring fluid action scenes, animated sequences, and polished motion graphics—hint at a future where professional-quality video production becomes accessible to individuals and small teams.

If performance holds as access widens, Seedance 2.0 may represent the next major leap in AI-generated video, with implications stretching from social media and advertising to entertainment and digital storytelling.

The AI video race is clearly entering a new phase—and ByteDance appears determined to lead it.

https://www.scmp.com/tech/article/3342932/bytedances-new-model-sparks-stock-rally-chinas-ai-video-battle-escalates

Everything is Robot

In 2011, Marc Andreessen wrote that software would eat the world. He was right. Fourteen years later, the world runs on software. Now he’s saying something bigger. “Robotics is going to be the biggest industry in the history of the planet. There are going to be hundreds of billions of robots of all shapes and sizes.” But here’s the uncomfortable truth: software isn’t done eating. It hasn’t even had breakfast. Software will keep eating the world, and the next course on the table is the physical world of atoms.

We think of robots as humanoids. Boston Dynamics. Tesla Optimus. Figure. Machines that walk and wave and weld. But that’s just one shape. Robots don’t need arms and legs. Tesla is already building robots on wheels. Zipline is building robots in the sky. Gecko Robotics builds wall-crawling robots that inspect power plants and refineries. A robot is not a body type. A robot is autonomy wrapped in hardware.

That’s the real shift. Hardware is the entry point. Autonomy is the multiplier. The body without the brain is nothing. The chassis is the shell. The moat is the mind.

Take Tesla. On the surface, it looks like a car company. But its moat isn’t the vehicle. It’s Autopilot, Full Self-Driving, and the massive neural nets trained on billions of miles of data. Tesla is a software AI company that happens to sell hardware. Over the next decade, cars will transform from tools of transportation to autonomous robots. And the companies that own the autonomy will own the industry.

Or look at Apple. The iPhone in 2025 isn’t what keeps people locked in. It’s not the aluminum or the glass. The real moat is iOS, the integration across apps, payments, cloud, and services. The iPhone is not a phone. It’s a software ecosystem you can’t leave.

NVIDIA is another example. The GPU was a leap in hardware engineering. But the thing that made NVIDIA untouchable wasn’t the chip. It was CUDA, the software layer that locked in developers and created an ecosystem. That’s why NVIDIA went from graphics cards to the most important AI company in the world.

This is the pattern. Hardware is a wedge. Software is the empire.