Benchmark·April 28, 2026·5 min read

Grok Imagine is the "Polisher" model. Hand off the early rounds, bring it in for refinement.

Contra Labs ran xAI's Grok Imagine through every phase of ad video production: ideation, mockup, refinement. It produced the most dramatic phase-over-phase improvement of any video model in the study.

Contra Labs
Contra Labs
Research

Contra Labs ran xAI's Grok Imagine through every phase of ad video production: ideation, mockup, refinement. It produced the most dramatic phase-over-phase improvement of any video model in the study.

Grok Imagine climbs from 3rd at ideation to 1st at refinement: 46% → 44% → 56% win rate.

Surge at refinement

By refinement, every major theme flipped positive. Motion Quality (+16), Usability (+23), Realism (+20), Prompt Adherence (+11). Win rate: 56%, ahead of every other video model we tested. Evaluators kept reaching for the same four words: organic, smooth, natural, controlled.

Every major refinement-phase theme flips positive for Grok Imagine: motion, usability, realism, prompt adherence.
The most usable and quality generation, free of visual oddities and unrealistic elements.
The camera movement showcases the product well. The movements of the clouds in the sky and the breeze are well-matched.
The neon edge illumination is beautifully lit, the camera movement pans in slowly. The close-up shot is golden. A nice ad.

Where it earns the win

Refinement is the phase where creatives stop generating and start polishing. Smooth camera moves, natural product framing, controlled lighting transitions. Grok handles all three with consistency. Motion quality (30 vs 18 across competitors), usability (24 vs 18), and lighting (17 vs ~3) all sit clearly in the green.

Grok Imagine wins refinement head-to-head against every other video model in the evaluation.

Where the next gains pay off

Two signals from the data point to where additional work compounds.

Ideation is rougher. Net Realism (−15) and Scene Coherence (−8) take hits from technical errors: object duplication, hands appearing out of frame, physics that doesn't quite hold. Grok seems to need material to iterate on before it shines.

There are two hands appearing in the shot instead of one, and they are interacting with themselves in a very unnatural way.
The portafilter is duplicated upon lock-in. Coffee pours in the device's sink.

Refinement variance is high. 42% first-place finishes alongside 25% fourth-place. When Grok hits, it leads the field. When it misses, it misses by a lot. Smoothing that variance is where the gains compound.

The practical implication

Bring Grok Imagine in once you have a draft worth iterating on. It takes you from "almost there" to "client-ready" better than anything else we tested.

Knowing which model to use at each phase is now a real creative skill.

Continue reading
All research
Research
The creative process has 3 phases. AI performs very differently in each.
Contra Labs has been studying how working creatives integrate AI into their workflows. What emerged is a consistent 3-stage structure: ideation, mockup, refinement. The way creatives use AI shifts significantly at each one.
April 23, 2026 · 5 min
Research
Solo creatives are earning more with AI and staying independent.
The majority of independent creatives surveyed report higher earning potential since adopting AI. They're taking on more projects, charging more, and hiring no one.
April 21, 2026 · 5 min
Benchmark
Veo 3.1 is the "Creative Director" model. Use it early, but hand off before refinement.
Contra Labs ran Google Veo 3.1 through every phase of ad video production: ideation, mockup, refinement. The data produced the clearest model profile we've recorded.
April 22, 2026 · 6 min

Connecting with the missing signal: taste

Contra connects top creative minds with AI teams training models to understand taste. This is expert input, not crowd labor. It's the creative layer powering the next generation of AI.

Designers

Writers

Marketers

Engineers

Social Media Experts

Video Editors & Animators

Music & Audio Engineers

1.5M+

creative experts

400+

Skills and tools represented

$250M+

verified expert earnings

Connecting with the missing signal: taste

Contra connects top creative minds with AI teams training models to understand taste. This is expert input, not crowd labor. It's the creative layer powering the next generation of AI.

Designers

Writers

Marketers

Engineers

Social Media Experts

Video Editors & Animators

Music & Audio Engineers

1.5M+

creative experts

400+

Skills and tools represented

$250M+

verified expert earnings

Connecting with the missing signal: taste

Contra connects top creative minds with AI teams training models to understand taste. This is expert input, not crowd labor. It's the creative layer powering the next generation of AI.

Designers

Writers

Marketers

Engineers

Social Media Experts

Video Editors & Animators

Music & Audio Engineers

1.5M+

creative experts

400+

Skills and tools represented

$250M+

verified expert earnings