ChatGPT's Image Model Thinks Before Drawing

PLUS: Ideogram's custom model training for brands and GPT-5.5's omnimodal debut

In partnership with

How Jennifer Aniston’s LolaVie brand grew sales 40% with CTV ads

The DTC beauty category is crowded. To break through, Jennifer Aniston’s brand LolaVie, worked with Roku Ads Manager to easily set up, test, and optimize CTV ad creatives. The campaign helped drive a big lift in sales and customer growth, helping LolaVie break through in the crowded beauty category.

ChatGPT’s image generator just took the largest lead in Image Arena history — and the reason isn’t just better output quality. ChatGPT Images 2.0 introduces a “thinking” step before generation, where the model researches, plans, and reasons about your image before creating a single pixel.

Sam Altman called the gap between old and new “like going from GPT-3 to GPT-5 all at once.” Whether that holds up across real-world creative workflows remains to be seen — but the architecture shift toward planning-first generation could set a new standard for what AI image tools are expected to do.

Today in AI:
  • ChatGPT Images 2.0 adds thinking-powered generation for all users

  • Ideogram launches custom models trained on your brand assets

  • GPT-5.5 debuts as OpenAI’s first natively omnimodal model

What’s new? OpenAI launched ChatGPT Images 2.0, built on the new gpt-image-2 model, introducing thinking-powered generation — the model researches, plans, and reasons about image structure before producing any output. It immediately took the largest lead in Image Arena history and is now live across all ChatGPT plans.

What matters?

  • The model applies agentic reasoning before generating, breaking down complex scene requirements and self-checking its plan — significantly improving results for multi-subject compositions, precise lighting, and layered scenarios.

  • gpt-image-2 delivers near-perfect multilingual text rendering for non-Latin scripts including Japanese, Korean, Chinese, Hindi, and Bengali — a persistent weak point in AI image tools until now.

  • The API unlocks 2K resolution and 8 coherent images from a single prompt, with web search integration so the model can fact-check visual content in real time.

Why it matters?

Planning before drawing mirrors how professional designers work — and embedding that reasoning loop into the model could close the gap between AI output and deliberate creative direction. Combined with accurate multilingual text and real-time web search, ChatGPT Images 2.0 is built for production use, not just demos.

GUIDE

What’s new? Ideogram launched Custom Branded Models, a fine-tuning feature that trains a dedicated image generation model on 15–100 of your own images — learning your exact visual identity, art direction, product geometry, and color systems. Available now on Pro and Team plans and via API.

What matters?

  • The custom model evolves with your brand: users feed their best outputs back into training, and the model continuously refines its understanding of your aesthetic without starting over.

  • Supported use cases span products, characters, mascots, illustration styles, and photography styles — any consistent visual identity that can be demonstrated across images qualifies for training.

  • Enterprise customers get on-site collaboration with Ideogram’s team and PLM and DAM integrations, embedding the custom model directly into design production and asset management pipelines.

Why it matters?

Brand-consistent image generation has been the hardest problem in AI creative tools — prompting your way to a specific shade of blue or proprietary character design rarely works reliably. Custom Branded Models solve this by training the model to know your brand intrinsically, making Ideogram a viable production tool for creative teams rather than just a personal creative assistant.

SPONSORED BY WISPR FLOW

You think 4x faster than you type. Your IDE should keep up.

Wispr Flow lets you dictate prompts, acceptance criteria, and bug reproductions inside Cursor or Warp — with automatic file name and variable recognition. Say user_id, get user_id. Say useEffect, get useEffect.

Paste directly into GitHub, Jira, or Linear. Give coding agents the full context they need without typing a novel.

89% of messages sent with zero edits. Millions of developers use Flow daily, including teams at OpenAI, Vercel, and Clay. Free on Mac, Windows, and iPhone.

What’s new? OpenAI launched GPT-5.5, the first model in the GPT-5 family to process text, images, audio, and video in a single unified architecture — no separate pipelines. Available via API now and rolling out to Plus, Pro, Business, and Enterprise users in ChatGPT and Codex.

What matters?

  • GPT-5.5 scores 82.7% on Terminal-Bench and uses 40% fewer tokens than GPT-5.4, making complex multi-step tasks significantly more efficient for developers building on the API.

  • Input pricing doubled to $5 per million tokens, but OpenAI says the effective cost increase is around 20% when accounting for token efficiency gains — batch endpoint pricing mirrors GPT-5.4’s standard rates.

  • The model excels at interpreting vague, multi-part prompts, understanding intent across tools and taking action in code editors, data files, spreadsheets, and live web content without step-by-step instructions.

Why it matters?

Native omnimodality — one model that genuinely handles text, images, audio, and video together — opens new territory for multimodal creative applications that previously required stitching together separate specialized models. Developers building tools that blend voice, visuals, and code in a single workflow now have a single model that handles the entire stack natively.

Everything else in AI

Deezer revealed that 75,000 AI-generated tracks now flood its platform every day — 44% of all new uploads — yet AI music accounts for only 1–3% of total streams, with 85% of those streams flagged as fraudulent.

SpaceX struck a deal giving it the option to acquire AI coding startup Cursor for $60 billion, pairing Cursor’s coding AI with SpaceX’s Colossus supercomputer and preempting a $2 billion funding round Cursor was about to close.

Anthropic confirmed it is investigating unauthorized access to Mythos, its restricted cybersecurity model, after a Discord group guessed the deployment URL using Anthropic’s naming conventions from previous model releases.

Microsoft made Copilot agent mode the default experience in Word, Excel, and PowerPoint, enabling multi-step document automation for Microsoft 365 subscribers — with Excel engagement up 67% during the preview period.

Essential AI Guides - Reading List:

Let us know!

Work with us

Reach 100k+ engaged Tech Professionals, Engineers, Managers and decision makers. Join brands like MorningBrew, HubSpot, Prezi, Nike, Ahref, Roku, 1440, Superhuman, and others in showcasing your product to our audience. Get in touch now →