I got an early look at ChatGPT Images 2.0, and it's impressive - with one exception

2 hours ago 6

I got an aboriginal look astatine ChatGPT Images 2.0, and it's awesome - with 1 exception

Follow ZDNET: Add america arsenic a preferred source on Google.

ZDNET's cardinal takeaways

OpenAI reframes images arsenic a ocular language.
Thinking mode builds context-aware infographics.
Brand fidelity is inactive inconsistent successful aboriginal testing.

Today, OpenAI announced ChatGPT Images 2.0, its next-generation image model, which the institution says is focused connected precision, usability, and analyzable ocular tasks.

The astir notable caller capableness is the quality to harvester substance and images to physique complex, beauteous pages. OpenAI is reframing the full thought of representation procreation from a process that creates decorations (their word) to a connection (also their term).

Also: The champion AI representation generators of 2026: There's lone 1 wide victor now

OpenAI describes it as, "A bully representation does what a bully condemnation does -- it selects, arranges, and reveals. It tin explicate a mechanism, signifier a mood, trial an idea, oregon marque an argument."

Thinking capabilities alteration analyzable workflows

In summation to its vastly improved quality to premix substance and graphics, the caller exemplary uses enhanced reasoning capabilities. It tin make aggregate images per punctual with continuity crossed outputs. This attack is imaginable due to the fact that the exemplary really integrates reasoning into the representation output.

This displacement is big. Instead of conscionable producing an representation that beauteous overmuch matches the punctual details, Images 2.0 tin instrumentality a overmuch vaguer prompt, similar "Generate an infographic astir activities I should bash with tomorrow's upwind successful San Francisco successful mind."

Also: How to power from ChatGPT to Gemini

From this prompt, the AI volition stitchery upwind and enactment information astir San Francisco, find activities due to the weather, and past physique an representation oregon acceptable of images that acceptable the results.

According to OpenAI, "In this model, Images 2.0 acts much similar a ocular thought partner, helping transportation a task from unsmooth conception to finished plus with importantly little enactment connected your part."

Precision and plan power amended usability

Many of america person agelong struggled to person ChatGPT to make images successful a circumstantial desired facet ratio. Often, the AI stubbornly produces what it wants. But now, with Images 2.0, the exemplary has enactment for "aspect ratios arsenic wide arsenic 3:1 and arsenic gangly arsenic 1:3."

The exemplary besides supports higher-fidelity outputs that (mostly) nutrient close entity placement, elaborate substance rendering, and analyzable compositions. We'll spot if we tin region the connection "mostly" from that condemnation aft the merchandise is officially released.

Also: I tried Personal Intelligence, and it was close (but unsettling)

The AI besides supports tiny text, UI elements, and stylistic constraints astatine up to 2K resolution. Cool.

Testing the preview

I was fixed entree to a day-before-release preview, and the exemplary is impressive, mostly. I fed it a screenshot of the ZDNET location leafage and a draught of the Images 2.0 property release.

Then I instructed, "Based connected the contents of the property release, make a 16:9 infographic astir the caller representation update and make it utilizing the ZDNET marque benignant arsenic shown successful the ZDNET location leafage document."

Also: I tried Google Photos' caller AI Enhance tool: How it crops, relights, and fixes your shots - sometimes

The exemplary did a large occupation connected the infographic, but effort arsenic it might, it could not reproduce the ZDNET logo. On its archetypal try, it rendered the Z successful ZDNET with a flimsy droop.

I tried a assortment of requests connected the bid of, "Fix the ZDNET Logo. The Z droops successful your mentation but is not droopy successful the existent logo." But Images 2.0 ne'er managed to hole it.

So I started a caller session. This time, I included the instruction, "Use peculiar attraction to reproduce the ZDNET logo accurately."

Also: I tested ChatGPT Plus vs. Gemini Pro to spot which is amended - and if it's worthy switching

Here's wherever things got precise odd. For its archetypal run, the exemplary someway dug up a transcript of ZDNET's logo from earlier our 2022 redesign. This logo is obscurity to beryllium recovered connected our existent location page. Weirdly, it rendered that aged logo utilizing the existent colour scheme. The exemplary past pushed the logo and the infographic accusation disconnected the near borderline of the image. It besides chose a airy bluish for "Images 2.0" that's not a ZDNET marque color.

I tried mightily to person it to usage the existent logo. I managed to get it to propulsion the representation to the right, truthful thing was chopped off. But adding the prompt, "Use the ZDNET logo that is connected the provided page. Do not hunt for an alternate logo," did thing to hole the problem.

I took 1 much changeable astatine the situation earlier deciding to spell backmost to finishing up this article. Once again, I started a caller league truthful the AI didn't person musculus representation from its erstwhile miscalculations.

Also: This almighty Gemini mounting made my AI results mode much idiosyncratic and accurate

The exemplary messed up the logo again. This time, the AI decided to adhd a rudder signifier to the stem of the stretched-out superior D.

To beryllium fair, I'm utilizing a pre-release mentation of Images 2.0. I'll beryllium backmost with a overmuch much broad trial tally of the exemplary aft the authoritative merchandise release.

I besides tried a akin trial utilizing a antithetic papers with Google's Nano Banana Pro, but due to the fact that it didn't grip the synthesis the mode that this caller mentation of OpenAI's merchandise does, it wasn't truly capable to repetition the results I got here. We'll cognize much arsenic we bash much precocious tests

Pricing and availability

The caller exemplary is disposable contiguous to each ChatGPT and Codex users. Advanced outputs and the reasoning capableness are disposable to ChatGPT Plus, Pro, Business, and Enterprise users. Be definite to prime "Thinking" from the ChatGPT dropdown barroom astatine the apical of the screen.

At the clip of writing, earlier release, the caller Images 2.0 exemplary is lone disposable connected the desktop. But OpenAI promises that these capabilities volition beryllium successful the mobile mentation arsenic well, on with the quality to finger-select images utilizing your mobile touchscreen.

The images are besides disposable via API utilizing the gpt-image-2 model. API pricing varies depending connected the quality, thinkiness (my word), and desired representation resolution.

If an AI tin grip layout and contented successful combination, volition that alteration however you attack plan projects? Let america cognize successful the comments below.

You tin travel my day-to-day task updates connected societal media. Be definite to subscribe to my play update newsletter, and travel maine connected Twitter/X astatine @DavidGewirtz, connected Facebook astatine Facebook.com/DavidGewirtz, connected Instagram astatine Instagram.com/DavidGewirtz, connected Bluesky astatine @DavidGewirtz.com, and connected YouTube astatine YouTube.com/DavidGewirtzTV.

Read Entire Article