Ask the common individual what they use AI for, and so they’ll in all probability rattle off the standard suspects: drafting emails, writing up fast LinkedIn posts, summarizing assembly notes, perhaps debugging a line of code, and producing photos. From creating photos of you hugging your previous self to turning your self right into a Pixar character to designing whole product mock-ups and advertising property, AI picture era is basically not a celebration trick.
Google’s been main this area with its Gemini Nano Banana mannequin for an excellent bit now, however since OpenAI dropped ChatGPT Photos 2.0 on the twenty first of April, Google has some critical competitors. I have been utilizing Nano Banana because it launched and have seen it develop into what it’s at this time. I have been testing ChatGPT Photos 2.0 for the reason that day it launched and, in fact, evaluating it to Nano Banana at each flip. The outcomes genuinely stunned me.
So, what are these two fashions?
A fast refresher earlier than we get into the enjoyable stuff
I do know lots of people who simply open the respective device, immediate it to generate no matter picture they want, and by no means actually take into consideration what’s occurring underneath the hood. So, I assumed I might start with a fast breakdown of what every mannequin really is and what makes them completely different. Google introduced their first picture mannequin powered by Gemini referred to as Nano Banana again in August 2025, constructed on the Gemini 2.5 Flash structure. It went viral nearly instantly. The quirky title caught, folks had been cracking jokes in regards to the banana brand, and it rapidly turned the go-to for AI picture era and enhancing.
Then in November, they launched Nano Banana Professional, providing superior intelligence and studio-quality artistic management. And in February 2026, Google launched Nano Banana 2, which mixes the superior options of Nano Banana Professional with the velocity of Gemini Flash fashions. Nano Banana 2 can pull from Gemini’s real-world information base, powered by real-time info and pictures from net search to extra precisely render particular topics. It might generate correct, legible textual content for advertising mock-ups or greeting playing cards, and even translate and localize textual content inside a picture. It helps as much as true 4K decision as a part of the usual providing, and it is quite a bit higher at following directions in comparison with the earlier fashions. The mannequin is at present the default picture era expertise throughout Google’s merchandise.
ChatGPT Photos 2.0 is a reasonably new launch, and was introduced throughout the identical week as GPT-5.5’s launch. It is OpenAI’s first picture mannequin with native pondering capabilities, that means it is able to really planning, looking the net, and checking its personal outputs earlier than finalizing a picture. It runs in two modes: Instantaneous and Considering. The previous is free for everybody, whereas the latter is reserved for paid ChatGPT subscribers. Together with the pondering capabilities, the mannequin can deal with textual content rendering throughout languages like Japanese, Korean, Hindi, and Bengali with near-perfect accuracy and helps as much as 2K decision.
It might generate as much as 10 photos from a single immediate. The brand new mannequin has a extra “up-to-date understanding” and a information cutoff of December 2025. Sam Altman described the mannequin as “going from GPT-3 to GPT-5” , which is a reasonably daring declare to make. That stated, ChatGPT’s preliminary picture era mannequin is one thing that I (and a whole lot of different folks) discovered fairly underwhelming. I might principally by no means attain for it over Nano Banana. So, the truth that Photos 2.0 has genuinely pulled me again says quite a bit about how massive of a leap that is.
Each the fashions have completely different picture kinds
You’ll be able to spot which mannequin made straight away
Each LLM has considerably of its personal persona. As an example, I discover that Claude fashions are much more conversational and ChatGPT fashions really feel extra assured and structured. You’ll be able to inform a distinction even if you give them the identical immediate. The identical applies to their picture fashions. Give ChatGPT Photos 2.0 and Nano Banana 2 the very same immediate, and you will get two noticeably different-looking photos. This is not simply due to the info it is skilled on or due to the mannequin’s underlying structure. It is as a result of every mannequin has a default aesthetic they only appear to gravitate towards.
In my testing, I’ve discovered that ChatGPT Photos 2.0 finally ends up producing extra grounded and naturalistic outputs. The outputs appear like actual pictures which were professionally edited. The lighting feels a bit imperfect in a great way, textures have a variation, and the picture simply seems very polished in all the fitting methods. Nano Banana 2, then again, leans more durable into vibrant, saturated, eye-catching visuals. The colours are deeper, the distinction is punchier, and the whole lot tends to really feel extra stylized. However they do not really feel very real looking.
This clearly is not simply my opinion both. As an example, Reddit consumer u/Inevitable_Gur_461 posted a GPT-Picture 2 vs Nano Banana 2 comparability on the r/ChatGPT subreddit. He used a reasonably in-depth immediate the place he needed to generate a black and white classic wedding ceremony pictures from the Fifties. He generated 2 photos from ChatGPT Photos 2.0, whereas the final picture he generated was from Nano Banana 2. I might’ve recognized the Nano Banana 2 picture with no double look or needing to see the feedback — it simply felt very… Nano Banana-ey. It simply has a sure AI look to it!
As an example, this is an instance I ran myself. There was this Instagram pattern happening the place you’d give picture fashions pictures of youthful and present you, after which ask it to generate a picture of each variations of you sitting collectively. I gave each fashions the identical immediate, the identical reference pictures, and requested for a similar comfortable, cinematic, studio-style look.
Whereas I admittedly wasn’t the most important fan of ChatGPT’s consequence (which is extra so due to the way in which my very own photos turned out), Nano Banana 2’s consequence simply felt very blatantly overdone. It had that telltale over-smoothed pores and skin, barely too-perfect lighting, and a normal “AI sheen” that made it apparent at first look. It felt extra akin to knowledgeable photoshoot, which wasn’t the vibe I had requested for in any respect.
That stated, I am not saying one is best than the opposite. It comes down to non-public desire and, extra importantly, what you are making an attempt to create. In the event you want one thing that appears prefer it was pulled from an actual digicam roll, I might advocate ChatGPT Photos 2.0. If you’d like one thing that is instantly eye-catching, say for a social media put up, Nano Banana 2’s type is what you want.
ChatGPT’s actual benefit is not simply extra real looking photos
It is significantly better
Whereas extra natural-looking photos is definitely one thing you may discover straight away, it is not actually what retains me reaching for ChatGPT Photos 2.0 over Nano Banana 2. The true benefit, in my eyes, is context. ChatGPT Photos 2.0 is quite a bit higher than Gemini at remembering precisely what you are engaged on. As an example, I’ve this trademark hamster sticker I have been utilizing on messaging apps (together with Slack) that I ship to everybody at any given second. If I am freaking out, I am going to ship it. If I am completely happy, I am going to ship it. If I am in tears, you already know what I am sending. I as soon as determined, why not go forward and convert the sticker to a Google Meet background?
From there on, I have been continually producing variations of the sticker related to the scenario I am in. A hamster (or the hamsters) crying, indignant over one thing, cramming for an examination, and even celebrating my birthday by blowing candles and carrying a cap. The hamster sticker is principally purported to characterize… me. I began this custom off with Nano Banana 2 (earlier than GPT Photos 2.0 launched) and whereas the outcomes had been at all times spectacular (they do not must be “real looking”), I might have to connect the reference picture once more, re-describe the character, and virtually begin the dialog over each few messages. If I simply gave it the directions by describing what I needed (even when I referred to the pic), it might both generate one thing utterly off or simply default to a generic hamster that seemed nothing like my unique sticker. The context simply would not stick. With ChatGPT Photos 2.0 although, I simply dropped the unique reference picture as soon as, and I’ve merely been telling it what to do from there.
So, for example, I requested the mannequin to maneuver all of the hamsters to a college and present that they are finding out. I did not embody the reference picture, or any further particulars. Simply the immediate, and that is it. I then requested it to make it appear like all of the hamsters had been begging and saying “pls????” as a result of I needed to ship it to my editor. At one level, somebody referred to as the hamsters mice, so naturally I had ChatGPT generate a complete indignant protest scene the place the hamsters had been screaming “WE ARE NOT MICE!!!” by means of tears. The purpose is, I stored constructing on the identical operating joke while not having to repeatedly clarify the characters, their vibe, or their look. ChatGPT Photos 2.0 remembered the hamster universe surprisingly nicely. The hamsters nonetheless seemed like my hamsters and had been in the identical scene within the greater image, even because the situations turned progressively extra unhinged.
One other instance is the one I touched on above — the youthful and older pattern. I dropped a screenshot of the Instagram Reel I noticed about this to offer Nano Banan 2 some reference, and instructed it to make the output just like it. As a substitute of utilizing the screenshot as stylistic inspiration, the mannequin simply gave me the identical picture barely edited.
Similar outfits, similar state of affairs, similar folks within the unique picture with barely completely different faces. It utterly modified how the older woman seemed. Gemini gave her curls, which I discover humorous as a result of I haven’t got curls, that means the mannequin was clearly not trying to duplicate me!
ChatGPT makes enhancing photos ridiculously easy
The half Nano Banana 2 actually must compensate for
As I simply talked about, I’ve discovered that Gemini’s Nano Banana 2 mannequin is not the perfect at retaining context and I discover that I must continually clarify the identical factor many times and re-upload reference photos. So, you may think about what it is like refining a picture the mannequin produced. There would not appear to be a simple strategy to simply say “change this one factor” and have it really work.
Most of the time, you may discover that it’s essential to obtain the picture, add it, after which request your modifications. ChatGPT Photos 2.0, then again, makes this complete course of really feel easy. You click on on a generated picture, and also you get two choices: you may both describe your edit instantly within the dialog panel, or use a range device to focus on a particular a part of the picture after which describe what you need modified. The mannequin holds onto the whole lot else, and solely touches what you requested it to. This would possibly sound minor, but it surely makes a large distinction.
ChatGPT Photos 2.0 wins this spherical honest and sq.
Whereas I did not actually anticipate I might ever be saying this, ChatGPT’s newest picture mannequin positively wins this spherical. It is an important mannequin, produces scarily spectacular photos, and takes time to suppose by means of the picture and develop it (whereas Gemini appears to be in a rush at all times).
That stated, I/O 2026 is correct across the nook, and a brand new mannequin is anticipated. Google I/O kicks off on Might nineteenth, and a number of shops are speculating that Nano Banana might get a big replace alongside what’s anticipated to be a serious Gemini mannequin announcement. So whereas ChatGPT Photos 2.0 has the sting proper now, I would not depend Google out simply but.