Googles New “Text To IMAGE Model” Just CHANGED Everything (Now RELEASED!)
Google’s new “Text To Image Model” has just been released and it’s a game changer. This advanced text to image technology, known as IMEN 2, is being hailed as the best text to image generator currently available. The release of IMEN 2, combined with the rise of Google’s Gemini Pro, shows that Google is serious about the AI race.
This text to image technology is different from what we’ve seen before, and it’s clear that Google has put a lot of effort into making it stand out. The focus on photo realism in the generated images is truly impressive. Google has implemented a specialized image aesthetics model based on human preferences, which has greatly improved the quality of the generated images.
One of the standout features of IMEN 2 is its ability to generate realistic hands, a detail that has historically been a challenge for AI technology. The inclusion of out painting and in painting features is also noteworthy, allowing users to zoom out and add elements to images with ease.
While the availability of IMEN 2 is limited to certain countries at the moment, there are ways to access it from other regions. Overall, Google’s IMEN 2 is a major leap forward in text to image technology, and its release has set a new standard for image generation.
Watch the video by TheAIGRID
Video Transcript
So Google actually just surprisingly released imen 2 which is their most advanced text to image technology and I’m not being over the top when I say this that this might actually be the very best text to image generator that we do have and I’ll explain the reasons
Why in just a moment now this is very very good in fact it’s shockingly good because we didn’t know that this was coming and with the rise of Google’s Gemini Pro it shows us that Google is taking this AI race very very seriously so right here you can see that image and
Two are most advanced text image technology now of course everybody knows that text to image isn’t something that is new it’s not something that’s you know we’ve seen for the first time but this is very very different because how Google have gone about this text image generator and how they’ve implemented it
Into their website is very different and is exactly what I’ve been asking for for quite some time so let’s take a look at some of the key features why this actually stands out and of course how you can get started so one of the first things you need to understand is that
This isn’t currently available in every single country although it is available in most of the countries so you should be able to use this so if you are in any of the economic European areas Switzerland and the UK you’re not going to be able to use the image generation
Because what they’ve done is they’ve actually added it to b b has been updated today since it is the 1st of February and you would have known that b is getting an update if you did watch our last video now under here you can see that I’ve put all of the countries
That are in the European economic area which according to the recent post by Google it isn’t going to be out in these countries just yet however there are a few ways that I’ll show you later that you can still access this really cool software so let’s actually take a look
At the image generation and how good this is because image generation is one thing but once I show you guys how they’ve implemented this then that’s when you guys are going to see why this is really cool so here we can see that we have four highquality images and it’s
Clear that Google’s focus on this was actually on photo realism so I’m going to show you guys some images in a moment um and you’re going to see just how good these images are because Google’s photo realism is actually pretty pretty good and remember guys one thing I want you
Guys to remember is that this is Google’s IM in 2 because Google had IM in one and IM in two so just remember that they’re on their second iteration of this model which just goes to show how big of a leap this is um and with these prompts we can see Mosaic inspired
Portrait of a person their features are formed by a collection of small colorful towels modern house on the coastal Cliff that is pretty cool Studio portrait of a man with a grizzly bear eating a sandwich that is pretty cool and I think these ones are pretty diverse in nature
And show us exactly how good this image generation software is now you might be thinking okay this is good but we’ve seen this before but let’s get into some of the other stuff because I can promise you guys this is much better than you guys think so this is the photo realism
Section now um the images right here they aren’t you know scaled properly that’s why the images don’t look as high quality but in terms of the actual photo realism if I you know upscale these properly these are pretty pretty good now if you remember if we actually
Compare this to you know some of the other ones for example like Dar 3 and I do have some comparisons coming on later on in the video so you’ll be able to see exactly how they compare to DAR 3 but I do think that the photo realism is really good because essentially what
Google did was they actually trained this to be very photorealistic in fact in the notes they say that they said that we trained A specialized image Aesthetics model based on human preferences for qualities like good lighting framing exposure sharpness and more each image was given an aesthetic score which helped condition image and
Two to give more weight to images in its training data set the line with qualities that humans prefer and this technique improves images to ability to generate high quality images so essentially they were able to get this model to perform really well because they fine-tuned it on what humans really
Like and something that I find as a little really small cool detail is the fact that of course they’ve shown us that hands are something that do work and if you aren’t familiar why that is a thing it’s because earlier on AI wasn’t really able to generate hands properly
And they would always be messed up we can see clearly here that the hands do look 100% realistic so there’s not really a way to be able to tell from that now let’s take a look at some of more images and I can’t wait to show you
Guys some of the other features cuz it’s absolutely you know really just amazing and then I’m going to show you guys the demo so you can see right here that this is a really really nice photorealistic image um quite surprising actually just how the lighting is like I said I
Wouldn’t know that this is AI generated it doesn’t look AI generated to me um but in fact this next image does look a bit more generated because it’s got that I’m not sure if this is like a bouquet effect or what kind of effect it is but
Um it definitely is like I guess you could say have that a generated thing but Google’s photo realism it is definitely most 100% there now we do need to take a look at something else because this was one of the features that you may have not seen so here we
Have out painting from Google Now in other Google products that they did talk about they have actually mentioned um out painting and in painting before essentially if you don’t know what that is that’s essentially where you zoom out and and essentially where that image you can just essentially just increase the
Size of that now that is something that we did see in mid Journey but it is good that we are now getting this from Google as well and I think this will be better implemented in Google just because of how um they’ve made the software and of
Course like I said they also did include in painting which is of course if you want to add something like right here in this example they wanted to add a shelf and it looks pretty pretty cool um and you can see they just set a Shelf with a
Couple of books on and vases hanging on green wall and then of course they’ve just managed to add that right into the image so I feel like that is basically you know it’s essentially just photoshop’s generative fill if you do remember photoshop’s generative fill it was essentially just this which you can
Access natively in Photoshop and of course um that was something that definitely went down well with a lot of people and I’m sure that now that Google has it especially with its photo realism it could definitely compete because we know that generative fi while is okay it sometimes doesn’t always give you the
Best results so if Google manages to get this area right um and currently um you’re going to see in vertex that I don’t think they gave us this feature but you’re going to see that it actually is really cool now what we also did get was we also did get text rendering
Support So essentially you can see right here that there is text being able to be put into these images with a remarkable degree of accuracy so the prompt STS a tube of toothpaste with the word symbol written on it on a bathroom counter advertisement and you can see that it’s
Got the words all the way down to the blur really really perfectly then of course a cup of strawberry yogurt with the words delicious written on the side sitting on a wooden tabletop next to a cup of yogurt is a plate with a toast and a glass of orange juice and you can
See that the text here looks very very accurate and it even manages to do it with like the you know the blur going on there and of course with different fonts that look really really creative and stylistic so I think that text rendering support is definitely one of the things
That was really hard it was only recently achieved by I think it was Dar 3 and mid Journey so now that Google has added this as well it goes to show us that they are not messing around when it comes to this and I’m sure that this
Image release is going to be there because they want Google’s Gemini to be very very good when it does release in its complete model now Gemini Pro is actually pretty good but of course Gemini Ultra they want that to be absolutely astounding now here is where we talk about the intuitive editing so
This ladies and gentlemen is image effects so essentially if you don’t know what this is this is from Google’s Test Kitchen and this is why I said this is probably even better than mid Journey if you are using this so essentially what you can do you can see that when you
Generate a photorealistic image you can essentially break up the words into different sections and these different sections can be changed to your preference if you want to adjust how that image is so you can see that he changed it from you know a jungle to a
City um and you can just simply break that down and it allows for greater Crea Freedom so I really like this so much because being able to switch from City to Jungle and just instantly get um a new generation I think that that is a feature that we have been asking for for
Quite some time in many of these AI generators for example um and this is no hate to Mid Journey or whatever but it’s just a comparison like mid journey is absolutely incredible if you don’t know what it is it’s arguably the state-of-the-art model in terms of image
Generation Um but you know just how it is on Discord it is a little bit harder to use than just a simple like drag and drop you know a simple you know uh drop and just you know change things with you know just a click of a button and I know
That they’ve made it easier and I know that they are working on a website but if Google manages to roll this out you know globally I think this will become more adopted because of how easier it is to use so I think something like this
Goes to show us as well um and I’ll show you guys the demo of this later because like I said you can actually use this with a link in the description so you can see right here there’s other um things at the bottom you know HDR 1990s
All of these things you have to remember that the average person who’s using an image generation software they don’t know about every single you know photo style they don’t know you know what warm coloring is you know uh you know certain types of film style you know they know
HDR 1990s Renaissance style they don’t you know immediately have all of these in their head so it is very very useful that we get something that can actually you know give us that ability to you know brainstorm for us and then we can see okay we can change this part we can
Change that part we can change from landscape to this um and I think it’s just going to give us uh a greater degree of creativeness to you know completing whatever project or whatever you know outcome that we do desire so I think that this is really cool um and
This is definitely something that is an improvement over other models that we haven’t seen before so Google actually have done a great job on that bit now in addition you can see that right here instead of photo real portrait you can see handdrawn abstract impressionist so
If you want a handdrawn one you can simply change it and then of course you can see that is how it looks you can then of course as well you can share this and you can get the seed as well so it’s pretty much like mid journey in the
Sense that you can um get the same seed which allows you to generate really similar images you know which are borderline consistent so I think this is you know one of the things that like I said from Google coming from Google and their image effects so if you want to
Actually use this this is going to be different just to make the distinction clear what we’re seeing here is actually Google’s image effects and essentially what that means is that this is in Google’s Test Kitchen so if you don’t know what Google’s Test Kitchen is the
Test kitchen is the area in which you can test new Google releases before they are essentially put out to the mainstream so I guess you could say it’s like an alpha version of um you know the AI stuff before it is wildly rolled out and there’s a few different things there
Like a music generator and an image generator um and they’re actually still pretty good but I’m guessing that Google just you know wants to test them get some feedback um and then of course um roll them out um completely so this is image effects and this is not Google B
But you can use this now um and I’m guessing that this is just a more advanced area so um like I said I am a real real fan of this because like I said I think this helps out people more than you do realize so there was also
This as well so logo generation and you could see that they actually talked about how Google’s you know new image generator image and 2 which is of course now powering pod essentially prompt a clean minimal emblem style logo for an ice cream shop cream background you can
See that that looks really cool then an abstract logo representing intelligence for an Enterprise platform vertex AI written under the logo so I think that this is also once again you know of course you can do this with chat gbt chat gbt’s D 3 is actually pretty good
But like I said Google’s coming over to here and as far as I know when I was testing this out in Google’s bod there wasn’t really a limit on the amount of image Generations I could have so I think that this is really effective for
Those of you who don’t have a gbt 4 subscription and you don’t want to subscribe to anything you just want to have your Google account you want to go ahead you want to create an image you can go ahead and B um and you can just
Do this straight away so I think this is uh something that’s pretty effective and and um yeah another thing um is that you know these are pretty clean as well because the text looks pretty decent um and I would say that this is a win this is definitely a very good win now
There’s also something that I wanted to include as well and that’s of course the seeds like mid Journey so if you don’t know uh mid what mid Journey has mid Journey essentially has seeds a mid-journey seed number is basically just the number that allows you to direct the consisten in the image
Generation process so essentially it’s just the starting point for the AI to generate the field of visual noise and basic basically it’s just a reference point that allows you to create more realistic SLC consistent results across your work so that’s why I said that this is really cool because you know right
Now from Dar 3 we don’t actually get any seed numbers um and even through you know discorded mid Journey the way how you get the seed is you have to put slash type in you know a lot of people just don’t want to do that like I know
It’s fairly simple to someone who’s using it all the time but someone who’s just getting started you’re really just going to want to be able to click a button just copy and paste a seed and because that’s how majority of the things apps that we use are now there
Was something uh that I think is good because image and 2 includes built-in safety precautions to help ensure that generated images aligned with Google’s responsible AI principles and essentially it’s you know watermarked with Google synth ID which allows you to generate invisible watermarks and verify images generated by this software so if
You don’t know what synth ID is essentially it just generates a synthetic ID so it’s basically a digital Watermark directly into the pixels of these air generated images that are imperceptible to the human eye so if you ever want to check the validity of an image you can do so with this software
Now essentially what’s good about this as well is that they’ve managed to make it so that even if you edit the image so even if you you know add some filters add some exposure The Watermark ID is still completely there so um you know there’s no way and you can see that we
Designed synth ID so it doesn’t compromise image quality and allows the water mark to remain detectable even after modifications like adding filters changing color saving with the various lossy compression schemes most commonly used for jpeg so I think this is really good because right now we are entering
The era of you know images which we don’t know are real or not and if Google does manage to become the number one in terms of image generation it’s going to be good to be able to have something that we can actually back check and then
Go back and say okay that is AI generated and that isn’t AI generated um and I think that other companies should take a note from Google because this is something that we do need in the future and right now whilst it’s not that big of a problem I think in the future it
Will become a larger problem as more people start to realize so let’s go ahead now and take a look at some of these images as well because I wanted to look at some of the individual images so this is a really cool realistic one like I said the photo realism with Google is
Really really good so the prompt is generate a realistic photo of a person looking off camera during sunset portrait mode so the background is faded then of course we have this one right here I really do like this one I don’t know why it just you know ushers a sense
Of peace I guess you could say and it says generate a collage art with photorealistic images of oceans and plants with muted colors and 3D shading that’s mixed media then we have this one I I find that this one is really cool as well um write a social media post and
Generate a mouthwatering image that I can use for a buffalo wing Festival that actually does look remarkably realistic um and I wouldn’t be able to tell that that is AI generated maybe my eyes are awful but um just the lighting the shading um definitely uh pretty pretty incredible stuff there and then of
Course image in two which is generate an image of a fashion show and a steampunk style digital art zoom in on their face looks very very effective I mean some of these uh right here don’t look that you know accurate but in the digital art style that is actually really cool so if
You seen the digital art style you know why that actually looks really really cool so um essentially what they’re trying to do with these these images is they’re trying to show how good the range is on these images because essentially what they’re showing us is that you know there’s a diverse um uh
You know a style set that you can apply to this um and it just goes to show us how effective this is and generate an image of a futuristic car driving through the mountain and then of course generate a vibrant and Lively image depicting an elephant parting in the
Heart of a lar vibrant jungle the elant should be in various colors and be adorned with fun now um I’m going to show you one more image here and then what I want to show you guys is the comparisons to darly 3 um and like I said before even though I’m doing these
Comparisons don’t forget that this is like I said Google’s second iteration whereas uh you know this is I guess you say darly 3’s thirdd iteration because before they had darly 2 they had darly then they had darly 2 then they got darly 3 so I would say you know just
Just REM just remember that because um I think that is you know the only fair way to compare it so um this is just Genera an image of a cluttered Alchemist Workshop filled with bubbling flasks gluing crystals and the tiny luminous swirling within the bottle so then of
Course we have imagin two and dar three on the right hand side um and I think this one here if for being honest this one darly 3 does look pretty cool but I would say a tiny luminous World Sr L within the bottle um this one here kind
Of does I mean the nuances here actually do look better from image and 2 I just think the understanding is better but um we never know when they’re going to drop darly four so it will be interesting to see what is done there we also do have
This comparison so imag and two um on the left and then Dar 3 on the right so I guess they have different interpretations of you know what collage is um I guess that one’s open up for interpretation and of course we got darly 3 here as well um in terms of
Generate photo realistic person looking off camera during sunset um and I think it’s it’s clear that you know um image 2 in terms of photo realism just really really wins in that sense because I think you know the way that they train the system in order to generate um how
That looks is particularly stunning so we’re going to take a look at some of these more and then once we take a look at these then I’m going to show you guys how it actually works and how you can actually use this so you can see um this
One again futuristic car driving through an old mountain surrounded by Nature um I think definitely Dar 3 does look a little bit better here but that’s completely fine um and then of course we have here I think it’s just different interpretations of what we’re looking at
Because like I said here this one seems to be a little bit more playful like a like a kids thing and then this one here seems to be a bit more artistic in terms of what we’re seeing and of course this one right here as well so you’ve got
This um this is you know like an image of uh some buffalo wings um and of course this one on the right here doesn’t really look as realistic and then this one on the left you know Google’s one does look really realistic so um right now what I’m going to show
You guys is a quick de demo of The Bard which you can actually use right now then I’m going to show you the image effects which is essentially this one which is the intuitive one which is uh you know like I guess you could say a
Soup up version of what mid journey is kind of supposed to be so um I’ll take a look and I’ll show you guys how those are so here you can see we are now in Google’s Bard and I just simply asked it to make me an image of a car and these
Are both powered by the same system so when I go and show you guys the images from image effects it’s all going to be powered by the same system so I said make me an image of a car um and it’s chosen to essentially make me an image
Of some BMW now I didn’t really do any kind of you know prompting really good prompting there I just simply said make me an image of a car and what I find cool is that you can generate one image and generate three pretty quickly um and
I think it does generate a lot of images pretty quickly like for example um this image of an egg that doesn’t look as realistic um but this definitely does like I would say that yeah that’s a real Burger the sauce and everything right there and for this second iteration I
Would argue that it’s pretty pretty realistic so we click generate and then you guys are going to see how long it does take I think it is pretty pretty quick I’m I’m not sure how how they managed to get it to be this fast but um the first time I generated these
Images it was really really quick like surprisingly quick and then of course you can see here that um I said generate an image of photo realistic food and then of course we do have those images here so I think that um you know it it does really really really well um and
I’m going to say um create a picture create a picture of a realistic soccer field and then I’m just and then I’m just going to see how that works so um I think this is good because like I said one thing about this is that it doesn’t
Seem to be any limit it doesn’t say that you know um you’ve generated the maximum images right now it just seems that you know you can keep on keep on going which is uh really good in terms of you know whatever you want to use it for so this
Realistic image of a football field I like this one except that the the lights do look a little bit weird they actually kind of look like letters there um and of course we’ve got this one I mean these these ones kind of look like an Arty style then of course like I said
You can click generate more and then it is going to generate some more images based on what you want now I really don’t have any ideas on what to generate but um yeah I I think uh it’s it’s something that is pretty cool based on
What you can do now I’m going to show you guys image in 2 in vertex AI because I think that that one is going to be much more effective so here we are um in image effects um and like I said this is Google’s test lab so this is where you
Can test out different stuff um you can just sign in it’s free and you can see there’s text effects music effects um and of course here’s where we go so um you can see it says silver coin with a smiling cat five words catnip so I’m
Just going to type in tab I’m just going to go ahead and click generate um and down here like I was talking about before this is where you can click these Styles photo realistic 35 mm film minimal sketchy handmade you can also see the settings here you can see the
Seed um and you know can you know do whatever with that so I’m going to go ahead and click generate um and then it’s going to optimize my prompts going to go ahead and see exactly how long this takes it only seems to be taking a
Few seconds but um okay so that is pretty cool wow that is actually really really cool wow that is that is uh that is surprising so that was pretty quickly they generated three image and then it was a silver coin with a smiling cat and words five catnips and you can see five
Catnips was really really it I mean it’s done pretty well I mean I mean yeah um this is actually kind of shocking because how easy this is to use um I can just change it to 5 10 20 I can take it from words symbols shapes and colors um
I can change it from a coin to a met let me let me change it to a badge and I’m going to click generate um and let’s see exactly how that changes so this is what I’m saying guys like being able to change it to you know simple words that
Are related to what your thing is and then you can see it kind of changes it I mean I guess this kind of looks like a badge and then we’ve got a coin here so um and a five catnips and then we can change it to let’s do 50 catnips right
Here um this was the initial problem that they gave me so um there’s no strange obsession with catnips for that matter just so you guys um understand and then of course 50 catnips 50 catnips 50 catnips so um yeah I think this is really really cool because you know you
Can um then you can change this so you can do dream aesthetic uh let me put dream aesthetic and let’s add that I mean I don’t know about you guys but um I find that this is just way easier to use in terms of image Generation Um than
Like actually trying to think hm what do I want to use what do I want to do what kind of you know uh things you know how do I do this um and I think when this even gets even better this is going to be something that is uh you know much
More effective than some of the other things we’re using and I would argue that this looks really really good in terms of you know just how quickly this is just um how quickly you can change the seed um so yeah you can see that right here this is really really really
Good to use in terms of being able to generate something really really cool then I’m going to do one more but like I was saying I think I think this is uh just going to show that Google uh truly understands that not only are they making an image generation software they
Are actually also making themselves something that is of course a product so I think that that is one of the most important things that you do need to understand here and I think that that is something that is very important because they previously talked about how you
Know people can make better models they can make better software but if you can’t make a better product then people just won’t use it so I think that this is uh just a key showing us that you know if you have something so we can change this to SteamPunk then we can
Generate um I think that just shows us that this is something that we definitely do want so um I’m going to do steampunk City and then I’m going to see what this is like like and then we got to SteamPunk City here and then there’s four Images um and that’s yeah I I
Really like what we’re getting here so um let me know what you guys thoughts are I think that this is definitely um fascinating I think that Google has you know done very well here in terms of you know image and 2 I think you know they
Definitely are on par with some of the other state-ofthe-art models that we do see let me know how you guys get on in terms of using this for yourself I think that this is going to really change the game because other you know companies are going to now I realized that yo this
User interface is pretty crazy it’s so easier to use a lot of people are probably going to use this once they get the chance to um and yeah let me know what you guys syn if you guys enjoy the video I guess we’ll see you for the next update
Video “Googles New “Text To IMAGE Model” Just CHANGED Everything (Now RELEASED!)” was uploaded on 02/02/2024 to Youtube Channel TheAIGRID