AI Image Generation

  • Guest, it's time once again for the massively important and exciting FoH Asshat Tournament!



    Go here and give us your nominations!
    Who's been the biggest Asshat in the last year? Give us your worst ones!

Captain Suave

Caesar si viveret, ad remum dareris.
5,253
8,953
Watching various prompts from people on Midjourney, I often see "4K'", "8K", etc. Does this actually make a difference? The resulting images aren't actually that resolution.
Adding to what Mist said, they're using the prompt to pull attributes of training images images which were flagged as 4k/8k, etc. It could be as vague an association as "high-res images are generally produced by more competent artists and are qualitatively better along some metric I care about". Prompts include camera models and such for similar reasons.
 
  • 1Like
Reactions: 1 user

Tmac

Adventurer
<Aristocrat╭ರ_•́>
9,969
16,984
Inpainting could be super powerful but the AI has a lot of trouble w it.

It seems like you have to inpaint significant real estate, bc if it’s too small the AI doesn’t know what to do with it.
 

Lambourne

Ahn'Qiraj Raider
2,862
6,828
New Midjourney function lets you zoom out images, i.e. generate new content around an existing image (presently only MJ generated stuff though). Helpful for optimizing images that looked good but were cropped a little wrong or had the wrong aspect ratio. Even works on images generated earlier.

Base:

1687693111145.jpeg


Zoomed out:

1687693151042.jpeg


You can stack this several times, this one was done 3 times:

1687693185608.jpeg


1687693199623.jpeg
 
  • 10Like
Reactions: 9 users

Pasteton

Blackwing Lair Raider
2,733
1,919
pretty sick



hopefully i get to make my own virtual world soon. I will release an mmo before pantheon
 
  • 4Like
Reactions: 3 users

Kharzette

Watcher of Overs
5,341
4,072
I wonder if it could generate clean collision geometry. Those environments would be chaos in a game engine.
 
  • 1Like
Reactions: 1 user

Lambourne

Ahn'Qiraj Raider
2,862
6,828
New Midjourney function lets you expand images left and right, and rather than the earlier zoom out function, this is new content generated at the same resolution. You can also modify the prompt.

For example, I started with a prompt for a fantasy style female thief by a campfire

1689251853047.jpeg


Then expanded the image but modified the prompt to male knight.

1689251899085.jpeg


You can redo this of course, so if you wanted to change the male figure you can just go off the base image again but with a different prompt.

1689252710661.jpeg




Definitely a lot of potential here because you get a lot more direction during the creation process. Before you'd have to just prompt for a male and a female figure by a campfire and hope for the best, now you can get one part of the image right before working on the rest. It's also expanded images not resampled ones so you don't lose any resolution.

Combined, you can create higher resolution images than before. I'm having to scale these down a lot to post them here. The one below is like 2.5mb uncompressed and 2700px wide

1689251940897.jpeg


1689252000288.jpeg
 
  • 9Like
  • 1Wow!
Reactions: 9 users

Kharzette

Watcher of Overs
5,341
4,072
That above looks so good. I love how it uses the campfire light.

I just got a lora to work finally. I'm running about 9 seconds per iteration, so four and a half minutes per 512x512 image. Not sure what I did to make it slower. It feels like it is memory thrashing.

This is the peace sign lora: Still mangles the fingers
16079-417097083-beautiful girl, white babydoll dress, small cute pointy ears, pale skin, short...png
16078-417097082-beautiful girl, white babydoll dress, small cute pointy ears, pale skin, short...png
 
  • 3Like
Reactions: 2 users

Kharzette

Watcher of Overs
5,341
4,072
Before overwatch league started I ran some experiments with the vtuber-poses lora. This sort of generates what I think the live2D program wants to make a 2D avatar. If you prompt feet, you get a really nice almost orthographic front view.

I think this is what I've been looking for for generating good ortho templates for making 3D characters. If I can get the arms to either V or T. My stuff supports V arms now as I've got capsules working for bone bounds.

Can't post any pictures because I make the templates nude so I can see muscle details and the like.
 

Kharzette

Watcher of Overs
5,341
4,072
Here's some with "catsuit". First the default athletic slender build:
16105-1887667889-beautiful girl, catsuit, small cute pointy ears, standing, pale skin, bald, d...png


A more middle ground. This one the feet weren't emphasized enough so it cut them off:
16108-2633469206-beautiful girl, catsuit, small cute pointy ears, standing, pale skin, bald, d...png


And a couple porky big booba builds:
16114-101384094-beautiful girl, catsuit, small cute pointy ears, standing, pale skin, bald, de...png
16115-101384095-beautiful girl, catsuit, small cute pointy ears, standing, pale skin, bald, de...png


And this one gave me 2 sides. No idea why, haven't been able to reproduce it, but I love it:
16109-2345945730-beautiful girl, catsuit, small cute pointy ears, standing, pale skin, bald, d...png
 
  • 1Like
Reactions: 1 user

Rajaah

Honorable Member
<Gold Donor>
12,512
16,532
New Midjourney function lets you expand images left and right, and rather than the earlier zoom out function, this is new content generated at the same resolution. You can also modify the prompt.

For example, I started with a prompt for a fantasy style female thief by a campfire

View attachment 482261

Then expanded the image but modified the prompt to male knight.

View attachment 482262

You can redo this of course, so if you wanted to change the male figure you can just go off the base image again but with a different prompt.

View attachment 482272



Definitely a lot of potential here because you get a lot more direction during the creation process. Before you'd have to just prompt for a male and a female figure by a campfire and hope for the best, now you can get one part of the image right before working on the rest. It's also expanded images not resampled ones so you don't lose any resolution.

Combined, you can create higher resolution images than before. I'm having to scale these down a lot to post them here. The one below is like 2.5mb uncompressed and 2700px wide

View attachment 482263

View attachment 482264

Whoa, those environments are super nice. If this tech had been around 25 years ago it would have been amazing to be able to create stuff like this as a teenager, back when I was obsessed with writing / drawing fantasy stuff. I would have been all in with AI art and it could have given life to a lot of ideas that I wasn't talented enough at drawing to create the way I wanted.
 
  • 1Like
Reactions: 1 user

Mist

REEEEeyore
<Gold Donor>
31,197
23,354
I'm from the future. Come with me if you want to be fake and gay.

1689214904358.png
 
  • 4Like
Reactions: 3 users

Kharzette

Watcher of Overs
5,341
4,072
Well I gave up on loras. They just seem to have the upper arms pinned to the torso, like that maneuver every young girl learns.

I realized that I had not gotten a single out of memory error all day long. That combined with the slowness that felt like paging and I realized they must have added a memory manager to this thing.

So now when you fail an allocation, it will just find the least recently used chunk and free it, then defrag to get a chunk of the size requested. DirectX does this automatically for textures / normals, but these AI coders have always been too lazy to bother.

So, controlnet here I come! I fired it up and sure enough it runs. The results were terrible, but it does work. Instead of using it directly, which is locked to the standard model with ddim, I'm integrating it with the automatic 1111 ui via a plugin. I've got several gigs of models to download but I shall report back with my findings.
 
  • 1Like
Reactions: 1 user

pharmakos

soʞɐɯɹɐɥd
<Bronze Donator>
16,305
-2,234
Been experimenting using album covers or other famous images as input images for Stable Diffusion 2.1. Did low noise for these DSotM remixes.

18feNM3NNORRZ8k8or5f--3--uw9d9.jpg


FIGe51vnbzrriVsv7IWA--2--b2hp7_2x.jpg


18feNM3NNORRZ8k8or5f--grid.jpg


ZUrYLbCsVdbyVV6HtbKv--grid.jpg


FIGe51vnbzrriVsv7IWA--grid.jpg
 
  • 2Like
Reactions: 1 users

Mist

REEEEeyore
<Gold Donor>
31,197
23,354
This is what I was talking about in another thread. Surprised it's happening so quickly, though not really. Just a full AI rendering pipeline, fuck polygons.



EDIT: Eh I went to their site, it's just a hack, it doesn't really work. "This demo is a conceptual demonstration of what could soon be the generation of experiences/games in the near future." It's pregenerated textures, generated by stable diffusion, and then stretched over a bunch of AI generated polygons.
 
  • 1Like
Reactions: 1 user

Pasteton

Blackwing Lair Raider
2,733
1,919
Help me understand what is the roadblock from ai creating ai 3D worlds. Needs more data? Not enough compute ?