HMOF_AI - Best AI Art & Models | Tensor.Art

Primary inspiration for primary prompts, or: Why your prompt is the most important tool in imagegen

So for those that follow me, you might notice that I very heavily use positive prompts in my gens, much moreso than other users, and almost always have a very "standard" negative prompt. Further, I also don't make a lot of use out of LORAs, only having between 1-3 per image, where some users will pile them on. And that's not a criticism! Ultimately, the wonderful thing about AI image generation, and art in general, is that you do what works for you, you develop your own style, and you eventually figure out how you can generate the images you want to see with regularity.But why do I do that? It's simple. Every model, every interface, every website you use to generate images is going to have a positive prompt entry field of some kind. Whether it's text, img2img, img2video, or what, there will be some place to enter your first idea. But not every place you go will have, even a negative prompt. Definitely not every place will have LORAs, or inpainting. So, in my mind, if you can get really good with your positive prompts, that's a skill that will take you anywhere.With that in mind, then, this article is definitely something that has worked for me, but ultimately it's all my theories. If you know of ways that work better for you, continue to use those! ^_^In most ways, imagegen models very much are not intelligent and not human, I wanna be clear about that, heh. I don't suddenly think there's gonna be a robot uprising or anything LOL.But, in some ways, imagegen models can almost be thought of as "learning" as more data is pumped into them. What I mean by that is, when you're prompting a model for an image, let's just do a simple, singular concept, like a "House", what the model will do is go back to the patterns of noise it has in its database that are tagged "House" and then start generating up from there. Again as you know, models don't have the raw images stored in their databases - that would mean even tiny models would have massive sizes, but it's patterns in the data that it pulls from when you prompt.So, with our prompt of "house", model will pull from all the images it's been trained on that are tagged "house..."Ones in the city, ones in a suburb, ones in the country, two-story ones, flat, ranch-style ones, ones with fences, ones without, ones with steep roofs...etc etc etc.That's why, if you're functionally using the same identical prompt for an image, heck, even if you're using the same seed, you'll still get some variation, because the model goes through and recombines your results in slightly different ways. Let's take a single character, singular, fairly simple prompt, and single LORA for an example: Freya Crescent from Final Fantasy.Initial Prompt:"high detail, clean lines, detailed background, depth of field, very aesthetic, high resolution, detailed scenery, newest, best face, ((detailed eyes)), expressive eyes, good anatomy, (character outline, solid outline), digital art (style), sharp shadows, (((flat shading))),1girl, standing, full-body, full length portrait, solo, dynamic posing, action pose, posing, final fantasy, burmecian, murid, rodent, (((freya crescent))), ears, tail, clothed, fully clothed, staff, holding staff, holding weapon,"All images generated using base model Nova Furry XL 5.0, Seed: 1234567890, Sampler/Scheduler: dpmpp_2s_ancestral/karras, Sampling Steps 25, CFG scale 5, and upscaled using R-ESRGAN Anime 6B for consistency.I picked Freya (aside from her cuteness) because she's a fairly well-known furry character, but her unique design means that she's not necessarily going to come out 100% accurately with pure model prompting alone, though as you can see, we got pretty close...Then, when you add a LORA in, you can get results like this:But again, even if you're using the same prompt, you'll get small variations in things like her posing, the background environment, and other fine details like that. Why is that? Remember, the model doesn't just pull noise/data from some area that is, say, “an outline around Freya's body”; it pulls the noising data...for the entire image. For every single tag. For “1girl”. For “standing”. For “action pose”. For “rodent”. For absolutely everything. And anything that a LORA won't “know” about in its dataset, is going to fall back to the base model. For example, most images that served as the training data for Freya probably were solo pics...I am entirely guessing on that, but that's my theory. But even with that, then that explains why some of your pics would come out 3/4 angled, or straight-on front, or even the occasional profile or over-the-shoulder pic, because if it's not specified in the prompt, the model will just guess. It knows I want a pic of her in a standing position, and I want it to be a full-length portrait, but beyond that it doesn't “know” what I want for her pose, so it just throws things at me.Now, we'll get more into prompting in a minute, and how small word changes can have big effects on the final image, but I want to detour slightly and talk about LORAs and LORA strength for a little bit. You'll notice in the image I linked above, even though my quality prompts call for flat shading, a digital art style, and a character outline, the image still looks like something more “traditionally” AI-art generated. That's because I had the LORA set to strength 1.0. Not a bad thing, but if you're running into a LORA that has a very strong identity, or maybe is even a little overtuned, or overtrained, it will override your prompt to generate images that are in line with its own data. If you're relying on LORAs for significant parts of your image, then you're going to run into trouble. But if you're able to make pure prompting the majority of your image, then you can tone the LORA strength down fairly significantly to still get its effects without ruining your image's intent. For example:This image is with the LORA strength turned down to 0.4 – it's still visibly identifiable as Freya with her clothes, outfit, and unique design. But it also takes into account my intentions for the image, the flat shading/2d-esque art style.But that's not all a prompt can do. You'll note in the main prompt, I only prompted that I wanted Freya to be “clothed, fully clothed”, which, of course. But with that in mind, and remembering that the model assumes “defaults” only if you don't have prompts that override its own assumptions, let's put Freya into something more casual, say, a red hoodie and bluejeans:Full prompt: “high detail, clean lines, detailed background, depth of field, very aesthetic, high resolution, detailed scenery, newest, best face, ((detailed eyes)), expressive eyes, good anatomy, (character outline, solid outline), digital art (style), sharp shadows, (((flat shading))),1girl, standing, full-body, full length portrait, solo, dynamic posing, action pose, posing, final fantasy, burmecian, murid, rodent, (((freya crescent))), ears, tail, clothed, fully clothed, staff, holding staff, holding weapon, hoodie, red hoodie, blue jeans, bluejeans, hood up, ear holes,”Illustrious-based models are incredibly smart. They have a wonderful understanding of anatomy, and not just in the “certain” fun ways. They all immediately understand that: Humanoid characters are supposed to follow a general body-plan. They have a head, that sits on a neck, that sits on shoulders, that two arms attach to, that a torso attaches to, that has legs, and some form of feet/hands/paws/claws at the ends of those arms and legs. So what that means for prompting is, you don't have to specify that “A red hoodie is a long sleeved garmet that covers Freya's torso...” You can just say “red hoodie”, and the model does the rest. I prompted for “hood up, ear holes” because that's just peak furgonomic design~ LOLBut that's not all! While her outfit is certainly much more casual and comfy, she still looks like she's ready to take on a horde, rather than something that is more appropriate for her attire. So what if we do this:Prompt: “high detail, clean lines, detailed background, depth of field, very aesthetic, high resolution, detailed scenery, newest, best face, ((detailed eyes)), expressive eyes, good anatomy, (character outline, solid outline), digital art (style), sharp shadows, (((flat shading))),interior, indoors, library, ornate, opulent, luxurious,1girl, standing, full-body, full length portrait, solo, posing, final fantasy, burmecian, murid, rodent, (((freya crescent))), ears, tail, clothed, fully clothed, staff, holding staff, holding weapon, hoodie, red hoodie, blue jeans, bluejeans, hood up, ear holes, blush, nose blush, shy smile, slight smile, one hand behind head,”So. What changes did I make? I added environmental prompts, which overrides the default “outdoors” setting that it was putting Freya in, and instead put her in a luxurious, opulent, library interior. She's doing some research into magical artifacts on a casual weekend! I changed her pose, with the “blush, nose blush, shy smile, slight smile, one hand behind head,” set of tags, because maybe she just rolled out of bed and she's a little sleepy yet.And as you can see, you can entirely change the tone and feel of an image, using just prompting alone and a few tags. Think of it this way: An imagegen model is like really fast, multilayer Photoshop, in a sense. You start with the “base” image that it thinks it should be, and you work up from there. Any changes you make in the prompt will be adhered to, to the best of its ability, 100% (e.g., environment changes override the default, outfit changes override the default, expression changes override the default, etc).But like I said at the start, if you're getting results you like, definitely continue to work with your own style! More creativity is always better, and more artists offering their thoughts into the space will make things more vibrant and wonderful for the entire community!

HMOF_AI

Semi-photoreal generations using Illustrious as a base - a brief guide:

Okay, so this is going to by no means be comprehensive, but I wanted to kinda make a mini-guide on AI art, and how I approach it, and different settings that I've discovered. I've found bits-and-pieces to this here and there, but I think this is the first time it's being put all together like this. Hopefully this is something someone finds helpful in their genning journey, or maybe even brings new people into the fold!Nova Furry Pony XL - Illustrious v2.0, (https://tensor.art/models/804656732992289076) that's the primary base model I use for most of my generations. It's versatile, flexible, and, being based off of Illustrious, rather than Stable Diffusion, Flux, or a raw Pony model, has simpler prompting requirements, better prompt adherence, and is much easier to generate multi-character interactions with than those other models. And, in terms of emphasizing or de-emphasizing certain things you do or don't want to see in the image, it's a simple matter of parentheses (), brackets, [], or braces {}, and the number of each determines the weighting for the prompt. No more memorizing carat and number usage for certain tags and not others, it's all streamlined. And now, LoRAs can be entirely used for changing the image to your liking, rather than being necessary to get something working in the first place.My base prompt for the images I'm using as examples, including the title image is as follows:---masterpiece, 8k, high detail, clean lines, detailed background, depth of field, best quality, amazing quality, very aesthetic, high resolution, ultra-detailed, absurdres, detailed scenery, volumetric lighting, score9_up, score8_up, score7_up, newest, realistic skin texture, realistic facial details, realistic face, best facial details, best face, good anatomy, expressive eyes, detailed hair, DSLR, ((realistic)), (photorealistic), ((realism)), (photorealism), sharp detail, ((HDR)), bright lighting, sharp shadows,close-up, outdoors, outside, city,interspecies, straight, human on anthro, human male on anthro female,BREAKCHARACTER 1: "1girl, source_furry, female, (((anthro))), furry, detailed fluffy fur, fuzzy, fox, muscular, red fur, white markings, white hair, fluffy hair, eyeliner, wingtip eyeliner, eyeshadow, smoky eyeshadow, bedroom eyes, fully clothed, armor, cyberpunk, futuristic armor, imminent kiss, squint, looking at partner,"BREAKCHARACTER 2: "1boy, human, ((human male)), man, dark skin, african american, black, buzzcut, hoodie, (hood up), imminent kiss, squint, looking at partner,"---I'm not sure what "separators" (breaks, using the word BREAK, putting things in quotes etc) actually work vs what is placebo effect for my own brain, but it does appear to "read" tags in groups. So like, if you put all your character tags in one batch, and then your environment tags in a batch, and then your second character tags in a batch, or something like that, you might have some success when generating multiple characters interacting in various ways.One key way you can change the feel of an image while keeping the same overall "style" is through changing your sampler and scheduler. I've not gone through every possible combination, but as you can see, changing the scheduler will make minor changes to the image, while changing the sampler will result in larger changes.Showcase of Samplers - AlbumHowever, making much larger changes than just samplers, are the models themselves. Each image is labeled with the model used to create it. The link to the full album of what I consider "good" Illustrious models of course is below, and then I have some thoughts about selected models beneath that. I was actually able to find some models that generate photoreal out of the gate, no LoRAs needed, but as you'll soon see, there are reasons I still prefer the more animesque models...Showcase of Animesque Models - AlbumModel - novaFurryXL_illustriousV30 - The latest version of NovaFurryXL, I actually may switch to this one in the future as my base model.Model - novaUnrealXL - Another good animesque model - produces quality stuff, but as you can see, it flipped the prompt - gave us a red-furred fox guy and a human girlModel - ntrMIXIllustriousXL_v40 - Same prompt, same seed, very different results. Intimate, romantic, not going in for a kiss thoughModel - WAI-Nsfw-Illustrious-09 - The most detailed of the animesque models, the clean lines are like something out of a comic book - definitely fun to play around with if you're doing human gens, but I'm not a fan of how it likes to stick big-ol' lips on furries...Showcase of Realistic Models - AlbumModel - alchemistMix_v40, Model - alchemistMixCreative_v10, Model - novaAnimalPony_v40, Model - ponyRealism_v22, Model - pornmasterPro_noobv01 and v02, Model - realismIllustriousBy_v22.I don't have a whole lot of individual thoughts about these because, for most of them, they don't even adhere to the prompt. Pony Realism is one of those that I kept in as an example of that. It just gave me two humans. Some other ones, even when not prompting for sex at all, were going very sexual with their results. Good, quality results - good anatomy and everything, just not even close to what I'm prompting.But, for the ones that do adhere to the prompt, they all have the same problem.It looks like, #1, you're just sticking a fox's head on a human's body, they don't really look "furry/anthro" enough, and #2, the expressions are so bland. Like look at these faces, do they look like lovers sharing a moment? Or do they look like plastic mannequins? I know what they look like to me LOL. And it's not good.So with that in mind, I stick to the more animesque base models as a starting point, and then build from there. So, I mix in LoRAs, to get the prompt adherence and the expressiveness I want, but with the photorealism that makes it all look good. Now of course with this, it's highly experimental and definitely "set-to-taste" however your preferences are, but here's some examples of the LoRAs I've been using, and various settings as well. As you can see, at high values (above 1.0), you can trigger almost as large of a change as you would by changing your model. And the same is true for other LoRAs as well, not just photorealistic ones.Showcase of LoRAs - Album(Not sure why Imgur flagged this as 18+, nothing in there that is...)So, with all that in mind, then, my workflow goes something like this:Generate a bunch of images with my chosen base model, settings, and LoRA valuesFind one that best matches what I'm looking forRun the final result through an upscaler, both to bring up the resolution, and to clean up details, especially in the facial area.Workflow walkthrough - Final productHopefully this helps someone if they're wanting to know how to generate more expressive photoreal or semi-photoreal generations, or even if you're looking to get into AI genning for the first time. My DMs are always open if you want assistance, guidance, or even just to chat and chill. I'm really no better than anybody else in the community, I just figured I'd get my thoughts down, because I always believe that helping each other out makes us all better, and especially in this community, hiding things like prompts, workflows, generation styles, etc., all that does is hold us all back.

HMOF_AI