Does someone know what FLUX 1.1 has been trained on? I generated almost hundred ...

smusamashah · on Oct 13, 2024

It's not just flux, you can do the same with other models including Stable Diffusion.

These two reddit threads [1][2] explore this convention a bit.

    DSC_0001-9999.JPG - Nikon Default
    DSCF0001-9999.JPG - Fujifilm Default
    IMG_0001-9999.JPG - Generic Image
    P0001-9999.JPG - Panasonic Default
    CIMG0001-9999.JPG - Casio Default
    PICT0001-9999.JPG - Sony Default
    Photo_0001-9999.JPG - Android Photo
    VID_0001-9999.mp4 - Generic Video
    
    Edit: Also created a version for 3D Software Filenames (all of them tested, only a few had some effects)
    
    Autodesk Filmbox (FBX): my_model0001-9999.fbx
    Stereolithography (STL): Model0001-9999.stl
    3ds Max: 3ds_Scene0001-9999.max
    Cinema 4D: Project0001-9999.c4d
    Maya (ASCII): Animation0001-9999.ma
    SketchUp: SketchUp0001-9999.skp

[1]: https://www.reddit.com/r/StableDiffusion/comments/1fxkt3p/co...

[2]: https://www.reddit.com/r/StableDiffusion/comments/1fxdm1n/i_...

112233 · on Oct 13, 2024

Thank you, this is good and horrific to know. The hair of my hair are standing on their end.

Of all the models exibiting this behaviour, has anyone published, what are the training data sources? Like, honest list, not the PR-boilerplate.

pajeets · on Oct 13, 2024

wow this is wild!

https://i.postimg.cc/vT6SV7pq/replicate-prediction-6ap8z1jv5...

https://i.postimg.cc/vZzMTM71/replicate-prediction-7r4b4p6sj...

https://i.postimg.cc/rs6wM5LJ/replicate-prediction-d8s4c93v5...

I DEMAND TO KNOW HOW RUN LOCAL SAAR

jncfhnb · on Oct 13, 2024

I’m not sure what saar means here but these images are fairly standard and a drop in the bucket compared to the hideous number of porn fine tunes published daily on civit ai if that’s what you’re looking for

pajeets · on Oct 13, 2024

wait you think these images are pornographic?

jncfhnb · on Oct 13, 2024

No, just guessing what you and frankly 80%+ of the community wants

jncfhnb · on Oct 13, 2024

I highly doubt it’s a product of the raw training dataset because I had the opposite problem. The token for “background” introduced intense blur on the whole image almost regardless of how it was used in the prompt, which is interesting because their prompt interpretation is much better.

It seems likely that they did heavy calibration of text as well as a lot of tuning efforts to make the model prefer images that are “flux-y”.

Whatever process they’re following, they’ve inadvertently made the model overly sensitive to certain terms to the point at which their mere inclusion is stronger than a Lora.

The photos you’re showing aren’t especially noteworthy in the scheme of things. It doesn’t take a lot of effort to “escape” the basic image formatting and get something hyper realistic. Personally I don’t think they’re trying to hide the hyper realism so much as trying to default to imagery that people want.

pajeets · on Oct 13, 2024

I experienced the same thing, it was so weird i got good results in the beginning and then it "craps out"

dont know why all the critical comments about flux are being downvoted or flag sure is weird