Illustrious XL v2.0：1536分辨率时代最佳的训练基础模型

Angelbottomless2025年3月15日大约 5 分钟

Illustrious XL v2.0：1536分辨率时代最佳的训练基础模型

简介

Illustrious XL 1.0-2.0系列旨在稳定1536分辨率的原生生成，同时显著提高自然语言理解能力。

虽然用户有时会观察到在1024x1536分辨率下能成功生成，但这些并不稳定。同样，512x512分辨率的生成偶尔也会产生不必要的伪影。

早期版本为何不稳定？

这些不一致的根本原因很简单：模型未在这些分辨率上进行有效泛化或训练。使用小数据集填补这些空白往往会导致在某些分辨率上过拟合。这意味着模型会将特定分辨率与特定概念关联起来，使其在多样化生成时变得不可靠。

一个有用的比喻是"广角效果"。如果数据集通常包含广角镜头，当给定广角分辨率时，模型自然会生成更小的人物，因为这是它学习泛化的方式。

为了解决这个问题，Illustrious XL v2.0需要大规模数据集和大规模训练——与原始v0.1训练相当——以消除各分辨率和数据集之间的偏差。

prompt: "stylish, no humans, city light, black theme, dim lighting, high contrast, night sky, masterpiece, absurdres, depth of field, butterflies, extremely aesthetic, absurdres, wallpaper, panorama, city background, neon, milky way, photo background, 512x512 generation"

"prompt: "The image features two characters,each with distinct black and white outfits,standing back-to-back. The character on the left wears a white coat with black accents,black pants,and boots,and is chained at the wrists and ankles. The character on the right is dressed in a black coat with white accents,black pants,and boots,also chained at the wrists and ankles. Both characters have spiked black hair and wield large key-shaped weapons. The background is white,and the text "Wielder Of The Key" and "Controls Light & Darkness" is displayed above and below the characters,respectively" Negative prompt: "worst quality, low quality, lowres, low details, bad quality, poorly drawn, bad anatomy, multiple views, bad hands, blurry, artist sign" Steps: 28, Sampler: Euler a, Schedule type: Automatic, CFG scale: 7.5, Seed: 3420215296, Size: 1248x1824*

prompt: "Generate a highly detailed anime-style illustration of a young man floating serenely above a sprawling, futuristic cityscape. The boy has dark, messy hair and piercing blue eyes. He's wearing a long, flowing white coat over dark, streamlined clothing – think a mix of traditional Japanese garments and futuristic techwear. His expression is calm and confident, almost detached. He is surrounded by a faint, glowing aura of light, possibly blue or white. Below him is a vast sci-fi city, filled with towering skyscrapers, holographic advertisements, and flying vehicles. The city should have a vibrant color palette – neon blues, purples, and pinks contrasting with darker metallic structures. There should be a sense of depth and scale, with buildings receding into the distance. The overall atmosphere should be epic and awe-inspiring, suggesting a powerful and mysterious character overlooking a technologically advanced world. Focus on dynamic lighting and detailed textures to create a visually stunning image. wlop, quasarcake, masterpiece"