2025-03-23
Image generation using AI that can run locally on a desktop or even a laptop has seen a tremendous leap in the last few years. When I first wrote about it after the public release of Stable Diffusion, getting everything installed was quite an involved process, involving configuring the right dependencies and terminal commands. This was especially true for those who wanted to experiment with different models (the systems that enable the generation of images from prompts) and front-ends (the user interfaces with which one can interact with the models and its results) that have since been released.
Thankfully there are now tools, such as Stability Matrix, that make the process of installing and using multiple models and front-ends very straightforward. In this article, I will walk through using Stability matrix to install a new frontend and a model to generate some images of characters as they would exist in a cyberpunk matrix.
The matrix is a staple concept in the Cyberpunk genre. It refers to a large immersive virtual reality network that users can access, most often through direct neural interfaces. In this virtual reality, data, knowledge, virtual avatars and AI can exist as tangible entities. The concept has been popularized in the book Neuromancer by William Gibson. Since then, it has been a staple in the Cyberpunk genre, appearing in the Matrix movies, the Shadowrun universe, Cyberpunk 2077 and more. In general, the concept is used to explore the effects of increasing digitization on society, human identity, and the implications of AI existing entirely within a virtual world.
To start, we first need to install Stability Matrix, which can be downloaded from its website or from its GitHub releases page. The installation process is pretty straightforward and I highly recommend opting for a portable install. This keeps all data, such as models, within a single directory, making it much easier to transfer everything to a different drive or computer in the future.
Once the application is installed, the next step is to set up a UI for generating images from prompts. There are multiple front-ends available from the packages menu. For this article, we will use Stable Diffusion WebUI Forge, as it is easy to use and more than sufficient for our very basic image generation use case.
We also need a model to generate images. For this we will use the Model Browser to find the right model for our use case. After searching for a model that is fine-tuned to generate cyberpunk imagery, I have settled on using the Splatter Punk - Neon Waves
model. Installing it is just a click away.
Now, with both a model and a front-end installed, we can start generating the images that we need. We want to generate pictures for four characters in a Cyberpunk themed world, based on four archetypes common in stories: the Hero, the Friend of the Hero, a Minion/Underling, and the Antagonist. While we could manually craft and fine tune the prompts, ChatGPT does a decent job of generating them for these archetypes:
"A brooding, determined man standing in a neon-lit alley of a futuristic city, wearing a sleek black jacket with subtle glowing circuitry patterns. His eyes reflect a mix of sadness and determination, and a faint holographic interface hovers around his wrist. The background features towering skyscrapers with digital ads, and rain softly blurs the neon glow. The atmosphere should feel tense and introspective, symbolizing betrayal and a relentless quest for justice."
"A cheerful and energetic woman in her late twenties, standing in a cluttered workshop filled with glowing wires, cybernetic components, and floating holograms. She wears mismatched tech goggles on her forehead, with brightly colored clothes accented by glowing accessories. A small, hovering drone buzzes beside her, reflecting her inventive and playful nature. The setting should be cozy yet chaotic, highlighting her technical genius and hidden vulnerabilities."
"A shadowy, humanoid figure made of fragmented code and shifting digital glitches, with piercing, glowing eyes. The body appears semi-transparent, constantly morphing between solid and pixelated forms. The background is a cold, digital void filled with streams of raw data flowing like rivers of light. The mood should be ominous and relentless, emphasizing Cipher’s role as an adaptive enforcer bound by hidden desires for autonomy."
"A powerful, composed woman standing at the center of a vast control room filled with holographic screens and pulsing data streams. She wears a high-collar, futuristic coat with minimalist design elements, her silver hair tied back in a severe style. Her expression is cold and calculating, with a subtle smirk of superiority. Behind her, the Matrix’s digital skyline stretches endlessly, symbolizing her control over the virtual world and her ambition for perfection."
The images above were generated using the default parameters with the listed prompts. In fact, the portraits of Jax and Nova were created on the first attempt, while the others were selected from multiple generated images. For this particular model, the positive prompts were mostly sufficient, however adding a negative prompt, such as one to filter out NSFW content, can be useful.
It has been incredible to see how much better the tooling and the models have become since these locally installable image generation models first were released. In the early days setups usually involved installing and managing many dependencies from the commandline, which is no longer required. Whether for storytelling, concept art, or creative exploration, AI-generated imagery even from a desktop can be just one install away.