What is text to image?
Text to image is the process of generating images from textual descriptions or prompts using machine learning algorithms. These algorithms are trained on large datasets of images and corresponding textual descriptions, and can generate new images based on text input. This technology has applications in various fields such as art, design, and advertising, and can also be used to generate realistic images for virtual reality or gaming environments. Text to image can also be used to generate images based on descriptions for people who are visually impaired or blind.
Example for SD2.0
I use Stable Diffusion versions 1.5, 2.0, and 2.1 to produce my artwork. My artwork is created without the use of filters, relying solely on the native SD versions.
My prompt is available for purchase. I utilize PromptyBrain as my primary prompt generator, which is specifically designed for use with Stable Diffusion and DallE2. With this user-friendly tool, you can easily generate well-written prompts simply by typing in your imaginative ideas. Additionally, by incorporating OpenAI’s API, you can swiftly generate stunning prompts in mere seconds. I encourage you to give it a try! (You need Gmail account and OpenAI API key to use PromptyBrain.)
Text to image involves using an enriched English vocabulary to create a prompt. Each English word carries its own meaning, and when combined effectively, they produce a great result.
I created the beautiful images above using just 36 words and 3 emojis. Here’s a tip to get started with creating: first, you need to decide on the subject of your creation, such as a woman, man, girl, boy, cat, dog, etc. This will help give you a clear direction for your project.
Here are 20 possible subject words for your project: man, woman, cat, dog, bird, tree, river, mountain, cityscape, landscape, beach, sunset, moon, starry night, unicorn, dragon, mermaid, pirate, robot, and alien.
Add adjective to describe a girl. I choose “kawaii”.
“kawaii” is a Japanese adjective that is commonly used to describe a cute or adorable girl or woman. So, “a kawaii girl” would refer to a girl who is perceived as cute or charming in a cute way.
A kawaii girl
Here are 20 words similar to “kawaii”: charming, adorable, cute, lovely, sweet, attractive, delightful, enchanting, appealing, winsome, engaging, alluring, captivating, endearing, precious, angelic, beguiling, bewitching, radiant, and graceful.
Add hair and eye color
Next, you’ll need to inform the AI of the subject’s hairstyle, such as short, long, or wavy. Additionally, providing details about the hair color will result in a more precise output for your image.
A kawaii girl with long wavy blond hair and blue eyes.
Great progress has been made! The AI has grasped my instructions. Just 8 words are sufficient for creating an impressive image. Simplicity is key, so clean and concise language is recommended.
Hair color list:
Beach blond hair, milk tea hair, ash brown hair, cherry red hair, caramel highlights, strawberry blonde hair, lavender hair, jet black hair, rose gold hair, platinum blonde hair, burgundy hair, mahogany hair, copper hair, chocolate brown hair, blue-black hair, silver hair, golden blonde hair, pastel pink hair, and dark blonde hair
Eye color list:
Baby blue eyes, hazel eyes, green eyes, brown eyes, gray eyes, amber eyes, blue-gray eyes, dark brown eyes, light blue eyes, blue-green eyes, black eyes, light brown eyes, dark blue eyes, turquoise eyes, violet eyes, olive green eyes, honey brown eyes, steel blue eyes, light green eyes, and deep-set brown eyes
Add Magic Keywords
The trick is to append computer graphics technical terms. It gives the image a professional and dramatic change. Instruct AI to apply the techniques for image enhancement.
A kawaii girl with long wavy blond hair and blue eyes. Delicate features and intricate details. Ultra realistic, bokeh, sharp features, back lighting. Studio background. Soft light, octane render, unreal engine, blender render
Adding technical terms can significantly alter the image. AI can intelligently utilize these terms to enhance images.
Here is a list of technical terms that AI can understand. Copy and paste them at the end of your sentence to drastically change your image.
- Unreal Engine: A popular game engine and real-time 3D creation tool used for creating interactive experiences, such as video games, virtual reality, and augmented reality applications.
- Octane Render: A GPU-based rendering engine that produces high-quality, photorealistic images and animations.
- Bokeh: A term used in photography to describe the aesthetic quality of the blur in out-of-focus areas of an image.
- Vray: A widely-used rendering engine that provides high-quality and realistic rendering for architectural visualization, product design, and other applications.
- Houdini Render: A rendering engine used in the 3D animation software Houdini for creating high-quality, photorealistic images and animations.
- Quixel Megascans: A library of high-quality 3D assets, including textures, materials, and models, that can be used in various 3D applications.
- Arnold Render: A rendering engine used for creating high-quality, photorealistic images and animations in film, television, and visual effects.
- 8K UHD: A high-resolution video format that has a resolution of 7680 x 4320 pixels.
- Raytracing: A rendering technique that simulates the behavior of light as it interacts with objects in a 3D scene, resulting in realistic lighting and shadows.
- CGI: Stands for Computer Generated Imagery, which refers to the use of computer graphics to create images, animations, and visual effects.
- Lumen Reflections: A feature in Unreal Engine 5 that enables real-time global illumination and accurate reflections in 3D scenes.
- CGSociety: A community of digital artists and animators who share their work, knowledge, and expertise.
- Ultra Realistic: A term used to describe computer graphics or digital art that looks extremely realistic and indistinguishable from real-life images.
- 100mm: Refers to the focal length of a camera lens, which is often used for portrait photography.
- Film Photography: Refers to the practice of capturing images on photographic film, which is processed to create prints or digital images.
- DSLR: Stands for Digital Single-Lens Reflex, which refers to a type of camera that uses a mirror and prism system to provide the photographer with an optical viewfinder.
- Cinema4D: A 3D modeling, animation, and rendering software used for creating motion graphics, visual effects, and other 3D content.
- Studio Quality: A term used to describe high-quality images or videos that have been produced in a professional studio environment.
- Film Grain: Refers to the texture or grain-like appearance that is often visible in film photography, and is sometimes added to digital images to give them a more organic and film-like look.
A mesmerizing kawaii girl with long wavy blond hair and captivating blue eyes, with intricate details on her delicate features and a stunningly realistic studio background. Using advanced techniques for lighting and texture mapping, this high-resolution image is both lifelike and otherworldly. The bokeh effect and back lighting create a mystical, dreamlike atmosphere, while the Unreal Engine adds a level of interactivity and depth to the image.
When working with Stable Diffusion to create an image, selecting the appropriate style is a crucial aspect of the process. It is important to provide clear instructions to the AI regarding the desired style of the image. Here are some examples of ideas for different styles that you can use as a reference.
analog photo, polaroid, macro photography, overglaze, volumetric fog, depth of field (or dof), silhouette, motion lines, motion blur, fisheye, ultra-wide angle
digital art, digital painting, trending on artstation, golden ratio, evocative, award winning, shiny, smooth, surreal, divine, celestial, elegant, oil painting (helps improve multiple styles), soft, fascinating, fine art, official art, keyvisual
color page, halftone, character design, concept art, symmetry, pixiv fanbox (for anime/manga), trending on dribbble (for vector graphics), precise lineart, tarot card
These terms are all related to landscape and scenery photography and art.
- Depth of field: The range of distance in an image, from the nearest to the farthest objects, that appear acceptably sharp and in focus.
- High dynamic range (HDR): A technique used in photography and cinematography to capture and display a greater range of brightness and luminosity levels than traditional imaging methods.
- Bokeh: The aesthetic quality of the blur produced in the out-of-focus parts of an image, often used to create a sense of depth or to draw attention to a subject.
- Minimalism: A design style characterized by simplicity, minimal elements, and a focus on negative space.
- Symmetry: A visual balance achieved by arranging elements in a way that is equal on either side of a central point or axis.
- Texture: The perceived surface quality of an object, created by the arrangement of its elements and materials.
- Monochromatic: A color scheme consisting of variations of a single color, often used to create a sense of unity and simplicity.
- Contrast: The difference between light and dark areas in an image, often used to create a sense of depth, drama, or emphasis.
- Silhouette: An outline or shape of an object or person filled in with a solid color, typically black, against a contrasting background.
- Rule of thirds: A compositional guideline in photography and design that involves dividing an image into thirds vertically and horizontally, and placing the most important elements along the intersections of these lines.
- Golden ratio: A mathematical concept used in art and design that involves using a ratio of approximately 1.618 to create visually appealing compositions and proportions.
- Leading lines: A compositional technique that involves using lines in an image to draw the viewer’s eye towards a particular point or subject.
- Reflection: The phenomenon of light bouncing off a surface and returning to the viewer, often used to create symmetry, depth, or a sense of illusion.
- Motion blur: A technique used to capture the movement of a subject by intentionally blurring parts of the image, often used in sports or action photography.
- Double exposure: A photographic technique that involves overlaying two or more images on top of each other to create a single image, often used to create a sense of surrealism or symbolism.
- Tilt-shift: A technique used to manipulate perspective by selectively focusing on a specific part of an image and blurring the rest, often used to create a miniature effect or to exaggerate the scale of a subject.
- Long exposure: A technique used to capture an image over an extended period of time, often resulting in a blurred or surreal effect, often used in night photography or to create a sense of movement in water or clouds.
- Time-lapse: A technique used to capture a series of images over a period of time and then compile them into a video, often used to show the changes in nature or the growth of a city skyline.
- High-speed photography: A technique used to capture fast-moving subjects or events, often used in sports, wildlife, or scientific photography.
- Infrared: A type of photography that captures light in the infrared spectrum, resulting in a surreal, otherworldly effect often used in landscape photography.
- Panorama: A wide-angle view of a landscape or cityscape that captures a larger field of view than a traditional image, often achieved through stitching together multiple images.
These terms are related to visual arts, design, and digital media. They are used to describe the quality, style, and level of detail of digital images or physical artworks.
- Angelic: Refers to having qualities or appearance of an angel. Often associated with purity and lightness.
- Breathtaking: A term used to describe an image or artwork that inspires awe or amazement in the viewer.
- Crystalline: A term used to describe an image or artwork that contains clear, sharp, and geometric shapes resembling crystals.
- Decorations: Objects or elements used to enhance the aesthetic appeal of a space or object.
- Embellishments: Decorative details or elements added to an object or design to enhance its appearance or appeal.
- Embroidery: The art of decorating fabric or other materials with needle and thread, often creating intricate and detailed designs.
- Filigree: A delicate ornamental design made of intertwined wire, often used in jewelry or architecture.
- Hard edge: A style of painting characterized by distinct, sharp edges and solid colors.
- HQ: An abbreviation for “high quality,” used to describe an image or artwork that is of exceptionally good quality.
- Hyperrealism: A style of art or photography that aims to create an extremely realistic and lifelike depiction of a subject.
- Insanely detailed: A term used to describe an image or artwork that contains an exceptionally high level of detail.
- Intricate: A term used to describe an image or artwork that contains complex and detailed elements.
- Lush detail: A term used to describe an image or artwork that contains rich and detailed elements, often associated with nature or organic forms.
- Masterpiece: A work of art that is considered the greatest achievement of an artist or a particular style or period.
- Max detail: A term used to describe an image or artwork that contains the maximum possible level of detail.
- Ornate: Elaborately decorated with intricate and complex designs, often associated with architecture or decorative arts.
- Perfectionism: A tendency to strive for flawlessness and exactness in one’s work or creations.
- Poster: A printed advertisement or announcement displayed in public spaces.
- Sharp focus: A technique used in photography to create a clear and crisp image with distinct edges and details.
- Spirals: A curved shape that forms a continuous line winding around a central point, often used in art and design.
- Tendrils: A slender, curling extension or growth, often found in plants or vines and used in art and design.
- Wallpaper: An image or pattern used as a decorative background on a computer screen or mobile device.
These terms are related to lighting and photography techniques used in visual arts, computer graphics, and cinematography. They are used to create realistic and aesthetically pleasing visual effects.
- Global illumination: A lighting technique that simulates how light reflects and interacts with objects in an environment.
- Cinematic lighting: A style of lighting used in movies to create a specific mood or atmosphere.
- Rim lighting: A technique used to create a halo-like effect around the edges of a subject, separating it from the background.
- God rays: A lighting effect that simulates beams of sunlight shining through clouds or other objects.
- Volumetric lighting: A technique used to create atmospheric effects, such as dust, fog, or haze.
- Subsurface scattering: A lighting technique that simulates how light penetrates and scatters within translucent materials, such as skin or wax.
- Luminescence: The emission of light from a material or object without being heated.
- Hard shadows: A sharp and distinct shadow created by a direct and focused light source.
- Ektachrome: A type of color film that produces bright, saturated colors.
- Diffused lighting: A soft and even lighting that lacks harsh shadows, often achieved by using a diffuser or bouncing light off a reflective surface.
- Bioluminescent details: Details that emit light naturally, often found in underwater creatures or plants.
- Halo: A circular lighting effect that appears around a light source.
- Shimmering light: A flickering or sparkling effect created by light reflecting off a textured or uneven surface.
- Iridescent: A type of lustrous color effect created by diffraction, interference, or selective absorption of certain wavelengths of light.
- Backlighting: A technique where the main light source is placed behind the subject, creating a silhouette effect.
- Radiant light rays: A lighting effect that simulates rays of light shining from a single point.
- Caustics: A lighting effect that creates patterns of light and shadow, often found in refractive surfaces like glass or water.
- Specular lighting: A technique used to create highlights and shininess on surfaces.
- Soft lighting: A gentle and diffused lighting that lacks harsh shadows, often used to create a natural and organic look.
- Studio lighting: Lighting techniques used in a controlled environment, often used in portrait or product photography.
- Translucency: A material or object’s ability to transmit and scatter light, creating a semi-transparent effect.
These words are related to color processing and manipulation in visual media such as photography, cinematography, and graphic design.
- Vibrant: Describes bright, lively, and intense colors that appear bold and eye-catching.
- Muted colors: Refers to colors that have been desaturated or toned down, resulting in a softer, more subdued appearance.
- Vivid color: Refers to colors that are intense, bright, and striking, often used to create a sense of energy and excitement.
- Post-processing: Refers to the editing and manipulation of images or videos after they have been captured or recorded, often using software tools.
- Color grading: The process of adjusting and manipulating the colors of an image or video to create a desired look or mood.
- Tone mapping: A technique used to compress the dynamic range of an image, allowing details in both bright and dark areas to be visible.
- Lush: Refers to colors that are rich, vibrant, and full of life, often associated with nature or organic forms.
- Low contrast: Refers to an image with minimal difference between the lightest and darkest areas, resulting in a muted, subdued appearance.
- Vintage: Refers to colors and styles that are reminiscent of an earlier time period, often with a nostalgic or sentimental appeal.
- Aesthetic: Refers to the visual style or appearance of an image, often associated with a particular art movement or design philosophy.
- Psychedelic: Refers to bright, bold, and often surreal colors and patterns, often associated with 1960s counterculture and psychedelic art.
- Monochrome: A color scheme consisting of a single color or shades of a single color, often used to create a sense of simplicity and elegance.
- Saturation: The intensity or purity of a color, often used to create a sense of vibrancy or depth in an image.
- Desaturation: The process of reducing the saturation of colors in an image, often used to create a muted or understated look.
- Color balance: The adjustment of the colors in an image to achieve a desired overall tone or mood.
- Color temperature: The perceived warmth or coolness of a color, often adjusted in post-processing to achieve a desired effect.
- Cross-processing: A technique of developing photographic film in a chemical solution intended for a different type of film, resulting in unique color shifts and contrasts.
- Tint: A slight color cast added to an image to create a particular mood or atmosphere.
- Gradient: A gradual transition between two or more colors, often used to create a smooth and subtle color shift in an image.
out of frame, bad anatomy, deformed hands, blurry, disfigured, deformed, cartoon, toy, figure, framed