AI Design Glossary

All the AI design terms you need to know, explained simply and clearly

All Basic Concepts (8)Generation Techniques (36)Model Architectures (12)Advanced Techniques (11)General Concepts (8)

A B C D E F G I K L M N O P Q R S T U V W Z75 terms

AI 3D Modeling

Generation Techniques

Technology that creates three-dimensional models from text or images using AI. It significantly accelerates the traditional 3D modeling process.

AI Animation

Generation Techniques

Technology that transforms static images or character designs into animated motion using artificial intelligence.

AI Art

General Concepts

Artworks created using AI technologies or where AI plays an active role in the creative process. It enables producing digital art without needing traditional artistic skills.

AI Avatar

Generation Techniques

Digital human representatives created with AI. They are used to produce video content with realistic facial expressions, speech, and gestures.

AI Background Removal

Generation Techniques

Technology that automatically removes or replaces backgrounds in photos using AI. It precisely separates the foreground through object segmentation.

AI Colorization

Generation Techniques

Technology that automatically colorizes black-and-white photos and videos using AI. It predicts accurate colors through historical context and object recognition.

AI Design

General Concepts

A broad field encompassing the integration of AI technologies into design processes. It includes applications ranging from UI/UX design to graphic design, architectural visualization to product design.

AI Face Swap

Generation Techniques

Technology that replaces a person's face in a photo or video with another face using AI.

AI Hallucination

Basic Concepts

The tendency of AI models to generate non-real, fabricated, or incorrect information/content. It manifests differently in text, image, and video generation.

AI Image Editing

General Concepts

A general term covering AI-powered photo and image editing tools. Includes automated retouching, object removal, style transformation, and more.

AI Inpainting & Outpainting

Generation Techniques

Inpainting edits/replaces specific areas of an image with AI, while outpainting extends image boundaries to create new content.

AI Interior Design

Generation Techniques

Technology that uses AI to redesign interior photos in different styles or create interior visuals from scratch.

AI Logo Design

Generation Techniques

Technology that automatically creates logos for brands and businesses using AI. It generates professional logo alternatives from text descriptions.

AI Music Generation

Generation Techniques

Technology that generates original music compositions, songs, and sound effects using AI. It creates music from text descriptions or melodies.

AI Presentation Design

Generation Techniques

Technology that creates professional presentations using AI. It automatically generates slides, visual layouts, and design suggestions from text content.

AI Product Photography

Generation Techniques

Technology that creates professional product photographs or enhances existing product visuals using AI.

AI Super Resolution

Generation Techniques

Advanced enhancement technology that uses AI to increase image resolution far beyond the original and reconstruct lost details.

AI Upscaling

Generation Techniques

Technology that uses AI to increase the resolution of low-resolution images. It intelligently predicts and reconstructs lost details.

AI Video Editing

Generation Techniques

Technologies that automate and accelerate the video editing process using AI. Covers automated cutting, color correction, subtitles, and effect application.

AI Video Generation

General Concepts

General term for technologies that create new video content from text, image, or video inputs using AI.

AI Voice Cloning

Generation Techniques

Technology that produces synthetic speech by cloning a person's voice using AI. It creates realistic voice copies from short audio samples.

Attention Mechanism

Model Architectures

The attention mechanism is an AI component that allows neural networks to selectively focus on different parts of input data.

Batch Processing

General Concepts

The automatic processing of multiple images or operations simultaneously or sequentially. Instead of manually processing items one by one, it enables the management of hundreds or thousands of items through an automated pipeline.

CLIP

Model Architectures

A multimodal AI model developed by OpenAI that can represent text and images in the same vector space. Used as a prompt understanding layer in image generation tools.

CLIP Score

Advanced Techniques

An evaluation metric that measures how well a generated image aligns with the given text prompt. It quantifies prompt alignment by calculating the cosine similarity between text and image embeddings of the CLIP model.

Conditioning

Advanced Techniques

Conditioning is the process of guiding an AI model to generate outputs based on specific inputs.

ControlNet

Advanced Techniques

A neural network architecture that adds additional control layers to diffusion models to specify structural conditions like pose, edges, and depth maps during image generation.

Cross-Attention

Model Architectures

Cross-attention is a specialized attention mechanism where computations are performed between two different data sequences.

Deepfake

Generation Techniques

A technique that uses deep learning technology to realistically superimpose a person's face, voice, or movements onto another person.

Denoising

Generation Techniques

Denoising is the fundamental principle of diffusion models; the model removes random noise step by step to form a coherent image.

Diffusion Model

Model Architectures

A deep learning model that generates images by gradually denoising. It starts from random noise and step by step creates a meaningful image.

DreamBooth

Advanced Techniques

A fine-tuning method developed by Google that customizes AI models to a specific subject or style using just a few photos.

Embedding

Model Architectures

The process of converting text, images, or other data types into dense, fixed-size numerical vectors. Used for semantic similarity calculation and model input representation.

Face Restoration

Generation Techniques

Technology for improving and sharpening low-quality, blurry, or damaged face images using artificial intelligence. It automatically repairs facial components like eyes, mouth, and skin, producing naturally-looking results.

Fine-Tuning

Advanced Techniques

The process of customizing a pre-trained AI model by providing additional training on a specific task, style, or dataset.

Fréchet Inception Distance

Basic Concepts

The FID Score is a standard evaluation metric used to measure the quality and realism of generated images.

GAN (Generative Adversarial Network)

Model Architectures

A deep learning model where two neural networks are trained against each other: a generator and a discriminator. The generator tries to produce realistic data, while the discriminator tries to distinguish between real and fake data.

Generative AI

General Concepts

The general term for AI systems that can produce new and original content based on patterns learned from training data. It covers text, image, video, music, and code generation.

Guidance Scale

Generation Techniques

Guidance Scale controls how strictly generation adheres to the text prompt in diffusion models.

Image Captioning

Generation Techniques

An AI technology that automatically describes the content of an image in text format. It expresses objects, scenes, colors, and relationships in the image with natural language sentences.

Image-to-Image

Generation Techniques

A technique that generates or transforms a new image using AI by referencing an existing image. The structure of the input image is preserved while style, content, or details can be modified.

img2img

Generation Techniques

Abbreviation for image-to-image. In the Stable Diffusion ecosystem, it refers to the mode of generating new images using a reference image. It transforms while preserving the structure of the original image.

Inference

Basic Concepts

The process where a trained AI model makes predictions or generates output on new inputs. In image generation, it corresponds to converting a prompt into an image.

Inpainting

Generation Techniques

A technique for regenerating or editing a specific area of an image by masking it with AI. Used for removing unwanted objects or modifying specific areas.

Knowledge Distillation

Advanced Techniques

Knowledge distillation transfers knowledge from a large teacher model to a smaller student model.

Latent Consistency Model

Model Architectures

LCM reduces traditional diffusion models' dozens of steps to just 4-8 steps.

Latent Space

Model Architectures

A multidimensional space where data is compressed and mathematically represented. Diffusion models perform image generation in this compressed space for computational efficiency.

LoRA (Low-Rank Adaptation)

Advanced Techniques

A method for efficiently fine-tuning large AI models by adding small, trainable matrices. The original model weights remain unchanged.

Mask

Generation Techniques

A binary or grayscale layer in image processing that selects certain regions while excluding others. In operations like inpainting and segmentation, it defines which pixels will be changed.

Model Merging

Advanced Techniques

Model merging combines weights of two or more models through mathematical methods to create a hybrid model.

Motion Brush

Generation Techniques

A tool in video generation platforms that allows selecting specific regions of an image and defining custom movement direction and intensity for those regions. It enables creating more professional and intentional videos with selective motion control.

Multi-Modal AI

General Concepts

Multi-modal AI describes systems capable of processing different data types like text, images, audio, and video simultaneously.

Negative Prompt

Basic Concepts

A text command that defines unwanted elements in AI image generation. The model generates images while avoiding the elements specified in the negative prompt.

Outpainting

Generation Techniques

A technique that extends the boundaries of an existing image using AI. It enlarges the image by creating new areas that are consistent with the original content.

Prompt

Basic Concepts

A text-based instruction or command given to AI models. It is used to describe the desired output and guides the model's generation process.

Prompt Engineering

Basic Concepts

A discipline that encompasses techniques and strategies for writing prompts to get the best results from AI models. It includes proper word choice, structuring, and parameter usage.

Quantization

Advanced Techniques

Quantization reduces model size and speeds up inference by converting numerical values to lower bit formats.

Region Prompt

Advanced Techniques

An advanced prompt technique that assigns different text instructions to different regions of an image, providing independent content control for each area. It allows precise control over composition and layout.

Stable Diffusion XL (SDXL)

Model Architectures

SDXL is an advanced diffusion model released by Stability AI in 2023, offering 1024x1024 native resolution.

Style Transfer

Generation Techniques

A technique that applies the artistic style of one image while preserving the content of another. It has uses such as reinterpreting photographs in the style of famous painters.

Super Resolution

Generation Techniques

Technology for creating a high-resolution version from a low-resolution image using artificial intelligence. Similar to upscale but can also fill in non-original areas of the image through hallucination.

Temporal Consistency

Advanced Techniques

The preservation of visual consistency between consecutive frames in video and animation generation. It encompasses the techniques used to ensure objects, characters, and backgrounds appear consistent from frame to frame.

Text-to-Image

Generation Techniques

Technology that generates images from natural language text descriptions using artificial intelligence. The prompt written by the user is interpreted by the AI model and converted into an image.

Text-to-Video

Generation Techniques

Technology that generates video content from natural language text descriptions using artificial intelligence. It converts text prompts into moving, consistent frame sequences.

Text-to-Video (Txt2Vid)

Generation Techniques

Technology that automatically generates video from text prompts using AI. It converts written descriptions into moving image sequences.

Tiling

Generation Techniques

A technique for generating tileable textures or patterns where the edges of an image connect seamlessly with each other. Used in games, graphic design, and fabric design for creating repeating patterns.

Token

Basic Concepts

The basic unit used by AI models when processing text. It can be a word, word fragment, or character. Prompt length and model capacity are measured in token count.

Transformer

Model Architectures

A deep learning architecture based on the attention mechanism with parallel processing capability. It forms the foundation of both language and visual models.

txt2img

Generation Techniques

Abbreviation for text-to-image. In the Stable Diffusion ecosystem, it refers to the mode of generating images from text prompts. Unlike img2img, it produces images from scratch.

Upscaling

Generation Techniques

The process of enlarging low-resolution images using AI without quality loss or with quality enhancement. It differs from traditional resizing with its ability to add detail and sharpen.

VAE (Variational Autoencoder)

Model Architectures

A probabilistic deep learning model that encodes data into a compressed latent space and can generate new data from this space. Used in the image encoding layer of diffusion models.

Video Diffusion

Model Architectures

The extension of diffusion models to the time dimension for use in video generation. By denoising in both spatial and temporal dimensions, it creates consistent and fluid video sequences.

Video-to-Video (Vid2Vid)

Generation Techniques

Technology that transforms an existing video into a new one using AI while taking it as reference. The video's structure and motion are preserved while style, content, or visual quality is modified.

Watermark Detection

General Concepts

AI technology that detects visible or invisible watermarks in images and in some cases removes them. Used in content verification, copyright protection, and AI-generation tracking fields.

Zero-Shot Learning

Basic Concepts

Zero-shot learning is the ability to perform tasks never seen in training data using only general knowledge.