AI Design Glossary

All the AI design terms you need to know, explained simply and clearly

All Basic Concepts (8)Generation Techniques (36)Model Architectures (12)Advanced Techniques (11)General Concepts (8)

A C D E G L S T V12 terms

Attention Mechanism

Model Architectures

The attention mechanism is an AI component that allows neural networks to selectively focus on different parts of input data.

CLIP

Model Architectures

A multimodal AI model developed by OpenAI that can represent text and images in the same vector space. Used as a prompt understanding layer in image generation tools.

Cross-Attention

Model Architectures

Cross-attention is a specialized attention mechanism where computations are performed between two different data sequences.

Diffusion Model

Model Architectures

A deep learning model that generates images by gradually denoising. It starts from random noise and step by step creates a meaningful image.

Embedding

Model Architectures

The process of converting text, images, or other data types into dense, fixed-size numerical vectors. Used for semantic similarity calculation and model input representation.

GAN (Generative Adversarial Network)

Model Architectures

A deep learning model where two neural networks are trained against each other: a generator and a discriminator. The generator tries to produce realistic data, while the discriminator tries to distinguish between real and fake data.

Latent Consistency Model

Model Architectures

LCM reduces traditional diffusion models' dozens of steps to just 4-8 steps.

Latent Space

Model Architectures

A multidimensional space where data is compressed and mathematically represented. Diffusion models perform image generation in this compressed space for computational efficiency.

Stable Diffusion XL (SDXL)

Model Architectures

SDXL is an advanced diffusion model released by Stability AI in 2023, offering 1024x1024 native resolution.

Transformer

Model Architectures

A deep learning architecture based on the attention mechanism with parallel processing capability. It forms the foundation of both language and visual models.

VAE (Variational Autoencoder)

Model Architectures

A probabilistic deep learning model that encodes data into a compressed latent space and can generate new data from this space. Used in the image encoding layer of diffusion models.

Video Diffusion

Model Architectures

The extension of diffusion models to the time dimension for use in video generation. By denoising in both spatial and temporal dimensions, it creates consistent and fluid video sequences.