The quality of an AI-generated architecture diagram depends almost entirely on the quality of the prompt. A vague prompt like "draw a microservices architecture" produces a generic diagram. A specific prompt that describes services, connections, network layers, and data flows produces a diagram accurate enough to share in a design review.
This guide collects 30+ ready-to-use prompts organized by category. Copy, adapt, and paste them into ArchitectureDiagram.ai to generate diagrams in seconds.
What makes a great architecture diagram prompt?
Four things separate a good prompt from a bad one:
- Named components — specific service names (ECS Fargate, not just "compute"; PostgreSQL, not just "database")
- Request flow — walk a request from client to data store and back
- Network boundaries — VPCs, subnets, namespaces, or logical groupings the AI should draw as containers
- Security and observability — WAF, IAM roles, CloudWatch, Prometheus — the layers most teams forget to include
AWS architecture diagram prompts
Three-tier web application (ECS + RDS + CloudFront)
"Three-tier AWS web app in us-east-1. Route 53 -> CloudFront -> internet-facing ALB in public subnets. ALB routes to ECS Fargate tasks in private subnets across two AZs. Tasks read/write RDS PostgreSQL Multi-AZ in isolated subnets and cache in ElastiCache Redis. Static assets in S3 served via CloudFront. NAT gateway for outbound. Show security groups, VPC boundaries, and IAM task role."
Serverless API (Lambda + API Gateway + DynamoDB)
"Serverless AWS API: Route 53 -> CloudFront -> API Gateway HTTP API with Cognito JWT authorizer. API Gateway routes to 5 Lambda functions (create, read, update, delete, search). All functions write to DynamoDB global table. Search function also queries OpenSearch. SNS fan-out to SQS for async notifications. Lambda uses X-Ray tracing and CloudWatch Logs."
Event-driven data pipeline (Kinesis + Glue + Athena)
"AWS event-driven data pipeline: IoT devices publish JSON events to Kinesis Data Stream (3 shards). Kinesis Data Firehose batches events to S3 raw bucket with Snappy compression. AWS Glue crawler builds Data Catalog every 6 hours. Athena queries catalog for ad-hoc analysis. A separate Glue ETL job transforms raw S3 to Parquet in a curated bucket. Failed records route to SQS dead-letter queue. CloudWatch alarms on Kinesis IteratorAge."
Multi-region active-active
"Active-active AWS in us-east-1 and eu-west-1. Route 53 latency routing. Each region: CloudFront -> ALB -> ECS tasks -> Aurora Global Database (writer in us-east-1, reader in eu-west-1). S3 cross-region replication for static assets. ElastiCache Redis per region with no cross-region sync. Show the Aurora replication link and latency routing arrows between regions."
EKS with Istio service mesh
"EKS cluster in us-east-1 with Istio service mesh. Ingress: AWS Load Balancer Controller -> Istio ingress gateway -> VirtualService. Services: user-service, order-service, payment-service, notification-service — each with Envoy sidecar. mTLS between all services. Kiali for mesh observability, Prometheus + Grafana for metrics, Jaeger for distributed tracing. RDS PostgreSQL outside the cluster via VPC endpoint."
Azure architecture diagram prompts
Azure AKS microservices
"Azure AKS cluster in East US with three node pools: system, app, and GPU. Azure Application Gateway Ingress Controller in front. Microservices: api-gateway, catalog, cart, checkout, payment, shipping — each in separate Kubernetes namespace. Azure Service Bus for async messaging between checkout and payment. Azure Database for PostgreSQL Flexible Server per service (database-per-service pattern). Azure Key Vault for secrets. Azure Monitor + Application Insights for observability."
Azure Functions + Cosmos DB + Event Grid
"Azure serverless: API Management -> Azure Functions (Consumption plan). Functions read/write Cosmos DB (SQL API, multi-region writes). Cosmos DB change feed triggers a separate Function that publishes to Event Grid. Event Grid routes to three subscribers: Blob Storage (archival), Logic Apps (email notification), Service Bus (downstream processing). Application Insights tracks all Function executions."
GCP architecture diagram prompts
GCP Cloud Run + Cloud SQL + Pub/Sub
"GCP: Cloud Load Balancing -> Cloud Run (us-central1, 0-100 instances). Cloud Run connects to Cloud SQL PostgreSQL via Cloud SQL Auth Proxy. Background jobs publish to Pub/Sub topic. Two Pub/Sub subscriptions: one Cloud Run job (batch processor), one BigQuery subscription (analytics sink). Cloud Armor WAF in front of load balancer. Cloud Logging + Cloud Monitoring for observability."
Kubernetes architecture diagram prompts
GitOps with Argo CD
"Kubernetes GitOps: GitHub repo with Helm charts and Kustomize overlays. Argo CD watches the repo and syncs manifests to EKS cluster. Argo CD manages three Applications: frontend, backend, infrastructure. External Secrets Operator syncs secrets from AWS Secrets Manager. Cert-manager issues TLS certificates from Let's Encrypt. NGINX Ingress Controller with external-dns updating Route 53."
Multi-tenant platform with namespace isolation
"Multi-tenant Kubernetes: each tenant gets a dedicated namespace with NetworkPolicy enforcing namespace isolation. Shared ingress controller routes by host (tenant-a.example.com -> namespace-a). Shared Prometheus with per-namespace dashboards in Grafana. Tenant namespaces have ResourceQuota (4 CPUs, 8Gi RAM) and LimitRange. OPA Gatekeeper enforces policy: no latest image tags, required labels on all workloads."
Microservices architecture diagram prompts
API gateway with backend for frontend (BFF)
"Microservices with BFF pattern: mobile client hits mobile-bff, web client hits web-bff. Both BFFs call the same downstream services: auth-service (JWT), product-catalog, inventory, cart, checkout. Checkout calls payment-service (Stripe) and notification-service. notification-service sends email (SendGrid) and SMS (Twilio). All services log to ELK stack, trace with Jaeger."
CQRS with event sourcing
"CQRS + Event Sourcing: command side writes domain events to EventStore DB. Events are projected to three read models: PostgreSQL (relational queries), Redis (session/cache), Elasticsearch (full-text search). Read services query their respective read models. Event processor subscribes to EventStore and updates projections. Commands go through validation and business rules before writing events."
Saga pattern for distributed transactions
"Orchestration-based saga for order processing. Saga orchestrator coordinates: (1) inventory-service reserves stock, (2) payment-service charges card, (3) shipping-service creates shipment, (4) notification-service sends confirmation. If payment fails, compensation: inventory-service releases reservation. If shipping fails, compensation: payment-service issues refund + inventory releases. Show the happy path and both compensation flows."
AI / ML architecture diagram prompts
RAG pipeline (Retrieval-Augmented Generation)
"RAG pipeline: user query -> embedding model (text-embedding-3-small) -> vector similarity search in Pinecone. Top-5 chunks retrieved. Chunks + original query assembled into prompt -> Claude claude-sonnet-4-6 for generation. Ingestion pipeline: PDF/Markdown documents -> text chunker -> embedding model -> Pinecone upsert. Show both the ingestion path and the query path."
Multi-agent AI system
"Multi-agent AI: orchestrator agent receives user task, decomposes it, and assigns subtasks to specialized agents: web-search-agent (Tavily API), code-agent (Python sandbox), file-agent (read/write local files), email-agent (Gmail API). Agents return results to orchestrator. Orchestrator synthesizes final response. All agents share a Redis-backed memory store. Langfuse for observability of all LLM calls."
LLM fine-tuning and inference pipeline
"MLOps: training data in S3 -> preprocessing job (Spark on EMR) -> cleaned dataset in S3. Fine-tuning job on SageMaker (4x A100 GPUs). Model artifacts saved to S3 Model Registry. SageMaker endpoint hosts the fine-tuned model with auto-scaling (2-10 instances). API Gateway + Lambda adapter in front of SageMaker endpoint. CloudWatch monitors endpoint latency and error rate, triggers re-training Lambda if drift detected."
Security architecture diagram prompts
Zero trust network architecture
"Zero trust: all users authenticate through Okta SSO (SAML/OIDC). Cloudflare Access enforces device posture checks before any resource is reached. Internal services are never directly internet-facing — all traffic routes through Cloudflare Tunnel. SIEM (Splunk) aggregates logs. EDR (CrowdStrike) on all endpoints. Privileged access to AWS via IAM Identity Center with session recordings in S3."
OAuth 2.0 / OIDC authorization flow
"OAuth 2.0 Authorization Code flow with PKCE: (1) user clicks login in SPA, (2) SPA generates code_verifier + code_challenge, redirects to authorization server (Auth0), (3) user authenticates, (4) authorization server redirects with auth code, (5) SPA exchanges code + code_verifier for access_token + refresh_token, (6) SPA calls API with access_token in Authorization header, (7) API validates JWT with JWKS endpoint, returns data. Show all 7 steps with numbered arrows."
Tips for refining AI-generated diagrams
After generating a first draft, use these chat editing commands to polish it:
- "Add a legend explaining the color coding" — useful when presenting to mixed audiences
- "Group components by team ownership with dashed boundaries" — shows organizational responsibility without changing the architecture
- "Add numbered arrows (1→2→3) to show request sequence" — clarifies order-of-operations in complex flows
- "Simplify to show only the critical path, remove observability details" — for executive presentations
- "Add failure mode annotations on single points of failure" — for resilience reviews
Try it
Pick any prompt above, customize the service names and regions for your system, and paste it into ArchitectureDiagram.ai. You get 2 free diagrams with no credit card required. See also: AWS diagram guide, Kubernetes diagram examples, RAG architecture diagrams.