Build ứng dụng AI trên Cloudflare Build AI applications on Cloudflare

Bạn muốn thêm inference, RAG hoặc gateway tới nhiều model mà không tự vận hành GPU cluster. You want inference, RAG, or multi-model gateways without operating your own GPU clusters.

Tài liệu Cloudflare (use case) → Official Cloudflare use case docs →

Kiến trúc gợi ý Suggested architecture

User → Worker/Pages → AI Gateway / Workers AI → Vectorize + R2/KV User → Worker/Pages → AI Gateway / Workers AI → Vectorize + R2/KV

Sơ đồ tham chiếu (Cloudflare Docs) Reference diagrams (Cloudflare Docs)

Figure 1: Knowledge seeding — Retrieval Augmented Generation (RAG) Retrieval Augmented Generation (RAG)

RAG kết hợp retrieval (Vectorize/KV) với Workers AI để chatbot trả lời chính xác hơn — seeding knowledge và query path tách biệt. RAG combines retrieval with generative models for better text. It uses external knowledge to create factual, relevant responses, improving coherence and accuracy in NLP tasks like chatbots.

Thuật ngữ: Concepts: RAG · Vectorize · Workers AI · Knowledge seeding · Embeddings

Sơ đồ chính thức ↗ Official diagram ↗ · AI Artificial Intelligence (AI)

Multi-vendor AI architecture — Multi-vendor AI observability and control Multi-vendor AI observability and control

By shifting features such as rate limiting, caching, and error handling to the proxy layer, organizations can apply unified configurations across services and inference service providers. By shifting features such as rate limiting, caching, and error handling to the proxy layer, organizations can apply unified configurations across services and inference service providers.

Sơ đồ chính thức ↗ Official diagram ↗ · AI Artificial Intelligence (AI)

Controls & stack Controls & stack

Workers AI cho inference tại edge Workers AI for edge inference
AI Gateway: routing, cache, observability tới LLM providers AI Gateway: routing, caching, observability to LLM providers
Vectorize cho RAG embeddings Vectorize for RAG embeddings
Durable Objects cho session/stateful chat Durable Objects for session/stateful chat
R2/KV cho documents & config R2/KV for documents and configuration

Tình huống khác (cùng lộ trình) Other scenarios (same track)

← Tất cả tình huống lộ trình này ← All scenarios in this track · Ba nhóm tình huống All three groups

Next step Next step

Tiếp tục hành trình học của bạn. Continue your learning journey.

Bài học Workers AI Workers AI lesson