Lộ trình liên quan Related learning track

Developer Platform Developer Platform

Học lộ trình này Study this track

Build ứng dụng AI trên Cloudflare Build AI applications on Cloudflare

Bạn muốn thêm inference, RAG hoặc gateway tới nhiều model mà không tự vận hành GPU cluster. You want inference, RAG, or multi-model gateways without operating your own GPU clusters.

Tài liệu Cloudflare (use case) → Official Cloudflare use case docs →

Kiến trúc gợi ý Suggested architecture

User → Worker/Pages → AI Gateway / Workers AI → Vectorize + R2/KV User → Worker/Pages → AI Gateway / Workers AI → Vectorize + R2/KV

Sơ đồ tham chiếu (Cloudflare Docs) Reference diagrams (Cloudflare Docs)

Figure 1: Knowledge seeding

Retrieval Augmented Generation (RAG) Retrieval Augmented Generation (RAG)

RAG kết hợp retrieval (Vectorize/KV) với Workers AI để chatbot trả lời chính xác hơn — seeding knowledge và query path tách biệt. RAG combines retrieval with generative models for better text. It uses external knowledge to create factual, relevant responses, improving coherence and accuracy in NLP tasks like chatbots.

Thuật ngữ: Concepts: RAG · Vectorize · Workers AI · Knowledge seeding · Embeddings

Sơ đồ chính thức ↗ Official diagram ↗ · AI Artificial Intelligence (AI)

Multi-vendor AI architecture

Multi-vendor AI observability and control Multi-vendor AI observability and control

By shifting features such as rate limiting, caching, and error handling to the proxy layer, organizations can apply unified configurations across services and inference service providers. By shifting features such as rate limiting, caching, and error handling to the proxy layer, organizations can apply unified configurations across services and inference service providers.

Sơ đồ chính thức ↗ Official diagram ↗ · AI Artificial Intelligence (AI)

Controls & stack Controls & stack

  • Workers AI cho inference tại edge Workers AI for edge inference
  • AI Gateway: routing, cache, observability tới LLM providers AI Gateway: routing, caching, observability to LLM providers
  • Vectorize cho RAG embeddings Vectorize for RAG embeddings
  • Durable Objects cho session/stateful chat Durable Objects for session/stateful chat
  • R2/KV cho documents & config R2/KV for documents and configuration

Tình huống khác (cùng lộ trình) Other scenarios (same track)

← Tất cả tình huống lộ trình này ← All scenarios in this track · Ba nhóm tình huống All three groups

Next step Next step

Tiếp tục hành trình học của bạn. Continue your learning journey.