Epic: E1 — TW RAG + Model Service
Priority: P0
Role: Engineer
User Story
As an Engineer, I want to build an end-to-end ingestion pipeline that reads guidance materials from GCS, chunks and embeds them, and indexes them in Vertex AI Vector Search, so that the RAG layer can retrieve relevant passages based on student/teacher context.
Context
Combines stories 1.7 (GCS ingestion connection) and 1.8 (chunking + embedding pipeline). GCS is the source-of-truth for guidance materials. Documents are read from the bucket, chunked with configurable size/overlap, embedded via Vertex AI, and indexed in Vector Search using @google-cloud/aiplatform (Vertex RAG Engine). Chunk size and embedding model choices affect retrieval quality — tune during implementation.
Acceptance Criteria
Dependencies
Combines #12 (1.7) and #13 (1.8)
📄 PRD: Part 1 — Glow CI PRD
Epic: E1 — TW RAG + Model Service
Priority: P0
Role: Engineer
User Story
As an Engineer, I want to build an end-to-end ingestion pipeline that reads guidance materials from GCS, chunks and embeds them, and indexes them in Vertex AI Vector Search, so that the RAG layer can retrieve relevant passages based on student/teacher context.
Context
Combines stories 1.7 (GCS ingestion connection) and 1.8 (chunking + embedding pipeline). GCS is the source-of-truth for guidance materials. Documents are read from the bucket, chunked with configurable size/overlap, embedded via Vertex AI, and indexed in Vector Search using
@google-cloud/aiplatform(Vertex RAG Engine). Chunk size and embedding model choices affect retrieval quality — tune during implementation.Acceptance Criteria
Dependencies
📄 PRD: Part 1 — Glow CI PRD