Get in Touch

Course Outline

Introduction to Retrieval-Augmented Generation (RAG)

  • Understanding RAG and its significance for enterprise AI.
  • Core components of a RAG system: retriever, generator, and document store.
  • Comparing RAG with standalone LLMs and vector search.

Setting Up a RAG Pipeline

  • Installing and configuring Haystack or similar frameworks.
  • Document ingestion and preprocessing.
  • Connecting retrievers to vector databases (e.g., FAISS, Pinecone).

Fine-Tuning the Retriever

  • Training dense retrievers using domain-specific data.
  • Leveraging sentence transformers and contrastive learning.
  • Evaluating retriever quality via top-k accuracy.

Fine-Tuning the Generator

  • Selecting base models (e.g., BART, T5, FLAN-T5).
  • Distinguishing between instruction tuning and supervised fine-tuning.
  • Utilizing LoRA and PEFT methods for efficient updates.

Evaluation and Optimization

  • Key metrics for evaluating RAG performance (e.g., BLEU, EM, F1).
  • Managing latency, retrieval quality, and reducing hallucinations.
  • Experiment tracking and iterative improvement.

Deployment and Real-World Integration

  • Deploying RAG in internal search engines and chatbots.
  • Addressing security, data access, and governance considerations.
  • Integrating with APIs, dashboards, or knowledge portals.

Case Studies and Best Practices

  • Enterprise use cases in finance, healthcare, and legal sectors.
  • Managing domain drift and knowledge base updates.
  • Exploring future directions in retrieval-augmented LLM systems.

Summary and Next Steps

Requirements

  • A solid understanding of natural language processing (NLP) concepts.
  • Experience with transformer-based language models.
  • Familiarity with Python and basic machine learning workflows.

Audience

  • NLP engineers.
  • Knowledge management teams.
 14 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories