Get in Touch

Course Outline

Introduction to Retrieval-Augmented Generation (RAG)

  • Understanding RAG and its significance for enterprise AI
  • Key components of a RAG system: retriever, generator, and document store
  • Comparison with standalone LLMs and vector search solutions

Setting Up a RAG Pipeline

  • Installing and configuring Haystack or comparable frameworks
  • Document ingestion and preprocessing techniques
  • Connecting retrievers to vector databases (e.g., FAISS, Pinecone)

Fine-Tuning the Retriever

  • Training dense retrievers using domain-specific data
  • Utilizing sentence transformers and contrastive learning
  • Evaluating retriever quality via top-k accuracy

Fine-Tuning the Generator

  • Selecting base models (e.g., BART, T5, FLAN-T5)
  • Distinguishing between instruction tuning and supervised fine-tuning
  • Applying LoRA and PEFT methods for efficient updates

Evaluation and Optimization

  • Metrics for assessing RAG performance (e.g., BLEU, EM, F1)
  • Addressing latency, retrieval quality, and hallucination reduction
  • Experiment tracking and iterative improvement strategies

Deployment and Real-World Integration

  • Deploying RAG within internal search engines and chatbots
  • Considering security, data access, and governance aspects
  • Integrating with APIs, dashboards, or knowledge portals

Case Studies and Best Practices

  • Enterprise use cases in finance, healthcare, and legal sectors
  • Managing domain drift and knowledge base updates
  • Future trends in retrieval-augmented LLM systems

Summary and Next Steps

Requirements

  • A solid understanding of natural language processing (NLP) concepts
  • Experience with transformer-based language models
  • Familiarity with Python and fundamental machine learning workflows

Target Audience

  • NLP engineers
  • Knowledge management teams
 14 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories