๐ก WRAP-UP
Built a complete RAG system with:
โ
Optimized retrieval (k=3, 86.67% precision)
โ
Evaluated prompts (8.0/10 quality)
โ
Real-time monitoring (7 charts)
โ
Full Docker deployment
โ
Hallucination prevention
#LLMZOOMCAMP #BuildInPublic
Posts by
๐ REPRODUCIBILITY
Everything needed to run this:
๐ฆ requirements.txt with pinned versions
๐ณ Docker Compose for one-command deploy
๐ Complete documentation
๐ฏ Sample data included
Clone, configure API key, run. That's it!
#LLMZOOMCAMP
โฑ๏ธ PERFORMANCE NUMBERS
โข Retrieval: < 1 second
โข Processing: 1,400 chunks/min
โข Batch size: 5,000 docs
โข Dataset: 10+ technical books (15,354 chunks)
Fast enough for real-time queries!
#LLMZOOMCAMP #Performance
๐ฅ SMART INGESTION
Auto-detects existing vector DB or creates new one
Handles PDFs + TXT files
Batch processing for large collections
Graceful error handling
Set it and forget it!
#LLMZOOMCAMP #DataEngineering
๐ EVALUATION FRAMEWORK
Retrieval: Precision + keyword relevance
LLM: Quality scoring (accuracy, depth, honesty)
Ran ~50 test queries across both evaluations.
Measure everything. Improve what matters.
#LLMZOOMCAMP #MLOps
๐ก๏ธ PREVENTING HALLUCINATIONS
Tested with out-of-scope questions.
System correctly says "I cannot tell you based on the provided context" instead of making things up.
Honesty > Confidence
#LLMZOOMCAMP #AIEthics
๐จ USER EXPERIENCE
Two-tab Streamlit interface:
1. Q&A System with source previews
2. Analytics Dashboard
Auto-initialization on startup = zero-config for users
Good UX = better adoption!
#LLMZOOMCAMP #UX
SCALING CHALLENGES
Hit API limits at 15,000+ document chunks!
Solution: Batch processing (5000 chunks/batch)
Result: ~1,400 chunks/min processing speed
Always plan for scale from day one.
#LLMZOOMCAMP #Scaling
๐ณ FULL CONTAINERIZATION
Docker Compose with:
โข Named volumes for persistence
โข Health checks
โข Resource limits (2 CPU, 4GB RAM)
โข Non-root user for security
โข Auto-restart policies
One command deploy!
#LLMZOOMCAMP #DevOps #Docker
๐ MONITORING MATTERS
Built an integrated dashboard with 7 real-time charts:
- Feedback distribution
- Response times
- Query volume
- Activity patterns
User feedback: ๐/๐ buttons after every answer
#LLMZOOMCAMP #DataViz
๐ค PROMPT ENGINEERING
Tested 4 prompt templates on quality:
โข Expert Technical: 8.0/10 โญ
โข Detailed Context: 7.9/10
โข Structured: 7.0/10
โข Concise: 6.2/10
Comprehensive wins over brevity for technical Q&A!
#LLMZOOMCAMP #PromptEngineering
๐ RETRIEVAL OPTIMIZATION
Evaluated 4 different approaches:
โข Semantic (k=3): 86.67% precision โ
โข Semantic (k=5): 84.00%
โข Semantic (k=10): 84.00%
โข MMR (k=5): 84.00%
Less is more! k=3 won with best relevance.
#LLMZOOMCAMP #MachineLearning
๐ ๏ธ TECH STACK
โข LLM: Google Gemini 2.5 Pro
โข Embeddings: text-embedding-004
โข Vector DB: ChromaDB
โข Framework: LangChain
โข UI: Streamlit
โข Container: Docker
All production-ready with monitoring!
#LLMZOOMCAMP #TechStack
๐ THE PROBLEM
Ever spent hours searching through multiple technical PDFs for one piece of info? Me too!
DocuMind solves this with AI-powered semantic search. Ask questions in natural language, get instant answers with sources.
#LLMZOOMCAMP #RAG
๐ Just completed my #DataTalksClub LLM Zoomcamp project: DocuMind - an end-to-end RAG system for technical documents!
Built with Google Gemini, LangChain, ChromaDB & Streamlit.
Let me share what I learned... ๐งต
#LLMZOOMCAMP #BuildInPublic #AI
๐ Built a comprehensive search evaluation system this week! Learned to compare multiple search approaches systematically. Now I can evaluate any search system with confidence! #LLMZOOMCAMP #SearchEvaluation #VectorSearch
โก Key learning: Different search methods have different strengths! Learned when to use exact text search vs semantic vector search vs scalable vector databases. Context matters! #LLMZOOMCAMP
๐ฏ Explored ROUGE evaluation for text generation quality! Learned how to measure how well generated text matches reference text - crucial skill for building better RAG systems! #LLMZOOMCAMP
๐ Discovered cosine similarity as a powerful metric for measuring semantic similarity between texts. Learned how to compare embeddings and understand how well our models preserve meaning! #LLMZOOMCAMP
๐ Got hands-on with Qdrant vector database! Learned how to set up vector collections, create embeddings, and perform semantic search at scale. Vector databases are the future of search! #LLMZOOMCAMP
๐ง Explored vector search this week! Learned how TF-IDF + SVD can create powerful embeddings that capture semantic meaning. Vector search vs text search - understanding when each shines! #LLMZOOMCAMP
๐ Week 5 of #LLMZOOMCAMP: Learned how to evaluate search quality! Discovered Hit Rate and MRR metrics - essential tools for measuring how well our search systems perform. ๐
๐ Completed the Agents module of #LLMZoomcamp! From basic function descriptions to full MCP implementations. Ready to build AI agents that don't just talk, but actually DO things in the real world! ๐ฏ
Next stop: Production-ready AI agents! ๐
#LLMZoomcamp #AIAgents #MachineLearning
๐ MCP (Model Context Protocol) is revolutionizing how we build AI tools! Instead of hardcoding functions, agents can now discover and use tools dynamically. It's like having a plugin system for AI! ๐
#MCP #LLMZoomcamp #AIInnovation
๐ก Key insight from #LLMZoomcamp: The real power of AI agents isn't in the model itself, but in how well they can interact with external systems. Function calling bridges the gap between AI reasoning and real-world actions!
#AIEngineering #LLMZoomcamp
๐ง Built my first MCP server with FastMCP! Creating tools that AI agents can discover and use automatically. The JSON-RPC protocol makes it feel like building microservices for AI.
```python
@mcp.tool
def get_weather(city: str) -> float:
```
#FastMCP #LLMZoomcamp #PythonDev
๐ ๏ธ Function calling in LLMs is a game-changer! Instead of just generating text, my AI can now:
- Query databases
- Call APIs
- Execute tools
- Return structured data
From chatbot to action-bot! ๐ช
#OpenAI #FunctionCalling #LLMZoomcamp
๐ก Discovered Model Context Protocol (MCP) today! It's like creating a universal language for AI agents to communicate with tools. Built a weather agent that can both GET and SET temperature data. Mind = blown! ๐ก๏ธโ๏ธ
#MCP #LLMZoomcamp #AIAgents
๐ Just completed Module 4 of #LLMZoomcamp! Learned how to build AI agents with function calling - teaching LLMs to interact with external tools and APIs. The future of AI is not just chat, it's intelligent action! ๐คโก
#AI #MachineLearning #FunctionCalling
๐ Completed the MLOps Zoomcamp project! Built end-to-end ML system with:
โ
Data pipeline
โ
Model training & tracking
โ
API deployment
โ
Monitoring & alerting
โ
Cloud infrastructure
Thanks Datatalksclub for this amazing course!