(@codegrokker) Bsky - nopzon.com

💡 WRAP-UP

Built a complete RAG system with:
✅ Optimized retrieval (k=3, 86.67% precision)
✅ Evaluated prompts (8.0/10 quality)
✅ Real-time monitoring (7 charts)
✅ Full Docker deployment
✅ Hallucination prevention

#LLMZOOMCAMP #BuildInPublic

5 months ago 2 1 0 0

🔄 REPRODUCIBILITY

Everything needed to run this:
📦 requirements.txt with pinned versions
🐳 Docker Compose for one-command deploy
📚 Complete documentation
🎯 Sample data included

Clone, configure API key, run. That's it!

#LLMZOOMCAMP

5 months ago 0 0 1 0

⏱️ PERFORMANCE NUMBERS

• Retrieval: < 1 second
• Processing: 1,400 chunks/min
• Batch size: 5,000 docs
• Dataset: 10+ technical books (15,354 chunks)

Fast enough for real-time queries!

#LLMZOOMCAMP #Performance

5 months ago 0 0 1 0

📥 SMART INGESTION

Auto-detects existing vector DB or creates new one
Handles PDFs + TXT files
Batch processing for large collections
Graceful error handling

Set it and forget it!

#LLMZOOMCAMP #DataEngineering

5 months ago 0 0 1 0

📈 EVALUATION FRAMEWORK

Retrieval: Precision + keyword relevance
LLM: Quality scoring (accuracy, depth, honesty)

Ran ~50 test queries across both evaluations.

Measure everything. Improve what matters.

#LLMZOOMCAMP #MLOps

5 months ago 0 0 1 0

🛡️ PREVENTING HALLUCINATIONS

Tested with out-of-scope questions.

System correctly says "I cannot tell you based on the provided context" instead of making things up.

Honesty > Confidence

#LLMZOOMCAMP #AIEthics

5 months ago 1 0 1 0

🎨 USER EXPERIENCE

Two-tab Streamlit interface:
1. Q&A System with source previews
2. Analytics Dashboard

Auto-initialization on startup = zero-config for users

Good UX = better adoption!

#LLMZOOMCAMP #UX

5 months ago 0 0 1 0

SCALING CHALLENGES

Hit API limits at 15,000+ document chunks!

Solution: Batch processing (5000 chunks/batch)
Result: ~1,400 chunks/min processing speed

Always plan for scale from day one.

#LLMZOOMCAMP #Scaling

5 months ago 0 0 1 0

🐳 FULL CONTAINERIZATION

Docker Compose with:
• Named volumes for persistence
• Health checks
• Resource limits (2 CPU, 4GB RAM)
• Non-root user for security
• Auto-restart policies

One command deploy!

#LLMZOOMCAMP #DevOps #Docker

5 months ago 1 0 2 0

📊 MONITORING MATTERS

Built an integrated dashboard with 7 real-time charts:
- Feedback distribution
- Response times
- Query volume
- Activity patterns

User feedback: 👍/👎 buttons after every answer

#LLMZOOMCAMP #DataViz

5 months ago 0 0 1 0

🤖 PROMPT ENGINEERING

Tested 4 prompt templates on quality:
• Expert Technical: 8.0/10 ⭐
• Detailed Context: 7.9/10
• Structured: 7.0/10
• Concise: 6.2/10

Comprehensive wins over brevity for technical Q&A!

#LLMZOOMCAMP #PromptEngineering

5 months ago 1 0 1 0

🔍 RETRIEVAL OPTIMIZATION

Evaluated 4 different approaches:
• Semantic (k=3): 86.67% precision ✅
• Semantic (k=5): 84.00%
• Semantic (k=10): 84.00%
• MMR (k=5): 84.00%

Less is more! k=3 won with best relevance.

#LLMZOOMCAMP #MachineLearning

5 months ago 1 0 1 0

🛠️ TECH STACK

• LLM: Google Gemini 2.5 Pro
• Embeddings: text-embedding-004
• Vector DB: ChromaDB
• Framework: LangChain
• UI: Streamlit
• Container: Docker

All production-ready with monitoring!

#LLMZOOMCAMP #TechStack

5 months ago 1 0 1 0

📚 THE PROBLEM

Ever spent hours searching through multiple technical PDFs for one piece of info? Me too!

DocuMind solves this with AI-powered semantic search. Ask questions in natural language, get instant answers with sources.

#LLMZOOMCAMP #RAG

5 months ago 0 0 1 0

🚀 Just completed my #DataTalksClub LLM Zoomcamp project: DocuMind - an end-to-end RAG system for technical documents!

Built with Google Gemini, LangChain, ChromaDB & Streamlit.

Let me share what I learned... 🧵

#LLMZOOMCAMP #BuildInPublic #AI

5 months ago 2 1 0 0

🎓 Built a comprehensive search evaluation system this week! Learned to compare multiple search approaches systematically. Now I can evaluate any search system with confidence! #LLMZOOMCAMP #SearchEvaluation #VectorSearch

8 months ago 0 0 0 0

⚡ Key learning: Different search methods have different strengths! Learned when to use exact text search vs semantic vector search vs scalable vector databases. Context matters! #LLMZOOMCAMP

8 months ago 0 0 0 0

🎯 Explored ROUGE evaluation for text generation quality! Learned how to measure how well generated text matches reference text - crucial skill for building better RAG systems! #LLMZOOMCAMP

8 months ago 0 0 0 0

📈 Discovered cosine similarity as a powerful metric for measuring semantic similarity between texts. Learned how to compare embeddings and understand how well our models preserve meaning! #LLMZOOMCAMP

8 months ago 0 0 0 0

🚀 Got hands-on with Qdrant vector database! Learned how to set up vector collections, create embeddings, and perform semantic search at scale. Vector databases are the future of search! #LLMZOOMCAMP

8 months ago 0 0 0 0

🧠 Explored vector search this week! Learned how TF-IDF + SVD can create powerful embeddings that capture semantic meaning. Vector search vs text search - understanding when each shines! #LLMZOOMCAMP

8 months ago 0 0 0 0

🔍 Week 5 of #LLMZOOMCAMP: Learned how to evaluate search quality! Discovered Hit Rate and MRR metrics - essential tools for measuring how well our search systems perform. 📊

8 months ago 1 0 0 0

📚 Completed the Agents module of #LLMZoomcamp! From basic function descriptions to full MCP implementations. Ready to build AI agents that don't just talk, but actually DO things in the real world! 🎯

Next stop: Production-ready AI agents! 🚀

#LLMZoomcamp #AIAgents #MachineLearning

9 months ago 1 0 0 0

🌐 MCP (Model Context Protocol) is revolutionizing how we build AI tools! Instead of hardcoding functions, agents can now discover and use tools dynamically. It's like having a plugin system for AI! 🔌

#MCP #LLMZoomcamp #AIInnovation

9 months ago 0 0 0 0

💡 Key insight from #LLMZoomcamp: The real power of AI agents isn't in the model itself, but in how well they can interact with external systems. Function calling bridges the gap between AI reasoning and real-world actions!

#AIEngineering #LLMZoomcamp

9 months ago 0 0 0 0

🔧 Built my first MCP server with FastMCP! Creating tools that AI agents can discover and use automatically. The JSON-RPC protocol makes it feel like building microservices for AI.

```python
@mcp.tool
def get_weather(city: str) -> float:
```

#FastMCP #LLMZoomcamp #PythonDev

9 months ago 0 0 0 0

🛠️ Function calling in LLMs is a game-changer! Instead of just generating text, my AI can now:
- Query databases
- Call APIs
- Execute tools
- Return structured data

From chatbot to action-bot! 💪

#OpenAI #FunctionCalling #LLMZoomcamp

9 months ago 0 0 0 0

📡 Discovered Model Context Protocol (MCP) today! It's like creating a universal language for AI agents to communicate with tools. Built a weather agent that can both GET and SET temperature data. Mind = blown! 🌡️☁️
#MCP #LLMZoomcamp #AIAgents

9 months ago 0 0 0 0

🚀 Just completed Module 4 of #LLMZoomcamp! Learned how to build AI agents with function calling - teaching LLMs to interact with external tools and APIs. The future of AI is not just chat, it's intelligent action! 🤖⚡
#AI #MachineLearning #FunctionCalling

9 months ago 1 1 0 0

🎓 Completed the MLOps Zoomcamp project! Built end-to-end ML system with:
✅ Data pipeline
✅ Model training & tracking
✅ API deployment
✅ Monitoring & alerting
✅ Cloud infrastructure

Thanks Datatalksclub for this amazing course!

9 months ago 0 0 0 0

Posts by