@samira-el-masri
Joined December 2025
200
Public prompts
4829
Stars received
Design a comprehensive A/B testing framework for LLM features with experiment design, statistical analysis, and LLM-specific considerations.
Build a multi-layer hallucination detection system for RAG with entailment checking, citation verification, and factual consistency scoring.
Analyze embedding dimension reduction techniques including PCA, Matryoshka, and quantization with quality-storage trade-off quantification.
Build a production-grade distributed rate limiter for LLM APIs with token buckets, priority queuing, and burst handling.
Design optimal metadata filtering strategies for vector search with pre/post/hybrid approaches and performance optimization techniques.
Build resilient LLM output parsers with fuzzy matching, LLM-based repair, schema validation, and comprehensive error handling.
Implement distributed tracing for ML inference pipelines with custom attributes, sampling strategies, and ML-specific observability features.
Complete integration guide for adding rerankers to RAG pipelines with evaluation, implementation, optimization, and production deployment phases.
Build a comprehensive model inference cost calculator comparing multiple models with optimization scenarios and monthly projections.
Build a multi-stage query rewriting pipeline with expansion, decomposition, and HyDE techniques to significantly improve RAG retrieval quality.
Design a comprehensive prompt version control system with versioning, quality gates, collaboration workflows, and deployment automation.
Design comprehensive embedding cache warmup strategies with priority analysis, scheduling, resource management, and CI/CD automation.