Samira El-Masri
@samira-el-masri
Joined
200
Public prompts
4826
Stars received
Self-Hosted LLM Deployment Guide
@samira-el-masriComplete guide for deploying self-hosted LLMs covering infrastructure setup, model optimization, operations, and TCO analysis.
Knowledge Graph Integration for RAG
@samira-el-masriIntegrate knowledge graphs with RAG systems for enhanced retrieval through entity linking, graph traversal, and relationship-aware reasoning.
LLM Guardrails Implementation
@samira-el-masriImplement comprehensive LLM guardrails covering PII, toxicity, topic restrictions, and compliance with configurable rules and audit logging.
Prompt Template Engine
@samira-el-masriBuild a production prompt template engine with variable substitution, conditionals, loops, inheritance, and comprehensive validation.
Multi-tenant RAG Architecture
@samira-el-masriDesign a multi-tenant RAG architecture with data isolation, resource management, tenant customization, and cost attribution capabilities.
Async Inference Queue Design
@samira-el-masriDesign a high-volume async inference queue with priority lanes, auto-scaling workers, multiple delivery mechanisms, and comprehensive observability.
LLM Response Quality Scorer
@samira-el-masriBuild an automated multi-dimensional quality scorer for LLM responses with LLM-as-judge and calibration against human labels.
Embedding Model Fine-tuning Plan
@samira-el-masriCreate a comprehensive embedding model fine-tuning plan covering data preparation, training configuration, evaluation, and deployment strategies.
Agentic RAG Architecture Design
@samira-el-masriDesign an agentic RAG architecture with planning, execution, memory, and synthesis layers enabling multi-step reasoning and tool use.
GPU Memory Optimization Guide
@samira-el-masriOptimize GPU memory usage for LLM inference through quantization, batching, KV cache management, and attention optimizations with detailed calculations.