S

Samira El-Masri

@samira-el-masri

Joined

200

Public prompts

4826

Stars received

Complete guide for deploying self-hosted LLMs covering infrastructure setup, model optimization, operations, and TCO analysis.

990

Integrate knowledge graphs with RAG systems for enhanced retrieval through entity linking, graph traversal, and relationship-aware reasoning.

310

Implement comprehensive LLM guardrails covering PII, toxicity, topic restrictions, and compliance with configurable rules and audit logging.

780

Build a production prompt template engine with variable substitution, conditionals, loops, inheritance, and comprehensive validation.

820

Design a multi-tenant RAG architecture with data isolation, resource management, tenant customization, and cost attribution capabilities.

290

Design a high-volume async inference queue with priority lanes, auto-scaling workers, multiple delivery mechanisms, and comprehensive observability.

960

Build an automated multi-dimensional quality scorer for LLM responses with LLM-as-judge and calibration against human labels.

120

Create a comprehensive embedding model fine-tuning plan covering data preparation, training configuration, evaluation, and deployment strategies.

680

Design an agentic RAG architecture with planning, execution, memory, and synthesis layers enabling multi-step reasoning and tool use.

180

Optimize GPU memory usage for LLM inference through quantization, batching, KV cache management, and attention optimizations with detailed calculations.

100

Design a multi-layer prompt injection defense system with input sanitization, prompt structure hardening, output validation, and attack monitoring.

820

Build a scalable document ingestion pipeline with extraction, chunking, embedding generation, and vector storage with parallel processing and error recovery.

870
PreviousPage 14 of 17Next