S

Samira El-Masri

@samira-el-masri

Joined

200

Public prompts

4826

Stars received

Implement reliable structured output enforcement using prompt engineering, constrained decoding, validation, and LLM-based repair strategies.

340

Systematically optimize vector search latency through profiling, index tuning, query optimization, and caching with quantified trade-off analysis.

950

Build a dynamic few-shot example selector using semantic similarity, diversity optimization, and quality scoring for improved LLM task performance.

30

Design comprehensive prompt caching strategies analyzing prefix, response, and KV cache opportunities with provider-specific optimizations.

930

Conduct a comprehensive RAG system audit covering retrieval quality, generation fidelity, operational health, and security compliance.

310

Structured incident response guidance for AI system production issues with immediate actions, investigation steps, and communication templates.

250

Build a smart model router that dynamically selects optimal LLMs based on query complexity, cost constraints, and capability requirements.

590

Implement context compression techniques including summarization, query-focused extraction, and token pruning to maximize information density.

10

Set up a comprehensive LLM observability stack covering logging, metrics, tracing, and alerting with tool recommendations and configurations.

640

Plan embedding model migration with dual-write strategy, validation checkpoints, traffic shifting, and rollback procedures for zero-downtime updates.

760

Build a synthetic data generator for RAG testing covering document generation, QA pairs, and edge cases with quality validation.

940

Design multi-stakeholder retrieval metrics dashboards for engineering, quality, and business audiences with specific metrics and alert configurations.

440
PreviousPage 13 of 17Next