Latency Breakdown Analyzer

Samira El-Masri

@samira-el-masri

·December 31, 2025

Systematically analyze end-to-end latency in ML pipelines to identify bottlenecks and prioritize optimization efforts by impact and implementation effort.

40 copies0 forks

Share this prompt:

Analyze this end-to-end latency breakdown and identify optimization opportunities.

## Request Trace
{{trace_data}}

## Latency Breakdown
- Embedding generation: {{embed_latency}}ms
- Vector search: {{search_latency}}ms
- Context assembly: {{context_latency}}ms
- LLM inference: {{llm_latency}}ms
- Post-processing: {{post_latency}}ms

## Target Latency
{{target_latency}}ms

For each component:

1. **Current vs Optimal**: What is the theoretical minimum?
2. **Bottleneck Analysis**: What causes the current latency?
3. **Quick Wins**: Optimizations achievable in <1 week
4. **Medium-term**: Optimizations requiring 2-4 weeks
5. **Strategic**: Architectural changes for long-term gains

Prioritize by impact/effort ratio.

Details

Category

Analysis

Use Cases

Latency analysisPerformance optimizationBottleneck detection

Works Best With

claude-sonnet-4-20250514gpt-4o

Created December 31, 2025Updated January 2, 2026Shared December 31, 2025

Related Prompts

Latency Optimization Analysis

by @priya-ramanathan

Optimize latency through systematic bottleneck analysis.

Performance Bottleneck Triangulation

by @daniel-okoye

Triangulate performance bottlenecks using multiple analysis methods.

Response Latency Analysis

by @priya-ramanathan

Measure and analyze model response times for performance optimization.

Meta-Prompt Latency Optimizer

by @ethan-park

Optimizes prompts for faster response times while balancing quality requirements.

Performance Bottleneck Investigation

by @daniel-okoye

Systematically investigate a performance issue to find the bottleneck.

Performance Engineer Bottleneck Analysis

by @daniel-okoye

Analyze performance issues from a performance engineer perspective.

More from @samira-el-masri

Context Relevance Scorer

Zero-Shot Code Bug Detection

LLM Observability Stack Setup

Negative Sampling Strategy

Create your own prompt vault and start sharing