Model Serving Architecture Review

U

@

·

Conduct comprehensive model serving architecture reviews evaluating scalability, reliability, cost efficiency, and security with actionable improvement recommendations.

67 copies0 forks
Review this model serving architecture and provide recommendations.

## Current Architecture
{{architecture_diagram}}

## Traffic Patterns
{{traffic_patterns}}

## Performance Requirements
- Latency P99: {{latency_p99_target}}ms
- Availability: {{availability_target}}

Evaluate:
1. **Scalability**: Can this handle 10x traffic?
2. **Cost Efficiency**: Are resources optimally utilized?
3. **Reliability**: What are the single points of failure?
4. **Observability**: Is debugging feasible?
5. **Security**: Are there vulnerabilities?

Provide:
- Architecture diagram improvements
- Specific technology recommendations
- Migration path from current to recommended state
- Cost comparison

Details

Category

Analysis

Use Cases

Architecture reviewInfrastructure planningScalability assessment

Works Best With

claude-sonnet-4-20250514gpt-4o
Created Shared

Create your own prompt vault and start sharing