Review this model serving architecture and provide recommendations. ## Current Architecture {{architecture_diagram}} ## Traffic Patterns {{traffic_patterns}} ## Performance Requirements - Latency P99: {{latency_p99_target}}ms - Availability: {{availability_target}} Evaluate: 1. **Scalability**: Can this handle 10x traffic? 2. **Cost Efficiency**: Are resources optimally utilized? 3. **Reliability**: What are the single points of failure? 4. **Observability**: Is debugging feasible? 5. **Security**: Are there vulnerabilities? Provide: - Architecture diagram improvements - Specific technology recommendations - Migration path from current to recommended state - Cost comparison
Model Serving Architecture Review
Conduct comprehensive model serving architecture reviews evaluating scalability, reliability, cost efficiency, and security with actionable improvement recommendations.
67 copies0 forks
Share this prompt:
Details
Category
AnalysisUse Cases
Architecture reviewInfrastructure planningScalability assessment
Works Best With
claude-sonnet-4-20250514gpt-4o
Created Updated Shared