Review this model serving architecture and provide recommendations. ## Current Architecture {{architecture_diagram}} ## Traffic Patterns {{traffic_patterns}} ## Performance Requirements - Latency P99: {{latency_p99_target}}ms - Availability: {{availability_target}} Evaluate: 1. **Scalability**: Can this handle 10x traffic? 2. **Cost Efficiency**: Are resources optimally utilized? 3. **Reliability**: What are the single points of failure? 4. **Observability**: Is debugging feasible? 5. **Security**: Are there vulnerabilities? Provide: - Architecture diagram improvements - Specific technology recommendations - Migration path from current to recommended state - Cost comparison
Model Serving Architecture Review
U
@
Conduct comprehensive model serving architecture reviews evaluating scalability, reliability, cost efficiency, and security with actionable improvement recommendations.
67 copies0 forks
Details
Category
AnalysisUse Cases
Architecture reviewInfrastructure planningScalability assessment
Works Best With
claude-sonnet-4-20250514gpt-4o
Created Shared