Measure {{model}} performance stability on {{metrics}} over {{time_period}}. Collect metric samples at regular intervals: - Record {{sample_count}} measurements - Calculate rolling mean and variance - Detect trend and seasonality Report stable performance estimate. Flag if variance indicates instability. Recommend monitoring frequency based on observed variance.
6 copies0 forks
Details
Category
AnalysisUse Cases
Metric stabilityTrend detectionMonitoring design
Works Best With
claude-opus-4.5gpt-5.2gemini-2.0-flash
Created Shared