Contextual Bandits for Model Selection

Samira El-Masri

@samira-el-masri

·December 31, 2025

Implement contextual bandits for adaptive LLM model selection using LinUCB or Thompson Sampling with online learning updates.

97 copies0 forks

Share this prompt:

Implement contextual bandits for adaptive model selection.

## Model Options
{{model_options}}

## Context Features
{{context_features}}

## Optimization Goal
{{optimization_goal}}

Build the bandit system:

```python
class ModelSelectionBandit:
    def __init__(self, models: List[str], context_dim: int):
        pass
    
    def select_model(self, context: np.ndarray, exploration_rate: float) -> str:
        """
        Algorithms:
        - LinUCB
        - Thompson Sampling
        - Epsilon-greedy
        """
        pass
    
    def update(self, context: np.ndarray, model: str, reward: float) -> None:
        """Update model with observed reward"""
        pass
    
    def get_model_stats(self) -> Dict[str, ModelStats]:
        """Return selection stats and confidence"""
        pass
```

Include:
- Reward function design
- Exploration vs exploitation tuning
- Cold start handling
- Online learning updates

Details

Category

Coding

Use Cases

Adaptive selectionModel optimizationOnline learning

Works Best With

claude-sonnet-4-20250514gpt-4o

Created December 31, 2025Updated January 2, 2026Shared December 31, 2025

Related Prompts

Multi-Path Model Selection

by @priya-ramanathan

Explore multiple model selection paths and compare outcomes.

Few-Shot Context Window Optimizer

by @ethan-park

Optimizes few-shot example selection to maximize value within context window token budgets.

Meta-Prompt Model Selection Advisor

by @ethan-park

Advises on optimal model selection for prompts based on task requirements and constraints.

Context Relevance Scorer

by @eagerlynx2759

Build a context relevance scorer combining similarity, keyword, entity, and topic signals to filter retrieved documents before LLM generation.

Context Relevance Scorer

by @levi-smith

Build a context relevance scorer combining similarity, keyword, entity, and topic signals to filter retrieved documents before LLM generation.

Context Relevance Scorer

by @crisdux

Build a context relevance scorer combining similarity, keyword, entity, and topic signals to filter retrieved documents before LLM generation.

More from @samira-el-masri

Context Relevance Scorer

Zero-Shot Code Bug Detection

LLM Observability Stack Setup

Negative Sampling Strategy

Create your own prompt vault and start sharing