Negative Sampling Strategy

U

@

·

Design comprehensive negative sampling strategies for retrieval training with easy, medium, and hard negative mining and curriculum learning.

30 copies0 forks
Design a negative sampling strategy for training/evaluating our retrieval system.

## Retrieval Task
{{retrieval_task}}

## Current Dataset
{{dataset_description}}

## Evaluation Goals
{{evaluation_goals}}

Develop sampling strategies:

**Easy Negatives**
- Random sampling
- Use cases and limitations

**Medium Negatives**
- BM25 top-k that are irrelevant
- Same-topic different answers

**Hard Negatives**
- Semantic near-misses
- Factually incorrect but plausible
- Cross-encoder filtered

**Mining Implementation**
```python
class NegativeMiner:
    def mine_hard_negatives(self, query: str, positive: str, corpus: List[str], k: int) -> List[str]:
        pass
```

Include:
- Ratio recommendations
- Quality filtering
- Curriculum learning approach

Details

Category

Analysis

Use Cases

Negative samplingRetrieval trainingEvaluation design

Works Best With

claude-sonnet-4-20250514gpt-4o
Created Shared

Create your own prompt vault and start sharing