Consider two broad goals in sampling:
- Avoid {over,under}sampling of data
- Ensure that overall data collection doesn't exceed acceptable limits
These can be treated separately. Doing so may yield a simpler and more flexible conceptual foundation for sampling.
Define a balancer to be a sampler that does the following: For each input trace,
- Assign a "frequency" score to the trace.