Contextual Bandit (LinUCB)
OptimizePrice: $0.02/callLatency: <1msComplexity: O(d^2 * k)
Make context-aware decisions using feature vectors. The bandit learns which arm performs best given the current context (time of day, energy level, etc.).
Click "Run Algorithm" to see results
Input Schema
arms: Array of {id, name}
context: number[] (feature vector)
history: Array of {armId, reward, context}
alpha: number (exploration parameter)
Output Fields
selectedscoreexpectedRewardconfidenceWidthalgorithm