Contextual Bandit (LinUCB)

Optimize
Price: $0.02/callLatency: <1msComplexity: O(d^2 * k)

Make context-aware decisions using feature vectors. The bandit learns which arm performs best given the current context (time of day, energy level, etc.).

Click "Run Algorithm" to see results

Input Schema

arms: Array of {id, name}
context: number[] (feature vector)
history: Array of {armId, reward, context}
alpha: number (exploration parameter)

Output Fields

selectedscoreexpectedRewardconfidenceWidthalgorithm