Hi, how do you understand the problem of reinforcement learning applied to dynamic pricing in e-commerce