Development of learning environment simulation: reinforcement learning powered play behavior model and comparative lab experiments