A reinforcement learning-based link quality estimation strategy for RPL and its impact on topology management