Pouyan, M., Golzari, S., Mousavi, A., Hatam, Ahmad. “Improving Q-Learning Using Simultaneous Updating and Adaptive Policy Based on Opposite Action.” Nashriyyah -i Muhandisi -i Barq va Muhandisi -i Kampyutar -i Iran, vol. 14, no. 2, 2016, pp. 137-146.