Evolution of Reinforcement Learning in Foraging Bees: A Simple Explanation for Risk Averse Behavior

Yael Niv, Daphna Joel, Isaac Meilijson and Eytan Ruppin

Reinforcement learning is a fundamental process by which organisms learn to achieve goals from their interactions with the environment. We use Evolutionary Computation techniques to derive (near-)optimal neuronal learning rules in a simple neural network model of decision-making in simulated bumblebees foraging for nectar. The resulting bees exhibit efficient reinforcement learning. The evolved synaptic plasticity dynamics give rise to varying exploration/exploitation levels and to the well-documented foraging strategy of risk aversion This behavior is shown to emerge directly from optimal reinforcement learning, providing a biologically founded, parsimonious and novel explanation of risk-averse behavior.

(Short version - MSWord 292K)

(Full text - ps file 218K)