K. Doya, Metalearning and neuromodulation, Neural Networks, vol.15, issue.4-6, pp.495-506, 2002.
DOI : 10.1016/S0893-6080(02)00044-8

D. Barraclough, M. Conroy, and D. Lee, Prefrontal cortex and decision making in a mixed-strategy game, Nature Neuroscience, vol.7, issue.4, pp.404-414, 2004.
DOI : 10.1038/nn1209

E. Procyk, Y. Tanaka, and J. Joseph, Anterior cingulate activity during routine and non-routine sequential behaviors in macaques, Nature Neuroscience, vol.3, issue.5, pp.502-510, 2000.
DOI : 10.1038/74880

URL : https://hal.archives-ouvertes.fr/inserm-00132133

G. Aston-jones and J. Cohen, Adaptive gain and the role of the locus coeruleus-norepinephrine system in optimal performance, The Journal of Comparative Neurology, vol.30, issue.1, pp.99-110, 2005.
DOI : 10.1002/cne.20723

J. Brown and T. Braver, Learned Predictions of Error Likelihood in the Anterior Cingulate Cortex, Science, vol.307, issue.5712, pp.1118-1139, 2005.
DOI : 10.1126/science.1105783

. Dosenbach, K. Visscher, E. Palmer, F. , M. Wenger et al., A Core System for the Implementation of Task Sets, Neuron, vol.50, issue.5, pp.799-812, 2006.
DOI : 10.1016/j.neuron.2006.04.031

M. Matsumoto, K. Matsumoto, H. Abe, and K. Tanaka, Medial prefrontal cell activity signaling prediction errors of action values, Nature Neuroscience, vol.93, issue.5, pp.647-56, 2007.
DOI : 10.1126/science.1069504

R. Quilodran, M. Rothe, and E. Procyk, Behavioral Shifts and Action Valuation in the Anterior Cingulate Cortex, Neuron, vol.57, issue.2, pp.314-339, 2008.
DOI : 10.1016/j.neuron.2007.11.031

URL : https://hal.archives-ouvertes.fr/inserm-00906686

R. Sutton and A. Barto, Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998.
DOI : 10.1109/TNN.1998.712192

P. Dominey, M. Arbib, and J. Joseph, A Model of Corticostriatal Plasticity for Learning Oculomotor Associations and Sequences, Journal of Cognitive Neuroscience, vol.28, issue.12, pp.311-336, 1995.
DOI : 10.1146/annurev.ps.40.020189.001203

M. Khamassi, L. Martinet, and A. Guillot, Combining self-organizing maps with mixture of epxerts : Application to an Actor-Critic model of reinforcement learning in the basal ganglia, Proceedings of the 9th International Conference on the Simulation of Adaptive Behavior (SAB), pp.394-405, 2006.

W. Schultz, P. Dayan, and P. Montague, A Neural Substrate of Prediction and Reward, Science, vol.275, issue.5306, pp.1593-1602, 1997.
DOI : 10.1126/science.275.5306.1593

K. Gurney, T. Prescott, and P. Redgrave, A computational model of action selection in the basal ganglia. I. A new functional anatomy, Biological Cybernetics, vol.84, issue.6, pp.401-411, 2001.
DOI : 10.1007/PL00007984

B. Girard, V. Cuzin, A. Guillot, K. Gurney, and T. Prescott, A BASAL GANGLIA INSPIRED MODEL OF ACTION SELECTION EVALUATED IN A ROBOTIC SURVIVAL TASK, Journal of Integrative Neuroscience, vol.02, issue.02, pp.179-200, 2003.
DOI : 10.1142/S0219635203000299

URL : https://hal.archives-ouvertes.fr/hal-00016392

E. Procyk and P. Goldman-rakic, Modulation of Dorsolateral Prefrontal Delay Activity during Self-Organized Behavior, Journal of Neuroscience, vol.26, issue.44, pp.11313-11336, 2006.
DOI : 10.1523/JNEUROSCI.2157-06.2006

URL : https://hal.archives-ouvertes.fr/inserm-00132158

S. Dehaene and J. Changeux, A neuronal model of a global workspace in effortful cognitive tasks, Proc Natl Acad Sci, pp.95-14529, 1998.

J. Cohen, G. Aston-jones, and S. Gilzenut, A systems-level perspective on attention and cognitive control, Cognitive Neuroscience of Attention, pp.71-90, 2004.