TY - GEN
T1 - Reward gain model describes cortical use-dependent plasticity
AU - Mawase, Firas
AU - Wymbs, Nicholas
AU - Uehara, Shintaro
AU - Celnik, Pablo
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/10/13
Y1 - 2016/10/13
N2 - Consistent repetitions of an action lead to plastic change in the motor cortex and cause shift in the direction of future movements. This process is known as use-dependent plasticity (UDP), one of the basic forms of the motor memory. We have recently demonstrated in a physiological study that success-related reinforcement signals could modulate the strength of UDP. We tested this idea by developing a computational approach that modeled the shift in the direction of future action as a change in preferred direction of population activity of neurons in the primary motor cortex. The rate of the change follows a modified temporal difference reinforcement learning algorithm, in which the learning policy is based on comparison between what reward the population experiences on a particular trial, and what it had expected on the basis of its previous learning. By using this model, we were able to characterize the nature of learning and retention of UDP. Exploring the relationship between reinforcement and UDP constitutes a crucial step toward understanding the basic blocks involved in the formation of motor memories.
AB - Consistent repetitions of an action lead to plastic change in the motor cortex and cause shift in the direction of future movements. This process is known as use-dependent plasticity (UDP), one of the basic forms of the motor memory. We have recently demonstrated in a physiological study that success-related reinforcement signals could modulate the strength of UDP. We tested this idea by developing a computational approach that modeled the shift in the direction of future action as a change in preferred direction of population activity of neurons in the primary motor cortex. The rate of the change follows a modified temporal difference reinforcement learning algorithm, in which the learning policy is based on comparison between what reward the population experiences on a particular trial, and what it had expected on the basis of its previous learning. By using this model, we were able to characterize the nature of learning and retention of UDP. Exploring the relationship between reinforcement and UDP constitutes a crucial step toward understanding the basic blocks involved in the formation of motor memories.
UR - http://www.scopus.com/inward/record.url?scp=85009084224&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85009084224&partnerID=8YFLogxK
U2 - 10.1109/EMBC.2016.7590626
DO - 10.1109/EMBC.2016.7590626
M3 - Conference contribution
C2 - 28268267
AN - SCOPUS:85009084224
T3 - Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
SP - 5
EP - 8
BT - 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2016
Y2 - 16 August 2016 through 20 August 2016
ER -