Explanipedia

Reinforcement Learning without Human Feedback for Last Mile Fine-Tuning of Large Language Models Open

Alec Solway · 2024

Reinforcement learning is used to align language models with human preference signals after first pre-training the model to predict the next token of text within a large corpus using likelihood maximization. Before being deployed in a spec…

Anterior cingulate cortex lesions impair multiple facets of task engagement not mediated by dorsomedial striatum neuron firing Open

Daniela Vázquez, Norma Peña-Flores, Sean Maulhardt, Alec Solway, Caroline J. Charpentier , et al. · 2024

The anterior cingulate cortex (ACC) has been implicated across multiple highly specialized cognitive functions—including task engagement, motivation, error detection, attention allocation, value processing, and action selection. Here, we a…

Information Uncertainty Influences Learning Strategy from Sequentially Delayed Rewards Open

Sean Maulhardt, Alec Solway, Caroline J. Charpentier · 2024

When receiving a reward after a sequence of multiple events, how do we determine which event caused the reward? This problem, known as temporal credit assignment, can be difficult for human solutions given a complex and uncertain environme…

A machine-learning approach for differentiating borderline personality disorder from community participants with brain-wide functional connectivity Open

Juha M. Lahnakoski, Tobias Nolte, Alec Solway, Iris Vilares, Andreas Hula , et al. · 2024

Spatially distributed functional connectivity patterns are moderately predictive of BPD despite heterogeneity of the patient population.

Optogenetic Inhibition of Rat Anterior Cingulate Cortex Impairs the Ability to Initiate and Stay on Task Open

Daniela Vázquez, Sean Maulhardt, Thomas A. Stalnaker, Alec Solway, Caroline J. Charpentier , et al. · 2024

Our prior research has identified neural correlates of cognitive control in the anterior cingulate cortex (ACC), leading us to hypothesize that the ACC is necessary for increasing attention as rats flexibly learn new contingencies during a…

Perceptual Decision Impairments Linked to Obsessive-Compulsive Symptoms are Substantially Driven by State-Based Effects Open

Claire M. Kaplan, Alec Solway · 2022

Computational models of decision making have identified a relationship between obsessive-compulsive symptoms (OCS), both in the general population and in patients, and impairments in perceptual evidence accumulation. Some studies have inte…

Conflict and competition between model-based and model-free control Open

Yuqing Lei, Alec Solway · 2022

A large literature has accumulated suggesting that human and animal decision making is driven by at least two systems, and that important functions of these systems can be captured by reinforcement learning algorithms. The “model-free” sys…

The relationships between subclinical OCD symptoms, beta/gamma-band power, and the rate of evidence integration during perceptual decision making Open

Alec Solway, Isabella Schneider, Yuqing Lei · 2022

Previous studies have demonstrated that the rate of evidence integration during perceptual decision making, a specific computationally defined parameter, is negatively correlated with both subclinical symptoms of OCD measured on a continuu…

Reinforcement Learning Disruptions in Individuals With Depression and Sensitivity to Symptom Change Following Cognitive Behavioral Therapy Open

Vanessa M. Brown, Lusha Zhu, Alec Solway, John M. Wang, Katherine McCurry , et al. · 2021

In this study, the mapping of reinforcement learning components to symptoms of major depression revealed mechanistic features associated with these symptoms and points to possible learning-based therapeutic processes and targets.

Multiscale classification reveals a multivariate functional connectivity marker for borderline personality disorder Open

Juha M. Lahnakoski, Tobias Nolte, Alec Solway, Iris Vilares, Andreas Hula , et al. · 2021

BackgroundFunctional connectivity measures have garnered interest as possible biomarkers of psychiatric disorders including borderline personality disorder (BPD). However, small sample sizes and lack of within-study replications have led t…

Multiscale classification reveals a multivariate functional connectivity marker for borderline personality disorder Open

Juha M. Lahnakoski, Tobias Nolte, Alec Solway, Iris Vilares, Andreas Hula , et al. · 2021

BackgroundFunctional connectivity measures have garnered interest as possible biomarkers of psychiatric disorders including borderline personality disorder (BPD). However, small sample sizes and lack of within-study replications have led t…

Transfer of information across repeated decisions in general and in obsessive–compulsive disorder Open

Alec Solway, Zhen Lin, Ekansh Vinaik · 2020

Significance Real-life decisions are often repeated. Whether considering taking a new job, or doing something mundane like checking if the stove is off, people frequently revisit decisions. This mode of behavior takes a particularly pathol…

Loss Aversion Correlates With the Propensity to Deploy Model-Based Control Open

Alec Solway, Terry Lohrenz, P. Read Montague · 2019

Reward-based decision making is thought to be driven by at least two different types of decision systems: a simple stimulus-response cache-based system which embodies the common-sense notion of "habit," for which model-free reinforcement l…

Simulating future value in intertemporal choice Open

Alec Solway, Terry Lohrenz, P. Read Montague · 2017

Evidence integration in model-based tree search Open

Alec Solway, Matthew Botvinick · 2015

Significance Recent behavioral research has made rapid progress toward revealing the processes by which we make choices based on judgments of subjective value. A key insight has been that this process unfolds incrementally over time, as we…

Optimal Behavioral Hierarchy Open

Alec Solway, Carlos Diuk, Natalia I. Córdova, Debbie Yee, Andrew G. Barto , et al. · 2014

Human behavior has long been recognized to display hierarchical structure: actions fit together into subtasks, which cohere into extended goal-directed activities. Arranging actions hierarchically has well established benefits, allowing be…

Neural Activity in Human Hippocampal Formation Reveals the Spatial Context of Retrieved Memories Open

Jonathan Miller, Markus Neufang, Alec Solway, Armin Brandt, M. Trippel , et al. · 2013

Remembrance of Places Past The hippocampus has two major roles in cognition. Place-responsive neurons form a context-sensitive cognitive map, firing more strongly when an animal traverses specific regions of its environment. Both humans an…

Direct recordings of grid-like neuronal activity in human spatial navigation Open

Joshua Jacobs, Christoph T. Weidemann, Jonathan Miller, Alec Solway, John F. Burke , et al. · 2013

PandaEPL: A library for programming spatial navigation experiments Open

Alec Solway, Jonathan Miller, Michael J. Kahana · 2013

Goal-directed decision making as probabilistic inference: A computational framework and potential neural correlates. Open

Alec Solway, Matthew Botvinick · 2012

Recent work has given rise to the view that reward-based decision making is governed by two key controllers: a habit system, which stores stimulus-response associations shaped by past reward, and a goal-oriented system that selects actions…

Positional and temporal clustering in serial order memory Open

Alec Solway, Bennet B. Murdock, Michael J. Kahana · 2011

A Neural Signature of Hierarchical Reinforcement Learning Open

José J. F. Ribas-Fernandes, Alec Solway, Carlos Diuk, Joseph T. McGuire, Andrew G. Barto , et al. · 2011

PyParse: A semiautomated system for scoring spoken recall data Open

Alec Solway, Aaron S. Geller, Per B. Sederberg, Michael J. Kahana · 2010

Alec Solway YOU? Author Swipe