Alec Solway
YOU?
Author Swipe
View article: Reinforcement Learning without Human Feedback for Last Mile Fine-Tuning of Large Language Models
Reinforcement Learning without Human Feedback for Last Mile Fine-Tuning of Large Language Models Open
Reinforcement learning is used to align language models with human preference signals after first pre-training the model to predict the next token of text within a large corpus using likelihood maximization. Before being deployed in a spec…
View article: Anterior cingulate cortex lesions impair multiple facets of task engagement not mediated by dorsomedial striatum neuron firing
Anterior cingulate cortex lesions impair multiple facets of task engagement not mediated by dorsomedial striatum neuron firing Open
The anterior cingulate cortex (ACC) has been implicated across multiple highly specialized cognitive functions—including task engagement, motivation, error detection, attention allocation, value processing, and action selection. Here, we a…
View article: Information Uncertainty Influences Learning Strategy from Sequentially Delayed Rewards
Information Uncertainty Influences Learning Strategy from Sequentially Delayed Rewards Open
When receiving a reward after a sequence of multiple events, how do we determine which event caused the reward? This problem, known as temporal credit assignment, can be difficult for human solutions given a complex and uncertain environme…
View article: A machine-learning approach for differentiating borderline personality disorder from community participants with brain-wide functional connectivity
A machine-learning approach for differentiating borderline personality disorder from community participants with brain-wide functional connectivity Open
Spatially distributed functional connectivity patterns are moderately predictive of BPD despite heterogeneity of the patient population.
View article: Optogenetic Inhibition of Rat Anterior Cingulate Cortex Impairs the Ability to Initiate and Stay on Task
Optogenetic Inhibition of Rat Anterior Cingulate Cortex Impairs the Ability to Initiate and Stay on Task Open
Our prior research has identified neural correlates of cognitive control in the anterior cingulate cortex (ACC), leading us to hypothesize that the ACC is necessary for increasing attention as rats flexibly learn new contingencies during a…
View article: Perceptual Decision Impairments Linked to Obsessive-Compulsive Symptoms are Substantially Driven by State-Based Effects
Perceptual Decision Impairments Linked to Obsessive-Compulsive Symptoms are Substantially Driven by State-Based Effects Open
Computational models of decision making have identified a relationship between obsessive-compulsive symptoms (OCS), both in the general population and in patients, and impairments in perceptual evidence accumulation. Some studies have inte…
View article: Conflict and competition between model-based and model-free control
Conflict and competition between model-based and model-free control Open
A large literature has accumulated suggesting that human and animal decision making is driven by at least two systems, and that important functions of these systems can be captured by reinforcement learning algorithms. The “model-free” sys…
View article: The relationships between subclinical OCD symptoms, beta/gamma-band power, and the rate of evidence integration during perceptual decision making
The relationships between subclinical OCD symptoms, beta/gamma-band power, and the rate of evidence integration during perceptual decision making Open
Previous studies have demonstrated that the rate of evidence integration during perceptual decision making, a specific computationally defined parameter, is negatively correlated with both subclinical symptoms of OCD measured on a continuu…
View article: Reinforcement Learning Disruptions in Individuals With Depression and Sensitivity to Symptom Change Following Cognitive Behavioral Therapy
Reinforcement Learning Disruptions in Individuals With Depression and Sensitivity to Symptom Change Following Cognitive Behavioral Therapy Open
In this study, the mapping of reinforcement learning components to symptoms of major depression revealed mechanistic features associated with these symptoms and points to possible learning-based therapeutic processes and targets.
View article: Multiscale classification reveals a multivariate functional connectivity marker for borderline personality disorder
Multiscale classification reveals a multivariate functional connectivity marker for borderline personality disorder Open
BackgroundFunctional connectivity measures have garnered interest as possible biomarkers of psychiatric disorders including borderline personality disorder (BPD). However, small sample sizes and lack of within-study replications have led t…
View article: Multiscale classification reveals a multivariate functional connectivity marker for borderline personality disorder
Multiscale classification reveals a multivariate functional connectivity marker for borderline personality disorder Open
BackgroundFunctional connectivity measures have garnered interest as possible biomarkers of psychiatric disorders including borderline personality disorder (BPD). However, small sample sizes and lack of within-study replications have led t…
View article: Transfer of information across repeated decisions in general and in obsessive–compulsive disorder
Transfer of information across repeated decisions in general and in obsessive–compulsive disorder Open
Significance Real-life decisions are often repeated. Whether considering taking a new job, or doing something mundane like checking if the stove is off, people frequently revisit decisions. This mode of behavior takes a particularly pathol…
View article: Loss Aversion Correlates With the Propensity to Deploy Model-Based Control
Loss Aversion Correlates With the Propensity to Deploy Model-Based Control Open
Reward-based decision making is thought to be driven by at least two different types of decision systems: a simple stimulus-response cache-based system which embodies the common-sense notion of "habit," for which model-free reinforcement l…
View article: Simulating future value in intertemporal choice
Simulating future value in intertemporal choice Open
View article: Evidence integration in model-based tree search
Evidence integration in model-based tree search Open
Significance Recent behavioral research has made rapid progress toward revealing the processes by which we make choices based on judgments of subjective value. A key insight has been that this process unfolds incrementally over time, as we…
View article: Optimal Behavioral Hierarchy
Optimal Behavioral Hierarchy Open
Human behavior has long been recognized to display hierarchical structure: actions fit together into subtasks, which cohere into extended goal-directed activities. Arranging actions hierarchically has well established benefits, allowing be…
View article: Neural Activity in Human Hippocampal Formation Reveals the Spatial Context of Retrieved Memories
Neural Activity in Human Hippocampal Formation Reveals the Spatial Context of Retrieved Memories Open
Remembrance of Places Past The hippocampus has two major roles in cognition. Place-responsive neurons form a context-sensitive cognitive map, firing more strongly when an animal traverses specific regions of its environment. Both humans an…
View article: Direct recordings of grid-like neuronal activity in human spatial navigation
Direct recordings of grid-like neuronal activity in human spatial navigation Open
View article: PandaEPL: A library for programming spatial navigation experiments
PandaEPL: A library for programming spatial navigation experiments Open
View article: Goal-directed decision making as probabilistic inference: A computational framework and potential neural correlates.
Goal-directed decision making as probabilistic inference: A computational framework and potential neural correlates. Open
Recent work has given rise to the view that reward-based decision making is governed by two key controllers: a habit system, which stores stimulus-response associations shaped by past reward, and a goal-oriented system that selects actions…
View article: Positional and temporal clustering in serial order memory
Positional and temporal clustering in serial order memory Open
View article: A Neural Signature of Hierarchical Reinforcement Learning
A Neural Signature of Hierarchical Reinforcement Learning Open
View article: PyParse: A semiautomated system for scoring spoken recall data
PyParse: A semiautomated system for scoring spoken recall data Open