Reasoning Elicitation in Language Models via Counterfactual Feedback Article Swipe
YOU?
·
· 2024
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2410.03767
Despite the increasing effectiveness of language models, their reasoning capabilities remain underdeveloped. In particular, causal reasoning through counterfactual question answering is lacking. This work aims to bridge this gap. We first derive novel metrics that balance accuracy in factual and counterfactual questions, capturing a more complete view of the reasoning abilities of language models than traditional factual-only based metrics. Second, we propose several fine-tuning approaches that aim to elicit better reasoning mechanisms, in the sense of the proposed metrics. Finally, we evaluate the performance of the fine-tuned language models in a variety of realistic scenarios. In particular, we investigate to what extent our fine-tuning approaches systemically achieve better generalization with respect to the base models in several problems that require, among others, inductive and deductive reasoning capabilities.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2410.03767
- https://arxiv.org/pdf/2410.03767
- OA Status
- green
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4403928893
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4403928893Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2410.03767Digital Object Identifier
- Title
-
Reasoning Elicitation in Language Models via Counterfactual FeedbackWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2024Year of publication
- Publication date
-
2024-10-02Full publication date if available
- Authors
-
Alihan Hüyük, Xinnuo Xu, Jacqueline Maasch, Aditya V. Nori, Javier GonzálezList of authors in order
- Landing page
-
https://arxiv.org/abs/2410.03767Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2410.03767Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2410.03767Direct OA link when available
- Concepts
-
Counterfactual thinking, Computer science, Natural language processing, Linguistics, Psychology, Social psychology, PhilosophyTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4403928893 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2410.03767 |
| ids.doi | https://doi.org/10.48550/arxiv.2410.03767 |
| ids.openalex | https://openalex.org/W4403928893 |
| fwci | |
| type | preprint |
| title | Reasoning Elicitation in Language Models via Counterfactual Feedback |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10181 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9591000080108643 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Natural Language Processing Techniques |
| topics[1].id | https://openalex.org/T10028 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9401999711990356 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1702 |
| topics[1].subfield.display_name | Artificial Intelligence |
| topics[1].display_name | Topic Modeling |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C108650721 |
| concepts[0].level | 2 |
| concepts[0].score | 0.9179164171218872 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q1783253 |
| concepts[0].display_name | Counterfactual thinking |
| concepts[1].id | https://openalex.org/C41008148 |
| concepts[1].level | 0 |
| concepts[1].score | 0.5959025621414185 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[1].display_name | Computer science |
| concepts[2].id | https://openalex.org/C204321447 |
| concepts[2].level | 1 |
| concepts[2].score | 0.4254884421825409 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q30642 |
| concepts[2].display_name | Natural language processing |
| concepts[3].id | https://openalex.org/C41895202 |
| concepts[3].level | 1 |
| concepts[3].score | 0.3321949243545532 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q8162 |
| concepts[3].display_name | Linguistics |
| concepts[4].id | https://openalex.org/C15744967 |
| concepts[4].level | 0 |
| concepts[4].score | 0.2475643754005432 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q9418 |
| concepts[4].display_name | Psychology |
| concepts[5].id | https://openalex.org/C77805123 |
| concepts[5].level | 1 |
| concepts[5].score | 0.1075977087020874 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q161272 |
| concepts[5].display_name | Social psychology |
| concepts[6].id | https://openalex.org/C138885662 |
| concepts[6].level | 0 |
| concepts[6].score | 0.10251912474632263 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q5891 |
| concepts[6].display_name | Philosophy |
| keywords[0].id | https://openalex.org/keywords/counterfactual-thinking |
| keywords[0].score | 0.9179164171218872 |
| keywords[0].display_name | Counterfactual thinking |
| keywords[1].id | https://openalex.org/keywords/computer-science |
| keywords[1].score | 0.5959025621414185 |
| keywords[1].display_name | Computer science |
| keywords[2].id | https://openalex.org/keywords/natural-language-processing |
| keywords[2].score | 0.4254884421825409 |
| keywords[2].display_name | Natural language processing |
| keywords[3].id | https://openalex.org/keywords/linguistics |
| keywords[3].score | 0.3321949243545532 |
| keywords[3].display_name | Linguistics |
| keywords[4].id | https://openalex.org/keywords/psychology |
| keywords[4].score | 0.2475643754005432 |
| keywords[4].display_name | Psychology |
| keywords[5].id | https://openalex.org/keywords/social-psychology |
| keywords[5].score | 0.1075977087020874 |
| keywords[5].display_name | Social psychology |
| keywords[6].id | https://openalex.org/keywords/philosophy |
| keywords[6].score | 0.10251912474632263 |
| keywords[6].display_name | Philosophy |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2410.03767 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2410.03767 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2410.03767 |
| locations[1].id | doi:10.48550/arxiv.2410.03767 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2410.03767 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5026858918 |
| authorships[0].author.orcid | https://orcid.org/0000-0002-5621-5698 |
| authorships[0].author.display_name | Alihan Hüyük |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Hüyük, Alihan |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5011220230 |
| authorships[1].author.orcid | |
| authorships[1].author.display_name | Xinnuo Xu |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Xu, Xinnuo |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5114470000 |
| authorships[2].author.orcid | |
| authorships[2].author.display_name | Jacqueline Maasch |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Maasch, Jacqueline |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5111937381 |
| authorships[3].author.orcid | |
| authorships[3].author.display_name | Aditya V. Nori |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Nori, Aditya V. |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5107318885 |
| authorships[4].author.orcid | |
| authorships[4].author.display_name | Javier González |
| authorships[4].author_position | last |
| authorships[4].raw_author_name | González, Javier |
| authorships[4].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2410.03767 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2024-11-01T00:00:00 |
| display_name | Reasoning Elicitation in Language Models via Counterfactual Feedback |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T10181 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9591000080108643 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Natural Language Processing Techniques |
| related_works | https://openalex.org/W4391375266, https://openalex.org/W2899084033, https://openalex.org/W2748952813, https://openalex.org/W3201448254, https://openalex.org/W4286970243, https://openalex.org/W2066431708, https://openalex.org/W4384133558, https://openalex.org/W3025615835, https://openalex.org/W173210993, https://openalex.org/W2390660599 |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2410.03767 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2410.03767 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2410.03767 |
| primary_location.id | pmh:oai:arXiv.org:2410.03767 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2410.03767 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2410.03767 |
| publication_date | 2024-10-02 |
| publication_year | 2024 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 43, 90 |
| abstract_inverted_index.In | 12, 95 |
| abstract_inverted_index.We | 29 |
| abstract_inverted_index.in | 37, 72, 89, 115 |
| abstract_inverted_index.is | 20 |
| abstract_inverted_index.of | 4, 47, 51, 75, 84, 92 |
| abstract_inverted_index.to | 25, 67, 99, 111 |
| abstract_inverted_index.we | 60, 80, 97 |
| abstract_inverted_index.aim | 66 |
| abstract_inverted_index.and | 39, 123 |
| abstract_inverted_index.our | 102 |
| abstract_inverted_index.the | 1, 48, 73, 76, 82, 85, 112 |
| abstract_inverted_index.This | 22 |
| abstract_inverted_index.aims | 24 |
| abstract_inverted_index.base | 113 |
| abstract_inverted_index.gap. | 28 |
| abstract_inverted_index.more | 44 |
| abstract_inverted_index.than | 54 |
| abstract_inverted_index.that | 34, 65, 118 |
| abstract_inverted_index.this | 27 |
| abstract_inverted_index.view | 46 |
| abstract_inverted_index.what | 100 |
| abstract_inverted_index.with | 109 |
| abstract_inverted_index.work | 23 |
| abstract_inverted_index.among | 120 |
| abstract_inverted_index.based | 57 |
| abstract_inverted_index.first | 30 |
| abstract_inverted_index.novel | 32 |
| abstract_inverted_index.sense | 74 |
| abstract_inverted_index.their | 7 |
| abstract_inverted_index.better | 69, 107 |
| abstract_inverted_index.bridge | 26 |
| abstract_inverted_index.causal | 14 |
| abstract_inverted_index.derive | 31 |
| abstract_inverted_index.elicit | 68 |
| abstract_inverted_index.extent | 101 |
| abstract_inverted_index.models | 53, 88, 114 |
| abstract_inverted_index.remain | 10 |
| abstract_inverted_index.Despite | 0 |
| abstract_inverted_index.Second, | 59 |
| abstract_inverted_index.achieve | 106 |
| abstract_inverted_index.balance | 35 |
| abstract_inverted_index.factual | 38 |
| abstract_inverted_index.metrics | 33 |
| abstract_inverted_index.models, | 6 |
| abstract_inverted_index.others, | 121 |
| abstract_inverted_index.propose | 61 |
| abstract_inverted_index.respect | 110 |
| abstract_inverted_index.several | 62, 116 |
| abstract_inverted_index.through | 16 |
| abstract_inverted_index.variety | 91 |
| abstract_inverted_index.Finally, | 79 |
| abstract_inverted_index.accuracy | 36 |
| abstract_inverted_index.complete | 45 |
| abstract_inverted_index.evaluate | 81 |
| abstract_inverted_index.lacking. | 21 |
| abstract_inverted_index.language | 5, 52, 87 |
| abstract_inverted_index.metrics. | 58, 78 |
| abstract_inverted_index.problems | 117 |
| abstract_inverted_index.proposed | 77 |
| abstract_inverted_index.question | 18 |
| abstract_inverted_index.require, | 119 |
| abstract_inverted_index.abilities | 50 |
| abstract_inverted_index.answering | 19 |
| abstract_inverted_index.capturing | 42 |
| abstract_inverted_index.deductive | 124 |
| abstract_inverted_index.inductive | 122 |
| abstract_inverted_index.realistic | 93 |
| abstract_inverted_index.reasoning | 8, 15, 49, 70, 125 |
| abstract_inverted_index.approaches | 64, 104 |
| abstract_inverted_index.fine-tuned | 86 |
| abstract_inverted_index.increasing | 2 |
| abstract_inverted_index.questions, | 41 |
| abstract_inverted_index.scenarios. | 94 |
| abstract_inverted_index.fine-tuning | 63, 103 |
| abstract_inverted_index.investigate | 98 |
| abstract_inverted_index.mechanisms, | 71 |
| abstract_inverted_index.particular, | 13, 96 |
| abstract_inverted_index.performance | 83 |
| abstract_inverted_index.traditional | 55 |
| abstract_inverted_index.capabilities | 9 |
| abstract_inverted_index.factual-only | 56 |
| abstract_inverted_index.systemically | 105 |
| abstract_inverted_index.capabilities. | 126 |
| abstract_inverted_index.effectiveness | 3 |
| abstract_inverted_index.counterfactual | 17, 40 |
| abstract_inverted_index.generalization | 108 |
| abstract_inverted_index.underdeveloped. | 11 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 5 |
| citation_normalized_percentile |