How Numerical Precision Affects Arithmetical Reasoning Capabilities of LLMs Article Swipe
YOU?
·
· 2024
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2410.13857
Despite the remarkable success of Transformer-based large language models (LLMs) across various domains, understanding and enhancing their mathematical capabilities remains a significant challenge. In this paper, we conduct a rigorous theoretical analysis of LLMs' mathematical abilities, with a specific focus on their arithmetic performances. We identify numerical precision as a key factor that influences their effectiveness in arithmetical tasks. Our results show that Transformers operating with low numerical precision fail to address arithmetic tasks, such as iterated addition and integer multiplication, unless the model size grows super-polynomially with respect to the input length. In contrast, Transformers with standard numerical precision can efficiently handle these tasks with significantly smaller model sizes. We further support our theoretical findings through empirical experiments that explore the impact of varying numerical precision on arithmetic tasks, providing valuable insights for improving the mathematical reasoning capabilities of LLMs.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2410.13857
- https://arxiv.org/pdf/2410.13857
- OA Status
- green
- Cited By
- 2
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4403580251
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4403580251Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2410.13857Digital Object Identifier
- Title
-
How Numerical Precision Affects Arithmetical Reasoning Capabilities of LLMsWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2024Year of publication
- Publication date
-
2024-10-17Full publication date if available
- Authors
-
Guhao Feng, Kai Yang, Y. Gu, Xinyue Ai, Shengjie Luo, Jian Sun, David He, Zhenguo Li, Lijia WangList of authors in order
- Landing page
-
https://arxiv.org/abs/2410.13857Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2410.13857Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2410.13857Direct OA link when available
- Concepts
-
Computer science, Management science, EconomicsTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
2Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 1, 2024: 1Per-year citation counts (last 5 years)
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4403580251 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2410.13857 |
| ids.doi | https://doi.org/10.48550/arxiv.2410.13857 |
| ids.openalex | https://openalex.org/W4403580251 |
| fwci | |
| type | preprint |
| title | How Numerical Precision Affects Arithmetical Reasoning Capabilities of LLMs |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T13523 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9527000188827515 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1703 |
| topics[0].subfield.display_name | Computational Theory and Mathematics |
| topics[0].display_name | Mathematics, Computing, and Information Processing |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C41008148 |
| concepts[0].level | 0 |
| concepts[0].score | 0.43097883462905884 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[0].display_name | Computer science |
| concepts[1].id | https://openalex.org/C539667460 |
| concepts[1].level | 1 |
| concepts[1].score | 0.3331190347671509 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q2414942 |
| concepts[1].display_name | Management science |
| concepts[2].id | https://openalex.org/C162324750 |
| concepts[2].level | 0 |
| concepts[2].score | 0.19056087732315063 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q8134 |
| concepts[2].display_name | Economics |
| keywords[0].id | https://openalex.org/keywords/computer-science |
| keywords[0].score | 0.43097883462905884 |
| keywords[0].display_name | Computer science |
| keywords[1].id | https://openalex.org/keywords/management-science |
| keywords[1].score | 0.3331190347671509 |
| keywords[1].display_name | Management science |
| keywords[2].id | https://openalex.org/keywords/economics |
| keywords[2].score | 0.19056087732315063 |
| keywords[2].display_name | Economics |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2410.13857 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2410.13857 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2410.13857 |
| locations[1].id | doi:10.48550/arxiv.2410.13857 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2410.13857 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5114338158 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Guhao Feng |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Feng, Guhao |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5006316216 |
| authorships[1].author.orcid | https://orcid.org/0000-0003-1059-0705 |
| authorships[1].author.display_name | Kai Yang |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Yang, Kai |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5091095118 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-0430-0348 |
| authorships[2].author.display_name | Y. Gu |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Gu, Yuntian |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5114338159 |
| authorships[3].author.orcid | |
| authorships[3].author.display_name | Xinyue Ai |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Ai, Xinyue |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5114338160 |
| authorships[4].author.orcid | |
| authorships[4].author.display_name | Shengjie Luo |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Luo, Shengjie |
| authorships[4].is_corresponding | False |
| authorships[5].author.id | https://openalex.org/A5048528530 |
| authorships[5].author.orcid | https://orcid.org/0000-0002-7992-8092 |
| authorships[5].author.display_name | Jian Sun |
| authorships[5].author_position | middle |
| authorships[5].raw_author_name | Sun, Jiacheng |
| authorships[5].is_corresponding | False |
| authorships[6].author.id | https://openalex.org/A5024485817 |
| authorships[6].author.orcid | https://orcid.org/0000-0002-5703-6616 |
| authorships[6].author.display_name | David He |
| authorships[6].author_position | middle |
| authorships[6].raw_author_name | He, Di |
| authorships[6].is_corresponding | False |
| authorships[7].author.id | https://openalex.org/A5103196797 |
| authorships[7].author.orcid | https://orcid.org/0000-0002-8492-3069 |
| authorships[7].author.display_name | Zhenguo Li |
| authorships[7].author_position | middle |
| authorships[7].raw_author_name | Li, Zhenguo |
| authorships[7].is_corresponding | False |
| authorships[8].author.id | https://openalex.org/A5100705123 |
| authorships[8].author.orcid | https://orcid.org/0009-0005-7631-9207 |
| authorships[8].author.display_name | Lijia Wang |
| authorships[8].author_position | last |
| authorships[8].raw_author_name | Wang, Liwei |
| authorships[8].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2410.13857 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2024-10-21T00:00:00 |
| display_name | How Numerical Precision Affects Arithmetical Reasoning Capabilities of LLMs |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T13523 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9527000188827515 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1703 |
| primary_topic.subfield.display_name | Computational Theory and Mathematics |
| primary_topic.display_name | Mathematics, Computing, and Information Processing |
| related_works | https://openalex.org/W4391375266, https://openalex.org/W2899084033, https://openalex.org/W2748952813, https://openalex.org/W2390279801, https://openalex.org/W4391913857, https://openalex.org/W2358668433, https://openalex.org/W4396701345, https://openalex.org/W2376932109, https://openalex.org/W2001405890, https://openalex.org/W4396696052 |
| cited_by_count | 2 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 1 |
| counts_by_year[1].year | 2024 |
| counts_by_year[1].cited_by_count | 1 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2410.13857 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2410.13857 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2410.13857 |
| primary_location.id | pmh:oai:arXiv.org:2410.13857 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2410.13857 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2410.13857 |
| publication_date | 2024-10-17 |
| publication_year | 2024 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 20, 28, 37, 49 |
| abstract_inverted_index.In | 23, 93 |
| abstract_inverted_index.We | 44, 110 |
| abstract_inverted_index.as | 48, 75 |
| abstract_inverted_index.in | 56 |
| abstract_inverted_index.of | 4, 32, 123, 139 |
| abstract_inverted_index.on | 40, 127 |
| abstract_inverted_index.to | 70, 89 |
| abstract_inverted_index.we | 26 |
| abstract_inverted_index.Our | 59 |
| abstract_inverted_index.and | 14, 78 |
| abstract_inverted_index.can | 100 |
| abstract_inverted_index.for | 133 |
| abstract_inverted_index.key | 50 |
| abstract_inverted_index.low | 66 |
| abstract_inverted_index.our | 113 |
| abstract_inverted_index.the | 1, 82, 90, 121, 135 |
| abstract_inverted_index.fail | 69 |
| abstract_inverted_index.show | 61 |
| abstract_inverted_index.size | 84 |
| abstract_inverted_index.such | 74 |
| abstract_inverted_index.that | 52, 62, 119 |
| abstract_inverted_index.this | 24 |
| abstract_inverted_index.with | 36, 65, 87, 96, 105 |
| abstract_inverted_index.LLMs' | 33 |
| abstract_inverted_index.LLMs. | 140 |
| abstract_inverted_index.focus | 39 |
| abstract_inverted_index.grows | 85 |
| abstract_inverted_index.input | 91 |
| abstract_inverted_index.large | 6 |
| abstract_inverted_index.model | 83, 108 |
| abstract_inverted_index.tasks | 104 |
| abstract_inverted_index.their | 16, 41, 54 |
| abstract_inverted_index.these | 103 |
| abstract_inverted_index.(LLMs) | 9 |
| abstract_inverted_index.across | 10 |
| abstract_inverted_index.factor | 51 |
| abstract_inverted_index.handle | 102 |
| abstract_inverted_index.impact | 122 |
| abstract_inverted_index.models | 8 |
| abstract_inverted_index.paper, | 25 |
| abstract_inverted_index.sizes. | 109 |
| abstract_inverted_index.tasks, | 73, 129 |
| abstract_inverted_index.tasks. | 58 |
| abstract_inverted_index.unless | 81 |
| abstract_inverted_index.Despite | 0 |
| abstract_inverted_index.address | 71 |
| abstract_inverted_index.conduct | 27 |
| abstract_inverted_index.explore | 120 |
| abstract_inverted_index.further | 111 |
| abstract_inverted_index.integer | 79 |
| abstract_inverted_index.length. | 92 |
| abstract_inverted_index.remains | 19 |
| abstract_inverted_index.respect | 88 |
| abstract_inverted_index.results | 60 |
| abstract_inverted_index.smaller | 107 |
| abstract_inverted_index.success | 3 |
| abstract_inverted_index.support | 112 |
| abstract_inverted_index.through | 116 |
| abstract_inverted_index.various | 11 |
| abstract_inverted_index.varying | 124 |
| abstract_inverted_index.addition | 77 |
| abstract_inverted_index.analysis | 31 |
| abstract_inverted_index.domains, | 12 |
| abstract_inverted_index.findings | 115 |
| abstract_inverted_index.identify | 45 |
| abstract_inverted_index.insights | 132 |
| abstract_inverted_index.iterated | 76 |
| abstract_inverted_index.language | 7 |
| abstract_inverted_index.rigorous | 29 |
| abstract_inverted_index.specific | 38 |
| abstract_inverted_index.standard | 97 |
| abstract_inverted_index.valuable | 131 |
| abstract_inverted_index.contrast, | 94 |
| abstract_inverted_index.empirical | 117 |
| abstract_inverted_index.enhancing | 15 |
| abstract_inverted_index.improving | 134 |
| abstract_inverted_index.numerical | 46, 67, 98, 125 |
| abstract_inverted_index.operating | 64 |
| abstract_inverted_index.precision | 47, 68, 99, 126 |
| abstract_inverted_index.providing | 130 |
| abstract_inverted_index.reasoning | 137 |
| abstract_inverted_index.abilities, | 35 |
| abstract_inverted_index.arithmetic | 42, 72, 128 |
| abstract_inverted_index.challenge. | 22 |
| abstract_inverted_index.influences | 53 |
| abstract_inverted_index.remarkable | 2 |
| abstract_inverted_index.efficiently | 101 |
| abstract_inverted_index.experiments | 118 |
| abstract_inverted_index.significant | 21 |
| abstract_inverted_index.theoretical | 30, 114 |
| abstract_inverted_index.Transformers | 63, 95 |
| abstract_inverted_index.arithmetical | 57 |
| abstract_inverted_index.capabilities | 18, 138 |
| abstract_inverted_index.mathematical | 17, 34, 136 |
| abstract_inverted_index.effectiveness | 55 |
| abstract_inverted_index.performances. | 43 |
| abstract_inverted_index.significantly | 106 |
| abstract_inverted_index.understanding | 13 |
| abstract_inverted_index.multiplication, | 80 |
| abstract_inverted_index.Transformer-based | 5 |
| abstract_inverted_index.super-polynomially | 86 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 9 |
| citation_normalized_percentile |