Physics Steering: Causal Control of Cross-Domain Concepts in a Physics Foundation Model Article Swipe
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2511.20798
Recent advances in mechanistic interpretability have revealed that large language models (LLMs) develop internal representations corresponding not only to concrete entities but also distinct, human-understandable abstract concepts and behaviour. Moreover, these hidden features can be directly manipulated to steer model behaviour. However, it remains an open question whether this phenomenon is unique to models trained on inherently structured data (ie. language, images) or if it is a general property of foundation models. In this work, we investigate the internal representations of a large physics-focused foundation model. Inspired by recent work identifying single directions in activation space for complex behaviours in LLMs, we extract activation vectors from the model during forward passes over simulation datasets for different physical regimes. We then compute "delta" representations between the two regimes. These delta tensors act as concept directions in activation space, encoding specific physical features. By injecting these concept directions back into the model during inference, we can steer its predictions, demonstrating causal control over physical behaviours, such as inducing or removing some particular physical feature from a simulation. These results suggest that scientific foundation models learn generalised representations of physical principles. They do not merely rely on superficial correlations and patterns in the simulations. Our findings open new avenues for understanding and controlling scientific foundation models and has implications for AI-enabled scientific discovery.
Related Topics
- Type
- preprint
- Landing Page
- http://arxiv.org/abs/2511.20798
- https://arxiv.org/pdf/2511.20798
- OA Status
- green
- OpenAlex ID
- https://openalex.org/W4416778131
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4416778131Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2511.20798Digital Object Identifier
- Title
-
Physics Steering: Causal Control of Cross-Domain Concepts in a Physics Foundation ModelWork title
- Type
-
preprintOpenAlex work type
- Publication year
-
2025Year of publication
- Publication date
-
2025-11-25Full publication date if available
- Authors
-
Payel Mukhopadhyay, Michael T. McCabe, Miles CranmerList of authors in order
- Landing page
-
https://arxiv.org/abs/2511.20798Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2511.20798Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2511.20798Direct OA link when available
- Cited by
-
0Total citation count in OpenAlex
Full payload
| id | https://openalex.org/W4416778131 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2511.20798 |
| ids.doi | https://doi.org/10.48550/arxiv.2511.20798 |
| ids.openalex | https://openalex.org/W4416778131 |
| fwci | |
| type | preprint |
| title | Physics Steering: Causal Control of Cross-Domain Concepts in a Physics Foundation Model |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| language | |
| locations[0].id | pmh:oai:arXiv.org:2511.20798 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2511.20798 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2511.20798 |
| locations[1].id | doi:10.48550/arxiv.2511.20798 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2511.20798 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5097372488 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Payel Mukhopadhyay |
| authorships[0].author_position | middle |
| authorships[0].raw_author_name | Mukhopadhyay, Payel |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5039169226 |
| authorships[1].author.orcid | https://orcid.org/0000-0001-5715-9563 |
| authorships[1].author.display_name | Michael T. McCabe |
| authorships[1].author_position | last |
| authorships[1].raw_author_name | McCabe, Michael |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5078731429 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-6458-3423 |
| authorships[2].author.display_name | Miles Cranmer |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Cranmer, Miles |
| authorships[2].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2511.20798 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-11-28T00:00:00 |
| display_name | Physics Steering: Causal Control of Cross-Domain Concepts in a Physics Foundation Model |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-28T23:14:17.795251 |
| primary_topic | |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2511.20798 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2511.20798 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2511.20798 |
| primary_location.id | pmh:oai:arXiv.org:2511.20798 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2511.20798 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2511.20798 |
| publication_date | 2025-11-25 |
| publication_year | 2025 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 66, 81, 173 |
| abstract_inverted_index.By | 141 |
| abstract_inverted_index.In | 72 |
| abstract_inverted_index.We | 118 |
| abstract_inverted_index.an | 44 |
| abstract_inverted_index.as | 131, 164 |
| abstract_inverted_index.be | 34 |
| abstract_inverted_index.by | 87 |
| abstract_inverted_index.do | 189 |
| abstract_inverted_index.if | 63 |
| abstract_inverted_index.in | 2, 93, 99, 134, 198 |
| abstract_inverted_index.is | 50, 65 |
| abstract_inverted_index.it | 42, 64 |
| abstract_inverted_index.of | 69, 80, 185 |
| abstract_inverted_index.on | 55, 193 |
| abstract_inverted_index.or | 62, 166 |
| abstract_inverted_index.to | 18, 37, 52 |
| abstract_inverted_index.we | 75, 101, 152 |
| abstract_inverted_index.Our | 201 |
| abstract_inverted_index.act | 130 |
| abstract_inverted_index.and | 27, 196, 208, 213 |
| abstract_inverted_index.but | 21 |
| abstract_inverted_index.can | 33, 153 |
| abstract_inverted_index.for | 96, 114, 206, 216 |
| abstract_inverted_index.has | 214 |
| abstract_inverted_index.its | 155 |
| abstract_inverted_index.new | 204 |
| abstract_inverted_index.not | 16, 190 |
| abstract_inverted_index.the | 77, 106, 124, 148, 199 |
| abstract_inverted_index.two | 125 |
| abstract_inverted_index.(ie. | 59 |
| abstract_inverted_index.They | 188 |
| abstract_inverted_index.also | 22 |
| abstract_inverted_index.back | 146 |
| abstract_inverted_index.data | 58 |
| abstract_inverted_index.from | 105, 172 |
| abstract_inverted_index.have | 5 |
| abstract_inverted_index.into | 147 |
| abstract_inverted_index.only | 17 |
| abstract_inverted_index.open | 45, 203 |
| abstract_inverted_index.over | 111, 160 |
| abstract_inverted_index.rely | 192 |
| abstract_inverted_index.some | 168 |
| abstract_inverted_index.such | 163 |
| abstract_inverted_index.that | 7, 178 |
| abstract_inverted_index.then | 119 |
| abstract_inverted_index.this | 48, 73 |
| abstract_inverted_index.work | 89 |
| abstract_inverted_index.LLMs, | 100 |
| abstract_inverted_index.These | 127, 175 |
| abstract_inverted_index.delta | 128 |
| abstract_inverted_index.large | 8, 82 |
| abstract_inverted_index.learn | 182 |
| abstract_inverted_index.model | 39, 107, 149 |
| abstract_inverted_index.space | 95 |
| abstract_inverted_index.steer | 38, 154 |
| abstract_inverted_index.these | 30, 143 |
| abstract_inverted_index.work, | 74 |
| abstract_inverted_index.(LLMs) | 11 |
| abstract_inverted_index.Recent | 0 |
| abstract_inverted_index.causal | 158 |
| abstract_inverted_index.during | 108, 150 |
| abstract_inverted_index.hidden | 31 |
| abstract_inverted_index.merely | 191 |
| abstract_inverted_index.model. | 85 |
| abstract_inverted_index.models | 10, 53, 181, 212 |
| abstract_inverted_index.passes | 110 |
| abstract_inverted_index.recent | 88 |
| abstract_inverted_index.single | 91 |
| abstract_inverted_index.space, | 136 |
| abstract_inverted_index.unique | 51 |
| abstract_inverted_index."delta" | 121 |
| abstract_inverted_index.avenues | 205 |
| abstract_inverted_index.between | 123 |
| abstract_inverted_index.complex | 97 |
| abstract_inverted_index.compute | 120 |
| abstract_inverted_index.concept | 132, 144 |
| abstract_inverted_index.control | 159 |
| abstract_inverted_index.develop | 12 |
| abstract_inverted_index.extract | 102 |
| abstract_inverted_index.feature | 171 |
| abstract_inverted_index.forward | 109 |
| abstract_inverted_index.general | 67 |
| abstract_inverted_index.images) | 61 |
| abstract_inverted_index.models. | 71 |
| abstract_inverted_index.remains | 43 |
| abstract_inverted_index.results | 176 |
| abstract_inverted_index.suggest | 177 |
| abstract_inverted_index.tensors | 129 |
| abstract_inverted_index.trained | 54 |
| abstract_inverted_index.vectors | 104 |
| abstract_inverted_index.whether | 47 |
| abstract_inverted_index.However, | 41 |
| abstract_inverted_index.Inspired | 86 |
| abstract_inverted_index.abstract | 25 |
| abstract_inverted_index.advances | 1 |
| abstract_inverted_index.concepts | 26 |
| abstract_inverted_index.concrete | 19 |
| abstract_inverted_index.datasets | 113 |
| abstract_inverted_index.directly | 35 |
| abstract_inverted_index.encoding | 137 |
| abstract_inverted_index.entities | 20 |
| abstract_inverted_index.features | 32 |
| abstract_inverted_index.findings | 202 |
| abstract_inverted_index.inducing | 165 |
| abstract_inverted_index.internal | 13, 78 |
| abstract_inverted_index.language | 9 |
| abstract_inverted_index.patterns | 197 |
| abstract_inverted_index.physical | 116, 139, 161, 170, 186 |
| abstract_inverted_index.property | 68 |
| abstract_inverted_index.question | 46 |
| abstract_inverted_index.regimes. | 117, 126 |
| abstract_inverted_index.removing | 167 |
| abstract_inverted_index.revealed | 6 |
| abstract_inverted_index.specific | 138 |
| abstract_inverted_index.Moreover, | 29 |
| abstract_inverted_index.different | 115 |
| abstract_inverted_index.distinct, | 23 |
| abstract_inverted_index.features. | 140 |
| abstract_inverted_index.injecting | 142 |
| abstract_inverted_index.language, | 60 |
| abstract_inverted_index.AI-enabled | 217 |
| abstract_inverted_index.activation | 94, 103, 135 |
| abstract_inverted_index.behaviour. | 28, 40 |
| abstract_inverted_index.behaviours | 98 |
| abstract_inverted_index.directions | 92, 133, 145 |
| abstract_inverted_index.discovery. | 219 |
| abstract_inverted_index.foundation | 70, 84, 180, 211 |
| abstract_inverted_index.inference, | 151 |
| abstract_inverted_index.inherently | 56 |
| abstract_inverted_index.particular | 169 |
| abstract_inverted_index.phenomenon | 49 |
| abstract_inverted_index.scientific | 179, 210, 218 |
| abstract_inverted_index.simulation | 112 |
| abstract_inverted_index.structured | 57 |
| abstract_inverted_index.behaviours, | 162 |
| abstract_inverted_index.controlling | 209 |
| abstract_inverted_index.generalised | 183 |
| abstract_inverted_index.identifying | 90 |
| abstract_inverted_index.investigate | 76 |
| abstract_inverted_index.manipulated | 36 |
| abstract_inverted_index.mechanistic | 3 |
| abstract_inverted_index.principles. | 187 |
| abstract_inverted_index.simulation. | 174 |
| abstract_inverted_index.superficial | 194 |
| abstract_inverted_index.correlations | 195 |
| abstract_inverted_index.implications | 215 |
| abstract_inverted_index.predictions, | 156 |
| abstract_inverted_index.simulations. | 200 |
| abstract_inverted_index.corresponding | 15 |
| abstract_inverted_index.demonstrating | 157 |
| abstract_inverted_index.understanding | 207 |
| abstract_inverted_index.physics-focused | 83 |
| abstract_inverted_index.representations | 14, 79, 122, 184 |
| abstract_inverted_index.interpretability | 4 |
| abstract_inverted_index.human-understandable | 24 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 3 |
| citation_normalized_percentile |