Listen, Adjust, Act: Adding Communication to Pre-Trained Agents via Goal Adjustments Article Swipe
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.3233/faia250586
Effective coordination among intelligent agents is challenging, particularly in complex environments–often tackled with Multi-agent Deep Reinforcement Learning (MADRL). Communication is key to facilitate coordination, yet manually designing communication mechanisms is impractical. Instead, Comm-MADRL allows agents to learn meaningful communication without predefined semantics. However, conventional Comm-MADRL methods require jointly optimising communication and behaviour, which complicates training, often reducing its applicability to complex environments. Alternatively, we consider extending already trained agents without communication capabilities. In this paper we introduce a method that does so by extending pre-trained Goal-conditioned Reinforcement Learning (GCRL) agents, treating communication as modifications of the latent goal embeddings. The agent first trains in communication-less tasks, and then transfers its knowledge of the environment to tasks with communication. We evaluate the technique in a complex environment: MA-Minecraft, on two tasks that involve communication, showing significant performance improvements when coordination is required and inconclusive when it is helpful but optional. Our results suggest communication implemented as goal modifications in MADRL can bridge current methods towards richer, real-world-like MA scenarios.
Related Topics
- Type
- book-chapter
- Language
- en
- Landing Page
- https://doi.org/10.3233/faia250586
- OA Status
- hybrid
- OpenAlex ID
- https://openalex.org/W4414434676
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4414434676Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.3233/faia250586Digital Object Identifier
- Title
-
Listen, Adjust, Act: Adding Communication to Pre-Trained Agents via Goal AdjustmentsWork title
- Type
-
book-chapterOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2025Year of publication
- Publication date
-
2025-09-22Full publication date if available
- Authors
-
Oriol Miro-Lopez-Feliu, Adrián Tormos, Víctor Giménez-ÁbalosList of authors in order
- Landing page
-
https://doi.org/10.3233/faia250586Publisher landing page
- Open access
-
YesWhether a free full text is available
- OA status
-
hybridOpen access status per OpenAlex
- OA URL
-
https://doi.org/10.3233/faia250586Direct OA link when available
- Cited by
-
0Total citation count in OpenAlex
Full payload
| id | https://openalex.org/W4414434676 |
|---|---|
| doi | https://doi.org/10.3233/faia250586 |
| ids.doi | https://doi.org/10.3233/faia250586 |
| ids.openalex | https://openalex.org/W4414434676 |
| fwci | 0.0 |
| type | book-chapter |
| title | Listen, Adjust, Act: Adding Communication to Pre-Trained Agents via Goal Adjustments |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10456 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.989300012588501 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Multi-Agent Systems and Negotiation |
| topics[1].id | https://openalex.org/T12128 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.927299976348877 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1702 |
| topics[1].subfield.display_name | Artificial Intelligence |
| topics[1].display_name | AI in Service Interactions |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| language | en |
| locations[0].id | doi:10.3233/faia250586 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4210201731 |
| locations[0].source.issn | 0922-6389, 1879-8314 |
| locations[0].source.type | journal |
| locations[0].source.is_oa | False |
| locations[0].source.issn_l | 0922-6389 |
| locations[0].source.is_core | True |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | Frontiers in artificial intelligence and applications |
| locations[0].source.host_organization | |
| locations[0].source.host_organization_name | |
| locations[0].license | cc-by-nc |
| locations[0].pdf_url | |
| locations[0].version | publishedVersion |
| locations[0].raw_type | book-chapter |
| locations[0].license_id | https://openalex.org/licenses/cc-by-nc |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | Frontiers in Artificial Intelligence and Applications |
| locations[0].landing_page_url | https://doi.org/10.3233/faia250586 |
| indexed_in | crossref |
| authorships[0].author.id | https://openalex.org/A5119704913 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Oriol Miro-Lopez-Feliu |
| authorships[0].countries | ES |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[0].affiliations[0].raw_affiliation_string | Barcelona Supercomputing Center |
| authorships[0].institutions[0].id | https://openalex.org/I2799803557 |
| authorships[0].institutions[0].ror | https://ror.org/05sd8tv96 |
| authorships[0].institutions[0].type | facility |
| authorships[0].institutions[0].lineage | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[0].institutions[0].country_code | ES |
| authorships[0].institutions[0].display_name | Barcelona Supercomputing Center |
| authorships[0].institutions[1].id | https://openalex.org/I9617848 |
| authorships[0].institutions[1].ror | https://ror.org/03mb6wj31 |
| authorships[0].institutions[1].type | education |
| authorships[0].institutions[1].lineage | https://openalex.org/I9617848 |
| authorships[0].institutions[1].country_code | ES |
| authorships[0].institutions[1].display_name | Universitat Politècnica de Catalunya |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Oriol Miro-Lopez-Feliu |
| authorships[0].is_corresponding | False |
| authorships[0].raw_affiliation_strings | Barcelona Supercomputing Center |
| authorships[1].author.id | https://openalex.org/A5001312222 |
| authorships[1].author.orcid | https://orcid.org/0000-0003-1658-9393 |
| authorships[1].author.display_name | Adrián Tormos |
| authorships[1].countries | ES |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[1].affiliations[0].raw_affiliation_string | Barcelona Supercomputing Center |
| authorships[1].institutions[0].id | https://openalex.org/I2799803557 |
| authorships[1].institutions[0].ror | https://ror.org/05sd8tv96 |
| authorships[1].institutions[0].type | facility |
| authorships[1].institutions[0].lineage | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[1].institutions[0].country_code | ES |
| authorships[1].institutions[0].display_name | Barcelona Supercomputing Center |
| authorships[1].institutions[1].id | https://openalex.org/I9617848 |
| authorships[1].institutions[1].ror | https://ror.org/03mb6wj31 |
| authorships[1].institutions[1].type | education |
| authorships[1].institutions[1].lineage | https://openalex.org/I9617848 |
| authorships[1].institutions[1].country_code | ES |
| authorships[1].institutions[1].display_name | Universitat Politècnica de Catalunya |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Adrian Tormos |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | Barcelona Supercomputing Center |
| authorships[2].author.id | https://openalex.org/A5081818290 |
| authorships[2].author.orcid | https://orcid.org/0000-0003-4514-6145 |
| authorships[2].author.display_name | Víctor Giménez-Ábalos |
| authorships[2].countries | ES |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[2].affiliations[0].raw_affiliation_string | Barcelona Supercomputing Center |
| authorships[2].institutions[0].id | https://openalex.org/I2799803557 |
| authorships[2].institutions[0].ror | https://ror.org/05sd8tv96 |
| authorships[2].institutions[0].type | facility |
| authorships[2].institutions[0].lineage | https://openalex.org/I2799803557, https://openalex.org/I9617848 |
| authorships[2].institutions[0].country_code | ES |
| authorships[2].institutions[0].display_name | Barcelona Supercomputing Center |
| authorships[2].institutions[1].id | https://openalex.org/I9617848 |
| authorships[2].institutions[1].ror | https://ror.org/03mb6wj31 |
| authorships[2].institutions[1].type | education |
| authorships[2].institutions[1].lineage | https://openalex.org/I9617848 |
| authorships[2].institutions[1].country_code | ES |
| authorships[2].institutions[1].display_name | Universitat Politècnica de Catalunya |
| authorships[2].author_position | last |
| authorships[2].raw_author_name | Victor Gimenez-Abalos |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | Barcelona Supercomputing Center |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://doi.org/10.3233/faia250586 |
| open_access.oa_status | hybrid |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Listen, Adjust, Act: Adding Communication to Pre-Trained Agents via Goal Adjustments |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T10456 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.989300012588501 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Multi-Agent Systems and Negotiation |
| cited_by_count | 0 |
| locations_count | 1 |
| best_oa_location.id | doi:10.3233/faia250586 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4210201731 |
| best_oa_location.source.issn | 0922-6389, 1879-8314 |
| best_oa_location.source.type | journal |
| best_oa_location.source.is_oa | False |
| best_oa_location.source.issn_l | 0922-6389 |
| best_oa_location.source.is_core | True |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | Frontiers in artificial intelligence and applications |
| best_oa_location.source.host_organization | |
| best_oa_location.source.host_organization_name | |
| best_oa_location.license | cc-by-nc |
| best_oa_location.pdf_url | |
| best_oa_location.version | publishedVersion |
| best_oa_location.raw_type | book-chapter |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by-nc |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | True |
| best_oa_location.raw_source_name | Frontiers in Artificial Intelligence and Applications |
| best_oa_location.landing_page_url | https://doi.org/10.3233/faia250586 |
| primary_location.id | doi:10.3233/faia250586 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4210201731 |
| primary_location.source.issn | 0922-6389, 1879-8314 |
| primary_location.source.type | journal |
| primary_location.source.is_oa | False |
| primary_location.source.issn_l | 0922-6389 |
| primary_location.source.is_core | True |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | Frontiers in artificial intelligence and applications |
| primary_location.source.host_organization | |
| primary_location.source.host_organization_name | |
| primary_location.license | cc-by-nc |
| primary_location.pdf_url | |
| primary_location.version | publishedVersion |
| primary_location.raw_type | book-chapter |
| primary_location.license_id | https://openalex.org/licenses/cc-by-nc |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | Frontiers in Artificial Intelligence and Applications |
| primary_location.landing_page_url | https://doi.org/10.3233/faia250586 |
| publication_date | 2025-09-22 |
| publication_year | 2025 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 77, 123 |
| abstract_inverted_index.In | 72 |
| abstract_inverted_index.MA | 166 |
| abstract_inverted_index.We | 118 |
| abstract_inverted_index.as | 92, 154 |
| abstract_inverted_index.by | 82 |
| abstract_inverted_index.in | 8, 103, 122, 157 |
| abstract_inverted_index.is | 5, 19, 29, 139, 145 |
| abstract_inverted_index.it | 144 |
| abstract_inverted_index.of | 94, 111 |
| abstract_inverted_index.on | 127 |
| abstract_inverted_index.so | 81 |
| abstract_inverted_index.to | 21, 35, 59, 114 |
| abstract_inverted_index.we | 63, 75 |
| abstract_inverted_index.Our | 149 |
| abstract_inverted_index.The | 99 |
| abstract_inverted_index.and | 50, 106, 141 |
| abstract_inverted_index.but | 147 |
| abstract_inverted_index.can | 159 |
| abstract_inverted_index.its | 57, 109 |
| abstract_inverted_index.key | 20 |
| abstract_inverted_index.the | 95, 112, 120 |
| abstract_inverted_index.two | 128 |
| abstract_inverted_index.yet | 24 |
| abstract_inverted_index.Deep | 14 |
| abstract_inverted_index.does | 80 |
| abstract_inverted_index.goal | 97, 155 |
| abstract_inverted_index.that | 79, 130 |
| abstract_inverted_index.then | 107 |
| abstract_inverted_index.this | 73 |
| abstract_inverted_index.when | 137, 143 |
| abstract_inverted_index.with | 12, 116 |
| abstract_inverted_index.MADRL | 158 |
| abstract_inverted_index.agent | 100 |
| abstract_inverted_index.among | 2 |
| abstract_inverted_index.first | 101 |
| abstract_inverted_index.learn | 36 |
| abstract_inverted_index.often | 55 |
| abstract_inverted_index.paper | 74 |
| abstract_inverted_index.tasks | 115, 129 |
| abstract_inverted_index.which | 52 |
| abstract_inverted_index.(GCRL) | 88 |
| abstract_inverted_index.agents | 4, 34, 68 |
| abstract_inverted_index.allows | 33 |
| abstract_inverted_index.bridge | 160 |
| abstract_inverted_index.latent | 96 |
| abstract_inverted_index.method | 78 |
| abstract_inverted_index.tasks, | 105 |
| abstract_inverted_index.trains | 102 |
| abstract_inverted_index.agents, | 89 |
| abstract_inverted_index.already | 66 |
| abstract_inverted_index.complex | 9, 60, 124 |
| abstract_inverted_index.current | 161 |
| abstract_inverted_index.helpful | 146 |
| abstract_inverted_index.involve | 131 |
| abstract_inverted_index.jointly | 47 |
| abstract_inverted_index.methods | 45, 162 |
| abstract_inverted_index.require | 46 |
| abstract_inverted_index.results | 150 |
| abstract_inverted_index.richer, | 164 |
| abstract_inverted_index.showing | 133 |
| abstract_inverted_index.suggest | 151 |
| abstract_inverted_index.tackled | 11 |
| abstract_inverted_index.towards | 163 |
| abstract_inverted_index.trained | 67 |
| abstract_inverted_index.without | 39, 69 |
| abstract_inverted_index.(MADRL). | 17 |
| abstract_inverted_index.However, | 42 |
| abstract_inverted_index.Instead, | 31 |
| abstract_inverted_index.Learning | 16, 87 |
| abstract_inverted_index.consider | 64 |
| abstract_inverted_index.evaluate | 119 |
| abstract_inverted_index.manually | 25 |
| abstract_inverted_index.reducing | 56 |
| abstract_inverted_index.required | 140 |
| abstract_inverted_index.treating | 90 |
| abstract_inverted_index.Effective | 0 |
| abstract_inverted_index.designing | 26 |
| abstract_inverted_index.extending | 65, 83 |
| abstract_inverted_index.introduce | 76 |
| abstract_inverted_index.knowledge | 110 |
| abstract_inverted_index.optional. | 148 |
| abstract_inverted_index.technique | 121 |
| abstract_inverted_index.training, | 54 |
| abstract_inverted_index.transfers | 108 |
| abstract_inverted_index.Comm-MADRL | 32, 44 |
| abstract_inverted_index.behaviour, | 51 |
| abstract_inverted_index.facilitate | 22 |
| abstract_inverted_index.meaningful | 37 |
| abstract_inverted_index.mechanisms | 28 |
| abstract_inverted_index.optimising | 48 |
| abstract_inverted_index.predefined | 40 |
| abstract_inverted_index.scenarios. | 167 |
| abstract_inverted_index.semantics. | 41 |
| abstract_inverted_index.Multi-agent | 13 |
| abstract_inverted_index.complicates | 53 |
| abstract_inverted_index.embeddings. | 98 |
| abstract_inverted_index.environment | 113 |
| abstract_inverted_index.implemented | 153 |
| abstract_inverted_index.intelligent | 3 |
| abstract_inverted_index.performance | 135 |
| abstract_inverted_index.pre-trained | 84 |
| abstract_inverted_index.significant | 134 |
| abstract_inverted_index.challenging, | 6 |
| abstract_inverted_index.conventional | 43 |
| abstract_inverted_index.coordination | 1, 138 |
| abstract_inverted_index.environment: | 125 |
| abstract_inverted_index.impractical. | 30 |
| abstract_inverted_index.improvements | 136 |
| abstract_inverted_index.inconclusive | 142 |
| abstract_inverted_index.particularly | 7 |
| abstract_inverted_index.Communication | 18 |
| abstract_inverted_index.MA-Minecraft, | 126 |
| abstract_inverted_index.Reinforcement | 15, 86 |
| abstract_inverted_index.applicability | 58 |
| abstract_inverted_index.capabilities. | 71 |
| abstract_inverted_index.communication | 27, 38, 49, 70, 91, 152 |
| abstract_inverted_index.coordination, | 23 |
| abstract_inverted_index.environments. | 61 |
| abstract_inverted_index.modifications | 93, 156 |
| abstract_inverted_index.Alternatively, | 62 |
| abstract_inverted_index.communication, | 132 |
| abstract_inverted_index.communication. | 117 |
| abstract_inverted_index.real-world-like | 165 |
| abstract_inverted_index.Goal-conditioned | 85 |
| abstract_inverted_index.communication-less | 104 |
| abstract_inverted_index.environments–often | 10 |
| cited_by_percentile_year | |
| countries_distinct_count | 1 |
| institutions_distinct_count | 3 |
| citation_normalized_percentile.value | 0.52417754 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | True |