Adversarial Preference Learning for Robust LLM Alignment Article Swipe
Yuanfu Wang
,
Pengyu Wang
,
Chenyang Xi
,
Bo Tang
,
Junyi Zhu
,
Wenqiang Wei
,
Chen Chen
,
Chao Yang
,
Jingfeng Zhang
,
Chaochao Lu
,
Yijun Niu
,
Keming Mao
,
Zhiyu Li
,
Feiyu Xiong
,
Jie Hu
,
Mingchuan Yang
·
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.18653/v1/2025.findings-acl.1126
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.18653/v1/2025.findings-acl.1126
Related Topics
Concepts
Metadata
- Type
- article
- Language
- en
- Landing Page
- https://doi.org/10.18653/v1/2025.findings-acl.1126
- https://aclanthology.org/2025.findings-acl.1126.pdf
- OA Status
- gold
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4412887825
All OpenAlex metadata
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4412887825Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.18653/v1/2025.findings-acl.1126Digital Object Identifier
- Title
-
Adversarial Preference Learning for Robust LLM AlignmentWork title
- Type
-
articleOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2025Year of publication
- Publication date
-
2025-01-01Full publication date if available
- Authors
-
Yuanfu Wang, Pengyu Wang, Chenyang Xi, Bo Tang, Junyi Zhu, Wenqiang Wei, Chen Chen, Chao Yang, Jingfeng Zhang, Chaochao Lu, Yijun Niu, Keming Mao, Zhiyu Li, Feiyu Xiong, Jie Hu, Mingchuan YangList of authors in order
- Landing page
-
https://doi.org/10.18653/v1/2025.findings-acl.1126Publisher landing page
- PDF URL
-
https://aclanthology.org/2025.findings-acl.1126.pdfDirect link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
goldOpen access status per OpenAlex
- OA URL
-
https://aclanthology.org/2025.findings-acl.1126.pdfDirect OA link when available
- Concepts
-
Adversarial system, Computer science, Preference, Artificial intelligence, Machine learning, Mathematics, StatisticsTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4412887825 |
|---|---|
| doi | https://doi.org/10.18653/v1/2025.findings-acl.1126 |
| ids.doi | https://doi.org/10.18653/v1/2025.findings-acl.1126 |
| ids.openalex | https://openalex.org/W4412887825 |
| fwci | 0.0 |
| type | article |
| title | Adversarial Preference Learning for Robust LLM Alignment |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | 21881 |
| biblio.first_page | 21865 |
| topics[0].id | https://openalex.org/T10601 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9527000188827515 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1707 |
| topics[0].subfield.display_name | Computer Vision and Pattern Recognition |
| topics[0].display_name | Handwritten Text Recognition Techniques |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C37736160 |
| concepts[0].level | 2 |
| concepts[0].score | 0.8147666454315186 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q1801315 |
| concepts[0].display_name | Adversarial system |
| concepts[1].id | https://openalex.org/C41008148 |
| concepts[1].level | 0 |
| concepts[1].score | 0.666168749332428 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[1].display_name | Computer science |
| concepts[2].id | https://openalex.org/C2781249084 |
| concepts[2].level | 2 |
| concepts[2].score | 0.6152558326721191 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q908656 |
| concepts[2].display_name | Preference |
| concepts[3].id | https://openalex.org/C154945302 |
| concepts[3].level | 1 |
| concepts[3].score | 0.5461805462837219 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[3].display_name | Artificial intelligence |
| concepts[4].id | https://openalex.org/C119857082 |
| concepts[4].level | 1 |
| concepts[4].score | 0.44465896487236023 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q2539 |
| concepts[4].display_name | Machine learning |
| concepts[5].id | https://openalex.org/C33923547 |
| concepts[5].level | 0 |
| concepts[5].score | 0.12717613577842712 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q395 |
| concepts[5].display_name | Mathematics |
| concepts[6].id | https://openalex.org/C105795698 |
| concepts[6].level | 1 |
| concepts[6].score | 0.09704729914665222 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q12483 |
| concepts[6].display_name | Statistics |
| keywords[0].id | https://openalex.org/keywords/adversarial-system |
| keywords[0].score | 0.8147666454315186 |
| keywords[0].display_name | Adversarial system |
| keywords[1].id | https://openalex.org/keywords/computer-science |
| keywords[1].score | 0.666168749332428 |
| keywords[1].display_name | Computer science |
| keywords[2].id | https://openalex.org/keywords/preference |
| keywords[2].score | 0.6152558326721191 |
| keywords[2].display_name | Preference |
| keywords[3].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[3].score | 0.5461805462837219 |
| keywords[3].display_name | Artificial intelligence |
| keywords[4].id | https://openalex.org/keywords/machine-learning |
| keywords[4].score | 0.44465896487236023 |
| keywords[4].display_name | Machine learning |
| keywords[5].id | https://openalex.org/keywords/mathematics |
| keywords[5].score | 0.12717613577842712 |
| keywords[5].display_name | Mathematics |
| keywords[6].id | https://openalex.org/keywords/statistics |
| keywords[6].score | 0.09704729914665222 |
| keywords[6].display_name | Statistics |
| language | en |
| locations[0].id | doi:10.18653/v1/2025.findings-acl.1126 |
| locations[0].is_oa | True |
| locations[0].source | |
| locations[0].license | cc-by |
| locations[0].pdf_url | https://aclanthology.org/2025.findings-acl.1126.pdf |
| locations[0].version | publishedVersion |
| locations[0].raw_type | proceedings-article |
| locations[0].license_id | https://openalex.org/licenses/cc-by |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | Findings of the Association for Computational Linguistics: ACL 2025 |
| locations[0].landing_page_url | https://doi.org/10.18653/v1/2025.findings-acl.1126 |
| indexed_in | crossref |
| authorships[0].author.id | https://openalex.org/A5111099262 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Yuanfu Wang |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Yuanfu Wang |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5089195991 |
| authorships[1].author.orcid | https://orcid.org/0000-0001-5768-0658 |
| authorships[1].author.display_name | Pengyu Wang |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Pengyu Wang |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5058308218 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-8211-2430 |
| authorships[2].author.display_name | Chenyang Xi |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Chenyang Xi |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5057816960 |
| authorships[3].author.orcid | https://orcid.org/0000-0001-7129-0250 |
| authorships[3].author.display_name | Bo Tang |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Bo Tang |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5054411848 |
| authorships[4].author.orcid | https://orcid.org/0009-0000-3584-9299 |
| authorships[4].author.display_name | Junyi Zhu |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Junyi Zhu |
| authorships[4].is_corresponding | False |
| authorships[5].author.id | https://openalex.org/A5052103501 |
| authorships[5].author.orcid | https://orcid.org/0000-0003-2078-9056 |
| authorships[5].author.display_name | Wenqiang Wei |
| authorships[5].author_position | middle |
| authorships[5].raw_author_name | Wenqiang Wei |
| authorships[5].is_corresponding | False |
| authorships[6].author.id | https://openalex.org/A5100418433 |
| authorships[6].author.orcid | https://orcid.org/0000-0002-3525-9755 |
| authorships[6].author.display_name | Chen Chen |
| authorships[6].author_position | middle |
| authorships[6].raw_author_name | Chen Chen |
| authorships[6].is_corresponding | False |
| authorships[7].author.id | https://openalex.org/A5041105948 |
| authorships[7].author.orcid | https://orcid.org/0000-0003-0917-5256 |
| authorships[7].author.display_name | Chao Yang |
| authorships[7].author_position | middle |
| authorships[7].raw_author_name | Chao Yang |
| authorships[7].is_corresponding | False |
| authorships[8].author.id | https://openalex.org/A5100749003 |
| authorships[8].author.orcid | https://orcid.org/0000-0002-8206-6235 |
| authorships[8].author.display_name | Jingfeng Zhang |
| authorships[8].author_position | middle |
| authorships[8].raw_author_name | Jingfeng Zhang |
| authorships[8].is_corresponding | False |
| authorships[9].author.id | https://openalex.org/A5034013066 |
| authorships[9].author.orcid | |
| authorships[9].author.display_name | Chaochao Lu |
| authorships[9].author_position | middle |
| authorships[9].raw_author_name | Chaochao Lu |
| authorships[9].is_corresponding | False |
| authorships[10].author.id | https://openalex.org/A5070554756 |
| authorships[10].author.orcid | https://orcid.org/0009-0009-6458-6024 |
| authorships[10].author.display_name | Yijun Niu |
| authorships[10].author_position | middle |
| authorships[10].raw_author_name | Yijun Niu |
| authorships[10].is_corresponding | False |
| authorships[11].author.id | https://openalex.org/A5077441424 |
| authorships[11].author.orcid | https://orcid.org/0000-0003-1243-0123 |
| authorships[11].author.display_name | Keming Mao |
| authorships[11].author_position | middle |
| authorships[11].raw_author_name | Keming Mao |
| authorships[11].is_corresponding | False |
| authorships[12].author.id | https://openalex.org/A5113013369 |
| authorships[12].author.orcid | https://orcid.org/0009-0002-8787-8192 |
| authorships[12].author.display_name | Zhiyu Li |
| authorships[12].author_position | middle |
| authorships[12].raw_author_name | Zhiyu Li |
| authorships[12].is_corresponding | False |
| authorships[13].author.id | https://openalex.org/A5090294171 |
| authorships[13].author.orcid | https://orcid.org/0000-0002-1456-2202 |
| authorships[13].author.display_name | Feiyu Xiong |
| authorships[13].author_position | middle |
| authorships[13].raw_author_name | Feiyu Xiong |
| authorships[13].is_corresponding | False |
| authorships[14].author.id | https://openalex.org/A5035602823 |
| authorships[14].author.orcid | https://orcid.org/0000-0002-3893-1594 |
| authorships[14].author.display_name | Jie Hu |
| authorships[14].author_position | middle |
| authorships[14].raw_author_name | Jie Hu |
| authorships[14].is_corresponding | False |
| authorships[15].author.id | https://openalex.org/A5037042612 |
| authorships[15].author.orcid | https://orcid.org/0000-0003-1511-4265 |
| authorships[15].author.display_name | Mingchuan Yang |
| authorships[15].author_position | last |
| authorships[15].raw_author_name | Mingchuan Yang |
| authorships[15].is_corresponding | False |
| has_content.pdf | True |
| has_content.grobid_xml | True |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://aclanthology.org/2025.findings-acl.1126.pdf |
| open_access.oa_status | gold |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Adversarial Preference Learning for Robust LLM Alignment |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T10601 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9527000188827515 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1707 |
| primary_topic.subfield.display_name | Computer Vision and Pattern Recognition |
| primary_topic.display_name | Handwritten Text Recognition Techniques |
| related_works | https://openalex.org/W2961085424, https://openalex.org/W4306674287, https://openalex.org/W4387369504, https://openalex.org/W4394896187, https://openalex.org/W3170094116, https://openalex.org/W4386462264, https://openalex.org/W3107602296, https://openalex.org/W4364306694, https://openalex.org/W4312192474, https://openalex.org/W4283697347 |
| cited_by_count | 0 |
| locations_count | 1 |
| best_oa_location.id | doi:10.18653/v1/2025.findings-acl.1126 |
| best_oa_location.is_oa | True |
| best_oa_location.source | |
| best_oa_location.license | cc-by |
| best_oa_location.pdf_url | https://aclanthology.org/2025.findings-acl.1126.pdf |
| best_oa_location.version | publishedVersion |
| best_oa_location.raw_type | proceedings-article |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | True |
| best_oa_location.raw_source_name | Findings of the Association for Computational Linguistics: ACL 2025 |
| best_oa_location.landing_page_url | https://doi.org/10.18653/v1/2025.findings-acl.1126 |
| primary_location.id | doi:10.18653/v1/2025.findings-acl.1126 |
| primary_location.is_oa | True |
| primary_location.source | |
| primary_location.license | cc-by |
| primary_location.pdf_url | https://aclanthology.org/2025.findings-acl.1126.pdf |
| primary_location.version | publishedVersion |
| primary_location.raw_type | proceedings-article |
| primary_location.license_id | https://openalex.org/licenses/cc-by |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | Findings of the Association for Computational Linguistics: ACL 2025 |
| primary_location.landing_page_url | https://doi.org/10.18653/v1/2025.findings-acl.1126 |
| publication_date | 2025-01-01 |
| publication_year | 2025 |
| referenced_works_count | 0 |
| abstract_inverted_index | |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 16 |
| citation_normalized_percentile.value | 0.32230376 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |