Contrastive Instruction Tuning Article Swipe
YOU?
·
· 2024
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2402.11138
Instruction tuning has been used as a promising approach to improve the performance of large language models (LLMs) on unseen tasks. However, current LLMs exhibit limited robustness to unseen instructions, generating inconsistent outputs when the same instruction is phrased with slightly varied forms or language styles. This behavior indicates LLMs' lack of robustness to textual variations and generalizability to unseen instructions, potentially leading to trustworthiness issues. Accordingly, we propose Contrastive Instruction Tuning, which maximizes the similarity between the hidden representations of semantically equivalent instruction-instance pairs while minimizing the similarity between semantically different ones. To facilitate this approach, we augment the existing FLAN collection by paraphrasing task instructions. Experiments on the PromptBench benchmark show that CoIN consistently improves LLMs' robustness to unseen instructions with variations across character, word, sentence, and semantic levels by an average of +2.5% in accuracy. Code is available at https://github.com/luka-group/CoIN.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2402.11138
- https://arxiv.org/pdf/2402.11138
- OA Status
- green
- Cited By
- 1
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4392011849
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4392011849Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2402.11138Digital Object Identifier
- Title
-
Contrastive Instruction TuningWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2024Year of publication
- Publication date
-
2024-02-17Full publication date if available
- Authors
-
Tianyi Yan, Wang Fei, James Y. Huang, Wenxuan Zhou, Fan Yin, Aram Galstyan, Wenpeng Yin, Muhao ChenList of authors in order
- Landing page
-
https://arxiv.org/abs/2402.11138Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2402.11138Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2402.11138Direct OA link when available
- Concepts
-
Contrastive analysis, Linguistics, Computer science, Natural language processing, PhilosophyTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
1Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 1Per-year citation counts (last 5 years)
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4392011849 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2402.11138 |
| ids.doi | https://doi.org/10.48550/arxiv.2402.11138 |
| ids.openalex | https://openalex.org/W4392011849 |
| fwci | 0.0 |
| type | preprint |
| title | Contrastive Instruction Tuning |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T13412 |
| topics[0].field.id | https://openalex.org/fields/33 |
| topics[0].field.display_name | Social Sciences |
| topics[0].score | 0.08370000123977661 |
| topics[0].domain.id | https://openalex.org/domains/2 |
| topics[0].domain.display_name | Social Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/3304 |
| topics[0].subfield.display_name | Education |
| topics[0].display_name | Education and Technology Integration |
| topics[1].id | https://openalex.org/T10636 |
| topics[1].field.id | https://openalex.org/fields/32 |
| topics[1].field.display_name | Psychology |
| topics[1].score | 0.04399999976158142 |
| topics[1].domain.id | https://openalex.org/domains/2 |
| topics[1].domain.display_name | Social Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/3204 |
| topics[1].subfield.display_name | Developmental and Educational Psychology |
| topics[1].display_name | Innovative Teaching and Learning Methods |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C2777629044 |
| concepts[0].level | 2 |
| concepts[0].score | 0.5449432730674744 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q614959 |
| concepts[0].display_name | Contrastive analysis |
| concepts[1].id | https://openalex.org/C41895202 |
| concepts[1].level | 1 |
| concepts[1].score | 0.5203306674957275 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q8162 |
| concepts[1].display_name | Linguistics |
| concepts[2].id | https://openalex.org/C41008148 |
| concepts[2].level | 0 |
| concepts[2].score | 0.4698801338672638 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[2].display_name | Computer science |
| concepts[3].id | https://openalex.org/C204321447 |
| concepts[3].level | 1 |
| concepts[3].score | 0.3613486886024475 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q30642 |
| concepts[3].display_name | Natural language processing |
| concepts[4].id | https://openalex.org/C138885662 |
| concepts[4].level | 0 |
| concepts[4].score | 0.11364084482192993 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q5891 |
| concepts[4].display_name | Philosophy |
| keywords[0].id | https://openalex.org/keywords/contrastive-analysis |
| keywords[0].score | 0.5449432730674744 |
| keywords[0].display_name | Contrastive analysis |
| keywords[1].id | https://openalex.org/keywords/linguistics |
| keywords[1].score | 0.5203306674957275 |
| keywords[1].display_name | Linguistics |
| keywords[2].id | https://openalex.org/keywords/computer-science |
| keywords[2].score | 0.4698801338672638 |
| keywords[2].display_name | Computer science |
| keywords[3].id | https://openalex.org/keywords/natural-language-processing |
| keywords[3].score | 0.3613486886024475 |
| keywords[3].display_name | Natural language processing |
| keywords[4].id | https://openalex.org/keywords/philosophy |
| keywords[4].score | 0.11364084482192993 |
| keywords[4].display_name | Philosophy |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2402.11138 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2402.11138 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2402.11138 |
| locations[1].id | doi:10.48550/arxiv.2402.11138 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2402.11138 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5035772408 |
| authorships[0].author.orcid | https://orcid.org/0000-0002-2674-4134 |
| authorships[0].author.display_name | Tianyi Yan |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Yan, Tianyi |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5100417196 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-7864-6122 |
| authorships[1].author.display_name | Wang Fei |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Wang, Fei |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5040331608 |
| authorships[2].author.orcid | |
| authorships[2].author.display_name | James Y. Huang |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Huang, James Y. |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5080937886 |
| authorships[3].author.orcid | https://orcid.org/0000-0003-1199-885X |
| authorships[3].author.display_name | Wenxuan Zhou |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Zhou, Wenxuan |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5109670936 |
| authorships[4].author.orcid | https://orcid.org/0009-0004-5057-0463 |
| authorships[4].author.display_name | Fan Yin |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Yin, Fan |
| authorships[4].is_corresponding | False |
| authorships[5].author.id | https://openalex.org/A5101715504 |
| authorships[5].author.orcid | https://orcid.org/0000-0003-4215-0886 |
| authorships[5].author.display_name | Aram Galstyan |
| authorships[5].author_position | middle |
| authorships[5].raw_author_name | Galstyan, Aram |
| authorships[5].is_corresponding | False |
| authorships[6].author.id | https://openalex.org/A5038386528 |
| authorships[6].author.orcid | https://orcid.org/0000-0002-3154-1139 |
| authorships[6].author.display_name | Wenpeng Yin |
| authorships[6].author_position | middle |
| authorships[6].raw_author_name | Yin, Wenpeng |
| authorships[6].is_corresponding | False |
| authorships[7].author.id | https://openalex.org/A5102861481 |
| authorships[7].author.orcid | https://orcid.org/0000-0003-0118-3147 |
| authorships[7].author.display_name | Muhao Chen |
| authorships[7].author_position | last |
| authorships[7].raw_author_name | Chen, Muhao |
| authorships[7].is_corresponding | False |
| has_content.pdf | True |
| has_content.grobid_xml | True |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2402.11138 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Contrastive Instruction Tuning |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T13412 |
| primary_topic.field.id | https://openalex.org/fields/33 |
| primary_topic.field.display_name | Social Sciences |
| primary_topic.score | 0.08370000123977661 |
| primary_topic.domain.id | https://openalex.org/domains/2 |
| primary_topic.domain.display_name | Social Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/3304 |
| primary_topic.subfield.display_name | Education |
| primary_topic.display_name | Education and Technology Integration |
| related_works | https://openalex.org/W2945865340, https://openalex.org/W4294495400, https://openalex.org/W4367176324, https://openalex.org/W2972340244, https://openalex.org/W1834428557, https://openalex.org/W3200487689, https://openalex.org/W2394697227, https://openalex.org/W1979243875, https://openalex.org/W3192129151, https://openalex.org/W2922545124 |
| cited_by_count | 1 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 1 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2402.11138 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2402.11138 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2402.11138 |
| primary_location.id | pmh:oai:arXiv.org:2402.11138 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2402.11138 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2402.11138 |
| publication_date | 2024-02-17 |
| publication_year | 2024 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 6 |
| abstract_inverted_index.To | 93 |
| abstract_inverted_index.an | 132 |
| abstract_inverted_index.as | 5 |
| abstract_inverted_index.at | 141 |
| abstract_inverted_index.by | 103, 131 |
| abstract_inverted_index.in | 136 |
| abstract_inverted_index.is | 37, 139 |
| abstract_inverted_index.of | 13, 51, 80, 134 |
| abstract_inverted_index.on | 18, 108 |
| abstract_inverted_index.or | 43 |
| abstract_inverted_index.to | 9, 27, 53, 58, 63, 119 |
| abstract_inverted_index.we | 67, 97 |
| abstract_inverted_index.and | 56, 128 |
| abstract_inverted_index.has | 2 |
| abstract_inverted_index.the | 11, 34, 74, 77, 87, 99, 109 |
| abstract_inverted_index.CoIN | 114 |
| abstract_inverted_index.Code | 138 |
| abstract_inverted_index.FLAN | 101 |
| abstract_inverted_index.LLMs | 23 |
| abstract_inverted_index.This | 46 |
| abstract_inverted_index.been | 3 |
| abstract_inverted_index.lack | 50 |
| abstract_inverted_index.same | 35 |
| abstract_inverted_index.show | 112 |
| abstract_inverted_index.task | 105 |
| abstract_inverted_index.that | 113 |
| abstract_inverted_index.this | 95 |
| abstract_inverted_index.used | 4 |
| abstract_inverted_index.when | 33 |
| abstract_inverted_index.with | 39, 122 |
| abstract_inverted_index.+2.5% | 135 |
| abstract_inverted_index.LLMs' | 49, 117 |
| abstract_inverted_index.forms | 42 |
| abstract_inverted_index.large | 14 |
| abstract_inverted_index.ones. | 92 |
| abstract_inverted_index.pairs | 84 |
| abstract_inverted_index.which | 72 |
| abstract_inverted_index.while | 85 |
| abstract_inverted_index.word, | 126 |
| abstract_inverted_index.(LLMs) | 17 |
| abstract_inverted_index.across | 124 |
| abstract_inverted_index.hidden | 78 |
| abstract_inverted_index.levels | 130 |
| abstract_inverted_index.models | 16 |
| abstract_inverted_index.tasks. | 20 |
| abstract_inverted_index.tuning | 1 |
| abstract_inverted_index.unseen | 19, 28, 59, 120 |
| abstract_inverted_index.varied | 41 |
| abstract_inverted_index.Tuning, | 71 |
| abstract_inverted_index.augment | 98 |
| abstract_inverted_index.average | 133 |
| abstract_inverted_index.between | 76, 89 |
| abstract_inverted_index.current | 22 |
| abstract_inverted_index.exhibit | 24 |
| abstract_inverted_index.improve | 10 |
| abstract_inverted_index.issues. | 65 |
| abstract_inverted_index.leading | 62 |
| abstract_inverted_index.limited | 25 |
| abstract_inverted_index.outputs | 32 |
| abstract_inverted_index.phrased | 38 |
| abstract_inverted_index.propose | 68 |
| abstract_inverted_index.styles. | 45 |
| abstract_inverted_index.textual | 54 |
| abstract_inverted_index.However, | 21 |
| abstract_inverted_index.approach | 8 |
| abstract_inverted_index.behavior | 47 |
| abstract_inverted_index.existing | 100 |
| abstract_inverted_index.improves | 116 |
| abstract_inverted_index.language | 15, 44 |
| abstract_inverted_index.semantic | 129 |
| abstract_inverted_index.slightly | 40 |
| abstract_inverted_index.accuracy. | 137 |
| abstract_inverted_index.approach, | 96 |
| abstract_inverted_index.available | 140 |
| abstract_inverted_index.benchmark | 111 |
| abstract_inverted_index.different | 91 |
| abstract_inverted_index.indicates | 48 |
| abstract_inverted_index.maximizes | 73 |
| abstract_inverted_index.promising | 7 |
| abstract_inverted_index.sentence, | 127 |
| abstract_inverted_index.character, | 125 |
| abstract_inverted_index.collection | 102 |
| abstract_inverted_index.equivalent | 82 |
| abstract_inverted_index.facilitate | 94 |
| abstract_inverted_index.generating | 30 |
| abstract_inverted_index.minimizing | 86 |
| abstract_inverted_index.robustness | 26, 52, 118 |
| abstract_inverted_index.similarity | 75, 88 |
| abstract_inverted_index.variations | 55, 123 |
| abstract_inverted_index.Contrastive | 69 |
| abstract_inverted_index.Experiments | 107 |
| abstract_inverted_index.Instruction | 0, 70 |
| abstract_inverted_index.PromptBench | 110 |
| abstract_inverted_index.instruction | 36 |
| abstract_inverted_index.performance | 12 |
| abstract_inverted_index.potentially | 61 |
| abstract_inverted_index.Accordingly, | 66 |
| abstract_inverted_index.consistently | 115 |
| abstract_inverted_index.inconsistent | 31 |
| abstract_inverted_index.instructions | 121 |
| abstract_inverted_index.paraphrasing | 104 |
| abstract_inverted_index.semantically | 81, 90 |
| abstract_inverted_index.instructions, | 29, 60 |
| abstract_inverted_index.instructions. | 106 |
| abstract_inverted_index.representations | 79 |
| abstract_inverted_index.trustworthiness | 64 |
| abstract_inverted_index.generalizability | 57 |
| abstract_inverted_index.instruction-instance | 83 |
| abstract_inverted_index.https://github.com/luka-group/CoIN. | 142 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 8 |
| citation_normalized_percentile |