DWUG EN: Diachronic Word Usage Graphs for English Article Swipe
YOU?
·
· 2022
· Open Access
·
· DOI: https://doi.org/10.5281/zenodo.7387261
This data collection contains diachronic Word Usage Graphs (WUGs) for English. Find a description of the data format, code to process the data and further datasets on the WUGsite. See previous versions for additional testsets. Please find more information on the provided data in the paper referenced below. Version: 2.0.1, 30.11.2022. Assigns noise uses the cluster label '-1' instead of removing them. Important: Version 2.0.0 extends previous versions with one more annotation round and new clusterings. Reference Dominik Schlechtweg, Nina Tahmasebi, Simon Hengchen, Haim Dubossarsky, Barbara McGillivray. 2021. DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing.
Related Topics
- Type
- dataset
- Language
- en
- Landing Page
- https://doi.org/10.5281/zenodo.7387261
- OA Status
- green
- Cited By
- 1
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4393649605
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4393649605Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.5281/zenodo.7387261Digital Object Identifier
- Title
-
DWUG EN: Diachronic Word Usage Graphs for EnglishWork title
- Type
-
datasetOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2022Year of publication
- Publication date
-
2022-11-30Full publication date if available
- Authors
-
Dominik Schlechtweg, Haim Dubossarsky, Simon Hengchen, Barbara McGillivray, Nina TahmasebiList of authors in order
- Landing page
-
https://doi.org/10.5281/zenodo.7387261Publisher landing page
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://doi.org/10.5281/zenodo.7387261Direct OA link when available
- Concepts
-
Word (group theory), Linguistics, Computer science, Natural language processing, History, PhilosophyTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
1Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 1Per-year citation counts (last 5 years)
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4393649605 |
|---|---|
| doi | https://doi.org/10.5281/zenodo.7387261 |
| ids.openalex | https://openalex.org/W4393649605 |
| fwci | |
| type | dataset |
| title | DWUG EN: Diachronic Word Usage Graphs for English |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10181 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9998999834060669 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Natural Language Processing Techniques |
| topics[1].id | https://openalex.org/T10028 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9952999949455261 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1702 |
| topics[1].subfield.display_name | Artificial Intelligence |
| topics[1].display_name | Topic Modeling |
| topics[2].id | https://openalex.org/T13629 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.9905999898910522 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1702 |
| topics[2].subfield.display_name | Artificial Intelligence |
| topics[2].display_name | Text Readability and Simplification |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C90805587 |
| concepts[0].level | 2 |
| concepts[0].score | 0.6946994662284851 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q10944557 |
| concepts[0].display_name | Word (group theory) |
| concepts[1].id | https://openalex.org/C41895202 |
| concepts[1].level | 1 |
| concepts[1].score | 0.5844476819038391 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q8162 |
| concepts[1].display_name | Linguistics |
| concepts[2].id | https://openalex.org/C41008148 |
| concepts[2].level | 0 |
| concepts[2].score | 0.4745294153690338 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[2].display_name | Computer science |
| concepts[3].id | https://openalex.org/C204321447 |
| concepts[3].level | 1 |
| concepts[3].score | 0.3480863869190216 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q30642 |
| concepts[3].display_name | Natural language processing |
| concepts[4].id | https://openalex.org/C95457728 |
| concepts[4].level | 0 |
| concepts[4].score | 0.3284671902656555 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q309 |
| concepts[4].display_name | History |
| concepts[5].id | https://openalex.org/C138885662 |
| concepts[5].level | 0 |
| concepts[5].score | 0.09811931848526001 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q5891 |
| concepts[5].display_name | Philosophy |
| keywords[0].id | https://openalex.org/keywords/word |
| keywords[0].score | 0.6946994662284851 |
| keywords[0].display_name | Word (group theory) |
| keywords[1].id | https://openalex.org/keywords/linguistics |
| keywords[1].score | 0.5844476819038391 |
| keywords[1].display_name | Linguistics |
| keywords[2].id | https://openalex.org/keywords/computer-science |
| keywords[2].score | 0.4745294153690338 |
| keywords[2].display_name | Computer science |
| keywords[3].id | https://openalex.org/keywords/natural-language-processing |
| keywords[3].score | 0.3480863869190216 |
| keywords[3].display_name | Natural language processing |
| keywords[4].id | https://openalex.org/keywords/history |
| keywords[4].score | 0.3284671902656555 |
| keywords[4].display_name | History |
| keywords[5].id | https://openalex.org/keywords/philosophy |
| keywords[5].score | 0.09811931848526001 |
| keywords[5].display_name | Philosophy |
| language | en |
| locations[0].id | pmh:oai:zenodo.org:7387261 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400562 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | Zenodo (CERN European Organization for Nuclear Research) |
| locations[0].source.host_organization | https://openalex.org/I67311998 |
| locations[0].source.host_organization_name | European Organization for Nuclear Research |
| locations[0].source.host_organization_lineage | https://openalex.org/I67311998 |
| locations[0].license | |
| locations[0].pdf_url | |
| locations[0].version | submittedVersion |
| locations[0].raw_type | info:eu-repo/semantics/other |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | https://doi.org/10.5281/zenodo.7387261 |
| authorships[0].author.id | https://openalex.org/A5013366042 |
| authorships[0].author.orcid | https://orcid.org/0000-0002-0685-2576 |
| authorships[0].author.display_name | Dominik Schlechtweg |
| authorships[0].countries | DE |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I100066346 |
| authorships[0].affiliations[0].raw_affiliation_string | University of Stuttgart |
| authorships[0].institutions[0].id | https://openalex.org/I100066346 |
| authorships[0].institutions[0].ror | https://ror.org/04vnq7t77 |
| authorships[0].institutions[0].type | education |
| authorships[0].institutions[0].lineage | https://openalex.org/I100066346 |
| authorships[0].institutions[0].country_code | DE |
| authorships[0].institutions[0].display_name | University of Stuttgart |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Dominik Schlechtweg |
| authorships[0].is_corresponding | False |
| authorships[0].raw_affiliation_strings | University of Stuttgart |
| authorships[1].author.id | https://openalex.org/A5013663289 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-2818-6113 |
| authorships[1].author.display_name | Haim Dubossarsky |
| authorships[1].countries | GB |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I241749 |
| authorships[1].affiliations[0].raw_affiliation_string | University of Cambridge |
| authorships[1].institutions[0].id | https://openalex.org/I241749 |
| authorships[1].institutions[0].ror | https://ror.org/013meh722 |
| authorships[1].institutions[0].type | education |
| authorships[1].institutions[0].lineage | https://openalex.org/I241749 |
| authorships[1].institutions[0].country_code | GB |
| authorships[1].institutions[0].display_name | University of Cambridge |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Haim Dubossarsky |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | University of Cambridge |
| authorships[2].author.id | https://openalex.org/A5006297499 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-8453-7221 |
| authorships[2].author.display_name | Simon Hengchen |
| authorships[2].countries | SE |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I881427289 |
| authorships[2].affiliations[0].raw_affiliation_string | University of Gothenburg |
| authorships[2].institutions[0].id | https://openalex.org/I881427289 |
| authorships[2].institutions[0].ror | https://ror.org/01tm6cn81 |
| authorships[2].institutions[0].type | education |
| authorships[2].institutions[0].lineage | https://openalex.org/I881427289 |
| authorships[2].institutions[0].country_code | SE |
| authorships[2].institutions[0].display_name | University of Gothenburg |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Simon Hengchen |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | University of Gothenburg |
| authorships[3].author.id | https://openalex.org/A5062737501 |
| authorships[3].author.orcid | https://orcid.org/0000-0003-3426-8200 |
| authorships[3].author.display_name | Barbara McGillivray |
| authorships[3].countries | GB |
| authorships[3].affiliations[0].institution_ids | https://openalex.org/I183935753, https://openalex.org/I4210128584 |
| authorships[3].affiliations[0].raw_affiliation_string | King's College London, The Alan Turing Institute |
| authorships[3].institutions[0].id | https://openalex.org/I183935753 |
| authorships[3].institutions[0].ror | https://ror.org/0220mzb33 |
| authorships[3].institutions[0].type | education |
| authorships[3].institutions[0].lineage | https://openalex.org/I124357947, https://openalex.org/I183935753 |
| authorships[3].institutions[0].country_code | GB |
| authorships[3].institutions[0].display_name | King's College London |
| authorships[3].institutions[1].id | https://openalex.org/I4210128584 |
| authorships[3].institutions[1].ror | https://ror.org/035dkdb55 |
| authorships[3].institutions[1].type | facility |
| authorships[3].institutions[1].lineage | https://openalex.org/I4210128584 |
| authorships[3].institutions[1].country_code | GB |
| authorships[3].institutions[1].display_name | The Alan Turing Institute |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Barbara McGillivray |
| authorships[3].is_corresponding | False |
| authorships[3].raw_affiliation_strings | King's College London, The Alan Turing Institute |
| authorships[4].author.id | https://openalex.org/A5003859694 |
| authorships[4].author.orcid | https://orcid.org/0000-0003-1688-1845 |
| authorships[4].author.display_name | Nina Tahmasebi |
| authorships[4].countries | SE |
| authorships[4].affiliations[0].institution_ids | https://openalex.org/I881427289 |
| authorships[4].affiliations[0].raw_affiliation_string | University of Gothenburg |
| authorships[4].institutions[0].id | https://openalex.org/I881427289 |
| authorships[4].institutions[0].ror | https://ror.org/01tm6cn81 |
| authorships[4].institutions[0].type | education |
| authorships[4].institutions[0].lineage | https://openalex.org/I881427289 |
| authorships[4].institutions[0].country_code | SE |
| authorships[4].institutions[0].display_name | University of Gothenburg |
| authorships[4].author_position | last |
| authorships[4].raw_author_name | Nina Tahmasebi |
| authorships[4].is_corresponding | False |
| authorships[4].raw_affiliation_strings | University of Gothenburg |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://doi.org/10.5281/zenodo.7387261 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2024-04-03T00:00:00 |
| display_name | DWUG EN: Diachronic Word Usage Graphs for English |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T10181 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9998999834060669 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Natural Language Processing Techniques |
| related_works | https://openalex.org/W2748952813, https://openalex.org/W2390279801, https://openalex.org/W2358668433, https://openalex.org/W2376932109, https://openalex.org/W2001405890, https://openalex.org/W2382290278, https://openalex.org/W2478288626, https://openalex.org/W4391913857, https://openalex.org/W2350741829, https://openalex.org/W2296205523 |
| cited_by_count | 1 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 1 |
| locations_count | 1 |
| best_oa_location.id | pmh:oai:zenodo.org:7387261 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400562 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | Zenodo (CERN European Organization for Nuclear Research) |
| best_oa_location.source.host_organization | https://openalex.org/I67311998 |
| best_oa_location.source.host_organization_name | European Organization for Nuclear Research |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I67311998 |
| best_oa_location.license | |
| best_oa_location.pdf_url | |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | info:eu-repo/semantics/other |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | https://doi.org/10.5281/zenodo.7387261 |
| primary_location.id | pmh:oai:zenodo.org:7387261 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400562 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | Zenodo (CERN European Organization for Nuclear Research) |
| primary_location.source.host_organization | https://openalex.org/I67311998 |
| primary_location.source.host_organization_name | European Organization for Nuclear Research |
| primary_location.source.host_organization_lineage | https://openalex.org/I67311998 |
| primary_location.license | |
| primary_location.pdf_url | |
| primary_location.version | submittedVersion |
| primary_location.raw_type | info:eu-repo/semantics/other |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | https://doi.org/10.5281/zenodo.7387261 |
| publication_date | 2022-11-30 |
| publication_year | 2022 |
| referenced_works_count | 0 |
| abstract_inverted_index.A | 91 |
| abstract_inverted_index.a | 12 |
| abstract_inverted_index.in | 44, 99, 110 |
| abstract_inverted_index.of | 14, 60, 94, 103 |
| abstract_inverted_index.on | 26, 40, 107 |
| abstract_inverted_index.to | 19 |
| abstract_inverted_index.and | 23, 74 |
| abstract_inverted_index.for | 9, 33 |
| abstract_inverted_index.new | 75 |
| abstract_inverted_index.one | 70 |
| abstract_inverted_index.the | 15, 21, 27, 41, 45, 55, 104 |
| abstract_inverted_index.2021 | 105 |
| abstract_inverted_index.Find | 11 |
| abstract_inverted_index.Four | 100 |
| abstract_inverted_index.Haim | 84 |
| abstract_inverted_index.Nina | 80 |
| abstract_inverted_index.Word | 5, 96 |
| abstract_inverted_index.code | 18 |
| abstract_inverted_index.data | 1, 16, 22, 43 |
| abstract_inverted_index.find | 37 |
| abstract_inverted_index.more | 38, 71 |
| abstract_inverted_index.uses | 54 |
| abstract_inverted_index.with | 69 |
| abstract_inverted_index.<a | 28, 89 |
| abstract_inverted_index.2.0.0 | 65 |
| abstract_inverted_index.2021. | 88 |
| abstract_inverted_index.Simon | 82 |
| abstract_inverted_index.Usage | 6, 97 |
| abstract_inverted_index.label | 57 |
| abstract_inverted_index.large | 92 |
| abstract_inverted_index.noise | 53 |
| abstract_inverted_index.paper | 46 |
| abstract_inverted_index.round | 73 |
| abstract_inverted_index.them. | 62 |
| abstract_inverted_index.(WUGs) | 8 |
| abstract_inverted_index.2.0.1, | 50 |
| abstract_inverted_index.Graphs | 7, 98 |
| abstract_inverted_index.Assigns | 52 |
| abstract_inverted_index.Barbara | 86 |
| abstract_inverted_index.Methods | 109 |
| abstract_inverted_index.Natural | 111 |
| abstract_inverted_index.Version | 64 |
| abstract_inverted_index.cluster | 56 |
| abstract_inverted_index.extends | 66 |
| abstract_inverted_index.format, | 17 |
| abstract_inverted_index.further | 24 |
| abstract_inverted_index.instead | 59 |
| abstract_inverted_index.process | 20 |
| abstract_inverted_index.English. | 10 |
| abstract_inverted_index.Language | 112 |
| abstract_inverted_index.Resource | 93 |
| abstract_inverted_index.contains | 3 |
| abstract_inverted_index.datasets | 25 |
| abstract_inverted_index.previous | 31, 67 |
| abstract_inverted_index.provided | 42 |
| abstract_inverted_index.removing | 61 |
| abstract_inverted_index.versions | 32, 68 |
| abstract_inverted_index.Empirical | 108 |
| abstract_inverted_index.Hengchen, | 83 |
| abstract_inverted_index.Conference | 106 |
| abstract_inverted_index.Diachronic | 95 |
| abstract_inverted_index.Tahmasebi, | 81 |
| abstract_inverted_index.additional | 34 |
| abstract_inverted_index.annotation | 72 |
| abstract_inverted_index.collection | 2 |
| abstract_inverted_index.diachronic | 4 |
| abstract_inverted_index.referenced | 47 |
| abstract_inverted_index.30.11.2022. | 51 |
| abstract_inverted_index.Proceedings | 102 |
| abstract_inverted_index.description | 13 |
| abstract_inverted_index.information | 39 |
| abstract_inverted_index.'-1' | 58 |
| abstract_inverted_index.<p>See | 30 |
| abstract_inverted_index.Dubossarsky, | 85 |
| abstract_inverted_index.McGillivray. | 87 |
| abstract_inverted_index.Schlechtweg, | 79 |
| abstract_inverted_index.<p>This | 0 |
| abstract_inverted_index.<p>Please | 36 |
| abstract_inverted_index.<p>Dominik | 78 |
| abstract_inverted_index.below.</p> | 48 |
| abstract_inverted_index.<p>Version: | 49 |
| abstract_inverted_index.testsets.</p> | 35 |
| abstract_inverted_index.Languages</a>. | 101 |
| abstract_inverted_index.Processing.</p> | 113 |
| abstract_inverted_index.clusterings.</p> | 76 |
| abstract_inverted_index.<em>Important</em>: | 63 |
| abstract_inverted_index.<p><strong>Reference</strong></p> | 77 |
| abstract_inverted_index.href="https://aclanthology.org/2021.emnlp-main.567/">DWUG: | 90 |
| abstract_inverted_index.href="https://www.ims.uni-stuttgart.de/data/wugs">WUGsite</a>.</p> | 29 |
| cited_by_percentile_year | |
| countries_distinct_count | 3 |
| institutions_distinct_count | 5 |
| citation_normalized_percentile |