Data Deidentification for Data Sharing in Educational and Psychological Research: Importance, Barriers, and Techniques Article Swipe
YOU?
·
· 2024
· Open Access
·
· DOI: https://doi.org/10.31234/osf.io/jgd9c
In this manuscript, we discuss the importance of data sharing in educational and psychological research, emphasizing the historical context of data sharing, the current open science movement, and the so-called replication crisis. We additionally explore the barriers to data sharing, particularly the fear of incorrectly deidentifying data or accidentally including private information. We then highlight the importance of deidentifying data for data sharing. Finally, we present specific techniques for data deidentification, namely non-perturbative and perturbative methods, and make recommendations for which techniques are relevant for specific types of variables. To assist readers in implementing the material from this study, we have additionally created an interactive tutorial as a Shiny web application, which is publicly available and free to use.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- https://doi.org/10.31234/osf.io/jgd9c
- OA Status
- gold
- Cited By
- 1
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4400817993
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4400817993Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.31234/osf.io/jgd9cDigital Object Identifier
- Title
-
Data Deidentification for Data Sharing in Educational and Psychological Research: Importance, Barriers, and TechniquesWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2024Year of publication
- Publication date
-
2024-07-19Full publication date if available
- Authors
-
Jeffrey Shero, Alexis Swanz, Allyson Hanson, Sara A. Hart, Jessica A. R. LoganList of authors in order
- Landing page
-
https://doi.org/10.31234/osf.io/jgd9cPublisher landing page
- Open access
-
YesWhether a free full text is available
- OA status
-
goldOpen access status per OpenAlex
- OA URL
-
https://doi.org/10.31234/osf.io/jgd9cDirect OA link when available
- Concepts
-
Data sharing, Psychological research, Research data, Psychology, Applied psychology, Data science, Computer science, Social psychology, Medicine, Data curation, Pathology, Alternative medicineTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
1Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 1Per-year citation counts (last 5 years)
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4400817993 |
|---|---|
| doi | https://doi.org/10.31234/osf.io/jgd9c |
| ids.doi | https://doi.org/10.31234/osf.io/jgd9c |
| ids.openalex | https://openalex.org/W4400817993 |
| fwci | 0.9565227 |
| type | preprint |
| title | Data Deidentification for Data Sharing in Educational and Psychological Research: Importance, Barriers, and Techniques |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T11719 |
| topics[0].field.id | https://openalex.org/fields/18 |
| topics[0].field.display_name | Decision Sciences |
| topics[0].score | 0.9578999876976013 |
| topics[0].domain.id | https://openalex.org/domains/2 |
| topics[0].domain.display_name | Social Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1803 |
| topics[0].subfield.display_name | Management Science and Operations Research |
| topics[0].display_name | Data Quality and Management |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C2779965156 |
| concepts[0].level | 3 |
| concepts[0].score | 0.6276739835739136 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q5227350 |
| concepts[0].display_name | Data sharing |
| concepts[1].id | https://openalex.org/C188255311 |
| concepts[1].level | 2 |
| concepts[1].score | 0.5644633173942566 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q7256388 |
| concepts[1].display_name | Psychological research |
| concepts[2].id | https://openalex.org/C3020038283 |
| concepts[2].level | 3 |
| concepts[2].score | 0.5374338030815125 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q42848 |
| concepts[2].display_name | Research data |
| concepts[3].id | https://openalex.org/C15744967 |
| concepts[3].level | 0 |
| concepts[3].score | 0.5129311084747314 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q9418 |
| concepts[3].display_name | Psychology |
| concepts[4].id | https://openalex.org/C75630572 |
| concepts[4].level | 1 |
| concepts[4].score | 0.38458144664764404 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q538904 |
| concepts[4].display_name | Applied psychology |
| concepts[5].id | https://openalex.org/C2522767166 |
| concepts[5].level | 1 |
| concepts[5].score | 0.3517906665802002 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q2374463 |
| concepts[5].display_name | Data science |
| concepts[6].id | https://openalex.org/C41008148 |
| concepts[6].level | 0 |
| concepts[6].score | 0.33913689851760864 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[6].display_name | Computer science |
| concepts[7].id | https://openalex.org/C77805123 |
| concepts[7].level | 1 |
| concepts[7].score | 0.2259814441204071 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q161272 |
| concepts[7].display_name | Social psychology |
| concepts[8].id | https://openalex.org/C71924100 |
| concepts[8].level | 0 |
| concepts[8].score | 0.08928251266479492 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q11190 |
| concepts[8].display_name | Medicine |
| concepts[9].id | https://openalex.org/C91632574 |
| concepts[9].level | 2 |
| concepts[9].score | 0.05993208289146423 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q15088675 |
| concepts[9].display_name | Data curation |
| concepts[10].id | https://openalex.org/C142724271 |
| concepts[10].level | 1 |
| concepts[10].score | 0.0 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q7208 |
| concepts[10].display_name | Pathology |
| concepts[11].id | https://openalex.org/C204787440 |
| concepts[11].level | 2 |
| concepts[11].score | 0.0 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q188504 |
| concepts[11].display_name | Alternative medicine |
| keywords[0].id | https://openalex.org/keywords/data-sharing |
| keywords[0].score | 0.6276739835739136 |
| keywords[0].display_name | Data sharing |
| keywords[1].id | https://openalex.org/keywords/psychological-research |
| keywords[1].score | 0.5644633173942566 |
| keywords[1].display_name | Psychological research |
| keywords[2].id | https://openalex.org/keywords/research-data |
| keywords[2].score | 0.5374338030815125 |
| keywords[2].display_name | Research data |
| keywords[3].id | https://openalex.org/keywords/psychology |
| keywords[3].score | 0.5129311084747314 |
| keywords[3].display_name | Psychology |
| keywords[4].id | https://openalex.org/keywords/applied-psychology |
| keywords[4].score | 0.38458144664764404 |
| keywords[4].display_name | Applied psychology |
| keywords[5].id | https://openalex.org/keywords/data-science |
| keywords[5].score | 0.3517906665802002 |
| keywords[5].display_name | Data science |
| keywords[6].id | https://openalex.org/keywords/computer-science |
| keywords[6].score | 0.33913689851760864 |
| keywords[6].display_name | Computer science |
| keywords[7].id | https://openalex.org/keywords/social-psychology |
| keywords[7].score | 0.2259814441204071 |
| keywords[7].display_name | Social psychology |
| keywords[8].id | https://openalex.org/keywords/medicine |
| keywords[8].score | 0.08928251266479492 |
| keywords[8].display_name | Medicine |
| keywords[9].id | https://openalex.org/keywords/data-curation |
| keywords[9].score | 0.05993208289146423 |
| keywords[9].display_name | Data curation |
| language | en |
| locations[0].id | doi:10.31234/osf.io/jgd9c |
| locations[0].is_oa | True |
| locations[0].source | |
| locations[0].license | cc-by |
| locations[0].pdf_url | |
| locations[0].version | acceptedVersion |
| locations[0].raw_type | posted-content |
| locations[0].license_id | https://openalex.org/licenses/cc-by |
| locations[0].is_accepted | True |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | https://doi.org/10.31234/osf.io/jgd9c |
| indexed_in | crossref |
| authorships[0].author.id | https://openalex.org/A5056949372 |
| authorships[0].author.orcid | https://orcid.org/0000-0002-2180-8979 |
| authorships[0].author.display_name | Jeffrey Shero |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Jeffrey Shero |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5093797181 |
| authorships[1].author.orcid | |
| authorships[1].author.display_name | Alexis Swanz |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Alexis Swanz |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5076437251 |
| authorships[2].author.orcid | https://orcid.org/0009-0000-0891-6038 |
| authorships[2].author.display_name | Allyson Hanson |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Allyson Hanson |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5018343901 |
| authorships[3].author.orcid | https://orcid.org/0000-0001-9793-0420 |
| authorships[3].author.display_name | Sara A. Hart |
| authorships[3].countries | US |
| authorships[3].affiliations[0].institution_ids | https://openalex.org/I103163165 |
| authorships[3].affiliations[0].raw_affiliation_string | Florida State University |
| authorships[3].institutions[0].id | https://openalex.org/I103163165 |
| authorships[3].institutions[0].ror | https://ror.org/05g3dte14 |
| authorships[3].institutions[0].type | education |
| authorships[3].institutions[0].lineage | https://openalex.org/I103163165 |
| authorships[3].institutions[0].country_code | US |
| authorships[3].institutions[0].display_name | Florida State University |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Sara Ann Hart |
| authorships[3].is_corresponding | False |
| authorships[3].raw_affiliation_strings | Florida State University |
| authorships[4].author.id | https://openalex.org/A5058436364 |
| authorships[4].author.orcid | https://orcid.org/0000-0003-3113-4346 |
| authorships[4].author.display_name | Jessica A. R. Logan |
| authorships[4].author_position | last |
| authorships[4].raw_author_name | Jessica A. R. Logan |
| authorships[4].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://doi.org/10.31234/osf.io/jgd9c |
| open_access.oa_status | gold |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Data Deidentification for Data Sharing in Educational and Psychological Research: Importance, Barriers, and Techniques |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T11719 |
| primary_topic.field.id | https://openalex.org/fields/18 |
| primary_topic.field.display_name | Decision Sciences |
| primary_topic.score | 0.9578999876976013 |
| primary_topic.domain.id | https://openalex.org/domains/2 |
| primary_topic.domain.display_name | Social Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1803 |
| primary_topic.subfield.display_name | Management Science and Operations Research |
| primary_topic.display_name | Data Quality and Management |
| related_works | https://openalex.org/W3123872772, https://openalex.org/W2152419662, https://openalex.org/W1504819070, https://openalex.org/W1501592941, https://openalex.org/W3006280437, https://openalex.org/W3080859738, https://openalex.org/W3205543244, https://openalex.org/W1979070592, https://openalex.org/W2604788701, https://openalex.org/W2568748071 |
| cited_by_count | 1 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 1 |
| locations_count | 1 |
| best_oa_location.id | doi:10.31234/osf.io/jgd9c |
| best_oa_location.is_oa | True |
| best_oa_location.source | |
| best_oa_location.license | cc-by |
| best_oa_location.pdf_url | |
| best_oa_location.version | acceptedVersion |
| best_oa_location.raw_type | posted-content |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | https://doi.org/10.31234/osf.io/jgd9c |
| primary_location.id | doi:10.31234/osf.io/jgd9c |
| primary_location.is_oa | True |
| primary_location.source | |
| primary_location.license | cc-by |
| primary_location.pdf_url | |
| primary_location.version | acceptedVersion |
| primary_location.raw_type | posted-content |
| primary_location.license_id | https://openalex.org/licenses/cc-by |
| primary_location.is_accepted | True |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | https://doi.org/10.31234/osf.io/jgd9c |
| publication_date | 2024-07-19 |
| publication_year | 2024 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 107 |
| abstract_inverted_index.In | 0 |
| abstract_inverted_index.To | 89 |
| abstract_inverted_index.We | 32, 52 |
| abstract_inverted_index.an | 103 |
| abstract_inverted_index.as | 106 |
| abstract_inverted_index.in | 10, 92 |
| abstract_inverted_index.is | 112 |
| abstract_inverted_index.of | 7, 19, 43, 57, 87 |
| abstract_inverted_index.or | 47 |
| abstract_inverted_index.to | 37, 117 |
| abstract_inverted_index.we | 3, 64, 99 |
| abstract_inverted_index.and | 12, 27, 73, 76, 115 |
| abstract_inverted_index.are | 82 |
| abstract_inverted_index.for | 60, 68, 79, 84 |
| abstract_inverted_index.the | 5, 16, 22, 28, 35, 41, 55, 94 |
| abstract_inverted_index.web | 109 |
| abstract_inverted_index.data | 8, 20, 38, 46, 59, 61, 69 |
| abstract_inverted_index.fear | 42 |
| abstract_inverted_index.free | 116 |
| abstract_inverted_index.from | 96 |
| abstract_inverted_index.have | 100 |
| abstract_inverted_index.make | 77 |
| abstract_inverted_index.open | 24 |
| abstract_inverted_index.then | 53 |
| abstract_inverted_index.this | 1, 97 |
| abstract_inverted_index.use. | 118 |
| abstract_inverted_index.Shiny | 108 |
| abstract_inverted_index.types | 86 |
| abstract_inverted_index.which | 80, 111 |
| abstract_inverted_index.assist | 90 |
| abstract_inverted_index.namely | 71 |
| abstract_inverted_index.study, | 98 |
| abstract_inverted_index.context | 18 |
| abstract_inverted_index.created | 102 |
| abstract_inverted_index.crisis. | 31 |
| abstract_inverted_index.current | 23 |
| abstract_inverted_index.discuss | 4 |
| abstract_inverted_index.explore | 34 |
| abstract_inverted_index.present | 65 |
| abstract_inverted_index.private | 50 |
| abstract_inverted_index.readers | 91 |
| abstract_inverted_index.science | 25 |
| abstract_inverted_index.sharing | 9 |
| abstract_inverted_index.Finally, | 63 |
| abstract_inverted_index.barriers | 36 |
| abstract_inverted_index.material | 95 |
| abstract_inverted_index.methods, | 75 |
| abstract_inverted_index.publicly | 113 |
| abstract_inverted_index.relevant | 83 |
| abstract_inverted_index.sharing, | 21, 39 |
| abstract_inverted_index.sharing. | 62 |
| abstract_inverted_index.specific | 66, 85 |
| abstract_inverted_index.tutorial | 105 |
| abstract_inverted_index.available | 114 |
| abstract_inverted_index.highlight | 54 |
| abstract_inverted_index.including | 49 |
| abstract_inverted_index.movement, | 26 |
| abstract_inverted_index.research, | 14 |
| abstract_inverted_index.so-called | 29 |
| abstract_inverted_index.historical | 17 |
| abstract_inverted_index.importance | 6, 56 |
| abstract_inverted_index.techniques | 67, 81 |
| abstract_inverted_index.variables. | 88 |
| abstract_inverted_index.educational | 11 |
| abstract_inverted_index.emphasizing | 15 |
| abstract_inverted_index.incorrectly | 44 |
| abstract_inverted_index.interactive | 104 |
| abstract_inverted_index.manuscript, | 2 |
| abstract_inverted_index.replication | 30 |
| abstract_inverted_index.accidentally | 48 |
| abstract_inverted_index.additionally | 33, 101 |
| abstract_inverted_index.application, | 110 |
| abstract_inverted_index.implementing | 93 |
| abstract_inverted_index.information. | 51 |
| abstract_inverted_index.particularly | 40 |
| abstract_inverted_index.perturbative | 74 |
| abstract_inverted_index.deidentifying | 45, 58 |
| abstract_inverted_index.psychological | 13 |
| abstract_inverted_index.recommendations | 78 |
| abstract_inverted_index.non-perturbative | 72 |
| abstract_inverted_index.deidentification, | 70 |
| cited_by_percentile_year.max | 95 |
| cited_by_percentile_year.min | 91 |
| countries_distinct_count | 1 |
| institutions_distinct_count | 5 |
| sustainable_development_goals[0].id | https://metadata.un.org/sdg/4 |
| sustainable_development_goals[0].score | 0.49000000953674316 |
| sustainable_development_goals[0].display_name | Quality Education |
| citation_normalized_percentile.value | 0.70980884 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |