Leveraging electronic health records for data science: common pitfalls and how to avoid them Article Swipe
Christopher Martin Sauer
,
Li-Ching Chen
,
Stephanie L Hyland
,
Armand R. J. Girbes
,
Paul Elbers
,
Leo Anthony Celi
·
YOU?
·
· 2022
· Open Access
·
· DOI: https://doi.org/10.1016/s2589-7500(22)00154-6
YOU?
·
· 2022
· Open Access
·
· DOI: https://doi.org/10.1016/s2589-7500(22)00154-6
Related Topics
Concepts
Overfitting
Data science
Health records
Computer science
Workflow
Robustness (evolution)
Sample (material)
Variable (mathematics)
Software deployment
Sample size determination
Data mining
Risk analysis (engineering)
Health care
Artificial intelligence
Medicine
Database
Statistics
Software engineering
Gene
Chemistry
Biochemistry
Mathematical analysis
Economic growth
Mathematics
Chromatography
Artificial neural network
Economics
Metadata
- Type
- review
- Language
- en
- Landing Page
- https://doi.org/10.1016/s2589-7500(22)00154-6
- OA Status
- gold
- Cited By
- 130
- References
- 78
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4296902923
All OpenAlex metadata
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4296902923Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.1016/s2589-7500(22)00154-6Digital Object Identifier
- Title
-
Leveraging electronic health records for data science: common pitfalls and how to avoid themWork title
- Type
-
reviewOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2022Year of publication
- Publication date
-
2022-09-22Full publication date if available
- Authors
-
Christopher Martin Sauer, Li-Ching Chen, Stephanie L Hyland, Armand R. J. Girbes, Paul Elbers, Leo Anthony CeliList of authors in order
- Landing page
-
https://doi.org/10.1016/s2589-7500(22)00154-6Publisher landing page
- Open access
-
YesWhether a free full text is available
- OA status
-
goldOpen access status per OpenAlex
- OA URL
-
https://doi.org/10.1016/s2589-7500(22)00154-6Direct OA link when available
- Concepts
-
Overfitting, Data science, Health records, Computer science, Workflow, Robustness (evolution), Sample (material), Variable (mathematics), Software deployment, Sample size determination, Data mining, Risk analysis (engineering), Health care, Artificial intelligence, Medicine, Database, Statistics, Software engineering, Gene, Chemistry, Biochemistry, Mathematical analysis, Economic growth, Mathematics, Chromatography, Artificial neural network, EconomicsTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
130Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 54, 2024: 50, 2023: 24, 2022: 2Per-year citation counts (last 5 years)
- References (count)
-
78Number of works referenced by this work
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4296902923 |
|---|---|
| doi | https://doi.org/10.1016/s2589-7500(22)00154-6 |
| ids.doi | https://doi.org/10.1016/s2589-7500(22)00154-6 |
| ids.pmid | https://pubmed.ncbi.nlm.nih.gov/36154811 |
| ids.openalex | https://openalex.org/W4296902923 |
| fwci | 25.45384664 |
| mesh[0].qualifier_ui | |
| mesh[0].descriptor_ui | D006801 |
| mesh[0].is_major_topic | False |
| mesh[0].qualifier_name | |
| mesh[0].descriptor_name | Humans |
| mesh[1].qualifier_ui | |
| mesh[1].descriptor_ui | D057286 |
| mesh[1].is_major_topic | True |
| mesh[1].qualifier_name | |
| mesh[1].descriptor_name | Electronic Health Records |
| mesh[2].qualifier_ui | |
| mesh[2].descriptor_ui | D000077488 |
| mesh[2].is_major_topic | True |
| mesh[2].qualifier_name | |
| mesh[2].descriptor_name | Data Science |
| mesh[3].qualifier_ui | |
| mesh[3].descriptor_ui | D003625 |
| mesh[3].is_major_topic | False |
| mesh[3].qualifier_name | |
| mesh[3].descriptor_name | Data Collection |
| mesh[4].qualifier_ui | |
| mesh[4].descriptor_ui | D012107 |
| mesh[4].is_major_topic | False |
| mesh[4].qualifier_name | |
| mesh[4].descriptor_name | Research Design |
| mesh[5].qualifier_ui | |
| mesh[5].descriptor_ui | D000085143 |
| mesh[5].is_major_topic | False |
| mesh[5].qualifier_name | |
| mesh[5].descriptor_name | Routinely Collected Health Data |
| mesh[6].qualifier_ui | |
| mesh[6].descriptor_ui | D006801 |
| mesh[6].is_major_topic | False |
| mesh[6].qualifier_name | |
| mesh[6].descriptor_name | Humans |
| mesh[7].qualifier_ui | |
| mesh[7].descriptor_ui | D057286 |
| mesh[7].is_major_topic | True |
| mesh[7].qualifier_name | |
| mesh[7].descriptor_name | Electronic Health Records |
| mesh[8].qualifier_ui | |
| mesh[8].descriptor_ui | D000077488 |
| mesh[8].is_major_topic | True |
| mesh[8].qualifier_name | |
| mesh[8].descriptor_name | Data Science |
| mesh[9].qualifier_ui | |
| mesh[9].descriptor_ui | D003625 |
| mesh[9].is_major_topic | False |
| mesh[9].qualifier_name | |
| mesh[9].descriptor_name | Data Collection |
| mesh[10].qualifier_ui | |
| mesh[10].descriptor_ui | D012107 |
| mesh[10].is_major_topic | False |
| mesh[10].qualifier_name | |
| mesh[10].descriptor_name | Research Design |
| mesh[11].qualifier_ui | |
| mesh[11].descriptor_ui | D000085143 |
| mesh[11].is_major_topic | False |
| mesh[11].qualifier_name | |
| mesh[11].descriptor_name | Routinely Collected Health Data |
| type | review |
| title | Leveraging electronic health records for data science: common pitfalls and how to avoid them |
| awards[0].id | https://openalex.org/G7597440964 |
| awards[0].funder_id | https://openalex.org/F4320337363 |
| awards[0].display_name | |
| awards[0].funder_award_id | EB017205 |
| awards[0].funder_display_name | National Institute of Biomedical Imaging and Bioengineering |
| biblio.issue | 12 |
| biblio.volume | 4 |
| biblio.last_page | e898 |
| biblio.first_page | e893 |
| topics[0].id | https://openalex.org/T13702 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9994000196456909 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Machine Learning in Healthcare |
| topics[1].id | https://openalex.org/T10845 |
| topics[1].field.id | https://openalex.org/fields/26 |
| topics[1].field.display_name | Mathematics |
| topics[1].score | 0.9807000160217285 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/2613 |
| topics[1].subfield.display_name | Statistics and Probability |
| topics[1].display_name | Advanced Causal Inference Techniques |
| topics[2].id | https://openalex.org/T10350 |
| topics[2].field.id | https://openalex.org/fields/36 |
| topics[2].field.display_name | Health Professions |
| topics[2].score | 0.9782999753952026 |
| topics[2].domain.id | https://openalex.org/domains/4 |
| topics[2].domain.display_name | Health Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/3605 |
| topics[2].subfield.display_name | Health Information Management |
| topics[2].display_name | Electronic Health Records Systems |
| funders[0].id | https://openalex.org/F4320306080 |
| funders[0].ror | https://ror.org/00k86s890 |
| funders[0].display_name | Foundation for the National Institutes of Health |
| funders[1].id | https://openalex.org/F4320332161 |
| funders[1].ror | https://ror.org/01cwqze88 |
| funders[1].display_name | National Institutes of Health |
| funders[2].id | https://openalex.org/F4320337363 |
| funders[2].ror | https://ror.org/00372qc85 |
| funders[2].display_name | National Institute of Biomedical Imaging and Bioengineering |
| is_xpac | False |
| apc_list.value | 5000 |
| apc_list.currency | USD |
| apc_list.value_usd | 5000 |
| apc_paid.value | 5000 |
| apc_paid.currency | USD |
| apc_paid.value_usd | 5000 |
| concepts[0].id | https://openalex.org/C22019652 |
| concepts[0].level | 3 |
| concepts[0].score | 0.7403736710548401 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q331309 |
| concepts[0].display_name | Overfitting |
| concepts[1].id | https://openalex.org/C2522767166 |
| concepts[1].level | 1 |
| concepts[1].score | 0.6945620775222778 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q2374463 |
| concepts[1].display_name | Data science |
| concepts[2].id | https://openalex.org/C3019952477 |
| concepts[2].level | 3 |
| concepts[2].score | 0.6884405612945557 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q1324077 |
| concepts[2].display_name | Health records |
| concepts[3].id | https://openalex.org/C41008148 |
| concepts[3].level | 0 |
| concepts[3].score | 0.6642923355102539 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[3].display_name | Computer science |
| concepts[4].id | https://openalex.org/C177212765 |
| concepts[4].level | 2 |
| concepts[4].score | 0.6582731008529663 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q627335 |
| concepts[4].display_name | Workflow |
| concepts[5].id | https://openalex.org/C63479239 |
| concepts[5].level | 3 |
| concepts[5].score | 0.5331541299819946 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q7353546 |
| concepts[5].display_name | Robustness (evolution) |
| concepts[6].id | https://openalex.org/C198531522 |
| concepts[6].level | 2 |
| concepts[6].score | 0.4942496120929718 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q485146 |
| concepts[6].display_name | Sample (material) |
| concepts[7].id | https://openalex.org/C182365436 |
| concepts[7].level | 2 |
| concepts[7].score | 0.4567781984806061 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q50701 |
| concepts[7].display_name | Variable (mathematics) |
| concepts[8].id | https://openalex.org/C105339364 |
| concepts[8].level | 2 |
| concepts[8].score | 0.44664132595062256 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q2297740 |
| concepts[8].display_name | Software deployment |
| concepts[9].id | https://openalex.org/C129848803 |
| concepts[9].level | 2 |
| concepts[9].score | 0.43781739473342896 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q2564360 |
| concepts[9].display_name | Sample size determination |
| concepts[10].id | https://openalex.org/C124101348 |
| concepts[10].level | 1 |
| concepts[10].score | 0.3750416934490204 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q172491 |
| concepts[10].display_name | Data mining |
| concepts[11].id | https://openalex.org/C112930515 |
| concepts[11].level | 1 |
| concepts[11].score | 0.33034777641296387 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q4389547 |
| concepts[11].display_name | Risk analysis (engineering) |
| concepts[12].id | https://openalex.org/C160735492 |
| concepts[12].level | 2 |
| concepts[12].score | 0.2338160276412964 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q31207 |
| concepts[12].display_name | Health care |
| concepts[13].id | https://openalex.org/C154945302 |
| concepts[13].level | 1 |
| concepts[13].score | 0.2293553352355957 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[13].display_name | Artificial intelligence |
| concepts[14].id | https://openalex.org/C71924100 |
| concepts[14].level | 0 |
| concepts[14].score | 0.19968584179878235 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q11190 |
| concepts[14].display_name | Medicine |
| concepts[15].id | https://openalex.org/C77088390 |
| concepts[15].level | 1 |
| concepts[15].score | 0.1228855550289154 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q8513 |
| concepts[15].display_name | Database |
| concepts[16].id | https://openalex.org/C105795698 |
| concepts[16].level | 1 |
| concepts[16].score | 0.11038902401924133 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q12483 |
| concepts[16].display_name | Statistics |
| concepts[17].id | https://openalex.org/C115903868 |
| concepts[17].level | 1 |
| concepts[17].score | 0.09509268403053284 |
| concepts[17].wikidata | https://www.wikidata.org/wiki/Q80993 |
| concepts[17].display_name | Software engineering |
| concepts[18].id | https://openalex.org/C104317684 |
| concepts[18].level | 2 |
| concepts[18].score | 0.0 |
| concepts[18].wikidata | https://www.wikidata.org/wiki/Q7187 |
| concepts[18].display_name | Gene |
| concepts[19].id | https://openalex.org/C185592680 |
| concepts[19].level | 0 |
| concepts[19].score | 0.0 |
| concepts[19].wikidata | https://www.wikidata.org/wiki/Q2329 |
| concepts[19].display_name | Chemistry |
| concepts[20].id | https://openalex.org/C55493867 |
| concepts[20].level | 1 |
| concepts[20].score | 0.0 |
| concepts[20].wikidata | https://www.wikidata.org/wiki/Q7094 |
| concepts[20].display_name | Biochemistry |
| concepts[21].id | https://openalex.org/C134306372 |
| concepts[21].level | 1 |
| concepts[21].score | 0.0 |
| concepts[21].wikidata | https://www.wikidata.org/wiki/Q7754 |
| concepts[21].display_name | Mathematical analysis |
| concepts[22].id | https://openalex.org/C50522688 |
| concepts[22].level | 1 |
| concepts[22].score | 0.0 |
| concepts[22].wikidata | https://www.wikidata.org/wiki/Q189833 |
| concepts[22].display_name | Economic growth |
| concepts[23].id | https://openalex.org/C33923547 |
| concepts[23].level | 0 |
| concepts[23].score | 0.0 |
| concepts[23].wikidata | https://www.wikidata.org/wiki/Q395 |
| concepts[23].display_name | Mathematics |
| concepts[24].id | https://openalex.org/C43617362 |
| concepts[24].level | 1 |
| concepts[24].score | 0.0 |
| concepts[24].wikidata | https://www.wikidata.org/wiki/Q170050 |
| concepts[24].display_name | Chromatography |
| concepts[25].id | https://openalex.org/C50644808 |
| concepts[25].level | 2 |
| concepts[25].score | 0.0 |
| concepts[25].wikidata | https://www.wikidata.org/wiki/Q192776 |
| concepts[25].display_name | Artificial neural network |
| concepts[26].id | https://openalex.org/C162324750 |
| concepts[26].level | 0 |
| concepts[26].score | 0.0 |
| concepts[26].wikidata | https://www.wikidata.org/wiki/Q8134 |
| concepts[26].display_name | Economics |
| keywords[0].id | https://openalex.org/keywords/overfitting |
| keywords[0].score | 0.7403736710548401 |
| keywords[0].display_name | Overfitting |
| keywords[1].id | https://openalex.org/keywords/data-science |
| keywords[1].score | 0.6945620775222778 |
| keywords[1].display_name | Data science |
| keywords[2].id | https://openalex.org/keywords/health-records |
| keywords[2].score | 0.6884405612945557 |
| keywords[2].display_name | Health records |
| keywords[3].id | https://openalex.org/keywords/computer-science |
| keywords[3].score | 0.6642923355102539 |
| keywords[3].display_name | Computer science |
| keywords[4].id | https://openalex.org/keywords/workflow |
| keywords[4].score | 0.6582731008529663 |
| keywords[4].display_name | Workflow |
| keywords[5].id | https://openalex.org/keywords/robustness |
| keywords[5].score | 0.5331541299819946 |
| keywords[5].display_name | Robustness (evolution) |
| keywords[6].id | https://openalex.org/keywords/sample |
| keywords[6].score | 0.4942496120929718 |
| keywords[6].display_name | Sample (material) |
| keywords[7].id | https://openalex.org/keywords/variable |
| keywords[7].score | 0.4567781984806061 |
| keywords[7].display_name | Variable (mathematics) |
| keywords[8].id | https://openalex.org/keywords/software-deployment |
| keywords[8].score | 0.44664132595062256 |
| keywords[8].display_name | Software deployment |
| keywords[9].id | https://openalex.org/keywords/sample-size-determination |
| keywords[9].score | 0.43781739473342896 |
| keywords[9].display_name | Sample size determination |
| keywords[10].id | https://openalex.org/keywords/data-mining |
| keywords[10].score | 0.3750416934490204 |
| keywords[10].display_name | Data mining |
| keywords[11].id | https://openalex.org/keywords/risk-analysis |
| keywords[11].score | 0.33034777641296387 |
| keywords[11].display_name | Risk analysis (engineering) |
| keywords[12].id | https://openalex.org/keywords/health-care |
| keywords[12].score | 0.2338160276412964 |
| keywords[12].display_name | Health care |
| keywords[13].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[13].score | 0.2293553352355957 |
| keywords[13].display_name | Artificial intelligence |
| keywords[14].id | https://openalex.org/keywords/medicine |
| keywords[14].score | 0.19968584179878235 |
| keywords[14].display_name | Medicine |
| keywords[15].id | https://openalex.org/keywords/database |
| keywords[15].score | 0.1228855550289154 |
| keywords[15].display_name | Database |
| keywords[16].id | https://openalex.org/keywords/statistics |
| keywords[16].score | 0.11038902401924133 |
| keywords[16].display_name | Statistics |
| keywords[17].id | https://openalex.org/keywords/software-engineering |
| keywords[17].score | 0.09509268403053284 |
| keywords[17].display_name | Software engineering |
| language | en |
| locations[0].id | doi:10.1016/s2589-7500(22)00154-6 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4210237014 |
| locations[0].source.issn | 2589-7500 |
| locations[0].source.type | journal |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | 2589-7500 |
| locations[0].source.is_core | True |
| locations[0].source.is_in_doaj | True |
| locations[0].source.display_name | The Lancet Digital Health |
| locations[0].source.host_organization | https://openalex.org/P4310320990 |
| locations[0].source.host_organization_name | Elsevier BV |
| locations[0].source.host_organization_lineage | https://openalex.org/P4310320990 |
| locations[0].source.host_organization_lineage_names | Elsevier BV |
| locations[0].license | cc-by |
| locations[0].pdf_url | |
| locations[0].version | publishedVersion |
| locations[0].raw_type | journal-article |
| locations[0].license_id | https://openalex.org/licenses/cc-by |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | The Lancet Digital Health |
| locations[0].landing_page_url | https://doi.org/10.1016/s2589-7500(22)00154-6 |
| locations[1].id | pmid:36154811 |
| locations[1].is_oa | False |
| locations[1].source.id | https://openalex.org/S4306525036 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | False |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | PubMed |
| locations[1].source.host_organization | https://openalex.org/I1299303238 |
| locations[1].source.host_organization_name | National Institutes of Health |
| locations[1].source.host_organization_lineage | https://openalex.org/I1299303238 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | publishedVersion |
| locations[1].raw_type | |
| locations[1].license_id | |
| locations[1].is_accepted | True |
| locations[1].is_published | True |
| locations[1].raw_source_name | The Lancet. Digital health |
| locations[1].landing_page_url | https://pubmed.ncbi.nlm.nih.gov/36154811 |
| locations[2].id | pmh:vumc:oai:pure.atira.dk:publications/d847a828-4c5b-4906-a7a2-6c51326627a5 |
| locations[2].is_oa | True |
| locations[2].source.id | https://openalex.org/S4306401843 |
| locations[2].source.issn | |
| locations[2].source.type | repository |
| locations[2].source.is_oa | False |
| locations[2].source.issn_l | |
| locations[2].source.is_core | False |
| locations[2].source.is_in_doaj | False |
| locations[2].source.display_name | Data Archiving and Networked Services (DANS) |
| locations[2].source.host_organization | https://openalex.org/I1322597698 |
| locations[2].source.host_organization_name | Royal Netherlands Academy of Arts and Sciences |
| locations[2].source.host_organization_lineage | https://openalex.org/I1322597698 |
| locations[2].license | cc-by |
| locations[2].pdf_url | |
| locations[2].version | submittedVersion |
| locations[2].raw_type | info:eu-repo/semantics/other |
| locations[2].license_id | https://openalex.org/licenses/cc-by |
| locations[2].is_accepted | False |
| locations[2].is_published | False |
| locations[2].raw_source_name | The Lancet Digital Health, 4(12), e893 - e898. Elsevier Ltd |
| locations[2].landing_page_url | https://research.vumc.nl/en/publications/d847a828-4c5b-4906-a7a2-6c51326627a5 |
| locations[3].id | pmh:vumc:oai:pure.atira.dk:publications/ff8abf8a-0177-489d-9c06-a58b9592cedb |
| locations[3].is_oa | True |
| locations[3].source.id | https://openalex.org/S4306401843 |
| locations[3].source.issn | |
| locations[3].source.type | repository |
| locations[3].source.is_oa | False |
| locations[3].source.issn_l | |
| locations[3].source.is_core | False |
| locations[3].source.is_in_doaj | False |
| locations[3].source.display_name | Data Archiving and Networked Services (DANS) |
| locations[3].source.host_organization | https://openalex.org/I1322597698 |
| locations[3].source.host_organization_name | Royal Netherlands Academy of Arts and Sciences |
| locations[3].source.host_organization_lineage | https://openalex.org/I1322597698 |
| locations[3].license | other-oa |
| locations[3].pdf_url | |
| locations[3].version | submittedVersion |
| locations[3].raw_type | info:eu-repo/semantics/other |
| locations[3].license_id | https://openalex.org/licenses/other-oa |
| locations[3].is_accepted | False |
| locations[3].is_published | False |
| locations[3].raw_source_name | The Lancet Digital Health, 4(12), e893 - e898. Elsevier Ltd |
| locations[3].landing_page_url | https://research.vumc.nl/en/publications/ff8abf8a-0177-489d-9c06-a58b9592cedb |
| indexed_in | crossref, doaj, pubmed |
| authorships[0].author.id | https://openalex.org/A5071187821 |
| authorships[0].author.orcid | https://orcid.org/0000-0002-2388-5919 |
| authorships[0].author.display_name | Christopher Martin Sauer |
| authorships[0].countries | NL, US |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I63966007, https://openalex.org/I911458345 |
| authorships[0].affiliations[0].raw_affiliation_string | Laboratory for Critical Care Computational Intelligence, Department of Intensive Care Medicine, Amsterdam Medical Data Science, Amsterdam Cardiovascular Science, Amsterdam Institute for Infection and Immunity, Amsterdam UMC, Location VUmc, Amsterdam, Netherlands; Laboratory for Computational Physiology, Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Cambridge, MA, USA. Electronic address: [email protected]. |
| authorships[0].institutions[0].id | https://openalex.org/I911458345 |
| authorships[0].institutions[0].ror | https://ror.org/00q6h8f30 |
| authorships[0].institutions[0].type | healthcare |
| authorships[0].institutions[0].lineage | https://openalex.org/I4210151833, https://openalex.org/I911458345 |
| authorships[0].institutions[0].country_code | NL |
| authorships[0].institutions[0].display_name | Amsterdam UMC Location Vrije Universiteit Amsterdam |
| authorships[0].institutions[1].id | https://openalex.org/I63966007 |
| authorships[0].institutions[1].ror | https://ror.org/042nb2s44 |
| authorships[0].institutions[1].type | education |
| authorships[0].institutions[1].lineage | https://openalex.org/I63966007 |
| authorships[0].institutions[1].country_code | US |
| authorships[0].institutions[1].display_name | Massachusetts Institute of Technology |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Christopher M Sauer |
| authorships[0].is_corresponding | False |
| authorships[0].raw_affiliation_strings | Laboratory for Critical Care Computational Intelligence, Department of Intensive Care Medicine, Amsterdam Medical Data Science, Amsterdam Cardiovascular Science, Amsterdam Institute for Infection and Immunity, Amsterdam UMC, Location VUmc, Amsterdam, Netherlands; Laboratory for Computational Physiology, Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Cambridge, MA, USA. Electronic address: [email protected]. |
| authorships[1].author.id | https://openalex.org/A5044574513 |
| authorships[1].author.orcid | https://orcid.org/0000-0003-4940-2791 |
| authorships[1].author.display_name | Li-Ching Chen |
| authorships[1].countries | TW |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I25846049 |
| authorships[1].affiliations[0].raw_affiliation_string | Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan. |
| authorships[1].institutions[0].id | https://openalex.org/I25846049 |
| authorships[1].institutions[0].ror | https://ror.org/00zdnkx70 |
| authorships[1].institutions[0].type | education |
| authorships[1].institutions[0].lineage | https://openalex.org/I25846049 |
| authorships[1].institutions[0].country_code | TW |
| authorships[1].institutions[0].display_name | National Tsing Hua University |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Li-Ching Chen |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan. |
| authorships[2].author.id | https://openalex.org/A5044911740 |
| authorships[2].author.orcid | |
| authorships[2].author.display_name | Stephanie L Hyland |
| authorships[2].countries | GB |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I4210164937 |
| authorships[2].affiliations[0].raw_affiliation_string | Microsoft Research, Cambridge, UK. |
| authorships[2].institutions[0].id | https://openalex.org/I4210164937 |
| authorships[2].institutions[0].ror | https://ror.org/05k87vq12 |
| authorships[2].institutions[0].type | company |
| authorships[2].institutions[0].lineage | https://openalex.org/I1290206253, https://openalex.org/I4210164937 |
| authorships[2].institutions[0].country_code | GB |
| authorships[2].institutions[0].display_name | Microsoft Research (United Kingdom) |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Stephanie L Hyland |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | Microsoft Research, Cambridge, UK. |
| authorships[3].author.id | https://openalex.org/A5021970332 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-0711-0494 |
| authorships[3].author.display_name | Armand R. J. Girbes |
| authorships[3].countries | NL |
| authorships[3].affiliations[0].institution_ids | https://openalex.org/I4210151833, https://openalex.org/I911458345 |
| authorships[3].affiliations[0].raw_affiliation_string | Laboratory for Critical Care Computational Intelligence, Department of Intensive Care Medicine, Amsterdam Medical Data Science, Amsterdam Cardiovascular Science, Amsterdam Institute for Infection and Immunity, Amsterdam UMC, Location VUmc, Amsterdam, Netherlands. |
| authorships[3].institutions[0].id | https://openalex.org/I911458345 |
| authorships[3].institutions[0].ror | https://ror.org/00q6h8f30 |
| authorships[3].institutions[0].type | healthcare |
| authorships[3].institutions[0].lineage | https://openalex.org/I4210151833, https://openalex.org/I911458345 |
| authorships[3].institutions[0].country_code | NL |
| authorships[3].institutions[0].display_name | Amsterdam UMC Location Vrije Universiteit Amsterdam |
| authorships[3].institutions[1].id | https://openalex.org/I4210151833 |
| authorships[3].institutions[1].ror | https://ror.org/05grdyy37 |
| authorships[3].institutions[1].type | healthcare |
| authorships[3].institutions[1].lineage | https://openalex.org/I4210151833 |
| authorships[3].institutions[1].country_code | NL |
| authorships[3].institutions[1].display_name | Amsterdam University Medical Centers |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Armand Girbes |
| authorships[3].is_corresponding | False |
| authorships[3].raw_affiliation_strings | Laboratory for Critical Care Computational Intelligence, Department of Intensive Care Medicine, Amsterdam Medical Data Science, Amsterdam Cardiovascular Science, Amsterdam Institute for Infection and Immunity, Amsterdam UMC, Location VUmc, Amsterdam, Netherlands. |
| authorships[4].author.id | https://openalex.org/A5060893723 |
| authorships[4].author.orcid | https://orcid.org/0000-0003-0447-6893 |
| authorships[4].author.display_name | Paul Elbers |
| authorships[4].countries | NL |
| authorships[4].affiliations[0].institution_ids | https://openalex.org/I4210151833, https://openalex.org/I911458345 |
| authorships[4].affiliations[0].raw_affiliation_string | Laboratory for Critical Care Computational Intelligence, Department of Intensive Care Medicine, Amsterdam Medical Data Science, Amsterdam Cardiovascular Science, Amsterdam Institute for Infection and Immunity, Amsterdam UMC, Location VUmc, Amsterdam, Netherlands. |
| authorships[4].institutions[0].id | https://openalex.org/I911458345 |
| authorships[4].institutions[0].ror | https://ror.org/00q6h8f30 |
| authorships[4].institutions[0].type | healthcare |
| authorships[4].institutions[0].lineage | https://openalex.org/I4210151833, https://openalex.org/I911458345 |
| authorships[4].institutions[0].country_code | NL |
| authorships[4].institutions[0].display_name | Amsterdam UMC Location Vrije Universiteit Amsterdam |
| authorships[4].institutions[1].id | https://openalex.org/I4210151833 |
| authorships[4].institutions[1].ror | https://ror.org/05grdyy37 |
| authorships[4].institutions[1].type | healthcare |
| authorships[4].institutions[1].lineage | https://openalex.org/I4210151833 |
| authorships[4].institutions[1].country_code | NL |
| authorships[4].institutions[1].display_name | Amsterdam University Medical Centers |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Paul Elbers |
| authorships[4].is_corresponding | False |
| authorships[4].raw_affiliation_strings | Laboratory for Critical Care Computational Intelligence, Department of Intensive Care Medicine, Amsterdam Medical Data Science, Amsterdam Cardiovascular Science, Amsterdam Institute for Infection and Immunity, Amsterdam UMC, Location VUmc, Amsterdam, Netherlands. |
| authorships[5].author.id | https://openalex.org/A5031401755 |
| authorships[5].author.orcid | https://orcid.org/0000-0001-6712-6626 |
| authorships[5].author.display_name | Leo Anthony Celi |
| authorships[5].countries | US |
| authorships[5].affiliations[0].institution_ids | https://openalex.org/I1316535847, https://openalex.org/I63966007 |
| authorships[5].affiliations[0].raw_affiliation_string | Laboratory for Computational Physiology, Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Cambridge, MA, USA; Department of Biostatistics, Harvard T H Chan School of Public Health, Boston, MA, USA; Division of Pulmonary, Critical Care and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA. |
| authorships[5].institutions[0].id | https://openalex.org/I1316535847 |
| authorships[5].institutions[0].ror | https://ror.org/04drvxt59 |
| authorships[5].institutions[0].type | healthcare |
| authorships[5].institutions[0].lineage | https://openalex.org/I1316535847 |
| authorships[5].institutions[0].country_code | US |
| authorships[5].institutions[0].display_name | Beth Israel Deaconess Medical Center |
| authorships[5].institutions[1].id | https://openalex.org/I63966007 |
| authorships[5].institutions[1].ror | https://ror.org/042nb2s44 |
| authorships[5].institutions[1].type | education |
| authorships[5].institutions[1].lineage | https://openalex.org/I63966007 |
| authorships[5].institutions[1].country_code | US |
| authorships[5].institutions[1].display_name | Massachusetts Institute of Technology |
| authorships[5].author_position | last |
| authorships[5].raw_author_name | Leo A Celi |
| authorships[5].is_corresponding | False |
| authorships[5].raw_affiliation_strings | Laboratory for Computational Physiology, Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Cambridge, MA, USA; Department of Biostatistics, Harvard T H Chan School of Public Health, Boston, MA, USA; Division of Pulmonary, Critical Care and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA. |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://doi.org/10.1016/s2589-7500(22)00154-6 |
| open_access.oa_status | gold |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Leveraging electronic health records for data science: common pitfalls and how to avoid them |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T13702 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9994000196456909 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Machine Learning in Healthcare |
| related_works | https://openalex.org/W2609159405, https://openalex.org/W1165680166, https://openalex.org/W2465593037, https://openalex.org/W2591061639, https://openalex.org/W2385119568, https://openalex.org/W2027457585, https://openalex.org/W3196878821, https://openalex.org/W4221154678, https://openalex.org/W2107854016, https://openalex.org/W4296902923 |
| cited_by_count | 130 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 54 |
| counts_by_year[1].year | 2024 |
| counts_by_year[1].cited_by_count | 50 |
| counts_by_year[2].year | 2023 |
| counts_by_year[2].cited_by_count | 24 |
| counts_by_year[3].year | 2022 |
| counts_by_year[3].cited_by_count | 2 |
| locations_count | 4 |
| best_oa_location.id | doi:10.1016/s2589-7500(22)00154-6 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4210237014 |
| best_oa_location.source.issn | 2589-7500 |
| best_oa_location.source.type | journal |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | 2589-7500 |
| best_oa_location.source.is_core | True |
| best_oa_location.source.is_in_doaj | True |
| best_oa_location.source.display_name | The Lancet Digital Health |
| best_oa_location.source.host_organization | https://openalex.org/P4310320990 |
| best_oa_location.source.host_organization_name | Elsevier BV |
| best_oa_location.source.host_organization_lineage | https://openalex.org/P4310320990 |
| best_oa_location.source.host_organization_lineage_names | Elsevier BV |
| best_oa_location.license | cc-by |
| best_oa_location.pdf_url | |
| best_oa_location.version | publishedVersion |
| best_oa_location.raw_type | journal-article |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | True |
| best_oa_location.raw_source_name | The Lancet Digital Health |
| best_oa_location.landing_page_url | https://doi.org/10.1016/s2589-7500(22)00154-6 |
| primary_location.id | doi:10.1016/s2589-7500(22)00154-6 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4210237014 |
| primary_location.source.issn | 2589-7500 |
| primary_location.source.type | journal |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | 2589-7500 |
| primary_location.source.is_core | True |
| primary_location.source.is_in_doaj | True |
| primary_location.source.display_name | The Lancet Digital Health |
| primary_location.source.host_organization | https://openalex.org/P4310320990 |
| primary_location.source.host_organization_name | Elsevier BV |
| primary_location.source.host_organization_lineage | https://openalex.org/P4310320990 |
| primary_location.source.host_organization_lineage_names | Elsevier BV |
| primary_location.license | cc-by |
| primary_location.pdf_url | |
| primary_location.version | publishedVersion |
| primary_location.raw_type | journal-article |
| primary_location.license_id | https://openalex.org/licenses/cc-by |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | The Lancet Digital Health |
| primary_location.landing_page_url | https://doi.org/10.1016/s2589-7500(22)00154-6 |
| publication_date | 2022-09-22 |
| publication_year | 2022 |
| referenced_works | https://openalex.org/W3136574244, https://openalex.org/W3135685263, https://openalex.org/W3157618458, https://openalex.org/W2896893468, https://openalex.org/W2051153808, https://openalex.org/W2979728113, https://openalex.org/W2766333003, https://openalex.org/W2158939484, https://openalex.org/W2517388783, https://openalex.org/W6781213985, https://openalex.org/W2580732432, https://openalex.org/W3207434859, https://openalex.org/W3209214964, https://openalex.org/W2307997912, https://openalex.org/W3120416406, https://openalex.org/W2906295032, https://openalex.org/W2597505554, https://openalex.org/W3149972283, https://openalex.org/W6713088067, https://openalex.org/W3036236969, https://openalex.org/W2117699144, https://openalex.org/W1975344296, https://openalex.org/W2116439696, https://openalex.org/W2070338534, https://openalex.org/W3206775159, https://openalex.org/W3199873668, https://openalex.org/W2809895810, https://openalex.org/W2012639032, https://openalex.org/W2058053844, https://openalex.org/W3083461656, https://openalex.org/W2282181907, https://openalex.org/W2145577370, https://openalex.org/W2164990391, https://openalex.org/W2768146862, https://openalex.org/W3031328926, https://openalex.org/W3044002062, https://openalex.org/W2909528198, https://openalex.org/W3107716710, https://openalex.org/W2899765933, https://openalex.org/W2280404143, https://openalex.org/W2782117851, https://openalex.org/W3174786846, https://openalex.org/W2130300971, https://openalex.org/W2162031798, https://openalex.org/W6748816092, https://openalex.org/W2031392253, https://openalex.org/W2523378431, https://openalex.org/W2042693868, https://openalex.org/W3013076305, https://openalex.org/W3214405491, https://openalex.org/W6640610376, https://openalex.org/W2909650734, https://openalex.org/W6725839788, https://openalex.org/W2736287575, https://openalex.org/W3040775906, https://openalex.org/W6685812147, https://openalex.org/W6679609430, https://openalex.org/W2043464907, https://openalex.org/W2046141107, https://openalex.org/W2035846549, https://openalex.org/W6756923131, https://openalex.org/W2897831543, https://openalex.org/W2298338128, https://openalex.org/W2548421507, https://openalex.org/W3210241905, https://openalex.org/W3039902043, https://openalex.org/W4214864397, https://openalex.org/W3080627676, https://openalex.org/W3046836011, https://openalex.org/W3101973032, https://openalex.org/W2512716582, https://openalex.org/W2985962305, https://openalex.org/W2788104422, https://openalex.org/W3042009984, https://openalex.org/W1946283557, https://openalex.org/W4214935020, https://openalex.org/W2952428244, https://openalex.org/W2132927459 |
| referenced_works_count | 78 |
| abstract_inverted_index | |
| cited_by_percentile_year.max | 100 |
| cited_by_percentile_year.min | 94 |
| countries_distinct_count | 4 |
| institutions_distinct_count | 6 |
| citation_normalized_percentile.value | 0.99489289 |
| citation_normalized_percentile.is_in_top_1_percent | True |
| citation_normalized_percentile.is_in_top_10_percent | True |