Introducing mCODEGPT as a zero-shot information extraction from clinical free text data tool for cancer research Article Swipe
Kai Zhang
,
Tongtong Huang
,
Bradley Malin
,
Travis Osterman
,
Qi Long
,
Xiaoqian Jiang
·
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.1038/s43856-025-01116-x
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.1038/s43856-025-01116-x
Related Topics
Concepts
No concepts available.
Metadata
- Type
- article
- Language
- en
- Landing Page
- https://doi.org/10.1038/s43856-025-01116-x
- https://www.nature.com/articles/s43856-025-01116-x.pdf
- OA Status
- gold
- Cited By
- 1
- References
- 24
- OpenAlex ID
- https://openalex.org/W4415205077
All OpenAlex metadata
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4415205077Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.1038/s43856-025-01116-xDigital Object Identifier
- Title
-
Introducing mCODEGPT as a zero-shot information extraction from clinical free text data tool for cancer researchWork title
- Type
-
articleOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2025Year of publication
- Publication date
-
2025-10-15Full publication date if available
- Authors
-
Kai Zhang, Tongtong Huang, Bradley Malin, Travis Osterman, Qi Long, Xiaoqian JiangList of authors in order
- Landing page
-
https://doi.org/10.1038/s43856-025-01116-xPublisher landing page
- PDF URL
-
https://www.nature.com/articles/s43856-025-01116-x.pdfDirect link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
goldOpen access status per OpenAlex
- OA URL
-
https://www.nature.com/articles/s43856-025-01116-x.pdfDirect OA link when available
- Cited by
-
1Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 1Per-year citation counts (last 5 years)
- References (count)
-
24Number of works referenced by this work
Full payload
| id | https://openalex.org/W4415205077 |
|---|---|
| doi | https://doi.org/10.1038/s43856-025-01116-x |
| ids.doi | https://doi.org/10.1038/s43856-025-01116-x |
| ids.pmid | https://pubmed.ncbi.nlm.nih.gov/41093969 |
| ids.openalex | https://openalex.org/W4415205077 |
| fwci | 2.68294463 |
| type | article |
| title | Introducing mCODEGPT as a zero-shot information extraction from clinical free text data tool for cancer research |
| biblio.issue | 1 |
| biblio.volume | 5 |
| biblio.last_page | 422 |
| biblio.first_page | 422 |
| topics[0].id | https://openalex.org/T11710 |
| topics[0].field.id | https://openalex.org/fields/13 |
| topics[0].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[0].score | 0.9994999766349792 |
| topics[0].domain.id | https://openalex.org/domains/1 |
| topics[0].domain.display_name | Life Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1312 |
| topics[0].subfield.display_name | Molecular Biology |
| topics[0].display_name | Biomedical Text Mining and Ontologies |
| topics[1].id | https://openalex.org/T10028 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9986000061035156 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1702 |
| topics[1].subfield.display_name | Artificial Intelligence |
| topics[1].display_name | Topic Modeling |
| topics[2].id | https://openalex.org/T11719 |
| topics[2].field.id | https://openalex.org/fields/18 |
| topics[2].field.display_name | Decision Sciences |
| topics[2].score | 0.9606999754905701 |
| topics[2].domain.id | https://openalex.org/domains/2 |
| topics[2].domain.display_name | Social Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1803 |
| topics[2].subfield.display_name | Management Science and Operations Research |
| topics[2].display_name | Data Quality and Management |
| is_xpac | False |
| apc_list.value | 2290 |
| apc_list.currency | GBP |
| apc_list.value_usd | 2808 |
| apc_paid.value | 2290 |
| apc_paid.currency | GBP |
| apc_paid.value_usd | 2808 |
| language | en |
| locations[0].id | doi:10.1038/s43856-025-01116-x |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4210167893 |
| locations[0].source.issn | 2730-664X |
| locations[0].source.type | journal |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | 2730-664X |
| locations[0].source.is_core | True |
| locations[0].source.is_in_doaj | True |
| locations[0].source.display_name | Communications Medicine |
| locations[0].source.host_organization | https://openalex.org/P4310319908 |
| locations[0].source.host_organization_name | Nature Portfolio |
| locations[0].source.host_organization_lineage | https://openalex.org/P4310319908, https://openalex.org/P4310319965 |
| locations[0].source.host_organization_lineage_names | Nature Portfolio, Springer Nature |
| locations[0].license | cc-by-nc-nd |
| locations[0].pdf_url | https://www.nature.com/articles/s43856-025-01116-x.pdf |
| locations[0].version | publishedVersion |
| locations[0].raw_type | journal-article |
| locations[0].license_id | https://openalex.org/licenses/cc-by-nc-nd |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | Communications Medicine |
| locations[0].landing_page_url | https://doi.org/10.1038/s43856-025-01116-x |
| locations[1].id | pmid:41093969 |
| locations[1].is_oa | False |
| locations[1].source.id | https://openalex.org/S4306525036 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | False |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | PubMed |
| locations[1].source.host_organization | https://openalex.org/I1299303238 |
| locations[1].source.host_organization_name | National Institutes of Health |
| locations[1].source.host_organization_lineage | https://openalex.org/I1299303238 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | publishedVersion |
| locations[1].raw_type | |
| locations[1].license_id | |
| locations[1].is_accepted | True |
| locations[1].is_published | True |
| locations[1].raw_source_name | Communications medicine |
| locations[1].landing_page_url | https://pubmed.ncbi.nlm.nih.gov/41093969 |
| locations[2].id | pmh:oai:doaj.org/article:bb6dd3dd5d4e408fa90f26736c5580c9 |
| locations[2].is_oa | False |
| locations[2].source.id | https://openalex.org/S4306401280 |
| locations[2].source.issn | |
| locations[2].source.type | repository |
| locations[2].source.is_oa | False |
| locations[2].source.issn_l | |
| locations[2].source.is_core | False |
| locations[2].source.is_in_doaj | False |
| locations[2].source.display_name | DOAJ (DOAJ: Directory of Open Access Journals) |
| locations[2].source.host_organization | |
| locations[2].source.host_organization_name | |
| locations[2].license | |
| locations[2].pdf_url | |
| locations[2].version | submittedVersion |
| locations[2].raw_type | article |
| locations[2].license_id | |
| locations[2].is_accepted | False |
| locations[2].is_published | False |
| locations[2].raw_source_name | Communications Medicine, Vol 5, Iss 1, Pp 1-14 (2025) |
| locations[2].landing_page_url | https://doaj.org/article/bb6dd3dd5d4e408fa90f26736c5580c9 |
| locations[3].id | pmh:oai:europepmc.org:11337540 |
| locations[3].is_oa | True |
| locations[3].source.id | https://openalex.org/S4306400806 |
| locations[3].source.issn | |
| locations[3].source.type | repository |
| locations[3].source.is_oa | False |
| locations[3].source.issn_l | |
| locations[3].source.is_core | False |
| locations[3].source.is_in_doaj | False |
| locations[3].source.display_name | Europe PMC (PubMed Central) |
| locations[3].source.host_organization | https://openalex.org/I1303153112 |
| locations[3].source.host_organization_name | European Bioinformatics Institute |
| locations[3].source.host_organization_lineage | https://openalex.org/I1303153112 |
| locations[3].license | other-oa |
| locations[3].pdf_url | |
| locations[3].version | submittedVersion |
| locations[3].raw_type | Text |
| locations[3].license_id | https://openalex.org/licenses/other-oa |
| locations[3].is_accepted | False |
| locations[3].is_published | False |
| locations[3].raw_source_name | |
| locations[3].landing_page_url | https://www.ncbi.nlm.nih.gov/pmc/articles/12528503 |
| indexed_in | crossref, doaj, pubmed |
| authorships[0].author.id | https://openalex.org/A5023152324 |
| authorships[0].author.orcid | https://orcid.org/0000-0002-4519-609X |
| authorships[0].author.display_name | Kai Zhang |
| authorships[0].countries | US |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I919571938 |
| authorships[0].affiliations[0].raw_affiliation_string | Department of Data Science and Artificial Intelligence, McWilliams School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA |
| authorships[0].institutions[0].id | https://openalex.org/I919571938 |
| authorships[0].institutions[0].ror | https://ror.org/03gds6c39 |
| authorships[0].institutions[0].type | education |
| authorships[0].institutions[0].lineage | https://openalex.org/I919571938 |
| authorships[0].institutions[0].country_code | US |
| authorships[0].institutions[0].display_name | The University of Texas Health Science Center at Houston |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Kai Zhang |
| authorships[0].is_corresponding | False |
| authorships[0].raw_affiliation_strings | Department of Data Science and Artificial Intelligence, McWilliams School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA |
| authorships[1].author.id | https://openalex.org/A5029365550 |
| authorships[1].author.orcid | |
| authorships[1].author.display_name | Tongtong Huang |
| authorships[1].countries | US |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I919571938 |
| authorships[1].affiliations[0].raw_affiliation_string | Department of Data Science and Artificial Intelligence, McWilliams School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA |
| authorships[1].institutions[0].id | https://openalex.org/I919571938 |
| authorships[1].institutions[0].ror | https://ror.org/03gds6c39 |
| authorships[1].institutions[0].type | education |
| authorships[1].institutions[0].lineage | https://openalex.org/I919571938 |
| authorships[1].institutions[0].country_code | US |
| authorships[1].institutions[0].display_name | The University of Texas Health Science Center at Houston |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Tongtong Huang |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | Department of Data Science and Artificial Intelligence, McWilliams School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA |
| authorships[2].author.id | https://openalex.org/A5090647314 |
| authorships[2].author.orcid | https://orcid.org/0000-0003-3040-5175 |
| authorships[2].author.display_name | Bradley Malin |
| authorships[2].countries | US |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I901861585 |
| authorships[2].affiliations[0].raw_affiliation_string | Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA |
| authorships[2].institutions[0].id | https://openalex.org/I901861585 |
| authorships[2].institutions[0].ror | https://ror.org/05dq2gs74 |
| authorships[2].institutions[0].type | healthcare |
| authorships[2].institutions[0].lineage | https://openalex.org/I4210162197, https://openalex.org/I901861585 |
| authorships[2].institutions[0].country_code | US |
| authorships[2].institutions[0].display_name | Vanderbilt University Medical Center |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Bradley A. Malin |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA |
| authorships[3].author.id | https://openalex.org/A5049126384 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-2841-8121 |
| authorships[3].author.display_name | Travis Osterman |
| authorships[3].countries | US |
| authorships[3].affiliations[0].institution_ids | https://openalex.org/I901861585 |
| authorships[3].affiliations[0].raw_affiliation_string | Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA |
| authorships[3].institutions[0].id | https://openalex.org/I901861585 |
| authorships[3].institutions[0].ror | https://ror.org/05dq2gs74 |
| authorships[3].institutions[0].type | healthcare |
| authorships[3].institutions[0].lineage | https://openalex.org/I4210162197, https://openalex.org/I901861585 |
| authorships[3].institutions[0].country_code | US |
| authorships[3].institutions[0].display_name | Vanderbilt University Medical Center |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Travis Osterman |
| authorships[3].is_corresponding | False |
| authorships[3].raw_affiliation_strings | Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA |
| authorships[4].author.id | https://openalex.org/A5002149616 |
| authorships[4].author.orcid | https://orcid.org/0000-0003-0660-5230 |
| authorships[4].author.display_name | Qi Long |
| authorships[4].countries | US |
| authorships[4].affiliations[0].institution_ids | https://openalex.org/I79576946 |
| authorships[4].affiliations[0].raw_affiliation_string | Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA |
| authorships[4].institutions[0].id | https://openalex.org/I79576946 |
| authorships[4].institutions[0].ror | https://ror.org/00b30xv10 |
| authorships[4].institutions[0].type | education |
| authorships[4].institutions[0].lineage | https://openalex.org/I79576946 |
| authorships[4].institutions[0].country_code | US |
| authorships[4].institutions[0].display_name | University of Pennsylvania |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Qi Long |
| authorships[4].is_corresponding | False |
| authorships[4].raw_affiliation_strings | Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA |
| authorships[5].author.id | https://openalex.org/A5055458864 |
| authorships[5].author.orcid | https://orcid.org/0000-0001-9933-2205 |
| authorships[5].author.display_name | Xiaoqian Jiang |
| authorships[5].countries | US |
| authorships[5].affiliations[0].institution_ids | https://openalex.org/I919571938 |
| authorships[5].affiliations[0].raw_affiliation_string | Department of Data Science and Artificial Intelligence, McWilliams School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA |
| authorships[5].institutions[0].id | https://openalex.org/I919571938 |
| authorships[5].institutions[0].ror | https://ror.org/03gds6c39 |
| authorships[5].institutions[0].type | education |
| authorships[5].institutions[0].lineage | https://openalex.org/I919571938 |
| authorships[5].institutions[0].country_code | US |
| authorships[5].institutions[0].display_name | The University of Texas Health Science Center at Houston |
| authorships[5].author_position | last |
| authorships[5].raw_author_name | Xiaoqian Jiang |
| authorships[5].is_corresponding | False |
| authorships[5].raw_affiliation_strings | Department of Data Science and Artificial Intelligence, McWilliams School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, USA |
| has_content.pdf | True |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://www.nature.com/articles/s43856-025-01116-x.pdf |
| open_access.oa_status | gold |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-16T00:00:00 |
| display_name | Introducing mCODEGPT as a zero-shot information extraction from clinical free text data tool for cancer research |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T11710 |
| primary_topic.field.id | https://openalex.org/fields/13 |
| primary_topic.field.display_name | Biochemistry, Genetics and Molecular Biology |
| primary_topic.score | 0.9994999766349792 |
| primary_topic.domain.id | https://openalex.org/domains/1 |
| primary_topic.domain.display_name | Life Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1312 |
| primary_topic.subfield.display_name | Molecular Biology |
| primary_topic.display_name | Biomedical Text Mining and Ontologies |
| cited_by_count | 1 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 1 |
| locations_count | 4 |
| best_oa_location.id | doi:10.1038/s43856-025-01116-x |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4210167893 |
| best_oa_location.source.issn | 2730-664X |
| best_oa_location.source.type | journal |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | 2730-664X |
| best_oa_location.source.is_core | True |
| best_oa_location.source.is_in_doaj | True |
| best_oa_location.source.display_name | Communications Medicine |
| best_oa_location.source.host_organization | https://openalex.org/P4310319908 |
| best_oa_location.source.host_organization_name | Nature Portfolio |
| best_oa_location.source.host_organization_lineage | https://openalex.org/P4310319908, https://openalex.org/P4310319965 |
| best_oa_location.source.host_organization_lineage_names | Nature Portfolio, Springer Nature |
| best_oa_location.license | cc-by-nc-nd |
| best_oa_location.pdf_url | https://www.nature.com/articles/s43856-025-01116-x.pdf |
| best_oa_location.version | publishedVersion |
| best_oa_location.raw_type | journal-article |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by-nc-nd |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | True |
| best_oa_location.raw_source_name | Communications Medicine |
| best_oa_location.landing_page_url | https://doi.org/10.1038/s43856-025-01116-x |
| primary_location.id | doi:10.1038/s43856-025-01116-x |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4210167893 |
| primary_location.source.issn | 2730-664X |
| primary_location.source.type | journal |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | 2730-664X |
| primary_location.source.is_core | True |
| primary_location.source.is_in_doaj | True |
| primary_location.source.display_name | Communications Medicine |
| primary_location.source.host_organization | https://openalex.org/P4310319908 |
| primary_location.source.host_organization_name | Nature Portfolio |
| primary_location.source.host_organization_lineage | https://openalex.org/P4310319908, https://openalex.org/P4310319965 |
| primary_location.source.host_organization_lineage_names | Nature Portfolio, Springer Nature |
| primary_location.license | cc-by-nc-nd |
| primary_location.pdf_url | https://www.nature.com/articles/s43856-025-01116-x.pdf |
| primary_location.version | publishedVersion |
| primary_location.raw_type | journal-article |
| primary_location.license_id | https://openalex.org/licenses/cc-by-nc-nd |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | Communications Medicine |
| primary_location.landing_page_url | https://doi.org/10.1038/s43856-025-01116-x |
| publication_date | 2025-10-15 |
| publication_year | 2025 |
| referenced_works | https://openalex.org/W1986349546, https://openalex.org/W2042210471, https://openalex.org/W3181361218, https://openalex.org/W2004910511, https://openalex.org/W2175004776, https://openalex.org/W2913389685, https://openalex.org/W4251372957, https://openalex.org/W2123442489, https://openalex.org/W2117243442, https://openalex.org/W2037987185, https://openalex.org/W2159583324, https://openalex.org/W4393094733, https://openalex.org/W4403488282, https://openalex.org/W4405427397, https://openalex.org/W4297253404, https://openalex.org/W2911489562, https://openalex.org/W2970771982, https://openalex.org/W4389487200, https://openalex.org/W4392619039, https://openalex.org/W2059300459, https://openalex.org/W3097870063, https://openalex.org/W2979826702, https://openalex.org/W7075891447, https://openalex.org/W6949647045 |
| referenced_works_count | 24 |
| abstract_inverted_index | |
| cited_by_percentile_year.max | 95 |
| cited_by_percentile_year.min | 91 |
| countries_distinct_count | 1 |
| institutions_distinct_count | 6 |
| citation_normalized_percentile.value | 0.86067863 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | True |