Unified access to up-to-date residue-level annotations from UniProtKB and other biological databases for PDB data Article Swipe
YOU?
·
· 2023
· Open Access
·
· DOI: https://doi.org/10.1038/s41597-023-02101-6
More than 61,000 proteins have up-to-date correspondence between their amino acid sequence (UniProtKB) and their 3D structures (PDB), enabled by the Structure Integration with Function, Taxonomy and Sequences (SIFTS) resource. SIFTS incorporates residue-level annotations from many other biological resources. SIFTS data is available in various formats like XML, CSV and TSV format or also accessible via the PDBe REST API but always maintained separately from the structure data (PDBx/mmCIF file) in the PDB archive. Here, we extended the wwPDB PDBx/mmCIF data dictionary with additional categories to accommodate SIFTS data and added the UniProtKB, Pfam, SCOP2, and CATH residue-level annotations directly into the PDBx/mmCIF files from the PDB archive. With the integrated UniProtKB annotations, these files now provide consistent numbering of residues in different PDB entries allowing easy comparison of structure models. The extended dictionary yields a more consistent, standardised metadata description without altering the core PDB information. This development enables up-to-date cross-reference information at the residue level resulting in better data interoperability, supporting improved data analysis and visualisation.
Related Topics
- Type
- article
- Language
- en
- Landing Page
- https://doi.org/10.1038/s41597-023-02101-6
- https://www.nature.com/articles/s41597-023-02101-6.pdf
- OA Status
- gold
- Cited By
- 9
- References
- 81
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4365137095
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4365137095Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.1038/s41597-023-02101-6Digital Object Identifier
- Title
-
Unified access to up-to-date residue-level annotations from UniProtKB and other biological databases for PDB dataWork title
- Type
-
articleOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2023Year of publication
- Publication date
-
2023-04-12Full publication date if available
- Authors
-
Preeti Choudhary, Stephen Anyango, John M. Berrisford, James Tolchard, Mihály Váradi, Sameer VelankarList of authors in order
- Landing page
-
https://doi.org/10.1038/s41597-023-02101-6Publisher landing page
- PDF URL
-
https://www.nature.com/articles/s41597-023-02101-6.pdfDirect link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
goldOpen access status per OpenAlex
- OA URL
-
https://www.nature.com/articles/s41597-023-02101-6.pdfDirect OA link when available
- Concepts
-
UniProt, Protein Data Bank (RCSB PDB), Metadata, Computer science, Database, Information retrieval, XML, Interoperability, World Wide Web, Biology, Gene, BiochemistryTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
9Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 2, 2024: 4, 2023: 3Per-year citation counts (last 5 years)
- References (count)
-
81Number of works referenced by this work
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4365137095 |
|---|---|
| doi | https://doi.org/10.1038/s41597-023-02101-6 |
| ids.doi | https://doi.org/10.1038/s41597-023-02101-6 |
| ids.pmid | https://pubmed.ncbi.nlm.nih.gov/37045837 |
| ids.openalex | https://openalex.org/W4365137095 |
| fwci | 1.20613585 |
| type | article |
| title | Unified access to up-to-date residue-level annotations from UniProtKB and other biological databases for PDB data |
| awards[0].id | https://openalex.org/G6993346261 |
| awards[0].funder_id | https://openalex.org/F4320306076 |
| awards[0].funder_award_id | DBI-2019297, PI: S.K. Burley) |
| awards[0].funder_display_name | National Science Foundation |
| awards[1].id | https://openalex.org/G4232685941 |
| awards[1].funder_id | https://openalex.org/F4320332265 |
| awards[1].funder_award_id | DBI-2019297, PI: S.K. Burley |
| awards[1].funder_display_name | National Science Board |
| awards[2].id | https://openalex.org/G4537705097 |
| awards[2].funder_id | https://openalex.org/F4320306076 |
| awards[2].funder_award_id | DBI-2019297, PI: S.K. Burley |
| awards[2].funder_display_name | National Science Foundation |
| awards[3].id | https://openalex.org/G754905133 |
| awards[3].funder_id | https://openalex.org/F4320334629 |
| awards[3].funder_award_id | BB/V004247/1, PI:Sameer Velankar |
| awards[3].funder_display_name | Biotechnology and Biological Sciences Research Council |
| biblio.issue | 1 |
| biblio.volume | 10 |
| biblio.last_page | 204 |
| biblio.first_page | 204 |
| grants[0].funder | https://openalex.org/F4320306076 |
| grants[0].award_id | DBI-2019297, PI: S.K. Burley |
| grants[0].funder_display_name | National Science Foundation |
| grants[1].funder | https://openalex.org/F4320306076 |
| grants[1].award_id | DBI-2019297, PI: S.K. Burley |
| grants[1].funder_display_name | National Science Foundation |
| grants[2].funder | https://openalex.org/F4320306076 |
| grants[2].award_id | DBI-2019297, PI: S.K. Burley |
| grants[2].funder_display_name | National Science Foundation |
| grants[3].funder | https://openalex.org/F4320306076 |
| grants[3].award_id | DBI-2019297, PI: S.K. Burley |
| grants[3].funder_display_name | National Science Foundation |
| grants[4].funder | https://openalex.org/F4320306076 |
| grants[4].award_id | DBI-2019297, PI: S.K. Burley) |
| grants[4].funder_display_name | National Science Foundation |
| grants[5].funder | https://openalex.org/F4320313288 |
| grants[5].award_id | |
| grants[5].funder_display_name | European Bioinformatics Institute |
| grants[6].funder | https://openalex.org/F4320332265 |
| grants[6].award_id | DBI-2019297, PI: S.K. Burley |
| grants[6].funder_display_name | National Science Board |
| grants[7].funder | https://openalex.org/F4320334629 |
| grants[7].award_id | BB/V004247/1, PI:Sameer Velankar |
| grants[7].funder_display_name | Biotechnology and Biological Sciences Research Council |
| grants[8].funder | https://openalex.org/F4320334629 |
| grants[8].award_id | BB/V004247/1, PI:Sameer Velankar |
| grants[8].funder_display_name | Biotechnology and Biological Sciences Research Council |
| grants[9].funder | https://openalex.org/F4320334629 |
| grants[9].award_id | BB/V004247/1, PI:Sameer Velankar |
| grants[9].funder_display_name | Biotechnology and Biological Sciences Research Council |
| grants[10].funder | https://openalex.org/F4320334629 |
| grants[10].award_id | BB/V004247/1, PI:Sameer Velankar |
| grants[10].funder_display_name | Biotechnology and Biological Sciences Research Council |
| grants[11].funder | https://openalex.org/F4320334629 |
| grants[11].award_id | BB/V004247/1, PI:Sameer Velankar |
| grants[11].funder_display_name | Biotechnology and Biological Sciences Research Council |
| grants[12].funder | https://openalex.org/F4320334629 |
| grants[12].award_id | BB/V004247/1, PI:Sameer Velankar |
| grants[12].funder_display_name | Biotechnology and Biological Sciences Research Council |
| topics[0].id | https://openalex.org/T11162 |
| topics[0].field.id | https://openalex.org/fields/25 |
| topics[0].field.display_name | Materials Science |
| topics[0].score | 0.9994000196456909 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/2505 |
| topics[0].subfield.display_name | Materials Chemistry |
| topics[0].display_name | Enzyme Structure and Function |
| topics[1].id | https://openalex.org/T10015 |
| topics[1].field.id | https://openalex.org/fields/13 |
| topics[1].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[1].score | 0.9991000294685364 |
| topics[1].domain.id | https://openalex.org/domains/1 |
| topics[1].domain.display_name | Life Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1312 |
| topics[1].subfield.display_name | Molecular Biology |
| topics[1].display_name | Genomics and Phylogenetic Studies |
| topics[2].id | https://openalex.org/T10521 |
| topics[2].field.id | https://openalex.org/fields/13 |
| topics[2].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[2].score | 0.9983000159263611 |
| topics[2].domain.id | https://openalex.org/domains/1 |
| topics[2].domain.display_name | Life Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1312 |
| topics[2].subfield.display_name | Molecular Biology |
| topics[2].display_name | RNA and protein synthesis mechanisms |
| funders[0].id | https://openalex.org/F4320306076 |
| funders[0].ror | https://ror.org/021nxhr62 |
| funders[0].display_name | National Science Foundation |
| funders[1].id | https://openalex.org/F4320313288 |
| funders[1].ror | https://ror.org/02catss52 |
| funders[1].display_name | European Bioinformatics Institute |
| funders[2].id | https://openalex.org/F4320332265 |
| funders[2].ror | https://ror.org/00apvva27 |
| funders[2].display_name | National Science Board |
| funders[3].id | https://openalex.org/F4320334629 |
| funders[3].ror | https://ror.org/00cwqg982 |
| funders[3].display_name | Biotechnology and Biological Sciences Research Council |
| is_xpac | False |
| apc_list.value | 1990 |
| apc_list.currency | USD |
| apc_list.value_usd | 1990 |
| apc_paid.value | 1990 |
| apc_paid.currency | USD |
| apc_paid.value_usd | 1990 |
| concepts[0].id | https://openalex.org/C202264299 |
| concepts[0].level | 3 |
| concepts[0].score | 0.8985824584960938 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q905695 |
| concepts[0].display_name | UniProt |
| concepts[1].id | https://openalex.org/C65556437 |
| concepts[1].level | 2 |
| concepts[1].score | 0.8746500015258789 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q766195 |
| concepts[1].display_name | Protein Data Bank (RCSB PDB) |
| concepts[2].id | https://openalex.org/C93518851 |
| concepts[2].level | 2 |
| concepts[2].score | 0.6989517211914062 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q180160 |
| concepts[2].display_name | Metadata |
| concepts[3].id | https://openalex.org/C41008148 |
| concepts[3].level | 0 |
| concepts[3].score | 0.6649366617202759 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[3].display_name | Computer science |
| concepts[4].id | https://openalex.org/C77088390 |
| concepts[4].level | 1 |
| concepts[4].score | 0.5818549394607544 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q8513 |
| concepts[4].display_name | Database |
| concepts[5].id | https://openalex.org/C23123220 |
| concepts[5].level | 1 |
| concepts[5].score | 0.4715084135532379 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q816826 |
| concepts[5].display_name | Information retrieval |
| concepts[6].id | https://openalex.org/C8797682 |
| concepts[6].level | 2 |
| concepts[6].score | 0.4616924524307251 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q2115 |
| concepts[6].display_name | XML |
| concepts[7].id | https://openalex.org/C20136886 |
| concepts[7].level | 2 |
| concepts[7].score | 0.4472900629043579 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q749647 |
| concepts[7].display_name | Interoperability |
| concepts[8].id | https://openalex.org/C136764020 |
| concepts[8].level | 1 |
| concepts[8].score | 0.26500818133354187 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q466 |
| concepts[8].display_name | World Wide Web |
| concepts[9].id | https://openalex.org/C86803240 |
| concepts[9].level | 0 |
| concepts[9].score | 0.1471470594406128 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q420 |
| concepts[9].display_name | Biology |
| concepts[10].id | https://openalex.org/C104317684 |
| concepts[10].level | 2 |
| concepts[10].score | 0.0 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q7187 |
| concepts[10].display_name | Gene |
| concepts[11].id | https://openalex.org/C55493867 |
| concepts[11].level | 1 |
| concepts[11].score | 0.0 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q7094 |
| concepts[11].display_name | Biochemistry |
| keywords[0].id | https://openalex.org/keywords/uniprot |
| keywords[0].score | 0.8985824584960938 |
| keywords[0].display_name | UniProt |
| keywords[1].id | https://openalex.org/keywords/protein-data-bank |
| keywords[1].score | 0.8746500015258789 |
| keywords[1].display_name | Protein Data Bank (RCSB PDB) |
| keywords[2].id | https://openalex.org/keywords/metadata |
| keywords[2].score | 0.6989517211914062 |
| keywords[2].display_name | Metadata |
| keywords[3].id | https://openalex.org/keywords/computer-science |
| keywords[3].score | 0.6649366617202759 |
| keywords[3].display_name | Computer science |
| keywords[4].id | https://openalex.org/keywords/database |
| keywords[4].score | 0.5818549394607544 |
| keywords[4].display_name | Database |
| keywords[5].id | https://openalex.org/keywords/information-retrieval |
| keywords[5].score | 0.4715084135532379 |
| keywords[5].display_name | Information retrieval |
| keywords[6].id | https://openalex.org/keywords/xml |
| keywords[6].score | 0.4616924524307251 |
| keywords[6].display_name | XML |
| keywords[7].id | https://openalex.org/keywords/interoperability |
| keywords[7].score | 0.4472900629043579 |
| keywords[7].display_name | Interoperability |
| keywords[8].id | https://openalex.org/keywords/world-wide-web |
| keywords[8].score | 0.26500818133354187 |
| keywords[8].display_name | World Wide Web |
| keywords[9].id | https://openalex.org/keywords/biology |
| keywords[9].score | 0.1471470594406128 |
| keywords[9].display_name | Biology |
| language | en |
| locations[0].id | doi:10.1038/s41597-023-02101-6 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S2607323502 |
| locations[0].source.issn | 2052-4463 |
| locations[0].source.type | journal |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | 2052-4463 |
| locations[0].source.is_core | True |
| locations[0].source.is_in_doaj | True |
| locations[0].source.display_name | Scientific Data |
| locations[0].source.host_organization | https://openalex.org/P4310319908 |
| locations[0].source.host_organization_name | Nature Portfolio |
| locations[0].source.host_organization_lineage | https://openalex.org/P4310319908 |
| locations[0].license | cc-by |
| locations[0].pdf_url | https://www.nature.com/articles/s41597-023-02101-6.pdf |
| locations[0].version | publishedVersion |
| locations[0].raw_type | journal-article |
| locations[0].license_id | https://openalex.org/licenses/cc-by |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | Scientific Data |
| locations[0].landing_page_url | https://doi.org/10.1038/s41597-023-02101-6 |
| locations[1].id | pmid:37045837 |
| locations[1].is_oa | False |
| locations[1].source.id | https://openalex.org/S4306525036 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | False |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | PubMed |
| locations[1].source.host_organization | https://openalex.org/I1299303238 |
| locations[1].source.host_organization_name | National Institutes of Health |
| locations[1].source.host_organization_lineage | https://openalex.org/I1299303238 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | publishedVersion |
| locations[1].raw_type | |
| locations[1].license_id | |
| locations[1].is_accepted | True |
| locations[1].is_published | True |
| locations[1].raw_source_name | Scientific data |
| locations[1].landing_page_url | https://pubmed.ncbi.nlm.nih.gov/37045837 |
| locations[2].id | pmh:oai:pubmedcentral.nih.gov:10097656 |
| locations[2].is_oa | True |
| locations[2].source.id | https://openalex.org/S2764455111 |
| locations[2].source.issn | |
| locations[2].source.type | repository |
| locations[2].source.is_oa | False |
| locations[2].source.issn_l | |
| locations[2].source.is_core | False |
| locations[2].source.is_in_doaj | False |
| locations[2].source.display_name | PubMed Central |
| locations[2].source.host_organization | https://openalex.org/I1299303238 |
| locations[2].source.host_organization_name | National Institutes of Health |
| locations[2].source.host_organization_lineage | https://openalex.org/I1299303238 |
| locations[2].license | cc-by |
| locations[2].pdf_url | https://pmc.ncbi.nlm.nih.gov/articles/PMC10097656/pdf/41597_2023_Article_2101.pdf |
| locations[2].version | submittedVersion |
| locations[2].raw_type | Text |
| locations[2].license_id | https://openalex.org/licenses/cc-by |
| locations[2].is_accepted | False |
| locations[2].is_published | False |
| locations[2].raw_source_name | Sci Data |
| locations[2].landing_page_url | https://www.ncbi.nlm.nih.gov/pmc/articles/10097656 |
| locations[3].id | pmh:oai:doaj.org/article:f525fdcb809742669e918419cf913c02 |
| locations[3].is_oa | False |
| locations[3].source.id | https://openalex.org/S4306401280 |
| locations[3].source.issn | |
| locations[3].source.type | repository |
| locations[3].source.is_oa | False |
| locations[3].source.issn_l | |
| locations[3].source.is_core | False |
| locations[3].source.is_in_doaj | False |
| locations[3].source.display_name | DOAJ (DOAJ: Directory of Open Access Journals) |
| locations[3].source.host_organization | |
| locations[3].source.host_organization_name | |
| locations[3].source.host_organization_lineage | |
| locations[3].license | |
| locations[3].pdf_url | |
| locations[3].version | submittedVersion |
| locations[3].raw_type | article |
| locations[3].license_id | |
| locations[3].is_accepted | False |
| locations[3].is_published | False |
| locations[3].raw_source_name | Scientific Data, Vol 10, Iss 1, Pp 1-13 (2023) |
| locations[3].landing_page_url | https://doaj.org/article/f525fdcb809742669e918419cf913c02 |
| indexed_in | crossref, doaj, pubmed |
| authorships[0].author.id | https://openalex.org/A5039912173 |
| authorships[0].author.orcid | https://orcid.org/0000-0003-2340-3278 |
| authorships[0].author.display_name | Preeti Choudhary |
| authorships[0].countries | GB |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I1303153112 |
| authorships[0].affiliations[0].raw_affiliation_string | Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK |
| authorships[0].institutions[0].id | https://openalex.org/I1303153112 |
| authorships[0].institutions[0].ror | https://ror.org/02catss52 |
| authorships[0].institutions[0].type | facility |
| authorships[0].institutions[0].lineage | https://openalex.org/I1303153112, https://openalex.org/I4210138560 |
| authorships[0].institutions[0].country_code | GB |
| authorships[0].institutions[0].display_name | European Bioinformatics Institute |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Preeti Choudhary |
| authorships[0].is_corresponding | True |
| authorships[0].raw_affiliation_strings | Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK |
| authorships[1].author.id | https://openalex.org/A5057704514 |
| authorships[1].author.orcid | https://orcid.org/0000-0003-4838-443X |
| authorships[1].author.display_name | Stephen Anyango |
| authorships[1].countries | GB |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I1303153112 |
| authorships[1].affiliations[0].raw_affiliation_string | Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK |
| authorships[1].institutions[0].id | https://openalex.org/I1303153112 |
| authorships[1].institutions[0].ror | https://ror.org/02catss52 |
| authorships[1].institutions[0].type | facility |
| authorships[1].institutions[0].lineage | https://openalex.org/I1303153112, https://openalex.org/I4210138560 |
| authorships[1].institutions[0].country_code | GB |
| authorships[1].institutions[0].display_name | European Bioinformatics Institute |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Stephen Anyango |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK |
| authorships[2].author.id | https://openalex.org/A5087973270 |
| authorships[2].author.orcid | https://orcid.org/0000-0003-4442-5871 |
| authorships[2].author.display_name | John M. Berrisford |
| authorships[2].countries | GB |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I1303153112 |
| authorships[2].affiliations[0].raw_affiliation_string | Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK |
| authorships[2].institutions[0].id | https://openalex.org/I1303153112 |
| authorships[2].institutions[0].ror | https://ror.org/02catss52 |
| authorships[2].institutions[0].type | facility |
| authorships[2].institutions[0].lineage | https://openalex.org/I1303153112, https://openalex.org/I4210138560 |
| authorships[2].institutions[0].country_code | GB |
| authorships[2].institutions[0].display_name | European Bioinformatics Institute |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | John Berrisford |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK |
| authorships[3].author.id | https://openalex.org/A5062308894 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-5779-4935 |
| authorships[3].author.display_name | James Tolchard |
| authorships[3].countries | GB |
| authorships[3].affiliations[0].institution_ids | https://openalex.org/I1303153112 |
| authorships[3].affiliations[0].raw_affiliation_string | Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK |
| authorships[3].institutions[0].id | https://openalex.org/I1303153112 |
| authorships[3].institutions[0].ror | https://ror.org/02catss52 |
| authorships[3].institutions[0].type | facility |
| authorships[3].institutions[0].lineage | https://openalex.org/I1303153112, https://openalex.org/I4210138560 |
| authorships[3].institutions[0].country_code | GB |
| authorships[3].institutions[0].display_name | European Bioinformatics Institute |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | James Tolchard |
| authorships[3].is_corresponding | False |
| authorships[3].raw_affiliation_strings | Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK |
| authorships[4].author.id | https://openalex.org/A5054254768 |
| authorships[4].author.orcid | https://orcid.org/0000-0002-3687-0839 |
| authorships[4].author.display_name | Mihály Váradi |
| authorships[4].countries | GB |
| authorships[4].affiliations[0].institution_ids | https://openalex.org/I1303153112 |
| authorships[4].affiliations[0].raw_affiliation_string | Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK |
| authorships[4].institutions[0].id | https://openalex.org/I1303153112 |
| authorships[4].institutions[0].ror | https://ror.org/02catss52 |
| authorships[4].institutions[0].type | facility |
| authorships[4].institutions[0].lineage | https://openalex.org/I1303153112, https://openalex.org/I4210138560 |
| authorships[4].institutions[0].country_code | GB |
| authorships[4].institutions[0].display_name | European Bioinformatics Institute |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Mihaly Varadi |
| authorships[4].is_corresponding | False |
| authorships[4].raw_affiliation_strings | Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK |
| authorships[5].author.id | https://openalex.org/A5042460017 |
| authorships[5].author.orcid | https://orcid.org/0000-0002-8439-5964 |
| authorships[5].author.display_name | Sameer Velankar |
| authorships[5].countries | GB |
| authorships[5].affiliations[0].institution_ids | https://openalex.org/I1303153112 |
| authorships[5].affiliations[0].raw_affiliation_string | Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK |
| authorships[5].institutions[0].id | https://openalex.org/I1303153112 |
| authorships[5].institutions[0].ror | https://ror.org/02catss52 |
| authorships[5].institutions[0].type | facility |
| authorships[5].institutions[0].lineage | https://openalex.org/I1303153112, https://openalex.org/I4210138560 |
| authorships[5].institutions[0].country_code | GB |
| authorships[5].institutions[0].display_name | European Bioinformatics Institute |
| authorships[5].author_position | last |
| authorships[5].raw_author_name | Sameer Velankar |
| authorships[5].is_corresponding | False |
| authorships[5].raw_affiliation_strings | Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK |
| has_content.pdf | True |
| has_content.grobid_xml | True |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://www.nature.com/articles/s41597-023-02101-6.pdf |
| open_access.oa_status | gold |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Unified access to up-to-date residue-level annotations from UniProtKB and other biological databases for PDB data |
| has_fulltext | True |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T11162 |
| primary_topic.field.id | https://openalex.org/fields/25 |
| primary_topic.field.display_name | Materials Science |
| primary_topic.score | 0.9994000196456909 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/2505 |
| primary_topic.subfield.display_name | Materials Chemistry |
| primary_topic.display_name | Enzyme Structure and Function |
| related_works | https://openalex.org/W2162388472, https://openalex.org/W2765199194, https://openalex.org/W3132544722, https://openalex.org/W3181892255, https://openalex.org/W4232154957, https://openalex.org/W2103435205, https://openalex.org/W2022209941, https://openalex.org/W4292058683, https://openalex.org/W4205671017, https://openalex.org/W4307362031 |
| cited_by_count | 9 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 2 |
| counts_by_year[1].year | 2024 |
| counts_by_year[1].cited_by_count | 4 |
| counts_by_year[2].year | 2023 |
| counts_by_year[2].cited_by_count | 3 |
| locations_count | 4 |
| best_oa_location.id | doi:10.1038/s41597-023-02101-6 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S2607323502 |
| best_oa_location.source.issn | 2052-4463 |
| best_oa_location.source.type | journal |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | 2052-4463 |
| best_oa_location.source.is_core | True |
| best_oa_location.source.is_in_doaj | True |
| best_oa_location.source.display_name | Scientific Data |
| best_oa_location.source.host_organization | https://openalex.org/P4310319908 |
| best_oa_location.source.host_organization_name | Nature Portfolio |
| best_oa_location.source.host_organization_lineage | https://openalex.org/P4310319908 |
| best_oa_location.license | cc-by |
| best_oa_location.pdf_url | https://www.nature.com/articles/s41597-023-02101-6.pdf |
| best_oa_location.version | publishedVersion |
| best_oa_location.raw_type | journal-article |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | True |
| best_oa_location.raw_source_name | Scientific Data |
| best_oa_location.landing_page_url | https://doi.org/10.1038/s41597-023-02101-6 |
| primary_location.id | doi:10.1038/s41597-023-02101-6 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S2607323502 |
| primary_location.source.issn | 2052-4463 |
| primary_location.source.type | journal |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | 2052-4463 |
| primary_location.source.is_core | True |
| primary_location.source.is_in_doaj | True |
| primary_location.source.display_name | Scientific Data |
| primary_location.source.host_organization | https://openalex.org/P4310319908 |
| primary_location.source.host_organization_name | Nature Portfolio |
| primary_location.source.host_organization_lineage | https://openalex.org/P4310319908 |
| primary_location.license | cc-by |
| primary_location.pdf_url | https://www.nature.com/articles/s41597-023-02101-6.pdf |
| primary_location.version | publishedVersion |
| primary_location.raw_type | journal-article |
| primary_location.license_id | https://openalex.org/licenses/cc-by |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | Scientific Data |
| primary_location.landing_page_url | https://doi.org/10.1038/s41597-023-02101-6 |
| publication_date | 2023-04-12 |
| publication_year | 2023 |
| referenced_works | https://openalex.org/W2898210859, https://openalex.org/W3112376646, https://openalex.org/W2154715418, https://openalex.org/W2040548339, https://openalex.org/W2038019215, https://openalex.org/W2091326717, https://openalex.org/W2786092929, https://openalex.org/W3181892255, https://openalex.org/W2010840375, https://openalex.org/W3118036451, https://openalex.org/W2982977147, https://openalex.org/W2900674118, https://openalex.org/W2170463736, https://openalex.org/W3095583226, https://openalex.org/W3094967361, https://openalex.org/W2128653811, https://openalex.org/W3106745904, https://openalex.org/W2104295259, https://openalex.org/W2103017472, https://openalex.org/W3112600195, https://openalex.org/W4205247356, https://openalex.org/W3048090444, https://openalex.org/W4247001379, https://openalex.org/W3214131546, https://openalex.org/W4200140556, https://openalex.org/W3206116958, https://openalex.org/W1517955486, https://openalex.org/W3110645309, https://openalex.org/W3165513106, https://openalex.org/W4224303521, https://openalex.org/W4250708691, https://openalex.org/W2804822363, https://openalex.org/W3211795435, https://openalex.org/W3177828909, https://openalex.org/W1533059251, https://openalex.org/W2793317762, https://openalex.org/W1910105166, https://openalex.org/W2016685368, https://openalex.org/W2158332589, https://openalex.org/W2774257091, https://openalex.org/W2972475246, https://openalex.org/W2029370147, https://openalex.org/W2050888629, https://openalex.org/W3022748398, https://openalex.org/W3014423874, https://openalex.org/W3166476894, https://openalex.org/W2599616891, https://openalex.org/W4281758613, https://openalex.org/W2187603966, https://openalex.org/W3206696710, https://openalex.org/W2931205426, https://openalex.org/W2582236637, https://openalex.org/W1199870172, https://openalex.org/W2750527680, https://openalex.org/W2984761660, https://openalex.org/W2224056471, https://openalex.org/W2944493837, https://openalex.org/W3009307274, https://openalex.org/W3028516693, https://openalex.org/W2898789672, https://openalex.org/W3164046276, https://openalex.org/W4310948626, https://openalex.org/W3157357317, https://openalex.org/W4313910025, https://openalex.org/W3186179742, https://openalex.org/W3159719254, https://openalex.org/W1987114890, https://openalex.org/W1965858496, https://openalex.org/W1980586346, https://openalex.org/W2153343124, https://openalex.org/W2145501578, https://openalex.org/W4286669440, https://openalex.org/W3093323969, https://openalex.org/W3217004381, https://openalex.org/W4321483489, https://openalex.org/W4229042017, https://openalex.org/W4220702678, https://openalex.org/W2096525273, https://openalex.org/W2763396749, https://openalex.org/W4205478563, https://openalex.org/W4226042196 |
| referenced_works_count | 81 |
| abstract_inverted_index.a | 136 |
| abstract_inverted_index.3D | 16 |
| abstract_inverted_index.at | 154 |
| abstract_inverted_index.by | 20 |
| abstract_inverted_index.in | 44, 71, 122, 159 |
| abstract_inverted_index.is | 42 |
| abstract_inverted_index.of | 120, 129 |
| abstract_inverted_index.or | 53 |
| abstract_inverted_index.to | 86 |
| abstract_inverted_index.we | 76 |
| abstract_inverted_index.API | 60 |
| abstract_inverted_index.CSV | 49 |
| abstract_inverted_index.PDB | 73, 107, 124, 146 |
| abstract_inverted_index.TSV | 51 |
| abstract_inverted_index.The | 132 |
| abstract_inverted_index.and | 14, 27, 50, 90, 96, 167 |
| abstract_inverted_index.but | 61 |
| abstract_inverted_index.now | 116 |
| abstract_inverted_index.the | 21, 57, 66, 72, 78, 92, 102, 106, 110, 144, 155 |
| abstract_inverted_index.via | 56 |
| abstract_inverted_index.CATH | 97 |
| abstract_inverted_index.More | 1 |
| abstract_inverted_index.PDBe | 58 |
| abstract_inverted_index.REST | 59 |
| abstract_inverted_index.This | 148 |
| abstract_inverted_index.With | 109 |
| abstract_inverted_index.XML, | 48 |
| abstract_inverted_index.acid | 11 |
| abstract_inverted_index.also | 54 |
| abstract_inverted_index.core | 145 |
| abstract_inverted_index.data | 41, 68, 81, 89, 161, 165 |
| abstract_inverted_index.easy | 127 |
| abstract_inverted_index.from | 35, 65, 105 |
| abstract_inverted_index.have | 5 |
| abstract_inverted_index.into | 101 |
| abstract_inverted_index.like | 47 |
| abstract_inverted_index.many | 36 |
| abstract_inverted_index.more | 137 |
| abstract_inverted_index.than | 2 |
| abstract_inverted_index.with | 24, 83 |
| abstract_inverted_index.Here, | 75 |
| abstract_inverted_index.Pfam, | 94 |
| abstract_inverted_index.SIFTS | 31, 40, 88 |
| abstract_inverted_index.added | 91 |
| abstract_inverted_index.amino | 10 |
| abstract_inverted_index.file) | 70 |
| abstract_inverted_index.files | 104, 115 |
| abstract_inverted_index.level | 157 |
| abstract_inverted_index.other | 37 |
| abstract_inverted_index.their | 9, 15 |
| abstract_inverted_index.these | 114 |
| abstract_inverted_index.wwPDB | 79 |
| abstract_inverted_index.(PDB), | 18 |
| abstract_inverted_index.61,000 | 3 |
| abstract_inverted_index.SCOP2, | 95 |
| abstract_inverted_index.always | 62 |
| abstract_inverted_index.better | 160 |
| abstract_inverted_index.format | 52 |
| abstract_inverted_index.yields | 135 |
| abstract_inverted_index.(SIFTS) | 29 |
| abstract_inverted_index.between | 8 |
| abstract_inverted_index.enabled | 19 |
| abstract_inverted_index.enables | 150 |
| abstract_inverted_index.entries | 125 |
| abstract_inverted_index.formats | 46 |
| abstract_inverted_index.models. | 131 |
| abstract_inverted_index.provide | 117 |
| abstract_inverted_index.residue | 156 |
| abstract_inverted_index.various | 45 |
| abstract_inverted_index.without | 142 |
| abstract_inverted_index.Abstract | 0 |
| abstract_inverted_index.Taxonomy | 26 |
| abstract_inverted_index.allowing | 126 |
| abstract_inverted_index.altering | 143 |
| abstract_inverted_index.analysis | 166 |
| abstract_inverted_index.archive. | 74, 108 |
| abstract_inverted_index.directly | 100 |
| abstract_inverted_index.extended | 77, 133 |
| abstract_inverted_index.improved | 164 |
| abstract_inverted_index.metadata | 140 |
| abstract_inverted_index.proteins | 4 |
| abstract_inverted_index.residues | 121 |
| abstract_inverted_index.sequence | 12 |
| abstract_inverted_index.Function, | 25 |
| abstract_inverted_index.Sequences | 28 |
| abstract_inverted_index.Structure | 22 |
| abstract_inverted_index.UniProtKB | 112 |
| abstract_inverted_index.available | 43 |
| abstract_inverted_index.different | 123 |
| abstract_inverted_index.numbering | 119 |
| abstract_inverted_index.resource. | 30 |
| abstract_inverted_index.resulting | 158 |
| abstract_inverted_index.structure | 67, 130 |
| abstract_inverted_index.PDBx/mmCIF | 80, 103 |
| abstract_inverted_index.UniProtKB, | 93 |
| abstract_inverted_index.accessible | 55 |
| abstract_inverted_index.additional | 84 |
| abstract_inverted_index.biological | 38 |
| abstract_inverted_index.categories | 85 |
| abstract_inverted_index.comparison | 128 |
| abstract_inverted_index.consistent | 118 |
| abstract_inverted_index.dictionary | 82, 134 |
| abstract_inverted_index.integrated | 111 |
| abstract_inverted_index.maintained | 63 |
| abstract_inverted_index.resources. | 39 |
| abstract_inverted_index.separately | 64 |
| abstract_inverted_index.structures | 17 |
| abstract_inverted_index.supporting | 163 |
| abstract_inverted_index.up-to-date | 6, 151 |
| abstract_inverted_index.(PDBx/mmCIF | 69 |
| abstract_inverted_index.(UniProtKB) | 13 |
| abstract_inverted_index.Integration | 23 |
| abstract_inverted_index.accommodate | 87 |
| abstract_inverted_index.annotations | 34, 99 |
| abstract_inverted_index.consistent, | 138 |
| abstract_inverted_index.description | 141 |
| abstract_inverted_index.development | 149 |
| abstract_inverted_index.information | 153 |
| abstract_inverted_index.annotations, | 113 |
| abstract_inverted_index.incorporates | 32 |
| abstract_inverted_index.information. | 147 |
| abstract_inverted_index.standardised | 139 |
| abstract_inverted_index.residue-level | 33, 98 |
| abstract_inverted_index.correspondence | 7 |
| abstract_inverted_index.visualisation. | 168 |
| abstract_inverted_index.cross-reference | 152 |
| abstract_inverted_index.interoperability, | 162 |
| cited_by_percentile_year.max | 98 |
| cited_by_percentile_year.min | 95 |
| corresponding_author_ids | https://openalex.org/A5039912173 |
| countries_distinct_count | 1 |
| institutions_distinct_count | 6 |
| corresponding_institution_ids | https://openalex.org/I1303153112 |
| citation_normalized_percentile.value | 0.72435956 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |