STRPsearch: fast detection of structured tandem repeat proteins Article Swipe
YOU?
·
· 2024
· Open Access
·
· DOI: https://doi.org/10.1093/bioinformatics/btae690
Motivation Structured Tandem Repeats Proteins (STRPs) constitute a subclass of tandem repeats characterized by repetitive structural motifs. These proteins exhibit distinct secondary structures that form repetitive tertiary arrangements, often resulting in large molecular assemblies. Despite highly variable sequences, STRPs can perform important and diverse biological functions, maintaining a consistent structure with a variable number of repeat units. With the advent of protein structure prediction methods, millions of 3D models of proteins are now publicly available. However, automatic detection of STRPs remains challenging with current state-of-the-art tools due to their lack of accuracy and long execution times, hindering their application on large datasets. In most cases, manual curation remains the most accurate method for detecting and classifying STRPs, making it impracticable to annotate millions of structures. Results We introduce STRPsearch, a novel tool for the rapid identification, classification, and mapping of STRPs. Leveraging manually curated entries from RepeatsDB as the known conformational space of STRPs, STRPsearch uses the latest advances in structural alignment for a fast and accurate detection of repeated structural motifs in proteins, followed by an innovative approach to map units and insertions through the generation of TM-score profiles. STRPsearch is highly scalable, efficiently processing large datasets, and can be applied to both experimental structures and predicted models. In addition, it demonstrates superior performance compared to existing tools, offering researchers a reliable and comprehensive solution for STRP analysis across diverse proteomes. Availability and implementation STRPsearch is coded in Python. All scripts and associated documentation are available from: https://github.com/BioComputingUP/STRPsearch.
Related Topics
- Type
- article
- Language
- en
- Landing Page
- https://doi.org/10.1093/bioinformatics/btae690
- OA Status
- gold
- Cited By
- 2
- References
- 23
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4404509638
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4404509638Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.1093/bioinformatics/btae690Digital Object Identifier
- Title
-
STRPsearch: fast detection of structured tandem repeat proteinsWork title
- Type
-
articleOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2024Year of publication
- Publication date
-
2024-11-14Full publication date if available
- Authors
-
Soroush Mozaffari, Paula Nazarena Arrías, Damiano Clementel, Damiano Piovesan, Carlo Ferrari, Silvio C. E. Tosatto, Alexander Miguel MonzónList of authors in order
- Landing page
-
https://doi.org/10.1093/bioinformatics/btae690Publisher landing page
- Open access
-
YesWhether a free full text is available
- OA status
-
goldOpen access status per OpenAlex
- OA URL
-
https://doi.org/10.1093/bioinformatics/btae690Direct OA link when available
- Concepts
-
Python (programming language), Computer science, Scripting language, Tandem repeat, Scalability, Documentation, Computational biology, Proteome, Structural bioinformatics, Identification (biology), Data mining, Artificial intelligence, Protein structure, Programming language, Bioinformatics, Biology, Database, Genetics, Genome, Gene, Biochemistry, BotanyTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
2Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 1, 2024: 1Per-year citation counts (last 5 years)
- References (count)
-
23Number of works referenced by this work
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4404509638 |
|---|---|
| doi | https://doi.org/10.1093/bioinformatics/btae690 |
| ids.doi | https://doi.org/10.1093/bioinformatics/btae690 |
| ids.pmid | https://pubmed.ncbi.nlm.nih.gov/39558588 |
| ids.openalex | https://openalex.org/W4404509638 |
| fwci | 0.96049445 |
| mesh[0].qualifier_ui | Q000737 |
| mesh[0].descriptor_ui | D011506 |
| mesh[0].is_major_topic | True |
| mesh[0].qualifier_name | chemistry |
| mesh[0].descriptor_name | Proteins |
| mesh[1].qualifier_ui | |
| mesh[1].descriptor_ui | D012984 |
| mesh[1].is_major_topic | True |
| mesh[1].qualifier_name | |
| mesh[1].descriptor_name | Software |
| mesh[2].qualifier_ui | |
| mesh[2].descriptor_ui | D030562 |
| mesh[2].is_major_topic | False |
| mesh[2].qualifier_name | |
| mesh[2].descriptor_name | Databases, Protein |
| mesh[3].qualifier_ui | |
| mesh[3].descriptor_ui | D020080 |
| mesh[3].is_major_topic | False |
| mesh[3].qualifier_name | |
| mesh[3].descriptor_name | Tandem Repeat Sequences |
| mesh[4].qualifier_ui | |
| mesh[4].descriptor_ui | D011487 |
| mesh[4].is_major_topic | False |
| mesh[4].qualifier_name | |
| mesh[4].descriptor_name | Protein Conformation |
| mesh[5].qualifier_ui | |
| mesh[5].descriptor_ui | D008958 |
| mesh[5].is_major_topic | False |
| mesh[5].qualifier_name | |
| mesh[5].descriptor_name | Models, Molecular |
| mesh[6].qualifier_ui | |
| mesh[6].descriptor_ui | D000465 |
| mesh[6].is_major_topic | False |
| mesh[6].qualifier_name | |
| mesh[6].descriptor_name | Algorithms |
| mesh[7].qualifier_ui | Q000737 |
| mesh[7].descriptor_ui | D011506 |
| mesh[7].is_major_topic | True |
| mesh[7].qualifier_name | chemistry |
| mesh[7].descriptor_name | Proteins |
| mesh[8].qualifier_ui | |
| mesh[8].descriptor_ui | D012984 |
| mesh[8].is_major_topic | True |
| mesh[8].qualifier_name | |
| mesh[8].descriptor_name | Software |
| mesh[9].qualifier_ui | |
| mesh[9].descriptor_ui | D030562 |
| mesh[9].is_major_topic | False |
| mesh[9].qualifier_name | |
| mesh[9].descriptor_name | Databases, Protein |
| mesh[10].qualifier_ui | |
| mesh[10].descriptor_ui | D020080 |
| mesh[10].is_major_topic | False |
| mesh[10].qualifier_name | |
| mesh[10].descriptor_name | Tandem Repeat Sequences |
| mesh[11].qualifier_ui | |
| mesh[11].descriptor_ui | D011487 |
| mesh[11].is_major_topic | False |
| mesh[11].qualifier_name | |
| mesh[11].descriptor_name | Protein Conformation |
| mesh[12].qualifier_ui | |
| mesh[12].descriptor_ui | D008958 |
| mesh[12].is_major_topic | False |
| mesh[12].qualifier_name | |
| mesh[12].descriptor_name | Models, Molecular |
| mesh[13].qualifier_ui | |
| mesh[13].descriptor_ui | D000465 |
| mesh[13].is_major_topic | False |
| mesh[13].qualifier_name | |
| mesh[13].descriptor_name | Algorithms |
| mesh[14].qualifier_ui | Q000737 |
| mesh[14].descriptor_ui | D011506 |
| mesh[14].is_major_topic | True |
| mesh[14].qualifier_name | chemistry |
| mesh[14].descriptor_name | Proteins |
| mesh[15].qualifier_ui | |
| mesh[15].descriptor_ui | D012984 |
| mesh[15].is_major_topic | True |
| mesh[15].qualifier_name | |
| mesh[15].descriptor_name | Software |
| mesh[16].qualifier_ui | |
| mesh[16].descriptor_ui | D030562 |
| mesh[16].is_major_topic | False |
| mesh[16].qualifier_name | |
| mesh[16].descriptor_name | Databases, Protein |
| mesh[17].qualifier_ui | |
| mesh[17].descriptor_ui | D020080 |
| mesh[17].is_major_topic | False |
| mesh[17].qualifier_name | |
| mesh[17].descriptor_name | Tandem Repeat Sequences |
| mesh[18].qualifier_ui | |
| mesh[18].descriptor_ui | D011487 |
| mesh[18].is_major_topic | False |
| mesh[18].qualifier_name | |
| mesh[18].descriptor_name | Protein Conformation |
| mesh[19].qualifier_ui | |
| mesh[19].descriptor_ui | D008958 |
| mesh[19].is_major_topic | False |
| mesh[19].qualifier_name | |
| mesh[19].descriptor_name | Models, Molecular |
| mesh[20].qualifier_ui | |
| mesh[20].descriptor_ui | D000465 |
| mesh[20].is_major_topic | False |
| mesh[20].qualifier_name | |
| mesh[20].descriptor_name | Algorithms |
| mesh[21].qualifier_ui | Q000737 |
| mesh[21].descriptor_ui | D011506 |
| mesh[21].is_major_topic | True |
| mesh[21].qualifier_name | chemistry |
| mesh[21].descriptor_name | Proteins |
| mesh[22].qualifier_ui | |
| mesh[22].descriptor_ui | D012984 |
| mesh[22].is_major_topic | True |
| mesh[22].qualifier_name | |
| mesh[22].descriptor_name | Software |
| mesh[23].qualifier_ui | |
| mesh[23].descriptor_ui | D030562 |
| mesh[23].is_major_topic | False |
| mesh[23].qualifier_name | |
| mesh[23].descriptor_name | Databases, Protein |
| mesh[24].qualifier_ui | |
| mesh[24].descriptor_ui | D020080 |
| mesh[24].is_major_topic | False |
| mesh[24].qualifier_name | |
| mesh[24].descriptor_name | Tandem Repeat Sequences |
| mesh[25].qualifier_ui | |
| mesh[25].descriptor_ui | D011487 |
| mesh[25].is_major_topic | False |
| mesh[25].qualifier_name | |
| mesh[25].descriptor_name | Protein Conformation |
| mesh[26].qualifier_ui | |
| mesh[26].descriptor_ui | D008958 |
| mesh[26].is_major_topic | False |
| mesh[26].qualifier_name | |
| mesh[26].descriptor_name | Models, Molecular |
| mesh[27].qualifier_ui | |
| mesh[27].descriptor_ui | D000465 |
| mesh[27].is_major_topic | False |
| mesh[27].qualifier_name | |
| mesh[27].descriptor_name | Algorithms |
| type | article |
| title | STRPsearch: fast detection of structured tandem repeat proteins |
| biblio.issue | 12 |
| biblio.volume | 40 |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10521 |
| topics[0].field.id | https://openalex.org/fields/13 |
| topics[0].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[0].score | 0.9959999918937683 |
| topics[0].domain.id | https://openalex.org/domains/1 |
| topics[0].domain.display_name | Life Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1312 |
| topics[0].subfield.display_name | Molecular Biology |
| topics[0].display_name | RNA and protein synthesis mechanisms |
| topics[1].id | https://openalex.org/T10015 |
| topics[1].field.id | https://openalex.org/fields/13 |
| topics[1].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[1].score | 0.9957000017166138 |
| topics[1].domain.id | https://openalex.org/domains/1 |
| topics[1].domain.display_name | Life Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1312 |
| topics[1].subfield.display_name | Molecular Biology |
| topics[1].display_name | Genomics and Phylogenetic Studies |
| topics[2].id | https://openalex.org/T10932 |
| topics[2].field.id | https://openalex.org/fields/13 |
| topics[2].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[2].score | 0.9865000247955322 |
| topics[2].domain.id | https://openalex.org/domains/1 |
| topics[2].domain.display_name | Life Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1312 |
| topics[2].subfield.display_name | Molecular Biology |
| topics[2].display_name | Microbial Metabolic Engineering and Bioproduction |
| is_xpac | False |
| apc_list.value | 3618 |
| apc_list.currency | USD |
| apc_list.value_usd | 3618 |
| apc_paid.value | 3618 |
| apc_paid.currency | USD |
| apc_paid.value_usd | 3618 |
| concepts[0].id | https://openalex.org/C519991488 |
| concepts[0].level | 2 |
| concepts[0].score | 0.7895341515541077 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q28865 |
| concepts[0].display_name | Python (programming language) |
| concepts[1].id | https://openalex.org/C41008148 |
| concepts[1].level | 0 |
| concepts[1].score | 0.7855837345123291 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[1].display_name | Computer science |
| concepts[2].id | https://openalex.org/C61423126 |
| concepts[2].level | 2 |
| concepts[2].score | 0.7244215607643127 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q187432 |
| concepts[2].display_name | Scripting language |
| concepts[3].id | https://openalex.org/C27149982 |
| concepts[3].level | 4 |
| concepts[3].score | 0.6774925589561462 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q3456514 |
| concepts[3].display_name | Tandem repeat |
| concepts[4].id | https://openalex.org/C48044578 |
| concepts[4].level | 2 |
| concepts[4].score | 0.5892376899719238 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q727490 |
| concepts[4].display_name | Scalability |
| concepts[5].id | https://openalex.org/C56666940 |
| concepts[5].level | 2 |
| concepts[5].score | 0.4865238070487976 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q788790 |
| concepts[5].display_name | Documentation |
| concepts[6].id | https://openalex.org/C70721500 |
| concepts[6].level | 1 |
| concepts[6].score | 0.4368392825126648 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q177005 |
| concepts[6].display_name | Computational biology |
| concepts[7].id | https://openalex.org/C104397665 |
| concepts[7].level | 2 |
| concepts[7].score | 0.41616278886795044 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q860947 |
| concepts[7].display_name | Proteome |
| concepts[8].id | https://openalex.org/C69131567 |
| concepts[8].level | 3 |
| concepts[8].score | 0.4153038263320923 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q2068215 |
| concepts[8].display_name | Structural bioinformatics |
| concepts[9].id | https://openalex.org/C116834253 |
| concepts[9].level | 2 |
| concepts[9].score | 0.411774605512619 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q2039217 |
| concepts[9].display_name | Identification (biology) |
| concepts[10].id | https://openalex.org/C124101348 |
| concepts[10].level | 1 |
| concepts[10].score | 0.3599773943424225 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q172491 |
| concepts[10].display_name | Data mining |
| concepts[11].id | https://openalex.org/C154945302 |
| concepts[11].level | 1 |
| concepts[11].score | 0.35386592149734497 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[11].display_name | Artificial intelligence |
| concepts[12].id | https://openalex.org/C47701112 |
| concepts[12].level | 2 |
| concepts[12].score | 0.34047192335128784 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q735188 |
| concepts[12].display_name | Protein structure |
| concepts[13].id | https://openalex.org/C199360897 |
| concepts[13].level | 1 |
| concepts[13].score | 0.2969589829444885 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q9143 |
| concepts[13].display_name | Programming language |
| concepts[14].id | https://openalex.org/C60644358 |
| concepts[14].level | 1 |
| concepts[14].score | 0.29167866706848145 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q128570 |
| concepts[14].display_name | Bioinformatics |
| concepts[15].id | https://openalex.org/C86803240 |
| concepts[15].level | 0 |
| concepts[15].score | 0.17743167281150818 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q420 |
| concepts[15].display_name | Biology |
| concepts[16].id | https://openalex.org/C77088390 |
| concepts[16].level | 1 |
| concepts[16].score | 0.17104360461235046 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q8513 |
| concepts[16].display_name | Database |
| concepts[17].id | https://openalex.org/C54355233 |
| concepts[17].level | 1 |
| concepts[17].score | 0.104239821434021 |
| concepts[17].wikidata | https://www.wikidata.org/wiki/Q7162 |
| concepts[17].display_name | Genetics |
| concepts[18].id | https://openalex.org/C141231307 |
| concepts[18].level | 3 |
| concepts[18].score | 0.08711668848991394 |
| concepts[18].wikidata | https://www.wikidata.org/wiki/Q7020 |
| concepts[18].display_name | Genome |
| concepts[19].id | https://openalex.org/C104317684 |
| concepts[19].level | 2 |
| concepts[19].score | 0.0 |
| concepts[19].wikidata | https://www.wikidata.org/wiki/Q7187 |
| concepts[19].display_name | Gene |
| concepts[20].id | https://openalex.org/C55493867 |
| concepts[20].level | 1 |
| concepts[20].score | 0.0 |
| concepts[20].wikidata | https://www.wikidata.org/wiki/Q7094 |
| concepts[20].display_name | Biochemistry |
| concepts[21].id | https://openalex.org/C59822182 |
| concepts[21].level | 1 |
| concepts[21].score | 0.0 |
| concepts[21].wikidata | https://www.wikidata.org/wiki/Q441 |
| concepts[21].display_name | Botany |
| keywords[0].id | https://openalex.org/keywords/python |
| keywords[0].score | 0.7895341515541077 |
| keywords[0].display_name | Python (programming language) |
| keywords[1].id | https://openalex.org/keywords/computer-science |
| keywords[1].score | 0.7855837345123291 |
| keywords[1].display_name | Computer science |
| keywords[2].id | https://openalex.org/keywords/scripting-language |
| keywords[2].score | 0.7244215607643127 |
| keywords[2].display_name | Scripting language |
| keywords[3].id | https://openalex.org/keywords/tandem-repeat |
| keywords[3].score | 0.6774925589561462 |
| keywords[3].display_name | Tandem repeat |
| keywords[4].id | https://openalex.org/keywords/scalability |
| keywords[4].score | 0.5892376899719238 |
| keywords[4].display_name | Scalability |
| keywords[5].id | https://openalex.org/keywords/documentation |
| keywords[5].score | 0.4865238070487976 |
| keywords[5].display_name | Documentation |
| keywords[6].id | https://openalex.org/keywords/computational-biology |
| keywords[6].score | 0.4368392825126648 |
| keywords[6].display_name | Computational biology |
| keywords[7].id | https://openalex.org/keywords/proteome |
| keywords[7].score | 0.41616278886795044 |
| keywords[7].display_name | Proteome |
| keywords[8].id | https://openalex.org/keywords/structural-bioinformatics |
| keywords[8].score | 0.4153038263320923 |
| keywords[8].display_name | Structural bioinformatics |
| keywords[9].id | https://openalex.org/keywords/identification |
| keywords[9].score | 0.411774605512619 |
| keywords[9].display_name | Identification (biology) |
| keywords[10].id | https://openalex.org/keywords/data-mining |
| keywords[10].score | 0.3599773943424225 |
| keywords[10].display_name | Data mining |
| keywords[11].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[11].score | 0.35386592149734497 |
| keywords[11].display_name | Artificial intelligence |
| keywords[12].id | https://openalex.org/keywords/protein-structure |
| keywords[12].score | 0.34047192335128784 |
| keywords[12].display_name | Protein structure |
| keywords[13].id | https://openalex.org/keywords/programming-language |
| keywords[13].score | 0.2969589829444885 |
| keywords[13].display_name | Programming language |
| keywords[14].id | https://openalex.org/keywords/bioinformatics |
| keywords[14].score | 0.29167866706848145 |
| keywords[14].display_name | Bioinformatics |
| keywords[15].id | https://openalex.org/keywords/biology |
| keywords[15].score | 0.17743167281150818 |
| keywords[15].display_name | Biology |
| keywords[16].id | https://openalex.org/keywords/database |
| keywords[16].score | 0.17104360461235046 |
| keywords[16].display_name | Database |
| keywords[17].id | https://openalex.org/keywords/genetics |
| keywords[17].score | 0.104239821434021 |
| keywords[17].display_name | Genetics |
| keywords[18].id | https://openalex.org/keywords/genome |
| keywords[18].score | 0.08711668848991394 |
| keywords[18].display_name | Genome |
| language | en |
| locations[0].id | doi:10.1093/bioinformatics/btae690 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S52395412 |
| locations[0].source.issn | 1367-4803, 1367-4811 |
| locations[0].source.type | journal |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | 1367-4803 |
| locations[0].source.is_core | True |
| locations[0].source.is_in_doaj | True |
| locations[0].source.display_name | Bioinformatics |
| locations[0].source.host_organization | https://openalex.org/P4310311648 |
| locations[0].source.host_organization_name | Oxford University Press |
| locations[0].source.host_organization_lineage | https://openalex.org/P4310311648, https://openalex.org/P4310311647 |
| locations[0].source.host_organization_lineage_names | Oxford University Press, University of Oxford |
| locations[0].license | cc-by |
| locations[0].pdf_url | |
| locations[0].version | publishedVersion |
| locations[0].raw_type | journal-article |
| locations[0].license_id | https://openalex.org/licenses/cc-by |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | Bioinformatics |
| locations[0].landing_page_url | https://doi.org/10.1093/bioinformatics/btae690 |
| locations[1].id | pmid:39558588 |
| locations[1].is_oa | False |
| locations[1].source.id | https://openalex.org/S4306525036 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | False |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | PubMed |
| locations[1].source.host_organization | https://openalex.org/I1299303238 |
| locations[1].source.host_organization_name | National Institutes of Health |
| locations[1].source.host_organization_lineage | https://openalex.org/I1299303238 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | publishedVersion |
| locations[1].raw_type | |
| locations[1].license_id | |
| locations[1].is_accepted | True |
| locations[1].is_published | True |
| locations[1].raw_source_name | Bioinformatics (Oxford, England) |
| locations[1].landing_page_url | https://pubmed.ncbi.nlm.nih.gov/39558588 |
| locations[2].id | pmh:oai:www.research.unipd.it:11577/3545472 |
| locations[2].is_oa | True |
| locations[2].source.id | https://openalex.org/S4306402448 |
| locations[2].source.issn | |
| locations[2].source.type | repository |
| locations[2].source.is_oa | False |
| locations[2].source.issn_l | |
| locations[2].source.is_core | False |
| locations[2].source.is_in_doaj | False |
| locations[2].source.display_name | Padua Research Archive (University of Padua) |
| locations[2].source.host_organization | https://openalex.org/I138689650 |
| locations[2].source.host_organization_name | University of Padua |
| locations[2].source.host_organization_lineage | https://openalex.org/I138689650 |
| locations[2].license | other-oa |
| locations[2].pdf_url | |
| locations[2].version | submittedVersion |
| locations[2].raw_type | info:eu-repo/semantics/article |
| locations[2].license_id | https://openalex.org/licenses/other-oa |
| locations[2].is_accepted | False |
| locations[2].is_published | False |
| locations[2].raw_source_name | |
| locations[2].landing_page_url | https://hdl.handle.net/11577/3545472 |
| indexed_in | crossref, doaj, pubmed |
| authorships[0].author.id | https://openalex.org/A5000237667 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Soroush Mozaffari |
| authorships[0].countries | IT |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I138689650 |
| authorships[0].affiliations[0].raw_affiliation_string | Department of Biomedical Sciences, University of Padova, Padova, 35121, Italy |
| authorships[0].institutions[0].id | https://openalex.org/I138689650 |
| authorships[0].institutions[0].ror | https://ror.org/00240q980 |
| authorships[0].institutions[0].type | education |
| authorships[0].institutions[0].lineage | https://openalex.org/I138689650 |
| authorships[0].institutions[0].country_code | IT |
| authorships[0].institutions[0].display_name | University of Padua |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Soroush Mozaffari |
| authorships[0].is_corresponding | False |
| authorships[0].raw_affiliation_strings | Department of Biomedical Sciences, University of Padova, Padova, 35121, Italy |
| authorships[1].author.id | https://openalex.org/A5009548693 |
| authorships[1].author.orcid | https://orcid.org/0000-0001-5668-768X |
| authorships[1].author.display_name | Paula Nazarena Arrías |
| authorships[1].countries | IT, SE |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I138689650 |
| authorships[1].affiliations[0].raw_affiliation_string | Department of Biomedical Sciences, University of Padova, Padova, 35121, Italy |
| authorships[1].affiliations[1].institution_ids | https://openalex.org/I86987016 |
| authorships[1].affiliations[1].raw_affiliation_string | Department of Protein Science, KTH Royal Institute of Technology, Stockholm, SE-10691, Sweden |
| authorships[1].institutions[0].id | https://openalex.org/I138689650 |
| authorships[1].institutions[0].ror | https://ror.org/00240q980 |
| authorships[1].institutions[0].type | education |
| authorships[1].institutions[0].lineage | https://openalex.org/I138689650 |
| authorships[1].institutions[0].country_code | IT |
| authorships[1].institutions[0].display_name | University of Padua |
| authorships[1].institutions[1].id | https://openalex.org/I86987016 |
| authorships[1].institutions[1].ror | https://ror.org/026vcq606 |
| authorships[1].institutions[1].type | education |
| authorships[1].institutions[1].lineage | https://openalex.org/I86987016 |
| authorships[1].institutions[1].country_code | SE |
| authorships[1].institutions[1].display_name | KTH Royal Institute of Technology |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Paula Nazarena Arrías |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | Department of Biomedical Sciences, University of Padova, Padova, 35121, Italy, Department of Protein Science, KTH Royal Institute of Technology, Stockholm, SE-10691, Sweden |
| authorships[2].author.id | https://openalex.org/A5055147027 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-7395-2921 |
| authorships[2].author.display_name | Damiano Clementel |
| authorships[2].countries | IT |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I138689650 |
| authorships[2].affiliations[0].raw_affiliation_string | Department of Biomedical Sciences, University of Padova, Padova, 35121, Italy |
| authorships[2].institutions[0].id | https://openalex.org/I138689650 |
| authorships[2].institutions[0].ror | https://ror.org/00240q980 |
| authorships[2].institutions[0].type | education |
| authorships[2].institutions[0].lineage | https://openalex.org/I138689650 |
| authorships[2].institutions[0].country_code | IT |
| authorships[2].institutions[0].display_name | University of Padua |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Damiano Clementel |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | Department of Biomedical Sciences, University of Padova, Padova, 35121, Italy |
| authorships[3].author.id | https://openalex.org/A5000882827 |
| authorships[3].author.orcid | https://orcid.org/0000-0001-8210-2390 |
| authorships[3].author.display_name | Damiano Piovesan |
| authorships[3].countries | IT |
| authorships[3].affiliations[0].institution_ids | https://openalex.org/I138689650 |
| authorships[3].affiliations[0].raw_affiliation_string | Department of Biomedical Sciences, University of Padova, Padova, 35121, Italy |
| authorships[3].institutions[0].id | https://openalex.org/I138689650 |
| authorships[3].institutions[0].ror | https://ror.org/00240q980 |
| authorships[3].institutions[0].type | education |
| authorships[3].institutions[0].lineage | https://openalex.org/I138689650 |
| authorships[3].institutions[0].country_code | IT |
| authorships[3].institutions[0].display_name | University of Padua |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Damiano Piovesan |
| authorships[3].is_corresponding | False |
| authorships[3].raw_affiliation_strings | Department of Biomedical Sciences, University of Padova, Padova, 35121, Italy |
| authorships[4].author.id | https://openalex.org/A5029176853 |
| authorships[4].author.orcid | https://orcid.org/0000-0003-3665-0446 |
| authorships[4].author.display_name | Carlo Ferrari |
| authorships[4].countries | IT |
| authorships[4].affiliations[0].institution_ids | https://openalex.org/I138689650 |
| authorships[4].affiliations[0].raw_affiliation_string | Department of Information Engineering, University of Padua, Padova, 35121, Italy |
| authorships[4].institutions[0].id | https://openalex.org/I138689650 |
| authorships[4].institutions[0].ror | https://ror.org/00240q980 |
| authorships[4].institutions[0].type | education |
| authorships[4].institutions[0].lineage | https://openalex.org/I138689650 |
| authorships[4].institutions[0].country_code | IT |
| authorships[4].institutions[0].display_name | University of Padua |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Carlo Ferrari |
| authorships[4].is_corresponding | False |
| authorships[4].raw_affiliation_strings | Department of Information Engineering, University of Padua, Padova, 35121, Italy |
| authorships[5].author.id | https://openalex.org/A5066560665 |
| authorships[5].author.orcid | https://orcid.org/0000-0003-4525-7793 |
| authorships[5].author.display_name | Silvio C. E. Tosatto |
| authorships[5].countries | IT |
| authorships[5].affiliations[0].institution_ids | https://openalex.org/I4210148743, https://openalex.org/I4210155236 |
| authorships[5].affiliations[0].raw_affiliation_string | Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR-IBIOM), Bari, Italy |
| authorships[5].affiliations[1].institution_ids | https://openalex.org/I138689650 |
| authorships[5].affiliations[1].raw_affiliation_string | Department of Biomedical Sciences, University of Padova, Padova, 35121, Italy |
| authorships[5].institutions[0].id | https://openalex.org/I4210148743 |
| authorships[5].institutions[0].ror | https://ror.org/05nzf7q96 |
| authorships[5].institutions[0].type | facility |
| authorships[5].institutions[0].lineage | https://openalex.org/I4210148743, https://openalex.org/I4210155236 |
| authorships[5].institutions[0].country_code | IT |
| authorships[5].institutions[0].display_name | Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies |
| authorships[5].institutions[1].id | https://openalex.org/I4210155236 |
| authorships[5].institutions[1].ror | https://ror.org/04zaypm56 |
| authorships[5].institutions[1].type | nonprofit |
| authorships[5].institutions[1].lineage | https://openalex.org/I4210155236 |
| authorships[5].institutions[1].country_code | IT |
| authorships[5].institutions[1].display_name | National Research Council |
| authorships[5].institutions[2].id | https://openalex.org/I138689650 |
| authorships[5].institutions[2].ror | https://ror.org/00240q980 |
| authorships[5].institutions[2].type | education |
| authorships[5].institutions[2].lineage | https://openalex.org/I138689650 |
| authorships[5].institutions[2].country_code | IT |
| authorships[5].institutions[2].display_name | University of Padua |
| authorships[5].author_position | middle |
| authorships[5].raw_author_name | Silvio C E Tosatto |
| authorships[5].is_corresponding | False |
| authorships[5].raw_affiliation_strings | Department of Biomedical Sciences, University of Padova, Padova, 35121, Italy, Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR-IBIOM), Bari, Italy |
| authorships[6].author.id | https://openalex.org/A5061169523 |
| authorships[6].author.orcid | https://orcid.org/0000-0003-0362-8218 |
| authorships[6].author.display_name | Alexander Miguel Monzón |
| authorships[6].countries | IT |
| authorships[6].affiliations[0].institution_ids | https://openalex.org/I138689650 |
| authorships[6].affiliations[0].raw_affiliation_string | Department of Information Engineering, University of Padua, Padova, 35121, Italy |
| authorships[6].institutions[0].id | https://openalex.org/I138689650 |
| authorships[6].institutions[0].ror | https://ror.org/00240q980 |
| authorships[6].institutions[0].type | education |
| authorships[6].institutions[0].lineage | https://openalex.org/I138689650 |
| authorships[6].institutions[0].country_code | IT |
| authorships[6].institutions[0].display_name | University of Padua |
| authorships[6].author_position | last |
| authorships[6].raw_author_name | Alexander Miguel Monzon |
| authorships[6].is_corresponding | False |
| authorships[6].raw_affiliation_strings | Department of Information Engineering, University of Padua, Padova, 35121, Italy |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://doi.org/10.1093/bioinformatics/btae690 |
| open_access.oa_status | gold |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | STRPsearch: fast detection of structured tandem repeat proteins |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-23T05:10:03.516525 |
| primary_topic.id | https://openalex.org/T10521 |
| primary_topic.field.id | https://openalex.org/fields/13 |
| primary_topic.field.display_name | Biochemistry, Genetics and Molecular Biology |
| primary_topic.score | 0.9959999918937683 |
| primary_topic.domain.id | https://openalex.org/domains/1 |
| primary_topic.domain.display_name | Life Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1312 |
| primary_topic.subfield.display_name | Molecular Biology |
| primary_topic.display_name | RNA and protein synthesis mechanisms |
| related_works | https://openalex.org/W3023169329, https://openalex.org/W4389470870, https://openalex.org/W2782165897, https://openalex.org/W1582950852, https://openalex.org/W4300438041, https://openalex.org/W2054104202, https://openalex.org/W2139703748, https://openalex.org/W2761254753, https://openalex.org/W1024825291, https://openalex.org/W2188981919 |
| cited_by_count | 2 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 1 |
| counts_by_year[1].year | 2024 |
| counts_by_year[1].cited_by_count | 1 |
| locations_count | 3 |
| best_oa_location.id | doi:10.1093/bioinformatics/btae690 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S52395412 |
| best_oa_location.source.issn | 1367-4803, 1367-4811 |
| best_oa_location.source.type | journal |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | 1367-4803 |
| best_oa_location.source.is_core | True |
| best_oa_location.source.is_in_doaj | True |
| best_oa_location.source.display_name | Bioinformatics |
| best_oa_location.source.host_organization | https://openalex.org/P4310311648 |
| best_oa_location.source.host_organization_name | Oxford University Press |
| best_oa_location.source.host_organization_lineage | https://openalex.org/P4310311648, https://openalex.org/P4310311647 |
| best_oa_location.source.host_organization_lineage_names | Oxford University Press, University of Oxford |
| best_oa_location.license | cc-by |
| best_oa_location.pdf_url | |
| best_oa_location.version | publishedVersion |
| best_oa_location.raw_type | journal-article |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | True |
| best_oa_location.raw_source_name | Bioinformatics |
| best_oa_location.landing_page_url | https://doi.org/10.1093/bioinformatics/btae690 |
| primary_location.id | doi:10.1093/bioinformatics/btae690 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S52395412 |
| primary_location.source.issn | 1367-4803, 1367-4811 |
| primary_location.source.type | journal |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | 1367-4803 |
| primary_location.source.is_core | True |
| primary_location.source.is_in_doaj | True |
| primary_location.source.display_name | Bioinformatics |
| primary_location.source.host_organization | https://openalex.org/P4310311648 |
| primary_location.source.host_organization_name | Oxford University Press |
| primary_location.source.host_organization_lineage | https://openalex.org/P4310311648, https://openalex.org/P4310311647 |
| primary_location.source.host_organization_lineage_names | Oxford University Press, University of Oxford |
| primary_location.license | cc-by |
| primary_location.pdf_url | |
| primary_location.version | publishedVersion |
| primary_location.raw_type | journal-article |
| primary_location.license_id | https://openalex.org/licenses/cc-by |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | Bioinformatics |
| primary_location.landing_page_url | https://doi.org/10.1093/bioinformatics/btae690 |
| publication_date | 2024-11-14 |
| publication_year | 2024 |
| referenced_works | https://openalex.org/W2055043387, https://openalex.org/W4384522661, https://openalex.org/W3186179742, https://openalex.org/W2199134476, https://openalex.org/W4403922164, https://openalex.org/W2114604503, https://openalex.org/W3173398312, https://openalex.org/W3015654545, https://openalex.org/W2143370123, https://openalex.org/W1867763642, https://openalex.org/W2092018396, https://openalex.org/W2800676748, https://openalex.org/W2007595404, https://openalex.org/W3177828909, https://openalex.org/W2054681398, https://openalex.org/W2779592733, https://openalex.org/W3132261088, https://openalex.org/W2070793039, https://openalex.org/W4386258637, https://openalex.org/W4375858802, https://openalex.org/W3211795435, https://openalex.org/W4362606335, https://openalex.org/W2102245393 |
| referenced_works_count | 23 |
| abstract_inverted_index.a | 8, 48, 52, 130, 164, 222 |
| abstract_inverted_index.3D | 68 |
| abstract_inverted_index.In | 103, 210 |
| abstract_inverted_index.We | 127 |
| abstract_inverted_index.an | 177 |
| abstract_inverted_index.as | 148 |
| abstract_inverted_index.be | 201 |
| abstract_inverted_index.by | 14, 176 |
| abstract_inverted_index.in | 31, 160, 173, 239 |
| abstract_inverted_index.is | 192, 237 |
| abstract_inverted_index.it | 119, 212 |
| abstract_inverted_index.of | 10, 55, 61, 67, 70, 79, 91, 124, 140, 153, 169, 188 |
| abstract_inverted_index.on | 100 |
| abstract_inverted_index.to | 88, 121, 180, 203, 217 |
| abstract_inverted_index.All | 241 |
| abstract_inverted_index.and | 43, 93, 115, 138, 166, 183, 199, 207, 224, 234, 243 |
| abstract_inverted_index.are | 72, 246 |
| abstract_inverted_index.can | 40, 200 |
| abstract_inverted_index.due | 87 |
| abstract_inverted_index.for | 113, 133, 163, 227 |
| abstract_inverted_index.map | 181 |
| abstract_inverted_index.now | 73 |
| abstract_inverted_index.the | 59, 109, 134, 149, 157, 186 |
| abstract_inverted_index.STRP | 228 |
| abstract_inverted_index.With | 58 |
| abstract_inverted_index.both | 204 |
| abstract_inverted_index.fast | 165 |
| abstract_inverted_index.form | 25 |
| abstract_inverted_index.from | 146 |
| abstract_inverted_index.lack | 90 |
| abstract_inverted_index.long | 94 |
| abstract_inverted_index.most | 104, 110 |
| abstract_inverted_index.that | 24 |
| abstract_inverted_index.tool | 132 |
| abstract_inverted_index.uses | 156 |
| abstract_inverted_index.with | 51, 83 |
| abstract_inverted_index.STRPs | 39, 80 |
| abstract_inverted_index.These | 18 |
| abstract_inverted_index.coded | 238 |
| abstract_inverted_index.from: | 248 |
| abstract_inverted_index.known | 150 |
| abstract_inverted_index.large | 32, 101, 197 |
| abstract_inverted_index.novel | 131 |
| abstract_inverted_index.often | 29 |
| abstract_inverted_index.rapid | 135 |
| abstract_inverted_index.space | 152 |
| abstract_inverted_index.their | 89, 98 |
| abstract_inverted_index.tools | 86 |
| abstract_inverted_index.units | 182 |
| abstract_inverted_index.STRPs, | 117, 154 |
| abstract_inverted_index.STRPs. | 141 |
| abstract_inverted_index.Tandem | 3 |
| abstract_inverted_index.across | 230 |
| abstract_inverted_index.advent | 60 |
| abstract_inverted_index.cases, | 105 |
| abstract_inverted_index.highly | 36, 193 |
| abstract_inverted_index.latest | 158 |
| abstract_inverted_index.making | 118 |
| abstract_inverted_index.manual | 106 |
| abstract_inverted_index.method | 112 |
| abstract_inverted_index.models | 69 |
| abstract_inverted_index.motifs | 172 |
| abstract_inverted_index.number | 54 |
| abstract_inverted_index.repeat | 56 |
| abstract_inverted_index.tandem | 11 |
| abstract_inverted_index.times, | 96 |
| abstract_inverted_index.tools, | 219 |
| abstract_inverted_index.units. | 57 |
| abstract_inverted_index.(STRPs) | 6 |
| abstract_inverted_index.Despite | 35 |
| abstract_inverted_index.Python. | 240 |
| abstract_inverted_index.Repeats | 4 |
| abstract_inverted_index.Results | 126 |
| abstract_inverted_index.applied | 202 |
| abstract_inverted_index.curated | 144 |
| abstract_inverted_index.current | 84 |
| abstract_inverted_index.diverse | 44, 231 |
| abstract_inverted_index.entries | 145 |
| abstract_inverted_index.exhibit | 20 |
| abstract_inverted_index.mapping | 139 |
| abstract_inverted_index.models. | 209 |
| abstract_inverted_index.motifs. | 17 |
| abstract_inverted_index.perform | 41 |
| abstract_inverted_index.protein | 62 |
| abstract_inverted_index.remains | 81, 108 |
| abstract_inverted_index.repeats | 12 |
| abstract_inverted_index.scripts | 242 |
| abstract_inverted_index.through | 185 |
| abstract_inverted_index.Abstract | 0 |
| abstract_inverted_index.However, | 76 |
| abstract_inverted_index.Proteins | 5 |
| abstract_inverted_index.TM-score | 189 |
| abstract_inverted_index.accuracy | 92 |
| abstract_inverted_index.accurate | 111, 167 |
| abstract_inverted_index.advances | 159 |
| abstract_inverted_index.analysis | 229 |
| abstract_inverted_index.annotate | 122 |
| abstract_inverted_index.approach | 179 |
| abstract_inverted_index.compared | 216 |
| abstract_inverted_index.curation | 107 |
| abstract_inverted_index.distinct | 21 |
| abstract_inverted_index.existing | 218 |
| abstract_inverted_index.followed | 175 |
| abstract_inverted_index.manually | 143 |
| abstract_inverted_index.methods, | 65 |
| abstract_inverted_index.millions | 66, 123 |
| abstract_inverted_index.offering | 220 |
| abstract_inverted_index.proteins | 19, 71 |
| abstract_inverted_index.publicly | 74 |
| abstract_inverted_index.reliable | 223 |
| abstract_inverted_index.repeated | 170 |
| abstract_inverted_index.solution | 226 |
| abstract_inverted_index.subclass | 9 |
| abstract_inverted_index.superior | 214 |
| abstract_inverted_index.tertiary | 27 |
| abstract_inverted_index.variable | 37, 53 |
| abstract_inverted_index.RepeatsDB | 147 |
| abstract_inverted_index.addition, | 211 |
| abstract_inverted_index.alignment | 162 |
| abstract_inverted_index.automatic | 77 |
| abstract_inverted_index.available | 247 |
| abstract_inverted_index.datasets, | 198 |
| abstract_inverted_index.datasets. | 102 |
| abstract_inverted_index.detecting | 114 |
| abstract_inverted_index.detection | 78, 168 |
| abstract_inverted_index.execution | 95 |
| abstract_inverted_index.hindering | 97 |
| abstract_inverted_index.important | 42 |
| abstract_inverted_index.introduce | 128 |
| abstract_inverted_index.molecular | 33 |
| abstract_inverted_index.predicted | 208 |
| abstract_inverted_index.profiles. | 190 |
| abstract_inverted_index.proteins, | 174 |
| abstract_inverted_index.resulting | 30 |
| abstract_inverted_index.scalable, | 194 |
| abstract_inverted_index.secondary | 22 |
| abstract_inverted_index.structure | 50, 63 |
| abstract_inverted_index.Leveraging | 142 |
| abstract_inverted_index.Motivation | 1 |
| abstract_inverted_index.STRPsearch | 155, 191, 236 |
| abstract_inverted_index.Structured | 2 |
| abstract_inverted_index.associated | 244 |
| abstract_inverted_index.available. | 75 |
| abstract_inverted_index.biological | 45 |
| abstract_inverted_index.consistent | 49 |
| abstract_inverted_index.constitute | 7 |
| abstract_inverted_index.functions, | 46 |
| abstract_inverted_index.generation | 187 |
| abstract_inverted_index.innovative | 178 |
| abstract_inverted_index.insertions | 184 |
| abstract_inverted_index.prediction | 64 |
| abstract_inverted_index.processing | 196 |
| abstract_inverted_index.proteomes. | 232 |
| abstract_inverted_index.repetitive | 15, 26 |
| abstract_inverted_index.sequences, | 38 |
| abstract_inverted_index.structural | 16, 161, 171 |
| abstract_inverted_index.structures | 23, 206 |
| abstract_inverted_index.STRPsearch, | 129 |
| abstract_inverted_index.application | 99 |
| abstract_inverted_index.assemblies. | 34 |
| abstract_inverted_index.challenging | 82 |
| abstract_inverted_index.classifying | 116 |
| abstract_inverted_index.efficiently | 195 |
| abstract_inverted_index.maintaining | 47 |
| abstract_inverted_index.performance | 215 |
| abstract_inverted_index.researchers | 221 |
| abstract_inverted_index.structures. | 125 |
| abstract_inverted_index.Availability | 233 |
| abstract_inverted_index.demonstrates | 213 |
| abstract_inverted_index.experimental | 205 |
| abstract_inverted_index.arrangements, | 28 |
| abstract_inverted_index.characterized | 13 |
| abstract_inverted_index.comprehensive | 225 |
| abstract_inverted_index.documentation | 245 |
| abstract_inverted_index.impracticable | 120 |
| abstract_inverted_index.conformational | 151 |
| abstract_inverted_index.implementation | 235 |
| abstract_inverted_index.classification, | 137 |
| abstract_inverted_index.identification, | 136 |
| abstract_inverted_index.state-of-the-art | 85 |
| abstract_inverted_index.https://github.com/BioComputingUP/STRPsearch. | 249 |
| cited_by_percentile_year.max | 95 |
| cited_by_percentile_year.min | 90 |
| countries_distinct_count | 2 |
| institutions_distinct_count | 7 |
| citation_normalized_percentile.value | 0.7137146 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |