Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters Article Swipe
YOU?
·
· 2024
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2401.05111
The zero-shot text-to-speech (TTS) method, based on speaker embeddings extracted from reference speech using self-supervised learning (SSL) speech representations, can reproduce speaker characteristics very accurately. However, this approach suffers from degradation in speech synthesis quality when the reference speech contains noise. In this paper, we propose a noise-robust zero-shot TTS method. We incorporated adapters into the SSL model, which we fine-tuned with the TTS model using noisy reference speech. In addition, to further improve performance, we adopted a speech enhancement (SE) front-end. With these improvements, our proposed SSL-based zero-shot TTS achieved high-quality speech synthesis with noisy reference speech. Through the objective and subjective evaluations, we confirmed that the proposed method is highly robust to noise in reference speech, and effectively works in combination with SE.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2401.05111
- https://arxiv.org/pdf/2401.05111
- OA Status
- green
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4390832512
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4390832512Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2401.05111Digital Object Identifier
- Title
-
Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adaptersWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2024Year of publication
- Publication date
-
2024-01-10Full publication date if available
- Authors
-
Ken‐ichi Fujita, Hiroshi Satō, Takanori Ashihara, Hiroki Kanagawa, Marc Delcroix, Takafumi Moriya, Yusuke IjimaList of authors in order
- Landing page
-
https://arxiv.org/abs/2401.05111Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2401.05111Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2401.05111Direct OA link when available
- Concepts
-
Computer science, Speech recognition, Noise (video), Speech synthesis, Speech enhancement, Zero (linguistics), Representation (politics), Voice activity detection, Quality (philosophy), Artificial intelligence, Speech processing, Noise reduction, Image (mathematics), Physics, Law, Linguistics, Politics, Quantum mechanics, Political science, PhilosophyTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4390832512 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2401.05111 |
| ids.doi | https://doi.org/10.48550/arxiv.2401.05111 |
| ids.openalex | https://openalex.org/W4390832512 |
| fwci | |
| type | preprint |
| title | Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10201 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9994000196456909 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Speech Recognition and Synthesis |
| topics[1].id | https://openalex.org/T10860 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9937999844551086 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1711 |
| topics[1].subfield.display_name | Signal Processing |
| topics[1].display_name | Speech and Audio Processing |
| topics[2].id | https://openalex.org/T11309 |
| topics[2].field.id | https://openalex.org/fields/17 |
| topics[2].field.display_name | Computer Science |
| topics[2].score | 0.9616000056266785 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1711 |
| topics[2].subfield.display_name | Signal Processing |
| topics[2].display_name | Music and Audio Processing |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C41008148 |
| concepts[0].level | 0 |
| concepts[0].score | 0.7643293142318726 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[0].display_name | Computer science |
| concepts[1].id | https://openalex.org/C28490314 |
| concepts[1].level | 1 |
| concepts[1].score | 0.7227286696434021 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q189436 |
| concepts[1].display_name | Speech recognition |
| concepts[2].id | https://openalex.org/C99498987 |
| concepts[2].level | 3 |
| concepts[2].score | 0.5974411368370056 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q2210247 |
| concepts[2].display_name | Noise (video) |
| concepts[3].id | https://openalex.org/C14999030 |
| concepts[3].level | 2 |
| concepts[3].score | 0.5898690819740295 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q16346 |
| concepts[3].display_name | Speech synthesis |
| concepts[4].id | https://openalex.org/C2776182073 |
| concepts[4].level | 3 |
| concepts[4].score | 0.5682386755943298 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q7575395 |
| concepts[4].display_name | Speech enhancement |
| concepts[5].id | https://openalex.org/C2780813799 |
| concepts[5].level | 2 |
| concepts[5].score | 0.5209020972251892 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q3274237 |
| concepts[5].display_name | Zero (linguistics) |
| concepts[6].id | https://openalex.org/C2776359362 |
| concepts[6].level | 3 |
| concepts[6].score | 0.45417001843452454 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q2145286 |
| concepts[6].display_name | Representation (politics) |
| concepts[7].id | https://openalex.org/C204201278 |
| concepts[7].level | 3 |
| concepts[7].score | 0.4494924545288086 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q1332614 |
| concepts[7].display_name | Voice activity detection |
| concepts[8].id | https://openalex.org/C2779530757 |
| concepts[8].level | 2 |
| concepts[8].score | 0.4307863712310791 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q1207505 |
| concepts[8].display_name | Quality (philosophy) |
| concepts[9].id | https://openalex.org/C154945302 |
| concepts[9].level | 1 |
| concepts[9].score | 0.39397984743118286 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[9].display_name | Artificial intelligence |
| concepts[10].id | https://openalex.org/C61328038 |
| concepts[10].level | 2 |
| concepts[10].score | 0.3758704662322998 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q3358061 |
| concepts[10].display_name | Speech processing |
| concepts[11].id | https://openalex.org/C163294075 |
| concepts[11].level | 2 |
| concepts[11].score | 0.2666015028953552 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q581861 |
| concepts[11].display_name | Noise reduction |
| concepts[12].id | https://openalex.org/C115961682 |
| concepts[12].level | 2 |
| concepts[12].score | 0.12678000330924988 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q860623 |
| concepts[12].display_name | Image (mathematics) |
| concepts[13].id | https://openalex.org/C121332964 |
| concepts[13].level | 0 |
| concepts[13].score | 0.05296427011489868 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q413 |
| concepts[13].display_name | Physics |
| concepts[14].id | https://openalex.org/C199539241 |
| concepts[14].level | 1 |
| concepts[14].score | 0.0 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q7748 |
| concepts[14].display_name | Law |
| concepts[15].id | https://openalex.org/C41895202 |
| concepts[15].level | 1 |
| concepts[15].score | 0.0 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q8162 |
| concepts[15].display_name | Linguistics |
| concepts[16].id | https://openalex.org/C94625758 |
| concepts[16].level | 2 |
| concepts[16].score | 0.0 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q7163 |
| concepts[16].display_name | Politics |
| concepts[17].id | https://openalex.org/C62520636 |
| concepts[17].level | 1 |
| concepts[17].score | 0.0 |
| concepts[17].wikidata | https://www.wikidata.org/wiki/Q944 |
| concepts[17].display_name | Quantum mechanics |
| concepts[18].id | https://openalex.org/C17744445 |
| concepts[18].level | 0 |
| concepts[18].score | 0.0 |
| concepts[18].wikidata | https://www.wikidata.org/wiki/Q36442 |
| concepts[18].display_name | Political science |
| concepts[19].id | https://openalex.org/C138885662 |
| concepts[19].level | 0 |
| concepts[19].score | 0.0 |
| concepts[19].wikidata | https://www.wikidata.org/wiki/Q5891 |
| concepts[19].display_name | Philosophy |
| keywords[0].id | https://openalex.org/keywords/computer-science |
| keywords[0].score | 0.7643293142318726 |
| keywords[0].display_name | Computer science |
| keywords[1].id | https://openalex.org/keywords/speech-recognition |
| keywords[1].score | 0.7227286696434021 |
| keywords[1].display_name | Speech recognition |
| keywords[2].id | https://openalex.org/keywords/noise |
| keywords[2].score | 0.5974411368370056 |
| keywords[2].display_name | Noise (video) |
| keywords[3].id | https://openalex.org/keywords/speech-synthesis |
| keywords[3].score | 0.5898690819740295 |
| keywords[3].display_name | Speech synthesis |
| keywords[4].id | https://openalex.org/keywords/speech-enhancement |
| keywords[4].score | 0.5682386755943298 |
| keywords[4].display_name | Speech enhancement |
| keywords[5].id | https://openalex.org/keywords/zero |
| keywords[5].score | 0.5209020972251892 |
| keywords[5].display_name | Zero (linguistics) |
| keywords[6].id | https://openalex.org/keywords/representation |
| keywords[6].score | 0.45417001843452454 |
| keywords[6].display_name | Representation (politics) |
| keywords[7].id | https://openalex.org/keywords/voice-activity-detection |
| keywords[7].score | 0.4494924545288086 |
| keywords[7].display_name | Voice activity detection |
| keywords[8].id | https://openalex.org/keywords/quality |
| keywords[8].score | 0.4307863712310791 |
| keywords[8].display_name | Quality (philosophy) |
| keywords[9].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[9].score | 0.39397984743118286 |
| keywords[9].display_name | Artificial intelligence |
| keywords[10].id | https://openalex.org/keywords/speech-processing |
| keywords[10].score | 0.3758704662322998 |
| keywords[10].display_name | Speech processing |
| keywords[11].id | https://openalex.org/keywords/noise-reduction |
| keywords[11].score | 0.2666015028953552 |
| keywords[11].display_name | Noise reduction |
| keywords[12].id | https://openalex.org/keywords/image |
| keywords[12].score | 0.12678000330924988 |
| keywords[12].display_name | Image (mathematics) |
| keywords[13].id | https://openalex.org/keywords/physics |
| keywords[13].score | 0.05296427011489868 |
| keywords[13].display_name | Physics |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2401.05111 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2401.05111 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2401.05111 |
| locations[1].id | doi:10.48550/arxiv.2401.05111 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2401.05111 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5101528279 |
| authorships[0].author.orcid | https://orcid.org/0000-0002-1575-6690 |
| authorships[0].author.display_name | Ken‐ichi Fujita |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Fujita, Kenichi |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5049917170 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-5899-1340 |
| authorships[1].author.display_name | Hiroshi Satō |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Sato, Hiroshi |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5033975068 |
| authorships[2].author.orcid | https://orcid.org/0009-0003-4322-4127 |
| authorships[2].author.display_name | Takanori Ashihara |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Ashihara, Takanori |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5076754533 |
| authorships[3].author.orcid | |
| authorships[3].author.display_name | Hiroki Kanagawa |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Kanagawa, Hiroki |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5023868166 |
| authorships[4].author.orcid | https://orcid.org/0000-0002-5175-7834 |
| authorships[4].author.display_name | Marc Delcroix |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Delcroix, Marc |
| authorships[4].is_corresponding | False |
| authorships[5].author.id | https://openalex.org/A5087290011 |
| authorships[5].author.orcid | https://orcid.org/0000-0003-1942-7250 |
| authorships[5].author.display_name | Takafumi Moriya |
| authorships[5].author_position | middle |
| authorships[5].raw_author_name | Moriya, Takafumi |
| authorships[5].is_corresponding | False |
| authorships[6].author.id | https://openalex.org/A5068604686 |
| authorships[6].author.orcid | |
| authorships[6].author.display_name | Yusuke Ijima |
| authorships[6].author_position | last |
| authorships[6].raw_author_name | Ijima, Yusuke |
| authorships[6].is_corresponding | False |
| has_content.pdf | True |
| has_content.grobid_xml | True |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2401.05111 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2024-01-13T00:00:00 |
| display_name | Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters |
| has_fulltext | True |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T10201 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9994000196456909 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Speech Recognition and Synthesis |
| related_works | https://openalex.org/W2120771489, https://openalex.org/W2051376034, https://openalex.org/W2294333436, https://openalex.org/W2955597484, https://openalex.org/W2373767407, https://openalex.org/W2653598178, https://openalex.org/W3110551121, https://openalex.org/W2747006289, https://openalex.org/W2550171623, https://openalex.org/W596245619 |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2401.05111 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2401.05111 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2401.05111 |
| primary_location.id | pmh:oai:arXiv.org:2401.05111 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2401.05111 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2401.05111 |
| publication_date | 2024-01-10 |
| publication_year | 2024 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 46, 77 |
| abstract_inverted_index.In | 41, 69 |
| abstract_inverted_index.We | 51 |
| abstract_inverted_index.in | 31, 115, 121 |
| abstract_inverted_index.is | 110 |
| abstract_inverted_index.on | 6 |
| abstract_inverted_index.to | 71, 113 |
| abstract_inverted_index.we | 44, 59, 75, 104 |
| abstract_inverted_index.SE. | 124 |
| abstract_inverted_index.SSL | 56 |
| abstract_inverted_index.TTS | 49, 63, 89 |
| abstract_inverted_index.The | 0 |
| abstract_inverted_index.and | 101, 118 |
| abstract_inverted_index.can | 19 |
| abstract_inverted_index.our | 85 |
| abstract_inverted_index.the | 36, 55, 62, 99, 107 |
| abstract_inverted_index.(SE) | 80 |
| abstract_inverted_index.With | 82 |
| abstract_inverted_index.from | 10, 29 |
| abstract_inverted_index.into | 54 |
| abstract_inverted_index.that | 106 |
| abstract_inverted_index.this | 26, 42 |
| abstract_inverted_index.very | 23 |
| abstract_inverted_index.when | 35 |
| abstract_inverted_index.with | 61, 94, 123 |
| abstract_inverted_index.(SSL) | 16 |
| abstract_inverted_index.(TTS) | 3 |
| abstract_inverted_index.based | 5 |
| abstract_inverted_index.model | 64 |
| abstract_inverted_index.noise | 114 |
| abstract_inverted_index.noisy | 66, 95 |
| abstract_inverted_index.these | 83 |
| abstract_inverted_index.using | 13, 65 |
| abstract_inverted_index.which | 58 |
| abstract_inverted_index.works | 120 |
| abstract_inverted_index.highly | 111 |
| abstract_inverted_index.method | 109 |
| abstract_inverted_index.model, | 57 |
| abstract_inverted_index.noise. | 40 |
| abstract_inverted_index.paper, | 43 |
| abstract_inverted_index.robust | 112 |
| abstract_inverted_index.speech | 12, 17, 32, 38, 78, 92 |
| abstract_inverted_index.Through | 98 |
| abstract_inverted_index.adopted | 76 |
| abstract_inverted_index.further | 72 |
| abstract_inverted_index.improve | 73 |
| abstract_inverted_index.method, | 4 |
| abstract_inverted_index.method. | 50 |
| abstract_inverted_index.propose | 45 |
| abstract_inverted_index.quality | 34 |
| abstract_inverted_index.speaker | 7, 21 |
| abstract_inverted_index.speech, | 117 |
| abstract_inverted_index.speech. | 68, 97 |
| abstract_inverted_index.suffers | 28 |
| abstract_inverted_index.However, | 25 |
| abstract_inverted_index.achieved | 90 |
| abstract_inverted_index.adapters | 53 |
| abstract_inverted_index.approach | 27 |
| abstract_inverted_index.contains | 39 |
| abstract_inverted_index.learning | 15 |
| abstract_inverted_index.proposed | 86, 108 |
| abstract_inverted_index.SSL-based | 87 |
| abstract_inverted_index.addition, | 70 |
| abstract_inverted_index.confirmed | 105 |
| abstract_inverted_index.extracted | 9 |
| abstract_inverted_index.objective | 100 |
| abstract_inverted_index.reference | 11, 37, 67, 96, 116 |
| abstract_inverted_index.reproduce | 20 |
| abstract_inverted_index.synthesis | 33, 93 |
| abstract_inverted_index.zero-shot | 1, 48, 88 |
| abstract_inverted_index.embeddings | 8 |
| abstract_inverted_index.fine-tuned | 60 |
| abstract_inverted_index.front-end. | 81 |
| abstract_inverted_index.subjective | 102 |
| abstract_inverted_index.accurately. | 24 |
| abstract_inverted_index.combination | 122 |
| abstract_inverted_index.degradation | 30 |
| abstract_inverted_index.effectively | 119 |
| abstract_inverted_index.enhancement | 79 |
| abstract_inverted_index.evaluations, | 103 |
| abstract_inverted_index.high-quality | 91 |
| abstract_inverted_index.incorporated | 52 |
| abstract_inverted_index.noise-robust | 47 |
| abstract_inverted_index.performance, | 74 |
| abstract_inverted_index.improvements, | 84 |
| abstract_inverted_index.text-to-speech | 2 |
| abstract_inverted_index.characteristics | 22 |
| abstract_inverted_index.self-supervised | 14 |
| abstract_inverted_index.representations, | 18 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 7 |
| sustainable_development_goals[0].id | https://metadata.un.org/sdg/16 |
| sustainable_development_goals[0].score | 0.5699999928474426 |
| sustainable_development_goals[0].display_name | Peace, Justice and strong institutions |
| citation_normalized_percentile |