Who Finds This Voice Attractive? A Large-Scale Experiment Using In-the-Wild Data Article Swipe
YOU?
·
· 2024
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2407.04270
This paper introduces CocoNut-Humoresque, an open-source large-scale speech likability corpus that includes speech segments and their per-listener likability scores. Evaluating voice likability is essential to designing preferable voices for speech systems, such as dialogue or announcement systems. In this study, we let 885 listeners rate 1800 speech segments of a wide range of speakers regarding their likability. When constructing the corpus, we also collected the multiple speaker attributes: genders, ages, and favorite YouTube videos. Therefore, the corpus enables the large-scale statistical analysis of voice likability regarding both speaker and listener factors. This paper describes the construction methodology and preliminary data analysis to reveal the gender and age biases in voice likability. In addition, the relationship between the likability and two acoustic features, the fundamental frequencies and the x-vectors of given utterances, is also investigated.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2407.04270
- https://arxiv.org/pdf/2407.04270
- OA Status
- green
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4400434635
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4400434635Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2407.04270Digital Object Identifier
- Title
-
Who Finds This Voice Attractive? A Large-Scale Experiment Using In-the-Wild DataWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2024Year of publication
- Publication date
-
2024-07-05Full publication date if available
- Authors
-
Hitoshi Suda, Aya Watanabe, Shinnosuke TakamichiList of authors in order
- Landing page
-
https://arxiv.org/abs/2407.04270Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2407.04270Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2407.04270Direct OA link when available
- Concepts
-
Scale (ratio), Business, Chemistry, Geography, CartographyTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4400434635 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2407.04270 |
| ids.doi | https://doi.org/10.48550/arxiv.2407.04270 |
| ids.openalex | https://openalex.org/W4400434635 |
| fwci | 0.0 |
| type | preprint |
| title | Who Finds This Voice Attractive? A Large-Scale Experiment Using In-the-Wild Data |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T12031 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.8442000150680542 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Speech and dialogue systems |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C2778755073 |
| concepts[0].level | 2 |
| concepts[0].score | 0.5990396738052368 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q10858537 |
| concepts[0].display_name | Scale (ratio) |
| concepts[1].id | https://openalex.org/C144133560 |
| concepts[1].level | 0 |
| concepts[1].score | 0.3436460494995117 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q4830453 |
| concepts[1].display_name | Business |
| concepts[2].id | https://openalex.org/C185592680 |
| concepts[2].level | 0 |
| concepts[2].score | 0.33428865671157837 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q2329 |
| concepts[2].display_name | Chemistry |
| concepts[3].id | https://openalex.org/C205649164 |
| concepts[3].level | 0 |
| concepts[3].score | 0.13155999779701233 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q1071 |
| concepts[3].display_name | Geography |
| concepts[4].id | https://openalex.org/C58640448 |
| concepts[4].level | 1 |
| concepts[4].score | 0.08488243818283081 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q42515 |
| concepts[4].display_name | Cartography |
| keywords[0].id | https://openalex.org/keywords/scale |
| keywords[0].score | 0.5990396738052368 |
| keywords[0].display_name | Scale (ratio) |
| keywords[1].id | https://openalex.org/keywords/business |
| keywords[1].score | 0.3436460494995117 |
| keywords[1].display_name | Business |
| keywords[2].id | https://openalex.org/keywords/chemistry |
| keywords[2].score | 0.33428865671157837 |
| keywords[2].display_name | Chemistry |
| keywords[3].id | https://openalex.org/keywords/geography |
| keywords[3].score | 0.13155999779701233 |
| keywords[3].display_name | Geography |
| keywords[4].id | https://openalex.org/keywords/cartography |
| keywords[4].score | 0.08488243818283081 |
| keywords[4].display_name | Cartography |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2407.04270 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | cc-by-sa |
| locations[0].pdf_url | https://arxiv.org/pdf/2407.04270 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | https://openalex.org/licenses/cc-by-sa |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2407.04270 |
| locations[1].id | doi:10.48550/arxiv.2407.04270 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2407.04270 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5067873216 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Hitoshi Suda |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Suda, Hitoshi |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5008391815 |
| authorships[1].author.orcid | https://orcid.org/0000-0003-3123-489X |
| authorships[1].author.display_name | Aya Watanabe |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Watanabe, Aya |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5013050263 |
| authorships[2].author.orcid | https://orcid.org/0000-0003-0520-7847 |
| authorships[2].author.display_name | Shinnosuke Takamichi |
| authorships[2].author_position | last |
| authorships[2].raw_author_name | Takamichi, Shinnosuke |
| authorships[2].is_corresponding | False |
| has_content.pdf | True |
| has_content.grobid_xml | True |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2407.04270 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Who Finds This Voice Attractive? A Large-Scale Experiment Using In-the-Wild Data |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T12031 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.8442000150680542 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Speech and dialogue systems |
| related_works | https://openalex.org/W4387497383, https://openalex.org/W2948807893, https://openalex.org/W2778153218, https://openalex.org/W2748952813, https://openalex.org/W1531601525, https://openalex.org/W4391375266, https://openalex.org/W2078814861, https://openalex.org/W2527526854, https://openalex.org/W1976181487, https://openalex.org/W1986764834 |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2407.04270 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | cc-by-sa |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2407.04270 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by-sa |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2407.04270 |
| primary_location.id | pmh:oai:arXiv.org:2407.04270 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | cc-by-sa |
| primary_location.pdf_url | https://arxiv.org/pdf/2407.04270 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | https://openalex.org/licenses/cc-by-sa |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2407.04270 |
| publication_date | 2024-07-05 |
| publication_year | 2024 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 49 |
| abstract_inverted_index.In | 37, 111 |
| abstract_inverted_index.an | 4 |
| abstract_inverted_index.as | 32 |
| abstract_inverted_index.in | 108 |
| abstract_inverted_index.is | 22, 131 |
| abstract_inverted_index.of | 48, 52, 82, 128 |
| abstract_inverted_index.or | 34 |
| abstract_inverted_index.to | 24, 101 |
| abstract_inverted_index.we | 40, 61 |
| abstract_inverted_index.885 | 42 |
| abstract_inverted_index.age | 106 |
| abstract_inverted_index.and | 14, 70, 88, 97, 105, 118, 125 |
| abstract_inverted_index.for | 28 |
| abstract_inverted_index.let | 41 |
| abstract_inverted_index.the | 59, 64, 75, 78, 94, 103, 113, 116, 122, 126 |
| abstract_inverted_index.two | 119 |
| abstract_inverted_index.1800 | 45 |
| abstract_inverted_index.This | 0, 91 |
| abstract_inverted_index.When | 57 |
| abstract_inverted_index.also | 62, 132 |
| abstract_inverted_index.both | 86 |
| abstract_inverted_index.data | 99 |
| abstract_inverted_index.rate | 44 |
| abstract_inverted_index.such | 31 |
| abstract_inverted_index.that | 10 |
| abstract_inverted_index.this | 38 |
| abstract_inverted_index.wide | 50 |
| abstract_inverted_index.ages, | 69 |
| abstract_inverted_index.given | 129 |
| abstract_inverted_index.paper | 1, 92 |
| abstract_inverted_index.range | 51 |
| abstract_inverted_index.their | 15, 55 |
| abstract_inverted_index.voice | 20, 83, 109 |
| abstract_inverted_index.biases | 107 |
| abstract_inverted_index.corpus | 9, 76 |
| abstract_inverted_index.gender | 104 |
| abstract_inverted_index.reveal | 102 |
| abstract_inverted_index.speech | 7, 12, 29, 46 |
| abstract_inverted_index.study, | 39 |
| abstract_inverted_index.voices | 27 |
| abstract_inverted_index.YouTube | 72 |
| abstract_inverted_index.between | 115 |
| abstract_inverted_index.corpus, | 60 |
| abstract_inverted_index.enables | 77 |
| abstract_inverted_index.scores. | 18 |
| abstract_inverted_index.speaker | 66, 87 |
| abstract_inverted_index.videos. | 73 |
| abstract_inverted_index.acoustic | 120 |
| abstract_inverted_index.analysis | 81, 100 |
| abstract_inverted_index.dialogue | 33 |
| abstract_inverted_index.factors. | 90 |
| abstract_inverted_index.favorite | 71 |
| abstract_inverted_index.genders, | 68 |
| abstract_inverted_index.includes | 11 |
| abstract_inverted_index.listener | 89 |
| abstract_inverted_index.multiple | 65 |
| abstract_inverted_index.segments | 13, 47 |
| abstract_inverted_index.speakers | 53 |
| abstract_inverted_index.systems, | 30 |
| abstract_inverted_index.systems. | 36 |
| abstract_inverted_index.addition, | 112 |
| abstract_inverted_index.collected | 63 |
| abstract_inverted_index.describes | 93 |
| abstract_inverted_index.designing | 25 |
| abstract_inverted_index.essential | 23 |
| abstract_inverted_index.features, | 121 |
| abstract_inverted_index.listeners | 43 |
| abstract_inverted_index.regarding | 54, 85 |
| abstract_inverted_index.x-vectors | 127 |
| abstract_inverted_index.Evaluating | 19 |
| abstract_inverted_index.Therefore, | 74 |
| abstract_inverted_index.introduces | 2 |
| abstract_inverted_index.likability | 8, 17, 21, 84, 117 |
| abstract_inverted_index.preferable | 26 |
| abstract_inverted_index.attributes: | 67 |
| abstract_inverted_index.frequencies | 124 |
| abstract_inverted_index.fundamental | 123 |
| abstract_inverted_index.large-scale | 6, 79 |
| abstract_inverted_index.likability. | 56, 110 |
| abstract_inverted_index.methodology | 96 |
| abstract_inverted_index.open-source | 5 |
| abstract_inverted_index.preliminary | 98 |
| abstract_inverted_index.statistical | 80 |
| abstract_inverted_index.utterances, | 130 |
| abstract_inverted_index.announcement | 35 |
| abstract_inverted_index.constructing | 58 |
| abstract_inverted_index.construction | 95 |
| abstract_inverted_index.per-listener | 16 |
| abstract_inverted_index.relationship | 114 |
| abstract_inverted_index.investigated. | 133 |
| abstract_inverted_index.CocoNut-Humoresque, | 3 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 3 |
| citation_normalized_percentile |