Harmonic Summation-Based Robust Pitch Estimation in Noisy and Reverberant Environments Article Swipe
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2509.16480
Accurate pitch estimation is essential for numerous speech processing applications, yet it remains challenging in high-distortion environments. This paper proposes a robust pitch estimation method that delivers robust pitch estimates in challenging noise environments. Our approach computes the Normalized Average Magnitude Difference Function (NAMDF), transforms it into a likelihood function, and generates probabilistic pitch states for frames at each sample shift. To enhance noise robustness, we aggregate likelihood values across integer multiples of the pitch period and neighboring frames. Furthermore, we introduce a simple yet effective continuity constraint in the Viterbi algorithm to refine pitch selection among multiple candidates. Experimental results show that our method consistently achieves lower Gross Pitch Error (GPE) and Voicing Decision Error (VDE) across various SNR levels, outperforming existing methods in both noisy and reverberant conditions.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2509.16480
- https://arxiv.org/pdf/2509.16480
- OA Status
- green
- OpenAlex ID
- https://openalex.org/W4415251858
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4415251858Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2509.16480Digital Object Identifier
- Title
-
Harmonic Summation-Based Robust Pitch Estimation in Noisy and Reverberant EnvironmentsWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2025Year of publication
- Publication date
-
2025-09-20Full publication date if available
- Authors
-
Anup K. Singh, Kris DemuynckList of authors in order
- Landing page
-
https://arxiv.org/abs/2509.16480Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2509.16480Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2509.16480Direct OA link when available
- Cited by
-
0Total citation count in OpenAlex
Full payload
| id | https://openalex.org/W4415251858 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2509.16480 |
| ids.doi | https://doi.org/10.48550/arxiv.2509.16480 |
| ids.openalex | https://openalex.org/W4415251858 |
| fwci | |
| type | preprint |
| title | Harmonic Summation-Based Robust Pitch Estimation in Noisy and Reverberant Environments |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10860 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.9979000091552734 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1711 |
| topics[0].subfield.display_name | Signal Processing |
| topics[0].display_name | Speech and Audio Processing |
| topics[1].id | https://openalex.org/T10822 |
| topics[1].field.id | https://openalex.org/fields/22 |
| topics[1].field.display_name | Engineering |
| topics[1].score | 0.9908999800682068 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/2204 |
| topics[1].subfield.display_name | Biomedical Engineering |
| topics[1].display_name | Acoustic Wave Phenomena Research |
| topics[2].id | https://openalex.org/T11008 |
| topics[2].field.id | https://openalex.org/fields/22 |
| topics[2].field.display_name | Engineering |
| topics[2].score | 0.98089998960495 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/2202 |
| topics[2].subfield.display_name | Aerospace Engineering |
| topics[2].display_name | Aerodynamics and Acoustics in Jet Flows |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2509.16480 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2509.16480 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2509.16480 |
| locations[1].id | doi:10.48550/arxiv.2509.16480 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2509.16480 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5101680403 |
| authorships[0].author.orcid | https://orcid.org/0000-0002-3653-1618 |
| authorships[0].author.display_name | Anup K. Singh |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Singh, Anup |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5046536366 |
| authorships[1].author.orcid | https://orcid.org/0000-0001-8525-7160 |
| authorships[1].author.display_name | Kris Demuynck |
| authorships[1].author_position | last |
| authorships[1].raw_author_name | Demuynck, Kris |
| authorships[1].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2509.16480 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-16T00:00:00 |
| display_name | Harmonic Summation-Based Robust Pitch Estimation in Noisy and Reverberant Environments |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T10860 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.9979000091552734 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1711 |
| primary_topic.subfield.display_name | Signal Processing |
| primary_topic.display_name | Speech and Audio Processing |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2509.16480 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2509.16480 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2509.16480 |
| primary_location.id | pmh:oai:arXiv.org:2509.16480 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2509.16480 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2509.16480 |
| publication_date | 2025-09-20 |
| publication_year | 2025 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 20, 47, 82 |
| abstract_inverted_index.To | 61 |
| abstract_inverted_index.at | 57 |
| abstract_inverted_index.in | 14, 30, 88, 124 |
| abstract_inverted_index.is | 3 |
| abstract_inverted_index.it | 11, 45 |
| abstract_inverted_index.of | 72 |
| abstract_inverted_index.to | 92 |
| abstract_inverted_index.we | 65, 80 |
| abstract_inverted_index.Our | 34 |
| abstract_inverted_index.SNR | 119 |
| abstract_inverted_index.and | 50, 76, 112, 127 |
| abstract_inverted_index.for | 5, 55 |
| abstract_inverted_index.our | 103 |
| abstract_inverted_index.the | 37, 73, 89 |
| abstract_inverted_index.yet | 10, 84 |
| abstract_inverted_index.This | 17 |
| abstract_inverted_index.both | 125 |
| abstract_inverted_index.each | 58 |
| abstract_inverted_index.into | 46 |
| abstract_inverted_index.show | 101 |
| abstract_inverted_index.that | 25, 102 |
| abstract_inverted_index.(GPE) | 111 |
| abstract_inverted_index.(VDE) | 116 |
| abstract_inverted_index.Error | 110, 115 |
| abstract_inverted_index.Gross | 108 |
| abstract_inverted_index.Pitch | 109 |
| abstract_inverted_index.among | 96 |
| abstract_inverted_index.lower | 107 |
| abstract_inverted_index.noise | 32, 63 |
| abstract_inverted_index.noisy | 126 |
| abstract_inverted_index.paper | 18 |
| abstract_inverted_index.pitch | 1, 22, 28, 53, 74, 94 |
| abstract_inverted_index.across | 69, 117 |
| abstract_inverted_index.frames | 56 |
| abstract_inverted_index.method | 24, 104 |
| abstract_inverted_index.period | 75 |
| abstract_inverted_index.refine | 93 |
| abstract_inverted_index.robust | 21, 27 |
| abstract_inverted_index.sample | 59 |
| abstract_inverted_index.shift. | 60 |
| abstract_inverted_index.simple | 83 |
| abstract_inverted_index.speech | 7 |
| abstract_inverted_index.states | 54 |
| abstract_inverted_index.values | 68 |
| abstract_inverted_index.Average | 39 |
| abstract_inverted_index.Viterbi | 90 |
| abstract_inverted_index.Voicing | 113 |
| abstract_inverted_index.enhance | 62 |
| abstract_inverted_index.frames. | 78 |
| abstract_inverted_index.integer | 70 |
| abstract_inverted_index.levels, | 120 |
| abstract_inverted_index.methods | 123 |
| abstract_inverted_index.remains | 12 |
| abstract_inverted_index.results | 100 |
| abstract_inverted_index.various | 118 |
| abstract_inverted_index.(NAMDF), | 43 |
| abstract_inverted_index.Accurate | 0 |
| abstract_inverted_index.Decision | 114 |
| abstract_inverted_index.Function | 42 |
| abstract_inverted_index.achieves | 106 |
| abstract_inverted_index.approach | 35 |
| abstract_inverted_index.computes | 36 |
| abstract_inverted_index.delivers | 26 |
| abstract_inverted_index.existing | 122 |
| abstract_inverted_index.multiple | 97 |
| abstract_inverted_index.numerous | 6 |
| abstract_inverted_index.proposes | 19 |
| abstract_inverted_index.Magnitude | 40 |
| abstract_inverted_index.aggregate | 66 |
| abstract_inverted_index.algorithm | 91 |
| abstract_inverted_index.effective | 85 |
| abstract_inverted_index.essential | 4 |
| abstract_inverted_index.estimates | 29 |
| abstract_inverted_index.function, | 49 |
| abstract_inverted_index.generates | 51 |
| abstract_inverted_index.introduce | 81 |
| abstract_inverted_index.multiples | 71 |
| abstract_inverted_index.selection | 95 |
| abstract_inverted_index.Difference | 41 |
| abstract_inverted_index.Normalized | 38 |
| abstract_inverted_index.constraint | 87 |
| abstract_inverted_index.continuity | 86 |
| abstract_inverted_index.estimation | 2, 23 |
| abstract_inverted_index.likelihood | 48, 67 |
| abstract_inverted_index.processing | 8 |
| abstract_inverted_index.transforms | 44 |
| abstract_inverted_index.candidates. | 98 |
| abstract_inverted_index.challenging | 13, 31 |
| abstract_inverted_index.conditions. | 129 |
| abstract_inverted_index.neighboring | 77 |
| abstract_inverted_index.reverberant | 128 |
| abstract_inverted_index.robustness, | 64 |
| abstract_inverted_index.Experimental | 99 |
| abstract_inverted_index.Furthermore, | 79 |
| abstract_inverted_index.consistently | 105 |
| abstract_inverted_index.applications, | 9 |
| abstract_inverted_index.environments. | 16, 33 |
| abstract_inverted_index.outperforming | 121 |
| abstract_inverted_index.probabilistic | 52 |
| abstract_inverted_index.high-distortion | 15 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 2 |
| citation_normalized_percentile |