Automatic Pronunciation Evaluation and Feedback Generation System based on Resource-efficient factorized TDNN and Phoneme Error Pattern Article Swipe
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.35444/ijana.2025.17102
In this paper we describe an automatic pronunciation error detection and feedback generation system for non-native second language learners by using deep acoustic model based on factorized TDNN and language model with phoneme error model. Our system builds language model considering phoneme error patterns and gives a useful feedback for learners. Deep acoustic model consists of TDNN-F with grouped fully-connected layers and shuffle operation. This network architecture maintains recognition accuracy like traditional TDNN and costs less then it. Also, our system evaluates pronunciation proficiency of utterance in word level and phoneme level based on confidence from Minimum Bayesian Risk decoder, feedback is generated on phone error model of L2 learners. This system based on resource-efficient deep acoustic architecture can be deployed in resource-limited mobile devices.
Related Topics
- Type
- article
- Language
- en
- Landing Page
- https://doi.org/10.35444/ijana.2025.17102
- https://doi.org/10.35444/ijana.2025.17102
- OA Status
- diamond
- References
- 33
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4412356061
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4412356061Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.35444/ijana.2025.17102Digital Object Identifier
- Title
-
Automatic Pronunciation Evaluation and Feedback Generation System based on Resource-efficient factorized TDNN and Phoneme Error PatternWork title
- Type
-
articleOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2025Year of publication
- Publication date
-
2025-01-01Full publication date if available
- Authors
-
Il Song Han, Hyok Kwak, O Chung-Hyok, Kiroong Choe, Chol-Nam OmList of authors in order
- Landing page
-
https://doi.org/10.35444/ijana.2025.17102Publisher landing page
- PDF URL
-
https://doi.org/10.35444/ijana.2025.17102Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
diamondOpen access status per OpenAlex
- OA URL
-
https://doi.org/10.35444/ijana.2025.17102Direct OA link when available
- Concepts
-
Computer science, Pronunciation, Speech recognition, Resource (disambiguation), Artificial intelligence, Pattern recognition (psychology), Computer network, Philosophy, LinguisticsTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- References (count)
-
33Number of works referenced by this work
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4412356061 |
|---|---|
| doi | https://doi.org/10.35444/ijana.2025.17102 |
| ids.doi | https://doi.org/10.35444/ijana.2025.17102 |
| ids.openalex | https://openalex.org/W4412356061 |
| fwci | 0.0 |
| type | article |
| title | Automatic Pronunciation Evaluation and Feedback Generation System based on Resource-efficient factorized TDNN and Phoneme Error Pattern |
| biblio.issue | 01 |
| biblio.volume | 17 |
| biblio.last_page | 6727 |
| biblio.first_page | 6719 |
| topics[0].id | https://openalex.org/T10201 |
| topics[0].field.id | https://openalex.org/fields/17 |
| topics[0].field.display_name | Computer Science |
| topics[0].score | 0.925599992275238 |
| topics[0].domain.id | https://openalex.org/domains/3 |
| topics[0].domain.display_name | Physical Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1702 |
| topics[0].subfield.display_name | Artificial Intelligence |
| topics[0].display_name | Speech Recognition and Synthesis |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C41008148 |
| concepts[0].level | 0 |
| concepts[0].score | 0.9181289076805115 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[0].display_name | Computer science |
| concepts[1].id | https://openalex.org/C2780844864 |
| concepts[1].level | 2 |
| concepts[1].score | 0.7986266613006592 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q184377 |
| concepts[1].display_name | Pronunciation |
| concepts[2].id | https://openalex.org/C28490314 |
| concepts[2].level | 1 |
| concepts[2].score | 0.6443203687667847 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q189436 |
| concepts[2].display_name | Speech recognition |
| concepts[3].id | https://openalex.org/C206345919 |
| concepts[3].level | 2 |
| concepts[3].score | 0.4298584461212158 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q20380951 |
| concepts[3].display_name | Resource (disambiguation) |
| concepts[4].id | https://openalex.org/C154945302 |
| concepts[4].level | 1 |
| concepts[4].score | 0.41431185603141785 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[4].display_name | Artificial intelligence |
| concepts[5].id | https://openalex.org/C153180895 |
| concepts[5].level | 2 |
| concepts[5].score | 0.38028034567832947 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q7148389 |
| concepts[5].display_name | Pattern recognition (psychology) |
| concepts[6].id | https://openalex.org/C31258907 |
| concepts[6].level | 1 |
| concepts[6].score | 0.07064729928970337 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q1301371 |
| concepts[6].display_name | Computer network |
| concepts[7].id | https://openalex.org/C138885662 |
| concepts[7].level | 0 |
| concepts[7].score | 0.0 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q5891 |
| concepts[7].display_name | Philosophy |
| concepts[8].id | https://openalex.org/C41895202 |
| concepts[8].level | 1 |
| concepts[8].score | 0.0 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q8162 |
| concepts[8].display_name | Linguistics |
| keywords[0].id | https://openalex.org/keywords/computer-science |
| keywords[0].score | 0.9181289076805115 |
| keywords[0].display_name | Computer science |
| keywords[1].id | https://openalex.org/keywords/pronunciation |
| keywords[1].score | 0.7986266613006592 |
| keywords[1].display_name | Pronunciation |
| keywords[2].id | https://openalex.org/keywords/speech-recognition |
| keywords[2].score | 0.6443203687667847 |
| keywords[2].display_name | Speech recognition |
| keywords[3].id | https://openalex.org/keywords/resource |
| keywords[3].score | 0.4298584461212158 |
| keywords[3].display_name | Resource (disambiguation) |
| keywords[4].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[4].score | 0.41431185603141785 |
| keywords[4].display_name | Artificial intelligence |
| keywords[5].id | https://openalex.org/keywords/pattern-recognition |
| keywords[5].score | 0.38028034567832947 |
| keywords[5].display_name | Pattern recognition (psychology) |
| keywords[6].id | https://openalex.org/keywords/computer-network |
| keywords[6].score | 0.07064729928970337 |
| keywords[6].display_name | Computer network |
| language | en |
| locations[0].id | doi:10.35444/ijana.2025.17102 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4210211723 |
| locations[0].source.issn | 0975-0282, 0975-0290 |
| locations[0].source.type | journal |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | 0975-0282 |
| locations[0].source.is_core | True |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | International Journal of Advanced Networking and Applications |
| locations[0].source.host_organization | |
| locations[0].source.host_organization_name | |
| locations[0].license | |
| locations[0].pdf_url | https://doi.org/10.35444/ijana.2025.17102 |
| locations[0].version | publishedVersion |
| locations[0].raw_type | journal-article |
| locations[0].license_id | |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | International Journal of Advanced Networking and Applications |
| locations[0].landing_page_url | https://doi.org/10.35444/ijana.2025.17102 |
| indexed_in | crossref |
| authorships[0].author.id | https://openalex.org/A5110503476 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Il Song Han |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | None Il Han |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5114064729 |
| authorships[1].author.orcid | |
| authorships[1].author.display_name | Hyok Kwak |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | None Hyok Kwak |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5039988876 |
| authorships[2].author.orcid | |
| authorships[2].author.display_name | O Chung-Hyok |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | None Chung-Hyok O |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5088522352 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-6084-2539 |
| authorships[3].author.display_name | Kiroong Choe |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | None Kang-Song Choe |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5027466476 |
| authorships[4].author.orcid | |
| authorships[4].author.display_name | Chol-Nam Om |
| authorships[4].author_position | last |
| authorships[4].raw_author_name | None Chol-Nam Om |
| authorships[4].is_corresponding | False |
| has_content.pdf | True |
| has_content.grobid_xml | True |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://doi.org/10.35444/ijana.2025.17102 |
| open_access.oa_status | diamond |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Automatic Pronunciation Evaluation and Feedback Generation System based on Resource-efficient factorized TDNN and Phoneme Error Pattern |
| has_fulltext | True |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T10201 |
| primary_topic.field.id | https://openalex.org/fields/17 |
| primary_topic.field.display_name | Computer Science |
| primary_topic.score | 0.925599992275238 |
| primary_topic.domain.id | https://openalex.org/domains/3 |
| primary_topic.domain.display_name | Physical Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1702 |
| primary_topic.subfield.display_name | Artificial Intelligence |
| primary_topic.display_name | Speech Recognition and Synthesis |
| related_works | https://openalex.org/W2183593636, https://openalex.org/W2350724007, https://openalex.org/W2355751417, https://openalex.org/W2423284978, https://openalex.org/W2083922162, https://openalex.org/W2000075989, https://openalex.org/W3006486861, https://openalex.org/W4220683390, https://openalex.org/W2033914206, https://openalex.org/W2042327336 |
| cited_by_count | 0 |
| locations_count | 1 |
| best_oa_location.id | doi:10.35444/ijana.2025.17102 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4210211723 |
| best_oa_location.source.issn | 0975-0282, 0975-0290 |
| best_oa_location.source.type | journal |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | 0975-0282 |
| best_oa_location.source.is_core | True |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | International Journal of Advanced Networking and Applications |
| best_oa_location.source.host_organization | |
| best_oa_location.source.host_organization_name | |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://doi.org/10.35444/ijana.2025.17102 |
| best_oa_location.version | publishedVersion |
| best_oa_location.raw_type | journal-article |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | True |
| best_oa_location.raw_source_name | International Journal of Advanced Networking and Applications |
| best_oa_location.landing_page_url | https://doi.org/10.35444/ijana.2025.17102 |
| primary_location.id | doi:10.35444/ijana.2025.17102 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4210211723 |
| primary_location.source.issn | 0975-0282, 0975-0290 |
| primary_location.source.type | journal |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | 0975-0282 |
| primary_location.source.is_core | True |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | International Journal of Advanced Networking and Applications |
| primary_location.source.host_organization | |
| primary_location.source.host_organization_name | |
| primary_location.license | |
| primary_location.pdf_url | https://doi.org/10.35444/ijana.2025.17102 |
| primary_location.version | publishedVersion |
| primary_location.raw_type | journal-article |
| primary_location.license_id | |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | International Journal of Advanced Networking and Applications |
| primary_location.landing_page_url | https://doi.org/10.35444/ijana.2025.17102 |
| publication_date | 2025-01-01 |
| publication_year | 2025 |
| referenced_works | https://openalex.org/W1524577339, https://openalex.org/W2126930309, https://openalex.org/W2560263812, https://openalex.org/W2515576076, https://openalex.org/W2251492169, https://openalex.org/W2091359957, https://openalex.org/W2139008940, https://openalex.org/W2295209184, https://openalex.org/W6680263073, https://openalex.org/W2112375231, https://openalex.org/W296713244, https://openalex.org/W105558132, https://openalex.org/W1496430420, https://openalex.org/W2134124280, https://openalex.org/W2132049498, https://openalex.org/W2008834750, https://openalex.org/W2155048744, https://openalex.org/W12857174, https://openalex.org/W2398741870, https://openalex.org/W2184045248, https://openalex.org/W2402640444, https://openalex.org/W2091856355, https://openalex.org/W2770743368, https://openalex.org/W2777448780, https://openalex.org/W2143612262, https://openalex.org/W2077155272, https://openalex.org/W6631362777, https://openalex.org/W1993721840, https://openalex.org/W2117671523, https://openalex.org/W2294543795, https://openalex.org/W2402146185, https://openalex.org/W2888867175, https://openalex.org/W2724359148 |
| referenced_works_count | 33 |
| abstract_inverted_index.a | 46 |
| abstract_inverted_index.In | 0 |
| abstract_inverted_index.L2 | 108 |
| abstract_inverted_index.an | 5 |
| abstract_inverted_index.be | 119 |
| abstract_inverted_index.by | 19 |
| abstract_inverted_index.in | 86, 121 |
| abstract_inverted_index.is | 101 |
| abstract_inverted_index.of | 55, 84, 107 |
| abstract_inverted_index.on | 25, 93, 103, 113 |
| abstract_inverted_index.we | 3 |
| abstract_inverted_index.Our | 35 |
| abstract_inverted_index.and | 10, 28, 44, 61, 73, 89 |
| abstract_inverted_index.can | 118 |
| abstract_inverted_index.for | 14, 49 |
| abstract_inverted_index.it. | 77 |
| abstract_inverted_index.our | 79 |
| abstract_inverted_index.Deep | 51 |
| abstract_inverted_index.Risk | 98 |
| abstract_inverted_index.TDNN | 27, 72 |
| abstract_inverted_index.This | 64, 110 |
| abstract_inverted_index.deep | 21, 115 |
| abstract_inverted_index.from | 95 |
| abstract_inverted_index.less | 75 |
| abstract_inverted_index.like | 70 |
| abstract_inverted_index.then | 76 |
| abstract_inverted_index.this | 1 |
| abstract_inverted_index.with | 31, 57 |
| abstract_inverted_index.word | 87 |
| abstract_inverted_index.Also, | 78 |
| abstract_inverted_index.based | 24, 92, 112 |
| abstract_inverted_index.costs | 74 |
| abstract_inverted_index.error | 8, 33, 42, 105 |
| abstract_inverted_index.gives | 45 |
| abstract_inverted_index.level | 88, 91 |
| abstract_inverted_index.model | 23, 30, 39, 53, 106 |
| abstract_inverted_index.paper | 2 |
| abstract_inverted_index.phone | 104 |
| abstract_inverted_index.using | 20 |
| abstract_inverted_index.TDNN-F | 56 |
| abstract_inverted_index.builds | 37 |
| abstract_inverted_index.layers | 60 |
| abstract_inverted_index.mobile | 123 |
| abstract_inverted_index.model. | 34 |
| abstract_inverted_index.second | 16 |
| abstract_inverted_index.system | 13, 36, 80, 111 |
| abstract_inverted_index.useful | 47 |
| abstract_inverted_index.Minimum | 96 |
| abstract_inverted_index.grouped | 58 |
| abstract_inverted_index.network | 65 |
| abstract_inverted_index.phoneme | 32, 41, 90 |
| abstract_inverted_index.shuffle | 62 |
| abstract_inverted_index.Bayesian | 97 |
| abstract_inverted_index.accuracy | 69 |
| abstract_inverted_index.acoustic | 22, 52, 116 |
| abstract_inverted_index.consists | 54 |
| abstract_inverted_index.decoder, | 99 |
| abstract_inverted_index.deployed | 120 |
| abstract_inverted_index.describe | 4 |
| abstract_inverted_index.devices. | 124 |
| abstract_inverted_index.feedback | 11, 48, 100 |
| abstract_inverted_index.language | 17, 29, 38 |
| abstract_inverted_index.learners | 18 |
| abstract_inverted_index.patterns | 43 |
| abstract_inverted_index.automatic | 6 |
| abstract_inverted_index.detection | 9 |
| abstract_inverted_index.evaluates | 81 |
| abstract_inverted_index.generated | 102 |
| abstract_inverted_index.learners. | 50, 109 |
| abstract_inverted_index.maintains | 67 |
| abstract_inverted_index.utterance | 85 |
| abstract_inverted_index.confidence | 94 |
| abstract_inverted_index.factorized | 26 |
| abstract_inverted_index.generation | 12 |
| abstract_inverted_index.non-native | 15 |
| abstract_inverted_index.operation. | 63 |
| abstract_inverted_index.considering | 40 |
| abstract_inverted_index.proficiency | 83 |
| abstract_inverted_index.recognition | 68 |
| abstract_inverted_index.traditional | 71 |
| abstract_inverted_index.architecture | 66, 117 |
| abstract_inverted_index.pronunciation | 7, 82 |
| abstract_inverted_index.fully-connected | 59 |
| abstract_inverted_index.resource-limited | 122 |
| abstract_inverted_index.resource-efficient | 114 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 5 |
| citation_normalized_percentile.value | 0.12336179 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | True |