MyJournals Home  

RSS FeedsSensors, Vol. 20, Pages 1809: End-to-End Automatic Pronunciation Error Detection Based on Improved Hybrid CTC/Attention Architecture (Sensors)

 
 

25 march 2020 22:00:07

 
Sensors, Vol. 20, Pages 1809: End-to-End Automatic Pronunciation Error Detection Based on Improved Hybrid CTC/Attention Architecture (Sensors)
 


Advanced automatic pronunciation error detection (APED) algorithms are usually based on state-of-the-art automatic speech recognition (ASR) techniques. With the development of deep learning technology, end-to-end ASR technology has gradually matured and achieved positive practical results, which provides us with a new opportunity to update the APED algorithm. We first constructed an end-to-end ASR system based on the hybrid connectionist temporal classification and attention (CTC/attention) architecture. An adaptive parameter was used to enhance the complementarity of the connectionist temporal classification (CTC) model and the attention-based seq2seq model, further improving the performance of the ASR system. After this, the improved ASR system was used in the APED task of Mandarin, and good results were obtained. This new APED method makes force alignment and segmentation unnecessary, and it does not require multiple complex models, such as an acoustic model or a language model. It is convenient and straightforward, and will be a suitable general solution for L1-independent computer-assisted pronunciation training (CAPT). Furthermore, we find that find that in regards to accuracy metrics, our proposed system based on the improved hybrid CTC/attention architecture is close to the state-of-the-art ASR system based on the deep neural network–deep neural network (DNN–DNN) architecture, and has a stronger effect on the F-measure metrics, which are especially suitable for the requirements of the APED task.


 
170 viewsCategory: Chemistry, Physics
 
[ASAP] Direct Measurement of Radical-Catalyzed C6H6 Formation from Acetylene and Validation of Theoretical Rate Coefficients for C2H3 + C2H2 and C4H5 + C2H2 Reactions (Journal of Physical Chemistry A)
Sensors, Vol. 20, Pages 1831: An Effective Sensor Deployment Scheme that Ensures Multilevel Coverage of Wireless Sensor Networks with Uncertain Properties (Sensors)
 
 
blog comments powered by Disqus


MyJournals.org
The latest issues of all your favorite science journals on one page

Username:
Password:

Register | Retrieve

Search:

Physics


Copyright © 2008 - 2024 Indigonet Services B.V.. Contact: Tim Hulsen. Read here our privacy notice.
Other websites of Indigonet Services B.V.: Nieuws Vacatures News Tweets Nachrichten