International Journal of Advance Computational Engineering and Networking (IJACEN)
.
Follow Us On :
current issues
Volume-12,Issue-1  ( Jan, 2024 )
Past issues
  1. Volume-11,Issue-12  ( Dec, 2023 )
  2. Volume-11,Issue-11  ( Nov, 2023 )
  3. Volume-11,Issue-10  ( Oct, 2023 )
  4. Volume-11,Issue-9  ( Sep, 2023 )
  5. Volume-11,Issue-8  ( Aug, 2023 )
  6. Volume-11,Issue-7  ( Jul, 2023 )
  7. Volume-11,Issue-6  ( Jun, 2023 )
  8. Volume-11,Issue-5  ( May, 2023 )
  9. Volume-11,Issue-4  ( Apr, 2023 )
  10. Volume-11,Issue-3  ( Mar, 2023 )

Statistics report
Apr. 2024
Submitted Papers : 80
Accepted Papers : 10
Rejected Papers : 70
Acc. Perc : 12%
Issue Published : 133
Paper Published : 1552
No. of Authors : 4025
  Journal Paper


Paper Title :
SEOP: Speech Enhancement System for Punjabi Language

Author :Jaspreet Kaur Sandhu, Amitoj Singh, Munish Kumar

Article Citation :Jaspreet Kaur Sandhu ,Amitoj Singh ,Munish Kumar , (2022 ) " SEOP: Speech Enhancement System for Punjabi Language " , International Journal of Advance Computational Engineering and Networking (IJACEN) , pp. 49-53, Volume-10,Issue-7

Abstract : Abstract - In this paper, the process of Punjabi speech enhancement using the Bidirectional Long Short-Term Memory (BLSTM) -Kalman Filter (KF) for improved Punjabi speech quality has been presented. We implemented the Speech Enhancement System of Punjabi (SEOP) system, in which we trained two separate BLSTM. One BLSTM learns to map from acoustic to magnitude of clean speech, while the other learns to map from acoustic to Line Spectrum Frequencies (LSFs). The estimated clean speech is then rebuilt, and the LSFs are transformed to Linear Prediction Coefficients (LPCs) for use in implementing KF. Experiments are carried out on the acoustic and tonal features. Our acoustic features include Linear Prediction Coefficient (LPC), Gammatone Frequency Cepstral Coefficients (GFCC), Mel-Frequency Cepstral Coefficient (MFCC), and Bark Frequency Cepstral Coefficients (BFCC). The experiment on acoustic and tonal features of noises revealed the effectiveness of BLSTM with KF. MFCC+pitch, MFCC+BFCC+pitch, and MFCC+GFCC+BFCC+pitch achieve the best Word Error Rate (WER) of 22.97%, 22.11%, and 17.40%, respectively. Keywords - Speech enhancement, Punjabi, Deep learning, Kalman filter, Bidirectional Long Short Term Memory (BLSTM), Tonal features.

Type : Research paper

Published : Volume-10,Issue-7


DOIONLINE NO - IJACEN-IRAJ-DOIONLINE-18839   View Here

Copyright: © Institute of Research and Journals

| PDF |
Viewed - 39
| Published on 2022-10-10
   
   
IRAJ Other Journals
IJACEN updates
Paper Submission is open now for upcoming Issue.
The Conference World

JOURNAL SUPPORTED BY