TY - CHAP AU - Mehra, P. AU - Kant, Verma S. TI - Comparing Classifiers for Recognizing the Emotions by extracting the Spectral Features of Speech Using Machine Learning PB - Institute of Electrical and Electronics Engineers (IEEE) SN - 9781665474924 T3 - Proceedings - IEEE International Conference on Device Intelligence, Computing and Communication Technologies, DICCT 2023 PY - 2023 SP - 387 EP - 391 PG - 5 DO - 10.1109/DICCT56244.2023.10110282 UR - https://m2.mtmt.hu/api/publication/34543490 ID - 34543490 N1 - Export Date: 30 January 2024 Correspondence Address: Mehra, P.; Uttarakhand Technical University, Uttarakhand, India; email: pramodmehra11@gmail.com LA - English DB - MTMT ER - TY - JOUR AU - Bhatia, S. AU - Devi, A. AU - Alsuwailem, R.I. AU - Mashat, A. TI - Convolutional Neural Network Based Real Time Arabic Speech Recognition to Arabic Braille for Hearing and Visually Impaired JF - FRONTIERS IN PUBLIC HEALTH J2 - FRONT PUBLIC HEALTH VL - 10 PY - 2022 SN - 2296-2565 DO - 10.3389/fpubh.2022.898355 UR - https://m2.mtmt.hu/api/publication/33634015 ID - 33634015 N1 - Department of Information Systems, College of Computer Sciences and Information Technology, King Faisal University, Al Hasa, Saudi Arabia Research Head, AP3 Solutions, Chennai, India Faculty of Computing and Information Technology, King Abdulaziz University, Rabigh, Saudi Arabia Export Date: 10 February 2023 Correspondence Address: Bhatia, S.; Department of Information Systems, Saudi Arabia; email: sbhatia@kfu.edu.sa Correspondence Address: Devi, A.; Research Head, India; email: research_head@ap3-solutions.com LA - English DB - MTMT ER - TY - JOUR AU - Mehra, P. AU - Jain, P. TI - ERIL: An Algorithm for Emotion Recognition from Indian Languages Using Machine Learning JF - WIRELESS PERSONAL COMMUNICATIONS J2 - WIRELESS PERS COMMUN VL - 126 PY - 2022 IS - 3 SP - 2557 EP - 2577 PG - 21 SN - 0929-6212 DO - 10.1007/s11277-022-09829-1 UR - https://m2.mtmt.hu/api/publication/33634014 ID - 33634014 N1 - Export Date: 10 February 2023 CODEN: WPCOF Correspondence Address: Mehra, P.; Uttarakhand Technical University, Uttarakhand, India; email: pramodmehra11@gmail.com LA - English DB - MTMT ER - TY - JOUR AU - Mehra, Pramod AU - Verma, Shashi Kant TI - BERIS: An mBERT-based Emotion Recognition Algorithm from Indian Speech JF - ACM Transactions on Asian and Low-Resource Language Information Processing J2 - ACM T ASIAN LOW-RESO VL - 21 PY - 2022 IS - 5 SP - 1 EP - 19 PG - 19 SN - 2375-4699 DO - 10.1145/3517195 UR - https://m2.mtmt.hu/api/publication/33634027 ID - 33634027 AB - Emotions, the building blocks of the human intellect, play a vital role in Artificial Intelligence (AI). For a robust AI-based machine, it is important that the machine understands human emotions. COVID-19 has introduced the world to no-touch intelligent systems. With an influx of users, it is critical to create devices that can communicate in a local dialect. A multilingual system is required in countries like India, which has a large population and a diverse range of languages. Given the importance of multilingual emotion recognition, this research introduces BERIS, an Indian language emotion detection system. From the Indian sound recording, BERIS estimates both acoustic and textual characteristics. To extract the textual features, we used Multilingual Bidirectional Encoder Representations from Transformers. For acoustics, BERIS computes the Mel Frequency Cepstral Coefficients and Linear Prediction coefficients, and Pitch. The features extracted are merged in a linear array. Since the dialogues are of varied lengths, the data are normalized to have arrays of equal length. Finally, we split the data into training and validated set to construct a predictive model. The model can predict emotions from the new input. On all the datasets presented, quantitative and qualitative evaluations show that the proposed algorithm outperforms state-of-the-art approaches. LA - English DB - MTMT ER - TY - CHAP AU - Choudhary, H. AU - Sadhya, D. AU - Patel, V. ED - IEEE, , TI - Automatic Speaker Verification using Gammatone Frequency Cepstral Coefficients T2 - 2021 8th International Conference on Signal Processing and Integrated Networks (SPIN) PB - IEEE CY - Piscataway (NJ) SN - 9781665435642 PY - 2021 SP - 424 EP - 428 PG - 5 DO - 10.1109/SPIN52536.2021.9566150 UR - https://m2.mtmt.hu/api/publication/33634018 ID - 33634018 N1 - Export Date: 10 February 2023 LA - English DB - MTMT ER - TY - JOUR AU - Malik, M. AU - Malik, M.K. AU - Mehmood, K. AU - Makhdoom, I. TI - Automatic speech recognition: a survey JF - MULTIMEDIA TOOLS AND APPLICATIONS: AN INTERNATIONAL JOURNAL J2 - MULTIMED TOOLS APPL VL - 80 PY - 2021 IS - 6 SP - 9411 EP - 9457 PG - 47 SN - 1380-7501 DO - 10.1007/s11042-020-10073-7 UR - https://m2.mtmt.hu/api/publication/33634017 ID - 33634017 N1 - Cited By :51 Export Date: 10 February 2023 CODEN: MTAPF Correspondence Address: Malik, M.; Punjab University College of Information Technology (PUCIT)Pakistan; email: mishaimmalik30@gmail.com LA - English DB - MTMT ER - TY - GEN AU - Puneet, Bawa AU - Virender, Kadyan AU - Vaibhav, Kumar AU - Ghanshyam, Raghuwanshi TI - Spectral-warping based noise-robust enhanced children ASR system PY - 2021 DO - 10.21203/rs.3.rs-976955/v1 UR - https://m2.mtmt.hu/api/publication/33634055 ID - 33634055 LA - English DB - MTMT ER - TY - JOUR AU - Raj, P.P. TI - Real-time pre-processing for improved feature extraction of noisy speech JF - INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY J2 - INT J SPEECH TECH VL - 24 PY - 2021 IS - 3 SP - 715 EP - 728 PG - 14 SN - 1381-2416 DO - 10.1007/s10772-021-09835-x UR - https://m2.mtmt.hu/api/publication/33634016 ID - 33634016 N1 - Export Date: 10 February 2023 CODEN: ISTEF Correspondence Address: Raj, P.P.; Department of Electronics & Communication Engineering, India; email: ppraj.nit@gmail.com LA - English DB - MTMT ER - TY - JOUR AU - Mohsen, Sadeghi AU - Hossein, Marvi AU - Ali, Reza Ahmadyfard TI - A New and Efficient Feature Extraction Method for Robust Speech Recognition Based on Fractional Fourier Transform and Differential Evolution Optimizer JF - JOURNAL OF MODELING IN ENGINEERING J2 - J MODEL ENG VL - 18 PY - 2020 IS - 64 SP - 85 EP - 96 PG - 12 SN - 2008-4854 DO - 10.22075/JME.2020.19267.1821 UR - https://m2.mtmt.hu/api/publication/33634105 ID - 33634105 LA - Persian DB - MTMT ER - TY - JOUR AU - Ouisaadane, A. AU - Safi, S. AU - Frikel, M. TI - Arabic digits speech recognition and speaker identification in noisy environment using a hybrid model of VQ and GMM JF - TELKOMNIKA J2 - TELKOMNIKA VL - 18 PY - 2020 IS - 4 SP - 2193 EP - 2204 PG - 12 SN - 1693-6930 DO - 10.12928/TELKOMNIKA.V18I4.14215 UR - https://m2.mtmt.hu/api/publication/33634019 ID - 33634019 N1 - Department of Mathematics and Computer Science, Polydisciplinary Faculty, Sultan Moulay Slimane University, Morocco ENSICAEN School, LAC Laboratory Caen-Normandie University, France Cited By :2 Export Date: 10 February 2023 Correspondence Address: Ouisaadane, A.; Department of Mathematics and Computer Science, Morocco; email: Abdelkbir.wiss@gmail.com LA - English DB - MTMT ER - TY - THES AU - Christian, Dayan Arcos Gordillo TI - Realce e Reconhecimento de Voz Contínua em Ambientes Adversos PY - 2018 SP - 179 UR - https://m2.mtmt.hu/api/publication/33634181 ID - 33634181 LA - Portuguese DB - MTMT ER - TY - THES AU - Nan, Phyu Phyu Hsan TI - A STUDY ON ISOLATED-WORD MYANMAR SPEECH RECOGNITION VIA ARTIFICIAL NEURAL NETWORKS PY - 2018 SP - 77 UR - https://m2.mtmt.hu/api/publication/33634195 ID - 33634195 LA - English DB - MTMT ER - TY - JOUR AU - Chenchah, Farah AU - Lachiri, Zied TI - A bio-inspired emotion recognition system under real-life conditions JF - APPLIED ACOUSTICS J2 - APPL ACOUST VL - 115 PY - 2017 SP - 6 EP - 14 PG - 9 SN - 0003-682X DO - 10.1016/j.apacoust.2016.06.020 UR - https://m2.mtmt.hu/api/publication/27393749 ID - 27393749 N1 - Cited By :10 Export Date: 10 February 2023 CODEN: AACOB Correspondence Address: Chenchah, F.; LR-SITI Laboratory, Tunisia; email: farahchenchah@yahoo.fr LA - English DB - MTMT ER - TY - JOUR AU - Pardede, Hilman Ferdinandus TI - Teknik Normalisasi Fitur Secara Adaptif untuk Sistem Pengenalan Ucapan Tahan Terhadap Gema JF - Inkom. Jurnal Informatika, Sistem Kendali, dan Komputer J2 - Inkom. Jurnal Informatika, Sistem Kendali, dan Komputer VL - 10 PY - 2017 IS - 2 SP - 47 EP - 56 SN - 2302-6146 UR - https://m2.mtmt.hu/api/publication/27393752 ID - 27393752 LA - English DB - MTMT ER - TY - CHAP AU - Kaur, Arshpreet AU - Singh, Amitoj AU - Kadyan, Virender TI - Correlative consideration concerning feature extraction techniques for speech recognition—A review PB - IEEE SN - 150901277X PB - IEEE PY - 2016 SP - 1 EP - 4 PG - 4 UR - https://m2.mtmt.hu/api/publication/27393751 ID - 27393751 LA - English DB - MTMT ER - TY - JOUR AU - Kim, Chanwoo AU - Stern, Richard M. TI - Power-Normalized Cepstral Coefficients (PNCC) for Robust Speech Recognition JF - IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING J2 - IEEE-ACM T AUDIO SPE VL - 24 PY - 2016 IS - 7 SP - 1315 EP - 1329 PG - 15 SN - 2329-9290 DO - 10.1109/TASLP.2016.2545928 UR - https://m2.mtmt.hu/api/publication/31993097 ID - 31993097 N1 - Cited By :184 Export Date: 10 February 2023 LA - English DB - MTMT ER - TY - JOUR AU - Eringis, Deividas AU - Tamulevicius, Gintautas TI - Modified Filterbank Analysis Features for Speech Recognition JF - BALTIC JOURNAL OF MODERN COMPUTING J2 - BJMC VL - 3 PY - 2015 IS - 1 SP - 29 SN - 2255-8942 UR - https://m2.mtmt.hu/api/publication/27393748 ID - 27393748 LA - English DB - MTMT ER - TY - THES AU - MAXIMILIANO, EPIFANIO ASÍS LÓPEZ TI - CREACIÓN DE SOFTWARE DE RECONOCIMIENTO DE VOZ PARA ESCRITURA DE EXPRESIONES ALGEBRAICAS Y SU NIVEL DE EFICIENCIA CON EL ESTÁNDAR WER-2015 PY - 2015 SP - 200 UR - https://m2.mtmt.hu/api/publication/33634211 ID - 33634211 LA - Spanish DB - MTMT ER - TY - JOUR AU - Cutajar, Michelle AU - Gatt, Edward AU - Grech, Ivan AU - Casha, Owen AU - Micallef, Joseph TI - Comparative study of automatic speech recognition techniques JF - IET SIGNAL PROCESSING J2 - IET SIGNAL PROCESS VL - 7 PY - 2013 IS - 1 SP - 25 EP - 46 PG - 22 SN - 1751-9675 DO - 10.1049/iet-spr.2012.0151 UR - https://m2.mtmt.hu/api/publication/27393744 ID - 27393744 N1 - Cited By :72 Export Date: 10 February 2023 LA - English DB - MTMT ER - TY - CHAP AU - Anand, Anu V AU - Devi, P Shobana AU - Stephen, Jose AU - Bhadran, VK TI - Malayalam Speech Recognition system and its application for visually impaired people PB - IEEE SN - 1467322725 PB - IEEE PY - 2012 SP - 619 EP - 624 PG - 6 UR - https://m2.mtmt.hu/api/publication/27393747 ID - 27393747 LA - English DB - MTMT ER - TY - THES AU - Andersstuen, Runar AU - Marcussen, Christoffer Jun TI - TaleTUC: Automatic Speech Recognition for a Bus Route Information System PY - 2012 UR - https://m2.mtmt.hu/api/publication/27393750 ID - 27393750 N1 - PBInstituttfordatateknikkoginformasjonsvitenskap LA - English DB - MTMT ER - TY - CHAP AU - Kim, Chanwoo AU - Stern, Richard M TI - Power-normalized cepstral coefficients (PNCC) for robust speech recognition T2 - 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 PB - IEEE CY - Piscataway (NJ) SN - 9781467300469 PB - IEEE PY - 2012 SP - 4101 EP - 4104 PG - 4 DO - 10.1109/ICASSP.2012.6288820 UR - https://m2.mtmt.hu/api/publication/27393743 ID - 27393743 LA - English DB - MTMT ER - TY - CHAP AU - Sárosi, G. AU - Mozsolics, T. AU - Tarján, B. AU - Balog, A. AU - Mihajlik, P. AU - Fegyó, T. ED - Anna, Esposito ED - Alessandro, Vinciarelli ED - Vicsi, Klára ED - Cathrine, Pelachaud ED - Anton, Nijolt TI - Recognition of multiple language voice navigation queries in traffic situations T2 - Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues VL - 6800 LNCS PB - Springer Netherlands CY - Berlin CY - Heidelberg SN - 9783642257759 T3 - Lecture Notes in Computer Science, ISSN 0302-9743 ; 6800. PY - 2011 SP - 199 EP - 213 PG - 15 DO - 10.1007/978-3-642-25775-9_20 UR - https://m2.mtmt.hu/api/publication/33634022 ID - 33634022 N1 - Department of Telecommunications and Media Informatics, Budapest University of Technology and Economics, Hungary THINKTech Research Center Nonprofit LLC, Hungary Aitia International Inc., Hungary Cited By :2 Export Date: 10 February 2023 Correspondence Address: Sárosi, G.; Department of Telecommunications and Media Informatics, Hungary; email: sarosi@tmit.bme.hu LA - English DB - MTMT ER -