mtmt
Magyar Tudományos Művek Tára
XML
JSON
Átlépés a keresőbe
In English
25th Interspeech Conference (Interspeech 2024)
Lapidot, Itshak [szerk.]
;
Gannot, Sharon [szerk.]
Angol nyelvű Konferenciakötet (Könyv) Tudományos
Megjelent: International Speech Communication Association (ISCA), Dublin, Írország
2024
Konferencia:
25th Annual Conference of the International Speech Communication Association, INTERSPEECH 2024 2024-09-01 [Kos, Görögország]
Sorozatok:
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2308-457X 1990-9772
Azonosítók
MTMT: 35090316
DOI:
10.21437/Interspeech.2024
Fejezetek
Tanner J. et al. Exploring the Anatomy of Articulation Rate in Spontaneous English Speech: Relationships between Utterance Length Effects and Social Factors. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024)
Roesler Oliver et al. Towards Scalable Remote Assessment of Mild Cognitive Impairment Via Multimodal Dialog. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 1-5
Xue Wei et al. Towards a better understanding of receptive multilingualism: listening conditions and priming effects. (2024) Megjelent: INTERSPEECH 2024 pp. 12-16
Arya Arunav et al. Exploiting Wavelet Scattering Transform for an Unsupervised Speaker Diarization in Deep Neural Network Framework. (2024) Megjelent: Interspeech 2024 pp. 47-51
Kumar Sahil et al. Vision Transformer Segmentation for Visual Bird Sound Denoising. (2024) Megjelent: INTERSPEECH 2024 pp. 122-126
Yang Yudong et al. Optical Flow Guided Tongue Trajectory Generation for Diffusion-based Acoustic to Articulatory Inversion. (2024) Megjelent: INTERSPEECH 2024 pp. 417-421
Jain Rishi et al. Multimodal Segmentation for Vocal Tract Modeling. (2024) Megjelent: INTERSPEECH 2024 pp. 422-426
Pistol Tillmann et al. Echoes of Implicit Bias Exploring Aesthetics and Social Meanings of Swiss German Dialect Features. (2024) Megjelent: INTERSPEECH 2024 pp. 447-451
Lee Seonwoo et al. Automatic Assessment of Speech Production Skills for Children with Cochlear Implants Using Wav2Vec2.0 Acoustic Embeddings. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 862-866
Fagniart Sophie et al. Production of fricative consonants in French-speaking children with cochlear implants and typical hearing: acoustic and phonological analyses. (2024) Megjelent: INTERSPEECH 2024 pp. 877-881
Xiong Yan et al. Improving Speech-Based Dysarthria Detection using Multi-task Learning with Gradient Projection. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 902-906
Arias-Vergara Tomas et al. Contrastive Learning Approach for Assessment of Phonological Precision in Patients with Tongue Cancer Using MRI Data. (2024) Megjelent: INTERSPEECH 2024 pp. 927-931
Gosztolya Gábor et al. Automatic Longitudinal Investigation of Multiple Sclerosis Subjects. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 942-946
Gosztolya Gábor et al. Combining Acoustic Feature Sets for Detecting Mild Cognitive Impairment in the Interspeech'24 TAUKADIAL Challenge. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 957-961
Duan Junwen et al. Pre-trained Feature Fusion and Matching for Mild Cognitive Impairment Detection. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 962-966
Favaro Anna et al. Leveraging Universal Speech Representations for Detecting and Assessing the Severity of Mild Cognitive Impairment Across Languages. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 972-976
Perez-Toro Paulailluirea et al. Multilingual Speech and Language Analysis for the Assessment of Mild Cognitive Impairment: Outcomes from the Taukadial Challenge. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 982-986
Niebuhr Oliver et al. How rhythm metrics are linked to produced and perceived speaker charisma. (2024) Megjelent: INTERSPEECH 2024 pp. 1065-1069
Han Kunmei. Modelling Lexical Characteristics of the Healthy Aging Population: A Corpus-Based Study. (2024) Megjelent: INTERSPEECH 2024 pp. 1090-1094
Gerczuki Maurice et al. Exploring Gender-Specific Speech Patterns in Automatic Suicide Risk Assessment. (2024) Megjelent: INTERSPEECH 2024 pp. 1095-1099
Mihajlik Peter et al. On Disfluency and Non-lexical Sound Labeling for End-to-end Automatic Speech Recognition. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 1270-1274
Triantafyllopoulos Andreas et al. Sustained Vowels for Pre- vs Post-Treatment COPD Classification. (2024) Megjelent: INTERSPEECH 2024 pp. 1410-1414
Ahn Emily P. et al. The Use of Phone Categories and Cross-Language Modeling for Phone Alignment of Panara. (2024) Megjelent: INTERSPEECH 2024 pp. 1505-1509
Rousso Rotem et al. Tradition or Innovation: A Comparison of Modern ASR Methods for Forced Alignment. (2024) Megjelent: INTERSPEECH 2024 pp. 1525-1529
Weise Tobias et al. Speaker- and Text-Independent Estimation of Articulatory Movements and Phoneme Alignments from Speech. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 1545-1549
Oura Anna et al. Preprocessing for acoustic-to-articulatory inversion using real-time MRI movies of Japanese speech. (2024) Megjelent: INTERSPEECH 2024 pp. 1550-1554
Weirich Melanie et al. Gender and age based f
0
-variation in the German Plapper Corpus. (2024) Megjelent: INTERSPEECH 2024 pp. 1565-1569
Makishima Naoki et al. SOMSRED: Sequential Output Modeling for Joint Multi-talker Overlapped Speech Recognition and Speaker Diarization. (2024) Megjelent: INTERSPEECH 2024 pp. 1660-1664
Botelho Catarina et al. Macro-descriptors for Alzheimer's disease detection using large language models. (2024) Megjelent: INTERSPEECH 2024 pp. 1975-1979
Braun Franziska et al. Infusing Acoustic Pause Context into Text-Based Dementia Assessment. (2024) Megjelent: INTERSPEECH 2024 pp. 1980-1984
Spiesberger Anika A. et al. "So...my child..." - How Child ADHD Influences theWay Parents Talk. (2024) Megjelent: INTERSPEECH 2024 pp. 2010-2014
Schade Leonie et al. Understanding ünderstanding": Presenting a richly annotated multimodal corpus of dyadic interaction. (2024) Megjelent: INTERSPEECH 2024 pp. 2040-2041
Kealey Jacob et al. Unsupervised Improved MVDR Beamforming for Sound Enhancement. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 2175-2179
Bentum Martijn et al. The Processing of Stress in End-to-End Automatic Speech Recognition Models. (2024) Megjelent: INTERSPEECH 2024 pp. 2350-2354
Kalabakov Stefan et al. A Comparative Analysis of Federated Learning for Speech-Based Cognitive Decline Detection. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 2455-2459
Shah Neil et al. Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models. (2024) Megjelent: INTERSPEECH 2024 pp. 2470-2474
Zheng Xiuwen et al. Fine-Tuning Automatic Speech Recognition for People with Parkinson's: An Effective Strategy for Enhancing Speech Technology Accessibility. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 2485-2489
Gosztolya Gábor et al. Wav2vec 2.0 Embeddings Are No Swiss Army Knife -- A Case Study for Multiple Sclerosis. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 2499-2503
Getman Yaroslav et al. Exploring adaptation techniques of large speech foundation models for low-resource ASR: a case study on Northern Sámi. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 2539-2543
Giroud Jeremy et al. Behavioral evidence for higher speech rate convergence following natural than artificial time altered speech. (2024) Megjelent: INTERSPEECH 2024 pp. 2610-2614
Fathan Abderrahim et al. On the impact of several regularization techniques on label noise robustness of self-supervised speaker verification systems. (2024) Megjelent: INTERSPEECH 2024 pp. 2670-2674
Chen Shihao et al. LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance. (2024) Megjelent: INTERSPEECH 2024 pp. 2770-2774
Kurihara Kiyoshi et al. Enhancing Japanese Text-to-Speech Accuracy with a Novel Combination Transformer-BERT-based G2P: Integrating Pronunciation Dictionaries and Accent Sandhi. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 2790-2794
Flynn Robert et al. Self-Train Before You Transcribe. (2024) Megjelent: INTERSPEECH 2024 pp. 2840-2844
Lee Jae-Hong et al. Online Subloop Search via Uncertainty Quantization for Efficient Test-Time Adaptation. (2024) Megjelent: INTERSPEECH 2024 pp. 2880-2884
Talkar Tanya et al. Detection of Cognitive Impairment And Alzheimer's Disease Using a Speech- and Language-Based Protocol. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 3025-3029
Woszczyk Dominika et al. Prosody-Driven Privacy-Preserving Dementia Detection. (2024) Megjelent: INTERSPEECH 2024 pp. 3035-3039
Hu Yiying et al. Key Acoustic Cues for the Realization of Metrical Prominence in Tone Languages: A Cross-Dialect Study. (2024) Megjelent: INTERSPEECH 2024 pp. 3130-3134
Suhas B. N. et al. Speaking of Health: Leveraging Large Language Models to assess Exercise Motivation and Behavior of Rehabilitation Patients. (2024) Megjelent: INTERSPEECH 2024 pp. 3155-3159
Chen Yafeng et al. ERes2NetV2: Boosting Short-Duration Speaker Verification Performance with Computational Efficiency. (2024) Megjelent: INTERSPEECH 2024 pp. 3245-3249
Elie Benjamin et al. A data-driven model of acoustic speech intelligibility for optimization-based models of speech production. (2024) Megjelent: INTERSPEECH 2024 pp. 3610-3614
Tulchynska Kira et al. Prosodic marking of syntactic boundaries in Khoekhoe. (2024) Megjelent: INTERSPEECH 2024 pp. 3684-3688
Triantafyllopoulos Andreas et al. Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition. (2024) Megjelent: INTERSPEECH 2024 pp. 3729-3733
O'Mahony Johannah et al. "Well", what can you do with messy data? Exploring the prosody and pragmatic function of the discourse marker "well" with found data and speech synthesis. (2024) Megjelent: INTERSPEECH 2024 pp. 4084-4088
Chen Peikun et al. Streaming Decoder-Only Automatic Speech Recognition with Discrete Speech Units: A Pilot Study. (2024) Megjelent: INTERSPEECH 2024 pp. 4468-4472
Ling Tongtao et al. A Small and Fast BERT for Chinese Medical Punctuation Restoration. (2024) Megjelent: 25th Interspeech Conference (Interspeech 2024) pp. 4533-4537
Lin Yi-Cheng et al. Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition. (2024) Megjelent: INTERSPEECH 2024 pp. 4633-4637
Selvakumar Anith et al. Getting More for Less: Using Weak Labels and AV-Mixup for Robust Audio-Visual Speaker Verification. (2024) Megjelent: INTERSPEECH 2024 pp. 4728-4732
Zhong Jiafeng et al. Enhancing Partially Spoofed Audio Localization with Boundary-aware Attention Mechanism. (2024) Megjelent: INTERSPEECH 2024 pp. 4838-4842
Li Jialu et al. Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis. (2024) Megjelent: INTERSPEECH 2024 pp. 5163-5167
Hivatkozás stílusok:
IEEE
ACM
APA
Chicago
Harvard
CSL
Másolás
Nyomtatás
2026-01-20 19:06
×
Lista exportálása irodalomjegyzékként
Hivatkozás stílusok:
IEEE
ACM
APA
Chicago
Harvard
Nyomtatás
Másolás