Speech and Computer, Kartoniert / Broschiert
Speech and Computer
- 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13-15, 2025, Proceedings, Part II
(soweit verfügbar beim Lieferanten)
- Herausgeber:
- Alexey Karpov, Gábor Gosztolya
- Verlag:
- Springer, 10/2025
- Einband:
- Kartoniert / Broschiert
- Sprache:
- Englisch
- ISBN-13:
- 9783032079589
- Artikelnummer:
- 12441851
- Umfang:
- 372 Seiten
- Gewicht:
- 563 g
- Maße:
- 235 x 155 mm
- Stärke:
- 21 mm
- Erscheinungstermin:
- 13.10.2025
- Hinweis
-
Achtung: Artikel ist nicht in deutscher Sprache!
Weitere Ausgaben von Speech and Computer |
Preis |
|---|---|
| Buch, Kartoniert / Broschiert, Englisch | EUR 116,09* |
| Buch, Kartoniert / Broschiert, Paperback, Englisch | EUR 72,27* |
| Buch, Kartoniert / Broschiert, Paperback, Englisch | EUR 72,27* |
Klappentext
.- Automatic Speech Recognition. .- In-Domain SSL Pre-Training and Streaming ASR: Application to Air Traffic Control Communications. .- Evaluating the Performance of Several ASR Systems in Environmental and Industrial Noise. .- Ground Truth-Free WER Prediction for ASR via Audio Quality and Model Confidence Features. .- Enhancing Speech Recognition through Text-to-Speech and Voice Conversion Augmentation. .- Best Data is more Supervised Data - Even for Hungarian ASR. .- Arabic ASR on the SADA Large-Scale Arabic Speech Corpus with Transformer-based Models. .- Speech Processing for Under-Resourced Languages. .- Effect of Increased Temporal Resolution on Speech Recognition for French Quebec using Features from Speech Self-Supervised Learning Models. .- Modeling Intra-Word Code-Switching for Karelian ASR. .- Improving Whisper-based Serbian ASR using Synthetic Speech. .- Domain Knowledge and Language Embeddings for Low-Resource Multilingual Phoneme ASR. .- Whistler Identification in Whistled Spanish (Silbo): A Case Study. .- Digital Speech Processing. .- PinkVocalTransformer: Neural Acoustic-to-Articulatory Inversion based on the Pink Trombone. .- CrossMP-SENet: Transformer-based Cross-Attention for Joint Magnitude-Phase Speech Enhancement. .- Adaptive Singing Voice Enhancement for Live Stages. .- Revealing the Hidden Temporal Structure of HubertSoft Embeddings based on the Russian Phonetic Corpus. .- Natural Language Processing. .- Analyzing Web-Scraped and Generated Inputs for Automatic and Scalable Intent Classification. .- Enhancing Retrieval Performance via LLM Hard-Negative Filtering. .- Sector-Wise Backpropagation for Low-Resource Text Classification in Deep Models. .- High-Frequency Multiword Units and the Typological Distribution of Multiword Units in Spoken Russian. .- Estimation of the Genre Composition of the English Subcorpus of the Google Books Ngram. .- Multimodal Systems. .- Ensembling Synchronisation-based and Face-Voice Association Paradigms for Robust Active Speaker Detection in Egocentric Recordings. .- Phonetic and Visual Characteristics of Cognitive Load. .- Cognitive Humor Processing in the Russian and English Internet Meme Chatting: EEG Study. .- Saudi Sign Language Translation Using T5.