
Overview
- Nominated as an outstanding thesis
- by Technische Universität München, Germany
- Describes the details and
- architecture of openSMILE - the number 1 open-source toolkit in speech emotion
- analytics and computational paralinguistics
- Reports on extensive automatic classification results for over ten public speech and music databases
- Includes supplementary material: sn.pub/extras
Part of the book series: Springer Theses (Springer Theses)
Access this book
Tax calculation will be finalised at checkout
Other ways to access
About this book
This book reports on an outstanding thesis that has significantly advanced the state-of-the-art in the automated analysis and classification of speech and music. It defines several standard acoustic parameter sets and describes their implementation in a novel, open-source, audio analysis framework called openSMILE, which has been accepted and intensively used worldwide. The book offers extensive descriptions of key methods for the automatic classification of speech and music signals in real-life conditions and reports on the evaluation of the framework developed and the acoustic parameter sets that were selected. It is not only intended as a manual for openSMILE users, but also and primarily as a guide and source of inspiration for students and scientists involved in the design of speech and music analysis methods that can robustly handle real-life conditions.
Similar content being viewed by others
Keywords
Table of contents (7 chapters)
Authors and Affiliations
Bibliographic Information
Book Title: Real-time Speech and Music Classification by Large Audio Feature Space Extraction
Authors: Florian Eyben
Series Title: Springer Theses
DOI: https://doi.org/10.1007/978-3-319-27299-3
Publisher: Springer Cham
eBook Packages: Engineering, Engineering (R0)
Copyright Information: Springer International Publishing Switzerland 2016
Hardcover ISBN: 978-3-319-27298-6Published: 06 January 2016
Softcover ISBN: 978-3-319-80111-7Published: 30 March 2018
eBook ISBN: 978-3-319-27299-3Published: 24 December 2015
Series ISSN: 2190-5053
Series E-ISSN: 2190-5061
Edition Number: 1
Number of Pages: XXXVIII, 298
Number of Illustrations: 2 b/w illustrations, 39 illustrations in colour
Topics: Signal, Image and Speech Processing, User Interfaces and Human Computer Interaction, Engineering Acoustics, Computational Linguistics