Perez-Castanos et al., 2019 - Google Patents

Cnn depth analysis with different channel inputs for acoustic scene classification

Perez-Castanos et al., 2019

Document ID: 15910129181848170542
Author: Perez-Castanos S; Naranjo-Alcazar J; Zuccarello P; Cobos M; Ferri F
Publication year: 2019
Publication venue: arXiv preprint arXiv:1906.04591

External Links

Cited by

Snippet

Acoustic scene classification (ASC) has been approached in the last years using deep learning techniques such as convolutional neural networks or recurrent neural networks. Many state-of-the-art solutions are based on image classification frameworks and, as such, a …

Continue reading at arxiv.org (PDF) (other versions)

238000004458 analytical method 0 title abstract description 6

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6256—Obtaining sets of training patterns; Bootstrap methods, e.g. bagging, boosting
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G06K9/4671—Extracting features based on salient regional features, e.g. Scale Invariant Feature Transform [SIFT] keypoints
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content

Similar Documents

Publication	Publication Date	Title
Valenti et al.	2017	A convolutional neural network approach for acoustic scene classification
Han et al.	2017	Convolutional Neural Networks with Binaural Representations and Background Subtraction for Acoustic Scene Classification.
Isa et al.	2022	Optimizing the hyperparameter tuning of YOLOv5 for underwater detection
Tom et al.	2018	End-To-End Audio Replay Attack Detection Using Deep Convolutional Networks with Attention.
Perez-Castanos et al.	2019	Cnn depth analysis with different channel inputs for acoustic scene classification
Li et al.	2020	Sound event detection via dilated convolutional recurrent neural networks
Phan et al.	2016	Robust audio event recognition with 1-max pooling convolutional neural networks
Ford et al.	2019	A Deep Residual Network for Large-Scale Acoustic Scene Analysis.
CN110600054B (en)	2021-09-21	Sound scene classification method based on network model fusion
CN112528920A (en)	2021-03-19	Pet image emotion recognition method based on depth residual error network
Parekh et al.	2019	Identify, locate and separate: Audio-visual object extraction in large video collections using weak supervision
Wang et al.	2019	Hybrid constant-Q transform based CNN ensemble for acoustic scene classification
Shoba et al.	2018	Image processing techniques for segments grouping in monaural speech separation
Wang et al.	2017	Audio event detection and classification using extended R-FCN approach
Wang et al.	2020	Acoustic scene classification with spectrogram processing strategies
Wang et al.	2020	A novel underground pipeline surveillance system based on hybrid acoustic features
Naranjo-Alcazar et al.	2019	On the performance of residual block design alternatives in convolutional neural networks for end-to-end audio classification
Nguyen et al.	2019	Acoustic scene classification with mismatched recording devices using mixture of experts layer
Liu et al.	2019	The system for acoustic scene classification using resnet
Aryal et al.	2023	Frequency-based CNN and attention module for acoustic scene classification
Chatterjee et al.	2022	Learning audio-visual dynamics using scene graphs for audio source separation
Tang et al.	2023	Differential treatment for time and frequency dimensions in mel-spectrograms: An efficient 3D Spectrogram network for underwater acoustic target classification
Seresht et al.	2022	Environmental sound classification with low-complexity convolutional neural network empowered by sparse salient region pooling
Kalinli et al.	2009	Saliency-driven unstructured acoustic scene classification using latent perceptual indexing
Zhou et al.	2018	An investigation of transfer learning mechanism for acoustic scene classification