Perez-Castanos et al., 2019 - Google Patents
Cnn depth analysis with different channel inputs for acoustic scene classificationPerez-Castanos et al., 2019
View PDF- Document ID
- 15910129181848170542
- Author
- Perez-Castanos S
- Naranjo-Alcazar J
- Zuccarello P
- Cobos M
- Ferri F
- Publication year
- Publication venue
- arXiv preprint arXiv:1906.04591
External Links
Snippet
Acoustic scene classification (ASC) has been approached in the last years using deep learning techniques such as convolutional neural networks or recurrent neural networks. Many state-of-the-art solutions are based on image classification frameworks and, as such, a …
- 238000004458 analytical method 0 title abstract description 6
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6256—Obtaining sets of training patterns; Bootstrap methods, e.g. bagging, boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G06K9/4671—Extracting features based on salient regional features, e.g. Scale Invariant Feature Transform [SIFT] keypoints
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Valenti et al. | A convolutional neural network approach for acoustic scene classification | |
Han et al. | Convolutional Neural Networks with Binaural Representations and Background Subtraction for Acoustic Scene Classification. | |
Isa et al. | Optimizing the hyperparameter tuning of YOLOv5 for underwater detection | |
Tom et al. | End-To-End Audio Replay Attack Detection Using Deep Convolutional Networks with Attention. | |
Perez-Castanos et al. | Cnn depth analysis with different channel inputs for acoustic scene classification | |
Li et al. | Sound event detection via dilated convolutional recurrent neural networks | |
Phan et al. | Robust audio event recognition with 1-max pooling convolutional neural networks | |
Ford et al. | A Deep Residual Network for Large-Scale Acoustic Scene Analysis. | |
CN110600054B (en) | Sound scene classification method based on network model fusion | |
CN112528920A (en) | Pet image emotion recognition method based on depth residual error network | |
Parekh et al. | Identify, locate and separate: Audio-visual object extraction in large video collections using weak supervision | |
Wang et al. | Hybrid constant-Q transform based CNN ensemble for acoustic scene classification | |
Shoba et al. | Image processing techniques for segments grouping in monaural speech separation | |
Wang et al. | Audio event detection and classification using extended R-FCN approach | |
Wang et al. | Acoustic scene classification with spectrogram processing strategies | |
Wang et al. | A novel underground pipeline surveillance system based on hybrid acoustic features | |
Naranjo-Alcazar et al. | On the performance of residual block design alternatives in convolutional neural networks for end-to-end audio classification | |
Nguyen et al. | Acoustic scene classification with mismatched recording devices using mixture of experts layer | |
Liu et al. | The system for acoustic scene classification using resnet | |
Aryal et al. | Frequency-based CNN and attention module for acoustic scene classification | |
Chatterjee et al. | Learning audio-visual dynamics using scene graphs for audio source separation | |
Tang et al. | Differential treatment for time and frequency dimensions in mel-spectrograms: An efficient 3D Spectrogram network for underwater acoustic target classification | |
Seresht et al. | Environmental sound classification with low-complexity convolutional neural network empowered by sparse salient region pooling | |
Kalinli et al. | Saliency-driven unstructured acoustic scene classification using latent perceptual indexing | |
Zhou et al. | An investigation of transfer learning mechanism for acoustic scene classification |