[go: up one dir, main page]

Perez-Castanos et al., 2019 - Google Patents

Cnn depth analysis with different channel inputs for acoustic scene classification

Perez-Castanos et al., 2019

View PDF
Document ID
15910129181848170542
Author
Perez-Castanos S
Naranjo-Alcazar J
Zuccarello P
Cobos M
Ferri F
Publication year
Publication venue
arXiv preprint arXiv:1906.04591

External Links

Snippet

Acoustic scene classification (ASC) has been approached in the last years using deep learning techniques such as convolutional neural networks or recurrent neural networks. Many state-of-the-art solutions are based on image classification frameworks and, as such, a …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • G06K9/6256Obtaining sets of training patterns; Bootstrap methods, e.g. bagging, boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • G06K9/4671Extracting features based on salient regional features, e.g. Scale Invariant Feature Transform [SIFT] keypoints
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6267Classification techniques
    • G06K9/6268Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00624Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30781Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F17/30784Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
    • G06F17/30799Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content

Similar Documents

Publication Publication Date Title
Valenti et al. A convolutional neural network approach for acoustic scene classification
Han et al. Convolutional Neural Networks with Binaural Representations and Background Subtraction for Acoustic Scene Classification.
Isa et al. Optimizing the hyperparameter tuning of YOLOv5 for underwater detection
Tom et al. End-To-End Audio Replay Attack Detection Using Deep Convolutional Networks with Attention.
Perez-Castanos et al. Cnn depth analysis with different channel inputs for acoustic scene classification
Li et al. Sound event detection via dilated convolutional recurrent neural networks
Phan et al. Robust audio event recognition with 1-max pooling convolutional neural networks
Ford et al. A Deep Residual Network for Large-Scale Acoustic Scene Analysis.
CN110600054B (en) Sound scene classification method based on network model fusion
CN112528920A (en) Pet image emotion recognition method based on depth residual error network
Parekh et al. Identify, locate and separate: Audio-visual object extraction in large video collections using weak supervision
Wang et al. Hybrid constant-Q transform based CNN ensemble for acoustic scene classification
Shoba et al. Image processing techniques for segments grouping in monaural speech separation
Wang et al. Audio event detection and classification using extended R-FCN approach
Wang et al. Acoustic scene classification with spectrogram processing strategies
Wang et al. A novel underground pipeline surveillance system based on hybrid acoustic features
Naranjo-Alcazar et al. On the performance of residual block design alternatives in convolutional neural networks for end-to-end audio classification
Nguyen et al. Acoustic scene classification with mismatched recording devices using mixture of experts layer
Liu et al. The system for acoustic scene classification using resnet
Aryal et al. Frequency-based CNN and attention module for acoustic scene classification
Chatterjee et al. Learning audio-visual dynamics using scene graphs for audio source separation
Tang et al. Differential treatment for time and frequency dimensions in mel-spectrograms: An efficient 3D Spectrogram network for underwater acoustic target classification
Seresht et al. Environmental sound classification with low-complexity convolutional neural network empowered by sparse salient region pooling
Kalinli et al. Saliency-driven unstructured acoustic scene classification using latent perceptual indexing
Zhou et al. An investigation of transfer learning mechanism for acoustic scene classification