[go: up one dir, main page]

Mehrez et al., 2018 - Google Patents

Understanding the performances of SMVP on multiprocessor platform

Mehrez et al., 2018

Document ID
2337960006929931258
Author
Mehrez I
Hamdi-Larbi O
Dufaud T
Emad N
Publication year
Publication venue
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA)

External Links

Snippet

Abstract Sparse Matrix Vector Product (SMVP) is an important kernel in many scientific applications. In this paper we study the performances of this kernel on multiprocessor platform using four different compression format (CSR, CSC, ELL and COO). Our aim is to …
Continue reading at search.proquest.com (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30386Retrieval requests
    • G06F17/30424Query processing
    • G06F17/30442Query optimisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/14Fourier, Walsh or analogous domain transformations, e.g. Laplace, Hilbert, Karhunen-Loeve, transforms
    • G06F17/141Discrete Fourier transforms
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/11Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
    • G06F17/12Simultaneous equations, e.g. systems of linear equations
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30312Storage and indexing structures; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design
    • G06F17/5068Physical circuit design, e.g. layout for integrated circuits or printed circuit boards
    • G06F17/5072Floorplanning, e.g. partitioning, placement
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3074Audio data retrieval
    • G06F17/30778Audio database index structures and management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/0207Addressing or allocation; Relocation with multidimensional access, e.g. row/column, matrix
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F2207/00Indexing scheme relating to methods or arrangements for processing data by operating upon the order or content of the data handled
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformations of program code
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored programme computers

Similar Documents

Publication Publication Date Title
US10296556B2 (en) System and method for efficient sparse matrix processing
CA3090329C (en) Neural network accelerator
US20190266217A1 (en) Apparatus and method for matrix computation
Anderson et al. Communication-avoiding QR decomposition for GPUs
Ashari et al. An efficient two-dimensional blocking strategy for sparse matrix-vector multiplication on GPUs
KR102065672B1 (en) Apparatus and method for convolution operation
Zhang et al. Algorithm-hardware co-design of attention mechanism on FPGA devices
KR20190049593A (en) Method and apparatus for performing operations in convolutional neural network
US20190095790A1 (en) Method and apparatus for adapting parameters of neural network
US20230281271A1 (en) Distributing matrix multiplication processing among processing nodes
Sun et al. Optimizing SpMV for diagonal sparse matrices on GPU
EP3295300B1 (en) System and method for determining concurrency factors for dispatch size of parallel processor kernels
Lee et al. Flexible group-level pruning of deep neural networks for on-device machine learning
Zhang et al. Regularizing irregularity: bitmap-based and portable sparse matrix multiplication for graph data on GPUs
KR20240149907A (en) Adaptive tensor computation kernel for sparse neural networks
Abubaker et al. Spatiotemporal graph and hypergraph partitioning models for sparse matrix-vector multiplication on many-core architectures
Mehrez et al. Understanding the performances of SMVP on multiprocessor platform
Jiang et al. Characterizing and optimizing transformer inference on arm many-core processor
Mondal et al. A unified engine for accelerating GNN weighting/aggregation operations, with efficient load balancing and graph-specific caching
Arrigoni et al. Efficiently parallelizable strassen-based multiplication of a matrix by its transpose
US11989257B2 (en) Assigning processing threads for matrix-matrix multiplication
Sun et al. Crsd: application specific auto-tuning of spmv for diagonal sparse matrices
Mehrez et al. Machine learning for optimal compression format prediction on multiprocessor platform
Mehrez et al. Understanding the performances of sparse compression formats using data parallel programming model
Ibrahim et al. Improvement of data throughput in data-intensive cloud computing applications