Riesinger et al., 2018 - Google Patents

Non-standard pseudo random number generators revisited for GPUs

Riesinger et al., 2018

Document ID: 1454814394956253504
Author: Riesinger C; Neckel T; Rupp F
Publication year: 2018
Publication venue: Future Generation Computer Systems

External Links

Cited by

Snippet

Pseudo random number generators are intensively used in many computational applications, eg the treatment of uncertainty quantification problems. For this reason, the right selection of such generators and their optimization for various hardware architectures is …

Continue reading at www.sciencedirect.com (other versions)

238000009826 distribution 0 abstract description 27

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/30—Arrangements for executing machine-instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/30—Arrangements for executing machine-instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline, look ahead
- G06F9/3885—Concurrent instruction execution, e.g. pipeline, look ahead using a plurality of independent parallel functional units
- G06F9/3887—Concurrent instruction execution, e.g. pipeline, look ahead using a plurality of independent parallel functional units controlled by a single instruction, e.g. SIMD
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/30—Arrangements for executing machine-instructions, e.g. instruction decode
- G06F9/32—Address formation of the next instruction, e.g. incrementing the instruction counter, jump
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/30—Arrangements for executing machine-instructions, e.g. instruction decode
- G06F9/34—Addressing or accessing the instruction operand or the result; Formation of operand address; Addressing modes
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
- G06F8/44—Encoding
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/58—Random or pseudo-random number generators
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/60—Methods or arrangements for performing computations using a digital non-denominational number representation, i.e. number representation without radix; Computing devices using combinations of denominational and non-denominational quantity representations, e.g. using difunction pulse trains, STEELE computers, phase computers
- G06F7/72—Methods or arrangements for performing computations using a digital non-denominational number representation, i.e. number representation without radix; Computing devices using combinations of denominational and non-denominational quantity representations, e.g. using difunction pulse trains, STEELE computers, phase computers using residue arithmetic
- G06F7/724—Finite field arithmetic
- G06F7/726—Inversion; Reciprocal calculation; Division of elements of a finite field
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2207/00—Indexing scheme relating to methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F2207/72—Indexing scheme relating to groups G06F7/72 - G06F7/729
- G06F2207/7219—Countermeasures against side channel or fault attacks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/06—Ray-tracing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation

Similar Documents

Publication	Publication Date	Title
Dakkak et al.	2019	Accelerating reduction and scan using tensor core units
Saito et al.	2013	Variants of Mersenne twister suitable for graphic processors
Páll et al.	2013	A flexible algorithm for calculating pair interactions on SIMD architectures
US8756264B2 (en)	2014-06-17	Parallel pseudorandom number generation
Maruyama et al.	2014	Optimizing stencil computations for NVIDIA Kepler GPUs
Satish et al.	2012	Can traditional programming bridge the ninja performance gap for parallel computing applications?
Li et al.	2011	Strassen's matrix multiplication on GPUs
US20170344514A1 (en)	2017-11-30	System and method for speeding up general matrix-matrix multiplication on the gpu
Peng et al.	2020	GLU3. 0: Fast GPU-based parallel sparse LU factorization for circuit simulation
US10067910B2 (en)	2018-09-04	System and method for GPU maximum register count optimization applied to general matrix-matrix multiplication
Mertmann et al.	2011	Fine-sorting one-dimensional particle-in-cell algorithm with Monte-Carlo collisions on a graphics processing unit
Riesinger et al.	2018	Non-standard pseudo random number generators revisited for GPUs
Sørensen et al.	2014	Multicore performance of block algebraic iterative reconstruction methods
Kannan	2013	Efficient sparse matrix multiple-vector multiplication using a bitmapped format
Haidar et al.	2015	Towards batched linear solvers on accelerated hardware platforms
László et al.	2012	Analysis of a gpu based cnn implementation
Strnad et al.	2016	Parallel construction of classification trees on a GPU
US9804826B2 (en)	2017-10-31	Parallelization of random number generators
Nagasaka et al.	2014	Cache-aware sparse matrix formats for Kepler GPU
Dobravec et al.	2017	Comparing CPU and GPU implementations of a simple matrix multiplication algorithm
Wang et al.	2016	A fast tridiagonal solver for Intel MIC architecture
Pershin et al.	2019	Performance limits study of stencil codes on modern GPGPUs
Chen et al.	2016	OpenCL-based erasure coding on heterogeneous architectures
Chen et al.	2009	High performance median filtering using commodity graphics hardware
Kaya	2019	Parallel algorithms for computing sparse matrix permanents