[go: up one dir, main page]

O'hallaron et al., 1998 - Google Patents

Architectural implications of a family of irregular applications

O'hallaron et al., 1998

View PDF
Document ID
18144875666966984766
Author
O'hallaron D
Shewchuk J
Gross T
Publication year
Publication venue
Proceedings 1998 Fourth International Symposium on High-Performance Computer Architecture

External Links

Snippet

Irregular applications based on sparse matrices are at the core of many important scientific computations. Since the importance of such applications is likely to increase in the future, high-performance parallel and distributed systems must provide adequate support for such …
Continue reading at apps.dtic.mil (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/16Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
    • G06F15/163Interprocessor communication
    • G06F15/173Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star, snowflake
    • G06F15/17356Indirect interconnection networks
    • G06F15/17368Indirect interconnection networks non hierarchical topologies
    • G06F15/17381Two dimensional, e.g. mesh, torus
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5066Algorithms for mapping a plurality of inter-dependent sub-tasks onto a plurality of physical CPUs
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3457Performance evaluation by simulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design
    • G06F17/5009Computer-aided design using simulation
    • G06F17/5022Logic simulation, e.g. for logic circuit operation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored programme computers
    • G06F15/80Architectures of general purpose stored programme computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored programme computers
    • G06F15/78Architectures of general purpose stored programme computers comprising a single central processing unit
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F1/00Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application
    • G06F1/04Generating or distributing clock signals or signals derived directly therefrom
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/885Monitoring specific for caches
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled

Similar Documents

Publication Publication Date Title
Gustafson et al. Development of parallel methods for a 1024-processor hypercube
Hillis et al. The CM-5 Connection Machine: a scalable supercomputer.
US5072371A (en) Method for simultaneous overlapped communications between neighboring processors in a multiple
O'hallaron Spark98: Sparse matrix kernels for shared memory and message passing systems
Shan et al. Message passing and shared address space parallelism on an SMP cluster
Keyes et al. Prospects for CFD on petaflops systems
O'hallaron et al. Architectural implications of a family of irregular applications
Villarreal et al. Distributed three-dimensional finite-difference modeling of wave propagation in acoustic media
Smith Mixed mode MPI/OpenMP programming
Messina et al. Benchmarking advanced architecture computers
Kaushik et al. On the interaction of architecture and algorithm in the domain-based parallelization of an unstructured-grid incompressible flow code
Tipparaju et al. Exploiting non-blocking remote memory access communication in scientific benchmarks
Fahey et al. Early evaluation of the Cray XD1
O’Hallaron et al. Properties of a family of parallel finite element simulations
Christara et al. An efficient transposition algorithm for distributed memory computers
Hoe et al. A personal supercomputer for climate research
Titov et al. Application of an integral approach to the parallel algorithm of 3D wave fields simulation in generalized coordinates
Killough et al. Parallelization of a General-Purpose Reservoir Simulator
Rauber et al. Modelling the runtime of scientific programs on parallel computers
McBryan A comparison of the Intel iPSC/860 and the Suprenum-1 parallel computers
Chen et al. Implementing parallel conjugate gradient on the earth multithreaded architecture
Plaskacz Parallel finite–element analysis via message passing
Shea et al. The IBM Victor V256 partitionable multiprocessor
Worley et al. A study of application sensitivity to variation in message‐passing latency and bandwidth
Gropp et al. Analyzing the parallel scalability of an implicit unstructured mesh CFD code