O'hallaron et al., 1998 - Google Patents
Architectural implications of a family of irregular applicationsO'hallaron et al., 1998
View PDF- Document ID
- 18144875666966984766
- Author
- O'hallaron D
- Shewchuk J
- Gross T
- Publication year
- Publication venue
- Proceedings 1998 Fourth International Symposium on High-Performance Computer Architecture
External Links
Snippet
Irregular applications based on sparse matrices are at the core of many important scientific computations. Since the importance of such applications is likely to increase in the future, high-performance parallel and distributed systems must provide adequate support for such …
- 230000001788 irregular 0 title abstract description 18
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
- G06F15/163—Interprocessor communication
- G06F15/173—Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star, snowflake
- G06F15/17356—Indirect interconnection networks
- G06F15/17368—Indirect interconnection networks non hierarchical topologies
- G06F15/17381—Two dimensional, e.g. mesh, torus
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G06F9/5066—Algorithms for mapping a plurality of inter-dependent sub-tasks onto a plurality of physical CPUs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3457—Performance evaluation by simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
- G06F17/5022—Logic simulation, e.g. for logic circuit operation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
- G06F15/80—Architectures of general purpose stored programme computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
- G06F15/78—Architectures of general purpose stored programme computers comprising a single central processing unit
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F1/00—Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application
- G06F1/04—Generating or distributing clock signals or signals derived directly therefrom
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring
- G06F2201/885—Monitoring specific for caches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Gustafson et al. | Development of parallel methods for a 1024-processor hypercube | |
| Hillis et al. | The CM-5 Connection Machine: a scalable supercomputer. | |
| US5072371A (en) | Method for simultaneous overlapped communications between neighboring processors in a multiple | |
| O'hallaron | Spark98: Sparse matrix kernels for shared memory and message passing systems | |
| Shan et al. | Message passing and shared address space parallelism on an SMP cluster | |
| Keyes et al. | Prospects for CFD on petaflops systems | |
| O'hallaron et al. | Architectural implications of a family of irregular applications | |
| Villarreal et al. | Distributed three-dimensional finite-difference modeling of wave propagation in acoustic media | |
| Smith | Mixed mode MPI/OpenMP programming | |
| Messina et al. | Benchmarking advanced architecture computers | |
| Kaushik et al. | On the interaction of architecture and algorithm in the domain-based parallelization of an unstructured-grid incompressible flow code | |
| Tipparaju et al. | Exploiting non-blocking remote memory access communication in scientific benchmarks | |
| Fahey et al. | Early evaluation of the Cray XD1 | |
| O’Hallaron et al. | Properties of a family of parallel finite element simulations | |
| Christara et al. | An efficient transposition algorithm for distributed memory computers | |
| Hoe et al. | A personal supercomputer for climate research | |
| Titov et al. | Application of an integral approach to the parallel algorithm of 3D wave fields simulation in generalized coordinates | |
| Killough et al. | Parallelization of a General-Purpose Reservoir Simulator | |
| Rauber et al. | Modelling the runtime of scientific programs on parallel computers | |
| McBryan | A comparison of the Intel iPSC/860 and the Suprenum-1 parallel computers | |
| Chen et al. | Implementing parallel conjugate gradient on the earth multithreaded architecture | |
| Plaskacz | Parallel finite–element analysis via message passing | |
| Shea et al. | The IBM Victor V256 partitionable multiprocessor | |
| Worley et al. | A study of application sensitivity to variation in message‐passing latency and bandwidth | |
| Gropp et al. | Analyzing the parallel scalability of an implicit unstructured mesh CFD code |