Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2010
Recent activities of major chip manufacturers, such as Intel, AMD, IBM and NVIDIA, make it more evident than ever that future designs of microprocessors and large HPC systems will be hybrid/heterogeneous in nature, relying on the integration (in varying proportions) of two major types of components:
2008 IEEE International Symposium on Parallel and Distributed Processing
Evaluation and tuning of the Level 3 CUBLAS for graphics processors2008 •
Queue
GPUs2008 •
A gamer wanders through a virtual world rendered in near- cinematic detail. Seconds later, the screen fills with a 3D explosion, the result of unseen enemies hiding in physically accurate shadows. Disappointed, the user exits the game and returns to a computer desktop that exhibits the stylish 3D look-and-feel of a modern window manager. Both of these visual experiences require hundreds of gigaflops of computing performance, a demand met by the GPU (graphics processing unit) present in every consumer PC.
2008 •
The unique architecture of the heterogeneous multi-core Cell processor offers great potential for high performance computing. It offers features such as high memory bandwidth using DMA, user managed local stores and SIMD architecture. In this paper, we present strategies for leveraging these features to develop a high performance BLAS library. We propose techniques to partition and distribute data across SPEs for handling DMA efficiently. We show that suitable pre-processing of data leads to significant performance improvements when the data is unaligned. In addition, we use a combination of two kernels – a specialized high performance kernel for the more frequently occurring cases and a generic kernel for handling boundary cases – to obtain better performance. Using these techniques for double precision, we obtain up to 70–80% of peak performance for different memory bandwidth bound level 1 and 2 routines and up to 80–90% for computation bound level 3 routines.
Language Policy and Political Issues in Education
Decolonization and bilingual intercultur (1)2017 •
Tourism & Management Studies
Community Based Tourism: A Global South Perspective2024 •
EDEBİYAT FAKÜLTESİ 30. YIL ARMAĞAN KİTABI
Bizans Sanatında Pseudo-Kûfi Süsleme2024 •
XIV Jornadas de Investigación y Tercer Encuentro de Investigadores en Psicología del Mercosur
El Curso Introductorio a La Carrera en La U.N.L.P. Como Dispositivo De Formación2007 •
Decreto Protocollo n. 004/24/SG/CEAST del 15 marzo 2024.
La Comunità di Gesù in Angola è stata Eretta Canonicamente in Associazione Pubblica di Fedeli della Chiesa Cattolica da parte della Conferência Episcopal de Angola e São Tomé, CEASTWalailak Journal of Science and Technology (WJST)
Non-negative Solutions of the Nonlinear Diophantine Equation (8^n)^x + p^y=z^2 for Some Prime Number p2021 •
Journal of Revenue & Pricing Management
Performance Monitor: The opportunity costs of revenue management2004 •
Editora Científica Digital eBooks
Nematoide Das Galhas Associado a Cultura Da Goiabeira Em Limoeiro Do Norte, Ceará2022 •
PsycEXTRA Dataset
A Model-Based Team Decision-Making and Performance Assessment Instrument: Development and Evaluation. Volumes 1 and 22002 •
OTKA Kutatási …
Externális tényezők a tudományban és a tudásban= External Factors in Science and Knowledge2006 •
Vidyodaya Journal of Science
Isolation, and characterization of Cladosporium alboflavescens for Acetaminophen biodegradation2023 •
International Journal of Innovative Research in Computer Science and Technology (IJIRCST)
A Review on the Detection and Classification of Glaucoma Disease Based on Transfer Learning2024 •
2019 •
2023 •