- Leibniz-Rechenzentrum der Bayerischen Akademie der Wissenschaften
Boltzmannstr. 1
D-85748 Garching bei Muenchen
Volker Weinberg
Leibniz Supercomputing Centre, High Performance Systems Division, Department Member
- Volker Weinberg studied physics at the Ludwig Maximilian University of Munich and later worked at the research centre... moreVolker Weinberg studied physics at the Ludwig Maximilian University of Munich and later worked at the research centre DESY. He received his PhD from the Free University of Berlin for his studies in the field of lattice QCD. Since 2008 he is working in the HPC group at the Leibniz Supercomputing Centre and is responsible for education and training at LRZ, new programming languages and accelerators. As a lecturer he has provided many courses about basic and advanced OpenMP and MPI programming, Intel manycore programming and optimization, GPU programming using OpenACC/OpenMP and Mathematica. Within PRACE, the Partnership for Advanced Computing in Europe, he is coordinating PRACE training at LRZ and is currently also involved as leader of the workpackage WP5 "HPC Planning and Commissioning". He is a member of the PRACE-6IP Technical Board and LRZ representative in the OpenMP Architecture Review Board (ARB). Furthermore, he is leading the workpackage "Education and Training" within GCS, the Gauss Centre for Supercomputing, which combines the three national supercomputing centres High Performance Computing Center Stuttgart (HLRS), Jülich Supercomputing Centre (JSC), and Leibniz Supercomputing Centre (LRZ) into Germany’s foremost supercomputing institution. Volker is one of 4 elected spokespersons of the scientific employees of the Bavarian Academy of Sciences and Humanities.edit
Research Interests:
Research Interests:
Research Interests:
This book presents the latest findings and ongoing research in the field of environmental informatics. It addresses a wide range of cross-cutting activities, such as efficient computing, virtual reality, disruption management, big data,... more
This book presents the latest findings and ongoing research in the field of environmental informatics. It addresses a wide range of cross-cutting activities, such as efficient computing, virtual reality, disruption management, big data, open science and the internet of things, and showcases how these green information & communication technologies (ICT) can be used to effectively address environmental and societal challenges. Presenting a selection of extended contributions to the 32nd edition of the International Conference EnviroInfo 2018, at the Leibniz Supercomputing Centre in Garching near Munich, it is essential reading for anyone looking to expand their expertise in the area.
This book presents the latest findings and ongoing research in the field of environmental informatics. It addresses a wide range of cross-cutting activities such as recent advances in environmental informatics, Internet of Things... more
This book presents the latest findings and ongoing research in the field of environmental informatics. It addresses a wide range of cross-cutting activities such as recent advances in environmental informatics, Internet of Things technologies, challenges in ICT-technologies, disaster management, energy aware software-engineering and development, environmental health informatics, environmental information systems, machine learning, open science and sustainable mobility. The book contains selected short and work in progress papers of the 32nd International Conference EnviroInfo 2018 at the Leibniz Supercomputing Centre of the Bavarian Academy of Sciences and Humanities (LRZ) in Garching near Munich.
This Best Practice Guide written from scratch provides information about Intel's Haswell/Broadwell architecture in order to enable programmers to achieve good performance of their applications. The guide covers a wide range of topics from... more
This Best Practice Guide written from scratch provides information about Intel's Haswell/Broadwell architecture in order to enable programmers to achieve good performance of their applications. The guide covers a wide range of topics from the description and comparison of the hardware of the Haswell/Broadwell processor, through information about the compiler usage as well as information about porting programs up to tools and strategies how to analyse and improve the performance of applications. With the introduction of extra vector instructions with these processors (fused multiply add etc.) stronger focus is placed on vectorisation. In the tuning context several of the tuning tools provided by Intel are demonstrated and examples are given on how to use command line tools to collect data in batch mode. Furthermore, the guide provides information about the following European Intel Haswell/Broadwell based European systems: Hazel Hen @ HLRS, Germany, Minotauro @ BSC, Spain, Salomon @ IT4Innovations, Czech Republic, SuperMUC Phase 2 @ LRZ, Germany.
Research Interests:
This Best Practice Guide describes general purpose computation on Graphics Processing Units (GPUs). GPUs were originally developed for computer gaming and other graphical tasks, but for many years have been exploited for general purpose... more
This Best Practice Guide describes general purpose computation on Graphics Processing Units (GPUs). GPUs were originally developed for computer gaming and other graphical tasks, but for many years have been exploited for general purpose computing across a number of areas. They offer advantages over traditional CPUs because they have greater computational capability and use high-bandwidth memory systems (with memory bandwidth being the main bottleneck for many scientific applications). The guide includes information on how to get started with programming GPUs, which cannot be used in isolation but only as "accelerators" in conjunction with CPUs, and how to get good performance. Focus is given to NVIDIA GPUs, which are most widespread today. The GPGPU Best Practice Guide is based on the PRACE-2IP GPGPU Best Practice Mini-Guide. The following new topics were added: GPU architecture especially highlighting new features of NVIDIA Pascal, OpenCL Programming, OpenMP 4.x Offloading. Several existing sections have been updated to reflect recent changes.
Research Interests:
This best practice guide provides information about SuperMUC in order to enable users of the system to achieve good performance of their applications. The guide covers a wide range of topics from the detailed description of the hardware... more
This best practice guide provides information about SuperMUC in order to enable users of the system to achieve good performance of their applications. The guide covers a wide range of topics from the detailed description of the hardware through information about the basic production environment including how to login and the accounting procedure as well as information about porting and submitting jobs, up to tools and strategies on how to analyze and improve the performance of applications. The guide includes contributions from LRZ, Nikos Anastopoulos (NTUA/GRNET) and Petri Nikunen (CSC).
Research Interests:
As code optimisation techniques are getting more and more important in HPC, LRZ @ GCS as one of the six European PRACE Advanced Training Centres (PATC) has extended its curriculum by a new PATC course " HPC code optimisation workshop " ,... more
As code optimisation techniques are getting more and more important in HPC, LRZ @ GCS as one of the six European PRACE Advanced Training Centres (PATC) has extended its curriculum by a new PATC course " HPC code optimisation workshop " , which took place at LRZ on May 4, 2017 for the first time.
Research Interests:
Most HPC systems are clusters of shared memory nodes. These SMP nodes can be small multi-core CPUs up to large many-core CPUs. Parallel programming may combine the distributed memory parallelisation on the node interconnect (e.g., using... more
Most HPC systems are clusters of shared memory nodes. These SMP nodes can be small multi-core CPUs up to large many-core CPUs. Parallel programming may combine the distributed memory parallelisation on the node interconnect (e.g., using MPI) with the shared memory parallelisation inside of each node (e.g., using OpenMP or MPI-3.0 shared memory). As such hybrid programming techniques are getting more and more important in HPC, GCS as one of the 6 European PRACE Advanced Training Centres (PATC) has extended its curriculum by a new PATC course on hybrid programming techniques, which took place at LRZ on January 14, 2016 for the first time.
Research Interests:
Since the standards of parallel programming languages are becoming more and more complex and extensive, it can be hard to stay up to date with recent developments. LRZ has thus invited leading HPC experts to give updates of recent... more
Since the standards of parallel programming languages are becoming more and more complex and extensive, it can be hard to stay up to date with recent developments. LRZ has thus invited leading HPC experts to give updates of recent advances in parallel programming languages during a 1-day workshop on June 8 2015 at LRZ.
Research Interests:
We discuss a detailed weak scaling analysis of GEM, a 3D MPI-parallelised gyrofluid code used in theoretical plasma physics at the Max Planck Institute of Plasma Physics, IPP at Garching b. M\"unchen, Germany. Within a PRACE Preparatory... more
We discuss a detailed weak scaling analysis of GEM, a 3D MPI-parallelised gyrofluid code used in theoretical plasma physics at the Max Planck Institute of Plasma Physics, IPP at Garching b. M\"unchen, Germany. Within a PRACE Preparatory Access Project various versions of the code have been analysed on the HPC systems SuperMUC at LRZ and JUQUEEN at J\"ulich Supercomputing Centre (JSC) to improve the parallel scalability of the application. The diagnostic tool Scalasca has been used to filter out suboptimal routines. The code uses the electromagnetic gyrofluid model which is a superset of magnetohydrodynamic and drift-Alfv\'en microturbulance and also includes several relevant kinetic processes. GEM can be used with different geometries depending on the targeted use case, and has been proven to show good scalability when the computational domain is distributed amongst two dimensions. Such a distribution allows grids with sufficient size to describe small scale tokamak devices. In order to enable simulation of very large tokamaks (such as the next generation nuclear fusion device ITER in Cadarache, France) the third dimension has been parallelised and weak scaling has been achieved for significantly larger grids.
Intel Array Building Blocks is a high-level data-parallel programming environment designed to produce scalable and portable results on existing and upcoming multi- and many-core platforms. We have chosen several mathematical kernels - a... more
Intel Array Building Blocks is a high-level data-parallel programming environment designed to produce scalable and portable results on existing and upcoming multi- and many-core platforms. We have chosen several mathematical kernels - a dense matrix-matrix multiplication, a sparse matrix-vector multiplication, a 1-D complex FFT and a conjugate gradients solver - as synthetic benchmarks and representatives of scientific codes and ported them to ArBB. This whitepaper describes the ArBB ports and presents performance and scaling measurements on the Westmere-EX based system SuperMIG at LRZ in comparison with OpenMP and MKL.
Research Interests:
Research Interests:
These are the proceedings of the workshop on " Lattice QCD, Chiral Perturbation Theory and Hadron Phenomenology " held at the European Centre for Theoretical Studies in Nuclear Physics and Related Areas from October 2 to 6, 2006. The... more
These are the proceedings of the workshop on " Lattice QCD, Chiral Perturbation Theory and Hadron Phenomenology " held at the European Centre for Theoretical Studies in Nuclear Physics and Related Areas from October 2 to 6, 2006. The workshop concentrated on bringing together researchers working in lattice QCD and chiral perturbation theory with the aim of improving our understanding how hadron properties can be calculated and analyzed from first principles. Included are a short contribution per talk.
Research Interests:
Research Interests:
Research Interests:
High Performance Computing (HPC) is experiencing vast amount of changes in the road towards Exascale computing capability. These changes stretch throughout different levels: from technology and architectures to use cases. In order to... more
High Performance Computing (HPC) is experiencing vast amount of changes in the road towards Exascale computing capability. These changes stretch throughout different levels: from technology and architectures to use cases. In order to attain the best performing HPC system, it is imperative that the underlying technology and architecture match the requirements of the current and emerging applications.
This document aims to provide an overview of these requirements by assessing the needs of user communities and of HPC centres in terms of technologies and architectures for next generation HPC systems evolving towards Exascale. For this purpose, surveys have been conducted among recently started Centres of Excellences (CoEs) in Europe for collecting the requirements from HPC user communities. A different survey has been distributed to all PRACE Tier-0/Tier-1 HPC sites to understand how these requirements differ from the current state of the art, to determine the requirements of HPC centres, and possibly motivate related prototyping efforts.
This document aims to provide an overview of these requirements by assessing the needs of user communities and of HPC centres in terms of technologies and architectures for next generation HPC systems evolving towards Exascale. For this purpose, surveys have been conducted among recently started Centres of Excellences (CoEs) in Europe for collecting the requirements from HPC user communities. A different survey has been distributed to all PRACE Tier-0/Tier-1 HPC sites to understand how these requirements differ from the current state of the art, to determine the requirements of HPC centres, and possibly motivate related prototyping efforts.
The Training Work Package (WP4) of the PRACE Fifth Implementation Phase (PRACE-5IP) project is responsible for the training activities of PRACE. In addition to the extensive programme of face-to-face training delivered by the PRACE... more
The Training Work Package (WP4) of the PRACE Fifth Implementation Phase (PRACE-5IP) project is responsible for the training activities of PRACE. In addition to the extensive programme of face-to-face training delivered by the PRACE Advanced Training Centres (PATCs) and PRACE Training Centres (PTCs), the training activities also include the On-demand training events, a series of online training offerings such as the Massive Open Online Courses (MOOCs), the training code repository (CodeVault) and the training portal. This deliverable is a report of all the PRACE-5IP training activities during the initial 15 months of the project.
PRACE-5IP continued the training activities of PRACE-4IP starting from February 2017. Between February 2017 and April 2018, the six PATCs delivered 103 courses, 290 course-days with 2,396 participants, while the four PTCs delivered 12 courses, 21 course-days with 351 participants. The feedback responses received have been overwhelmingly positive (8.5/10 average overall rating for PATC courses).
Similarly, the PRACE Seasonal Schools in PRACE-5IP have been carefully selected via a formal selection process. The Seasonal Schools scheduled for this reporting period, namely the Autumn School 2017 in Gdańsk, Poland and the Winter Seasonal School 2018 in Bratislava, Slovak Republic, attracted 39 attendees in total, and obtained high overall ratings.
In addition, WP4 continued the collaboration with the Centres of Excellence (CoEs) where a number of On-demand events were organised. Finally, WP4 co-organised the 2017 International Summer School on HPC Challenges in Computational Sciences in Boulder, Colorado, United States of America, from 25-30June 2017.
All forms of face-to-face and online training are complementary. They contribute to the success of the PRACE training programme, and should be continued in future PRACE HPC training activities. In accordance with the PRACE-5IP goals, the majority of training events also address industrial users, and motivate them by showing the importance and benefits of HPC for the practice.
PRACE-5IP continued the training activities of PRACE-4IP starting from February 2017. Between February 2017 and April 2018, the six PATCs delivered 103 courses, 290 course-days with 2,396 participants, while the four PTCs delivered 12 courses, 21 course-days with 351 participants. The feedback responses received have been overwhelmingly positive (8.5/10 average overall rating for PATC courses).
Similarly, the PRACE Seasonal Schools in PRACE-5IP have been carefully selected via a formal selection process. The Seasonal Schools scheduled for this reporting period, namely the Autumn School 2017 in Gdańsk, Poland and the Winter Seasonal School 2018 in Bratislava, Slovak Republic, attracted 39 attendees in total, and obtained high overall ratings.
In addition, WP4 continued the collaboration with the Centres of Excellence (CoEs) where a number of On-demand events were organised. Finally, WP4 co-organised the 2017 International Summer School on HPC Challenges in Computational Sciences in Boulder, Colorado, United States of America, from 25-30June 2017.
All forms of face-to-face and online training are complementary. They contribute to the success of the PRACE training programme, and should be continued in future PRACE HPC training activities. In accordance with the PRACE-5IP goals, the majority of training events also address industrial users, and motivate them by showing the importance and benefits of HPC for the practice.
The Work Package 7 ‘Application Enabling and Support’ provides applications enabling support for HPC applications codes that are important for European academic and/or industrial researchers to ensure that these applications can... more
The Work Package 7 ‘Application Enabling and Support’ provides applications enabling support for HPC applications codes that are important for European academic and/or industrial researchers to ensure that these applications can effectively exploit current and future PRACE systems. Applications are selected for enabling via calls such as PRACE Preparatory Access (for Tier-0 or Tier-1), or SHAPE (SME HPC Adoption Programme in Europe). This applications enabling activity uses the most promising tools, algorithms and standards for optimisation and parallel scaling that have recently been developed through research and experience in PRACE and other projects. Through the applications-enabling work, the Work Package 7 develops specific expertise on most – if not all – of the architectures which make up the European HPC system. Technical results obtained within this Work Package have been disseminated though many technical whitepapers freely available on the PRACE RI web site. In addition to this enabling work on existing systems, the Work Package also progresses the technical work needed to ensure that key applications are able to use future PRACE Exascale systems, and investigate the tools, languages and libraries needed to exploit future PRACE Exascale systems. One of the main objectives of the Work Package is to support European HPC research communities through the provision of Best Practice Guides, benchmarks, and example parallel codes. The successful series of Best Practice Guides has been initiated in PRACE-1IP and has been continuously extended since then. Topics for these Best Practice Guides include: optimal porting of applications (e.g., choice of numerical libraries and compiler options); architecture specific optimisation and scaling techniques; optimal system environment (e.g., tuneable system parameters, job placement and optimised system libraries); debugging tools, performance analysis tools and programming environment. This report describes the process which led to the Best Practice Guides and the structure of the guides.
Research Interests:
Task T7.1 “Enabling Applications Codes for PRACE Systems” in Work Package 7 (WP7) of PRACE-4IP aims to provide application enabling support for the HPC applications which are important for the European researchers and small and medium... more
Task T7.1 “Enabling Applications Codes for PRACE Systems” in Work Package 7 (WP7) of PRACE-4IP aims to provide application enabling support for the HPC applications which are important for the European researchers and small and medium enterprises to ensure the applications can effectively exploit HPC systems.
Research Interests:
Talk about HPC training within GCS, the Gauss Centre for Supercomputing, which combines the three national supercomputing centres High Performance Computing Center Stuttgart (HLRS), Jülich Supercomputing Centre (JSC), and Leibniz... more
Talk about HPC training within GCS, the Gauss Centre for Supercomputing, which combines the three national supercomputing centres High Performance Computing Center Stuttgart (HLRS), Jülich Supercomputing Centre (JSC), and Leibniz Supercomputing Centre (LRZ) into Germany’s foremost supercomputing institution.
Talk about PRACE training events in Europe.
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
Research Interests:
These are the proceedings of the workshop on ``Lattice QCD, Chiral Perturbation Theory and Hadron Phenomenology'' held at the European Centre for Theoretical Studies in Nuclear Physics and Related Areas from October 2 to 6, 2006. The... more
These are the proceedings of the workshop on ``Lattice QCD, Chiral Perturbation Theory and Hadron Phenomenology'' held at the European Centre for Theoretical Studies in Nuclear Physics and Related Areas from October 2 to 6, 2006. The workshop concentrated on bringing together researchers working in lattice QCD and chiral perturbation theory with the aim of improving our understanding how hadron properties can be calculated and analyzed from first principles. Included are a short contribution per talk.