Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Challenges on Porting Lattice Boltzmann Method on Accelerators: NVIDIA Graphic Processing Units and Intel Xeon Phi

Claudio Schepke, João V. F. Lima, Matheus S. Serpa

Source Title: Analysis and Applications of Lattice Boltzmann Simulations

DOI: 10.4018/978-1-5225-4760-0.ch002

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Currently NVIDIA GPUs and Intel Xeon Phi accelerators are alternatives of computational architectures to provide high performance. This chapter investigates the performance impact of these architectures on the lattice Boltzmann method. This method is an alternative to simulate fluid flows iteratively using discrete representations. It can be adopted for a large number of flows simulations using simple operation rules. In the experiments, it was considered a three-dimensional version of the method, with 19 discrete directions of propagation (D3Q19). Performance evaluation compare three modern GPUs: K20M, K80, and Titan X; and two architectures of Xeon Phi: Knights Corner (KNC) and Knights Landing (KNL). Titan X provides the fastest execution time of all hardware considered. The results show that GPUs offer better processing time for the application. A KNL cache implementation presents the best results for Xeon Phi architectures and the new Xeon Phi (KNL) is two times faster than the previous model (KNC).

Chapter Preview

Top

Introduction

High performance computing has been responsible for a scientific revolution. Using computers, problems that could not be solved, or demanded too much time to be solved, became available to the scientific community. The evolution of computer architectures improved the computational power, increasing the range of problems that could be dealt. The adoption of integrated circuits, pipelines, increased frequency of operation, out-of-order execution, and branch prediction are an important part of the technologies introduced up to the end of the 20th century. Recently, the concern about energy consumption has been growing, with the goal of achieving computation at the exascale level in a sustainable way. However, the aforementioned technologies alone do not allow the achievement of exascale computing, due to the high energy cost of increasing frequency and pipeline stages, as well as the fact that we are at the limits of exploration the instruction level parallelism.

In order to solve such problems, multicore and accelerators architectures have been introduced in recent years. The main feature of multicore and accelerators is the presence of several processing cores operating concurrently, in which the application has to be programmed by separating it into several tasks that communicate with each other. Concerning the use of accelerators in HPC architectures, its main characteristic is the presence of different environments in the same system, each with its own specialized architecture for a type of task. A typical HPC system is normally composed by a generic processor, responsible of managing the system, and several accelerators present in the system to perform the computation of certain kind of tasks.

The usage of accelerators poses several challenges for HPC. Applications need to be coded considering the particularities and constraints of each environment, as well as considering their distinct architectural characteristics. For example, in the memory hierarchy, the presence of several cache memory levels, some shared and others private, as well as whether the memory banks are centralized or distributed, introduce non-uniform access times, which impact the performance. In addition, in accelerators, the number of functional units may vary between different hardware versions, and the instruction set itself may not be the same. All these aspects influence on the performance of applications and need to be considered in the application code.

This chapter covers recent challenges of parallel programming for the Lattice-Boltzmann Method (LBM) (Schepke, Maillard, & Navaux, 2009). LBM is the current backbone for fluid flow through porous media. It has been extensively applied for Soil Filtration and Fuel Cells for the last five years. LBM is an iterative numerical method to model and to simulate fluid dynamics properties, where space, time and velocity are discrete. The method enables the computational modeling of a large variety of problems, including fluid with multi-components, in one or more phases, with irregular boundary conditions and in complex geometries (Valero-Lara, 2014) (Valero-Lara, & Jansson, 2016). The LBM has been used for simulations of blood vessels, flow of oil in porous rocks with water emulsions, and turbulent flows (Nita, Itu, Suciu, & Suciu, 2013) (Obrecht, Kuznik, Tourancheau, & Roux, 2011).

The LBM is a numerical approach for simulation of fluid flows that take benefits of the fact that it can be used for specific flow conditions, to be naturally discrete and to be parallelized. In terms of development, the fluid flow modeling begins discrete, that is, the domain representation does not need to be discretized after. This model simplifies coding because both method and algorithm are the same. At last, because the operations of the method are local, each lattice element can be computed in parallel. So, a parallel version of the algorithm should be straightforward.

Computational methods, such as LBM, should be continuously ported to the newest HPC hardware available to maintain competitiveness. Parallel programming strategies are considered for operations over each lattice element and it neighbor elements. To execute parallel simulations, state-of-art HPC architectures are employed, generating accurate and faster results at each generation. The software must evolve to support the features of each design to keep performance scaling. Furthermore, it is important to understand the software impact in order to improve performance.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Challenges on Porting Lattice Boltzmann Method on Accelerators: NVIDIA Graphic Processing Units and Intel Xeon Phi

Abstract

Introduction

Complete Chapter List