Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Stochastic Approximation-Based Transport Profiling for Big Data Movement Over Dedicated Connections

Daqing Yun, Chase Q. Wu

Source Title: Stochastic Methods for Estimation and Problem Solving in Engineering

DOI: 10.4018/978-1-5225-5045-7.ch005

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

High-performance networks featuring advance bandwidth reservation have been developed and deployed to support big data transfer in extreme-scale scientific applications. The performance of such big data transfer largely depends on the transport protocols being used. For a given protocol in a given network environment, different parameter settings may lead to different performance, and oftentimes the default settings do not yield the best performance. It is, however, impractical to conduct an exhaustive search in the large parameter space of transport protocols for a set of suitable parameter values. This chapter proposes a stochastic approximation-based transport profiler, namely FastProf, to quickly determine the optimal operational zone of a protocol over dedicated connections. The proposed method is evaluated using both emulations based on real-life measurements and experiments over physical connections. The results show that FastProf significantly reduces the profiling overhead while achieving a comparable level of transport performance with the exhaustive search-based approach.

Chapter Preview

Top

Introduction

Extreme-scale scientific applications are generating colossal amounts of datasets, now frequently termed as “big data”, which must be transferred over long geographical distances for distributed processing and analysis. Such big data transfer requires stable and high-speed network connections, which, unfortunately, are not readily available in traditional shared IP networks, e.g., the Internet. High-performance Networks (HPNs) such as ESnet (ESnet, 2017), Internet2 (Internet2, 2017), and Google’s B4 (Jain et al., 2013) that provide on-demand dedicated high-speed network connections realized by technologies such as MPLS (Andersson & Swallow, 2003) and OpenFlow (McKeown et al., 2008) have emerged to support these data- and network-intensive applications. More recently, significant progress has been made for big data movement in various aspects including the deployment of 100Gbps networks with future 1Tbps capacity, the increase in end-host capabilities with multiple cores and buses, and the use of parallel file systems. However, such technology advancement and infrastructure investment have not led to corresponding improvement in big data transfer performance, especially at the application level. Maximizing the data transfer performance over such dedicated connections is still challenging, mainly because: i) the optimal operational zones of transport protocols are affected by the complex configurations and dynamics of the network segments, end-hosts, and protocol itself; ii) different parameter settings may lead to very different performance and oftentimes the default parameter setting does not yield the best performance; iii) due to the lack of accurate models for high-performance transport protocols such as UDT (Gu & Grossman, 2007), a widely used protocol in the HPN community (UDT, 2017a), and the complex dynamics of network environments, it is generally very difficult to model and derive the optimal parameter values analytically. For a given data transfer protocol, a careful selection of parameter values may result in a significant performance improvement over its default settings. As a motivating example, Figure 1 shows the instantaneous UDT throughput performance with different block sizes over a local back-to-back 10Gbps connection, where the performance is improved by three times on average through a simple adjustment made on the block size.

Figure 1.

Instantaneous throughput measurements of UDT over a 10Gbps back-to-back connection with different block sizes

Transport profiling, which sweeps through the combinations of parameter settings such as socket options, application-specific parameters, and protocol-specific configurations, enables users to determine the “best” set of parameter values for the optimal data transfer performance. There exist several bandwidth estimation tools such as ESnet iperf3 (Iperf3, 2017), which uses continuous data transfer to estimate the achievable throughput along an end-to-end network path. It provides users with various functions and options for tuning TCP, UDP and SCTP. The Transport Profiler Generator (TPG) (Yun et al., 2015) complements ESnet iperf3 by providing throughput estimation for UDT, and additionally supports parallel data streams over multiple NIC-to-NIC connections. A survey of bandwidth estimation tools can be found in (Prasad et al., 2003). These bandwidth estimation tools could be utilized to conduct exhaustive transport profiling to optimize big data transfer performance. However, such exhaustive-based profiling method is prohibitively time consuming when there exists a large control parameter space, which is almost always the case in most big data transfer scenarios. In general, users may not be in favor of performing transport profiling if the profiling overhead is comparable with the time needed for their actual data transfer.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Stochastic Approximation-Based Transport Profiling for Big Data Movement Over Dedicated Connections

Abstract

Introduction

Complete Chapter List