Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

A Transformer-Based Model for Multi-Track Music Generation

Cong Jin, Tao Wang, Shouxun Liu, Yun Tie, Jianguang Li, Xiaobing Li, Simon Lui

Source Title: International Journal of Multimedia Data Engineering and Management (IJMDEM) 11(3)

DOI: 10.4018/IJMDEM.2020070103

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Most of the current works are still limited to dealing with the melody generation containing pitch, rhythm, duration of each note, and pause between notes. This paper proposes a transformer-based model to generate multi-track music including tracks of piano, guitar, and drum, which is abbreviated as MTMG model. The proposed MTMG model is mainly innovated and improved on the basis of transformer. Firstly, the model obtains three target sequences after pairwise learning through learning network. Then, according to these three target sequences, GPT is applied to predict and generate three closely related sequences of instrument tracks. Finally, the three generated instrument tracks are fused to obtain multi-track music pieces containing piano, guitar, and drum. To verify the effectiveness of the proposed model, related experiments are conducted on a pair of comparative subjective and objective evaluation. The encouraging performance of the proposed model over other state-of-the-art models demonstrates its superiority in musical representation.

Article Preview

Top

Background

Similar to most sequence-to-sequence (Seq2Seq) models(Sutskever et al., 2014), Transformer uses an encoder-decoder structure. However, the previous model usually uses a recurrent neural network (such as LSTM) in the encoder and decoder. The disadvantage of this network structure is the problem of long-term dependence and the inability to calculate in parallel. In order to improve the efficiency of parallel computing and capture long-term dependencies, Transformer gave up the RNN cycle generation and used the self-attention model to build a fully connected network structure, thereby implementing an architecture based entirely on the feed-forward attention mechanism. In this context, OpenAI's GPT pre-training model came into being(Radford et al., 2018). GPT uses a generative method to train language models. GPT uses the decoder structure in Transformer and does not use a complete Transformer to build the network. With the wide application of GPT, OpenAI proposed GPT-2, which has a larger training data set and can do diverse tasks without supervision on the basis of GPT. The structure of the GPT-2 model is still the same as GPT, where the core idea is that unsupervised pre-training models can be used to do supervised tasks(Radford et al., 2019). Similar with the pre-training model, a weekly-supervised deep hashing method is proposed by using weekly-supervised information(Li et al., 2020). BERT (Bidirectional Encoder Representations from Transformers) also modifies the pre-training target on the basis of GPT, and uses a larger model and more data to pre-train to obtain the best results at present.

Complete Article List

Search this Journal:

Reset

Volume 15: 1 Issue (2024)

Volume 14: 1 Issue (2023)

Volume 13: 4 Issues (2022): 1 Released, 3 Forthcoming

Volume 12: 4 Issues (2021)

Volume 11: 4 Issues (2020)

Volume 10: 4 Issues (2019)

Volume 9: 4 Issues (2018)

Volume 8: 4 Issues (2017)

Volume 7: 4 Issues (2016)

Volume 6: 4 Issues (2015)

Volume 5: 4 Issues (2014)

Volume 4: 4 Issues (2013)

Volume 3: 4 Issues (2012)

Volume 2: 4 Issues (2011)

Volume 1: 4 Issues (2010)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

A Transformer-Based Model for Multi-Track Music Generation

Abstract

Background

Complete Article List