Search the World's Largest Database of Information Science & Technology Terms & Definitions
InfInfoScipedia LogoScipedia
A Free Service of IGI Global Publishing House
Below please find a list of definitions for the term that
you selected from multiple scholarly research resources.

What is Vanishing Gradient Problem

Handbook of Research on Applied Data Science and Artificial Intelligence in Business and Industry
With addition of more layers to the neural network the loss function gradient becomes smaller approaching to almost zero which makes it harder to train the network.
Published in Chapter:
Question Answering Chatbot Using Memory Networks
Riddhi Patel (Charotar University of Science and Technology, India), Parth Patel (Charotar University of Science and Technology, India), Chintan M. Bhatt (Charotar University of Science and Technology, India), and Mrugendrasinh L. Rahevar (Charotar University of Science and Technology, India)
DOI: 10.4018/978-1-7998-6985-6.ch019
Abstract
Manipulation of a large amount of data becomes a very tedious task. Hence, the authors took the approach of memory networks for the implementation of the chatbot. Traditionally, the LSTM model was used to implement chatbots and QA systems. But the LSTM failed to store relevant information when given a longer information set. On the contrary, the memory networks have an additional memory component with it. This can help in storing long information for further use which is greatly advantageous for the QA and chatbot systems as compared to LSTM. The authors trained and tested their model over Facebook's bAbi dataset which consists of several tasks and has questions regarding each task to retrieve the accuracy of the model. On the pedestal of that dataset, they have presented the accuracy for every task in their study with memory networks.
Full Text Chapter Download: US $37.50 Add to Cart
More Results
Deep Convolutional Neural Network for Object Classification: Under Constrained and Unconstrained Environments
This problem arises while training the network, the error between the output of the network and the target could vanish while it flows back to the input layer.
Full Text Chapter Download: US $37.50 Add to Cart
eContent Pro Discount Banner
InfoSci OnDemandECP Editorial ServicesAGOSR