Search the World's Largest Database of Information Science & Technology Terms & Definitions
InfInfoScipedia LogoScipedia
A Free Service of IGI Global Publishing House
Below please find a list of definitions for the term that
you selected from multiple scholarly research resources.

What is Off-Policy Learning

Encyclopedia of Business Analytics and Optimization
the value assigned to a given state (or state-action pair) is a function of the immediate reward and of the maximum rewards received in the subsequent states during the episode.
Published in Chapter:
Reinforcement Learning for Business Modeling
Fernando S. Oliveira (ESSEC Business School, Singapore)
Copyright: © 2014 |Pages: 10
DOI: 10.4018/978-1-4666-5202-6.ch181
Full Text Chapter Download: US $37.50 Add to Cart
eContent Pro Discount Banner
InfoSci OnDemandECP Editorial ServicesAGOSR