How active is active learning: value function method vs an approximation method

IRIS

In a previous paper Amman and Tucci (2018) compare the two dominant approaches for solving models with optimal experimentation (also called active learning), i.e. the value function and the approximation method. By using the same model and dataset as in Beck and Wieland (2002), they find that the approximation method produces solutions close to those generated by the value function approach and identify some elements of the model specifications which affect the difference between the two solutions. They conclude that differences are small when the effects of learning are limited. However the dataset used in the experiment describes a situation where the controller is dealing with a nonstationary process and there is no penalty on the control. The goal of this paper is to see if their conclusions hold in the more commonly studied case of a controller facing a stationary process and a positive penalty on the control.

Amman, H.M., Tucci, M.P. (2018). How active is active learning: value function method vs an approximation method. QUADERNI DEL DIPARTIMENTO DI ECONOMIA POLITICA, 788, 1-25.

How active is active learning: value function method vs an approximation method

Hans M. Amman;Marco P. Tucci^{Writing – Original Draft Preparation}

2018-01-01

Abstract

In a previous paper Amman and Tucci (2018) compare the two dominant approaches for solving models with optimal experimentation (also called active learning), i.e. the value function and the approximation method. By using the same model and dataset as in Beck and Wieland (2002), they find that the approximation method produces solutions close to those generated by the value function approach and identify some elements of the model specifications which affect the difference between the two solutions. They conclude that differences are small when the effects of learning are limited. However the dataset used in the experiment describes a situation where the controller is dealing with a nonstationary process and there is no penalty on the control. The goal of this paper is to see if their conclusions hold in the more commonly studied case of a controller facing a stationary process and a positive penalty on the control.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2018
			
	Rivista su cui è pubblicata l'opera
	
				QUADERNI DEL DIPARTIMENTO DI ECONOMIA POLITICA
			
	Citazione
	
				Amman, H.M., Tucci, M.P. (2018). How active is active learning: value function method vs an approximation method. QUADERNI DEL DIPARTIMENTO DI ECONOMIA POLITICA, 788, 1-25.
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
DEPS 788 Oct. 18.pdf accesso aperto Tipologia: PDF editoriale Licenza: PUBBLICO - Pubblico con Copyright Dimensione 454.91 kB Formato Adobe PDF Visualizza/Apri	454.91 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1061962