Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets

Calabuig, J. M.; Falciani, H.; Sánchez Pérez, Enrique Alfonso

doi:10.1016/j.neucom.2020.02.052

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: CalabuigFalcianiS ...

Tamaño: 851.8Kb

Formato: PDF

Descripción: Versión del Autor.

Abrir

Nombre: 1-s2.0-S092523122 ...

Tamaño: 1.672Mb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Calabuig, J. M.	es_ES
dc.contributor.author	Falciani, H.	es_ES
dc.contributor.author	Sánchez Pérez, Enrique Alfonso	es_ES
dc.date.accessioned	2021-09-16T03:31:51Z
dc.date.available	2021-09-16T03:31:51Z
dc.date.issued	2020-07-20	es_ES
dc.identifier.issn	0925-2312	es_ES
dc.identifier.uri	http://hdl.handle.net/10251/172597
dc.description.abstract	[EN] We consider a quasi-metric topological structure for the construction of a new reinforcement learning model in the framework of financial markets. It is based on a Lipschitz type extension of reward functions defined in metric spaces. Specifically, the McShane and Whitney extensions are considered for a reward function which is defined by the total evaluation of the benefits produced by the investment decision at a given time. We define the metric as a linear combination of a Euclidean distance and an angular metric component. All information about the evolution of the system from the beginning of the time interval is used to support the extension of the reward function, but in addition this data set is enriched by adding some artificially produced states. Thus, the main novelty of our method is the way we produce more states-which we call "dreams"-to enrich learning. Using some known states of the dynamical system that represents the evolution of the financial market, we use our technique to simulate new states by interpolating real states and introducing some random variables. These new states are used to feed a learning algorithm designed to improve the investment strategy by following a typical reinforcement learning scheme. (C) 2020 Elsevier B.V. All rights reserved.	es_ES
dc.description.sponsorship	This work was supported by the Ministerio de Ciencia, Innovacion y Universidades, Agencial Estatal de Investigaciones and FEDER (Spain) (grant number MTM2016-77054-C2-1-P.)	es_ES
dc.language	Inglés	es_ES
dc.publisher	Elsevier	es_ES
dc.relation.ispartof	Neurocomputing	es_ES
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Pseudo-metric	es_ES
dc.subject	Reinforcement learning	es_ES
dc.subject	Lipschitz extension	es_ES
dc.subject	Mathematical economics	es_ES
dc.subject	Financial market	es_ES
dc.subject	Model	es_ES
dc.subject.classification	MATEMATICA APLICADA	es_ES
dc.title	Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets	es_ES
dc.type	Artículo	es_ES
dc.identifier.doi	10.1016/j.neucom.2020.02.052	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MINECO//MTM2016-77054-C2-1-P/ES/ANALISIS NO LINEAL, INTEGRACION VECTORIAL Y APLICACIONES EN CIENCIAS DE LA INFORMACION/	es_ES
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Matemática Aplicada - Departament de Matemàtica Aplicada	es_ES
dc.description.bibliographicCitation	Calabuig, JM.; Falciani, H.; Sánchez Pérez, EA. (2020). Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets. Neurocomputing. 398:172-184. https://doi.org/10.1016/j.neucom.2020.02.052	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.publisherversion	https://doi.org/10.1016/j.neucom.2020.02.052	es_ES
dc.description.upvformatpinicio	172	es_ES
dc.description.upvformatpfin	184	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	398	es_ES
dc.relation.pasarela	S\424071	es_ES
dc.contributor.funder	European Regional Development Fund	es_ES
dc.contributor.funder	MINISTERIO DE ECONOMÍA Y COMPETITIVIDAD	es_ES
dc.description.references	Aliprantis, C., & Burkinshaw, O. (2003). Locally Solid Riesz Spaces with Applications to Economics. Mathematical Surveys and Monographs. doi:10.1090/surv/105	es_ES
dc.description.references	Almahdi, S., & Yang, S. Y. (2017). An adaptive portfolio trading system: A risk-return portfolio optimization using recurrent reinforcement learning with expected maximum drawdown. Expert Systems with Applications, 87, 267-279. doi:10.1016/j.eswa.2017.06.023	es_ES
dc.description.references	Aronsson, G. (1967). Extension of functions satisfying lipschitz conditions. Arkiv för Matematik, 6(6), 551-561. doi:10.1007/bf02591928	es_ES
dc.description.references	Bekiros, S. D. (2010). Heterogeneous trading strategies with adaptive fuzzy Actor–Critic reinforcement learning: A behavioral approach. Journal of Economic Dynamics and Control, 34(6), 1153-1170. doi:10.1016/j.jedc.2010.01.015	es_ES
dc.description.references	Bekiros, S. D. (2015). Heuristic learning in intraday trading under uncertainty. Journal of Empirical Finance, 30, 34-49. doi:10.1016/j.jempfin.2014.11.002	es_ES
dc.description.references	Bertoluzzo, F., & Corazza, M. (2012). Testing Different Reinforcement Learning Configurations for Financial Trading: Introduction and Applications. Procedia Economics and Finance, 3, 68-77. doi:10.1016/s2212-5671(12)00122-0	es_ES
dc.description.references	Cavalcante, R. C., Brasileiro, R. C., Souza, V. L. F., Nobrega, J. P., & Oliveira, A. L. I. (2016). Computational Intelligence and Financial Markets: A Survey and Future Directions. Expert Systems with Applications, 55, 194-211. doi:10.1016/j.eswa.2016.02.006	es_ES
dc.description.references	Chen, Y., & Hao, Y. (2017). A feature weighted support vector machine and K-nearest neighbor algorithm for stock market indices prediction. Expert Systems with Applications, 80, 340-355. doi:10.1016/j.eswa.2017.02.044	es_ES
dc.description.references	Chong, E., Han, C., & Park, F. C. (2017). Deep learning networks for stock market analysis and prediction: Methodology, data representations, and case studies. Expert Systems with Applications, 83, 187-205. doi:10.1016/j.eswa.2017.04.030	es_ES
dc.description.references	Das, S. P., & Padhy, S. (2015). A novel hybrid model using teaching–learning-based optimization and a support vector machine for commodity futures index forecasting. International Journal of Machine Learning and Cybernetics, 9(1), 97-111. doi:10.1007/s13042-015-0359-0	es_ES
dc.description.references	Defoort, M., Polyakov, A., Demesure, G., Djemai, M., & Veluvolu, K. (2015). Leader‐follower fixed‐time consensus for multi‐agent systems with unknown non‐linear inherent dynamics. IET Control Theory & Applications, 9(14), 2165-2170. doi:10.1049/iet-cta.2014.1301	es_ES
dc.description.references	Dempster, M. A. H., & Leemans, V. (2006). An automated FX trading system using adaptive reinforcement learning. Expert Systems with Applications, 30(3), 543-552. doi:10.1016/j.eswa.2005.10.012	es_ES
dc.description.references	Deng, Y., Bao, F., Kong, Y., Ren, Z., & Dai, Q. (2017). Deep Direct Reinforcement Learning for Financial Signal Representation and Trading. IEEE Transactions on Neural Networks and Learning Systems, 28(3), 653-664. doi:10.1109/tnnls.2016.2522401	es_ES
dc.description.references	M. Dong, X. Yang, Y. Wu, J.H. Xue, Metric learning via maximizing the Lipschitz margin ratio, arXiv:1802.03464 (2018) 1–12.	es_ES
dc.description.references	Driessens, K., Ramon, J., & Gärtner, T. (2006). Graph kernels and Gaussian processes for relational reinforcement learning. Machine Learning, 64(1-3), 91-119. doi:10.1007/s10994-006-8258-y	es_ES
dc.description.references	Dunis, C. L., Rosillo, R., de la Fuente, D., & Pino, R. (2012). Forecasting IBEX-35 moves using support vector machines. Neural Computing and Applications, 23(1), 229-236. doi:10.1007/s00521-012-0821-9	es_ES
dc.description.references	Fischer, T., & Krauss, C. (2018). Deep learning with long short-term memory networks for financial market predictions. European Journal of Operational Research, 270(2), 654-669. doi:10.1016/j.ejor.2017.11.054	es_ES
dc.description.references	Gerlein, E. A., McGinnity, M., Belatreche, A., & Coleman, S. (2016). Evaluating machine learning classification for financial trading: An empirical approach. Expert Systems with Applications, 54, 193-207. doi:10.1016/j.eswa.2016.01.018	es_ES
dc.description.references	Gottlieb, L.-A., Kontorovich, A., & Krauthgamer, R. (2014). Efficient Classification for Metric Data. IEEE Transactions on Information Theory, 60(9), 5750-5759. doi:10.1109/tit.2014.2339840	es_ES
dc.description.references	Guo, X.-G., Wang, J. L., Liao, F., & Teo, R. S. H. (2016). Distributed adaptive control for vehicular platoon with unknown dead-zone inputs and velocity/acceleration disturbances. International Journal of Robust and Nonlinear Control, 27(16), 2961-2981. doi:10.1002/rnc.3720	es_ES
dc.description.references	Jeong, G., & Kim, H. Y. (2019). Improving financial trading decisions using deep Q-learning: Predicting the number of shares, action strategies, and transfer learning. Expert Systems with Applications, 117, 125-138. doi:10.1016/j.eswa.2018.09.036	es_ES
dc.description.references	Kearney, C., & Liu, S. (2014). Textual sentiment in finance: A survey of methods and models. International Review of Financial Analysis, 33, 171-185. doi:10.1016/j.irfa.2014.02.006	es_ES
dc.description.references	Lahmiri, S. (2016). A variational mode decompoisition approach for analysis and forecasting of economic and financial time series. Expert Systems with Applications, 55, 268-273. doi:10.1016/j.eswa.2016.02.025	es_ES
dc.description.references	Lee, T. K., Cho, J. H., Kwon, D. S., & Sohn, S. Y. (2019). Global stock market investment strategies based on financial network indicators using machine learning techniques. Expert Systems with Applications, 117, 228-242. doi:10.1016/j.eswa.2018.09.005	es_ES
dc.description.references	Lee, J. W., Park, J., O, J., Lee, J., & Hong, E. (2007). A Multiagent Approach to $Q$-Learning for Daily Stock Trading. IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, 37(6), 864-877. doi:10.1109/tsmca.2007.904825	es_ES
dc.description.references	Li, Y., Jiang, W., Yang, L., & Wu, T. (2018). On neural networks and learning systems for business computing. Neurocomputing, 275, 1150-1159. doi:10.1016/j.neucom.2017.09.054	es_ES
dc.description.references	Liu, F., & Wang, J. (2012). Fluctuation prediction of stock market index by Legendre neural network with random time strength function. Neurocomputing, 83, 12-21. doi:10.1016/j.neucom.2011.09.033	es_ES
dc.description.references	LOUGHRAN, T., & MCDONALD, B. (2016). Textual Analysis in Accounting and Finance: A Survey. Journal of Accounting Research, 54(4), 1187-1230. doi:10.1111/1475-679x.12123	es_ES
dc.description.references	Lu, C.-J., Lee, T.-S., & Chiu, C.-C. (2009). Financial time series forecasting using independent component analysis and support vector regression. Decision Support Systems, 47(2), 115-125. doi:10.1016/j.dss.2009.02.001	es_ES
dc.description.references	Mahmoudi, N., Docherty, P., & Moscato, P. (2018). Deep neural networks understand investors better. Decision Support Systems, 112, 23-34. doi:10.1016/j.dss.2018.06.002	es_ES
dc.description.references	Maringer, D., & Ramtohul, T. (2011). Regime-switching recurrent reinforcement learning for investment decision making. Computational Management Science, 9(1), 89-107. doi:10.1007/s10287-011-0131-1	es_ES
dc.description.references	McShane, E. J. (1934). Extension of range of functions. Bulletin of the American Mathematical Society, 40(12), 837-842. doi:10.1090/s0002-9904-1934-05978-0	es_ES
dc.description.references	Milman, V. A. (1999). Absolutely minimal extensions of functions on metric spaces. Sbornik: Mathematics, 190(6), 859-885. doi:10.1070/sm1999v190n06abeh000409	es_ES
dc.description.references	Moghaddam, A. H., Moghaddam, M. H., & Esfandyari, M. (2016). Stock market index prediction using artificial neural network. Journal of Economics, Finance and Administrative Science, 21(41), 89-93. doi:10.1016/j.jefas.2016.07.002	es_ES
dc.description.references	Moody, J., & Saffell, M. (2001). Learning to trade via direct reinforcement. IEEE Transactions on Neural Networks, 12(4), 875-889. doi:10.1109/72.935097	es_ES
dc.description.references	Khadjeh Nassirtoussi, A., Aghabozorgi, S., Ying Wah, T., & Ngo, D. C. L. (2014). Text mining for market prediction: A systematic review. Expert Systems with Applications, 41(16), 7653-7670. doi:10.1016/j.eswa.2014.06.009	es_ES
dc.description.references	H. Park, M.K. Sim, D.G. Choi, An intelligent financial portfolio trading strategy using deep q-learning, arXiv:1907.03665 (2019) 1–39.	es_ES
dc.description.references	Patel, J., Shah, S., Thakkar, P., & Kotecha, K. (2015). Predicting stock market index using fusion of machine learning techniques. Expert Systems with Applications, 42(4), 2162-2172. doi:10.1016/j.eswa.2014.10.031	es_ES
dc.description.references	Pendharkar, P. C., & Cusatis, P. (2018). Trading financial indices with reinforcement learning agents. Expert Systems with Applications, 103, 1-13. doi:10.1016/j.eswa.2018.02.032	es_ES
dc.description.references	Romaguera, S., & Sanchis, M. (2000). Semi-Lipschitz Functions and Best Approximation in Quasi-Metric Spaces. Journal of Approximation Theory, 103(2), 292-301. doi:10.1006/jath.1999.3439	es_ES
dc.description.references	Schmidhuber, J. (2015). Deep learning in neural networks: An overview. Neural Networks, 61, 85-117. doi:10.1016/j.neunet.2014.09.003	es_ES
dc.description.references	Sezer, O. B., & Ozbayoglu, A. M. (2018). Algorithmic financial trading with deep convolutional neural networks: Time series to image conversion approach. Applied Soft Computing, 70, 525-538. doi:10.1016/j.asoc.2018.04.024	es_ES
dc.description.references	Tkáč, M., & Verner, R. (2016). Artificial neural networks in business: Two decades of research. Applied Soft Computing, 38, 788-804. doi:10.1016/j.asoc.2015.09.040	es_ES
dc.description.references	Ticknor, J. L. (2013). A Bayesian regularized artificial neural network for stock market forecasting. Expert Systems with Applications, 40(14), 5501-5506. doi:10.1016/j.eswa.2013.04.013	es_ES
dc.description.references	Wang, B., Huang, H., & Wang, X. (2012). A novel text mining approach to financial time series forecasting. Neurocomputing, 83, 136-145. doi:10.1016/j.neucom.2011.12.013	es_ES
dc.description.references	Wang, B., Huang, H., & Wang, X. (2011). A support vector machine based MSM model for financial short-term volatility forecasting. Neural Computing and Applications, 22(1), 21-28. doi:10.1007/s00521-011-0742-z	es_ES
dc.description.references	Xiao, G., Zhang, H., Luo, Y., & Qu, Q. (2017). General value iteration based reinforcement learning for solving optimal tracking control problem of continuous–time affine nonlinear systems. Neurocomputing, 245, 114-123. doi:10.1016/j.neucom.2017.03.038	es_ES
dc.description.references	Yeh, C.-Y., Huang, C.-W., & Lee, S.-J. (2011). A multiple-kernel support vector regression approach for stock market price forecasting. Expert Systems with Applications, 38(3), 2177-2186. doi:10.1016/j.eswa.2010.08.004	es_ES
dc.description.references	Zhang, X., Hu, Y., Xie, K., Zhang, W., Su, L., & Liu, M. (2015). An evolutionary trend reversion model for stock trading rule discovery. Knowledge-Based Systems, 79, 27-35. doi:10.1016/j.knosys.2014.08.010	es_ES
dc.description.references	Zhang, J., & Maringer, D. (2015). Using a Genetic Algorithm to Improve Recurrent Reinforcement Learning for Equity Trading. Computational Economics, 47(4), 551-567. doi:10.1007/s10614-015-9490-y	es_ES
dc.description.references	Zhiqiang, G., Huaiqing, W., & Quan, L. (2012). Financial time series forecasting using LPP and SVM optimized by PSO. Soft Computing, 17(5), 805-818. doi:10.1007/s00500-012-0953-y	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Artículos, conferencias, monografías [48360]

Mostrar el registro sencillo del ítem

Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)