- -

Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Calabuig, J. M. es_ES
dc.contributor.author Falciani, H. es_ES
dc.contributor.author Sánchez Pérez, Enrique Alfonso es_ES
dc.date.accessioned 2021-09-16T03:31:51Z
dc.date.available 2021-09-16T03:31:51Z
dc.date.issued 2020-07-20 es_ES
dc.identifier.issn 0925-2312 es_ES
dc.identifier.uri http://hdl.handle.net/10251/172597
dc.description.abstract [EN] We consider a quasi-metric topological structure for the construction of a new reinforcement learning model in the framework of financial markets. It is based on a Lipschitz type extension of reward functions defined in metric spaces. Specifically, the McShane and Whitney extensions are considered for a reward function which is defined by the total evaluation of the benefits produced by the investment decision at a given time. We define the metric as a linear combination of a Euclidean distance and an angular metric component. All information about the evolution of the system from the beginning of the time interval is used to support the extension of the reward function, but in addition this data set is enriched by adding some artificially produced states. Thus, the main novelty of our method is the way we produce more states-which we call "dreams"-to enrich learning. Using some known states of the dynamical system that represents the evolution of the financial market, we use our technique to simulate new states by interpolating real states and introducing some random variables. These new states are used to feed a learning algorithm designed to improve the investment strategy by following a typical reinforcement learning scheme. (C) 2020 Elsevier B.V. All rights reserved. es_ES
dc.description.sponsorship This work was supported by the Ministerio de Ciencia, Innovacion y Universidades, Agencial Estatal de Investigaciones and FEDER (Spain) (grant number MTM2016-77054-C2-1-P.) es_ES
dc.language Inglés es_ES
dc.publisher Elsevier es_ES
dc.relation.ispartof Neurocomputing es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject Pseudo-metric es_ES
dc.subject Reinforcement learning es_ES
dc.subject Lipschitz extension es_ES
dc.subject Mathematical economics es_ES
dc.subject Financial market es_ES
dc.subject Model es_ES
dc.subject.classification MATEMATICA APLICADA es_ES
dc.title Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.1016/j.neucom.2020.02.052 es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MINECO//MTM2016-77054-C2-1-P/ES/ANALISIS NO LINEAL, INTEGRACION VECTORIAL Y APLICACIONES EN CIENCIAS DE LA INFORMACION/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Matemática Aplicada - Departament de Matemàtica Aplicada es_ES
dc.description.bibliographicCitation Calabuig, JM.; Falciani, H.; Sánchez Pérez, EA. (2020). Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets. Neurocomputing. 398:172-184. https://doi.org/10.1016/j.neucom.2020.02.052 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion https://doi.org/10.1016/j.neucom.2020.02.052 es_ES
dc.description.upvformatpinicio 172 es_ES
dc.description.upvformatpfin 184 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 398 es_ES
dc.relation.pasarela S\424071 es_ES
dc.contributor.funder European Regional Development Fund es_ES
dc.contributor.funder MINISTERIO DE ECONOMÍA Y COMPETITIVIDAD es_ES
dc.description.references Aliprantis, C., & Burkinshaw, O. (2003). Locally Solid Riesz Spaces with Applications to Economics. Mathematical Surveys and Monographs. doi:10.1090/surv/105 es_ES
dc.description.references Almahdi, S., & Yang, S. Y. (2017). An adaptive portfolio trading system: A risk-return portfolio optimization using recurrent reinforcement learning with expected maximum drawdown. Expert Systems with Applications, 87, 267-279. doi:10.1016/j.eswa.2017.06.023 es_ES
dc.description.references Aronsson, G. (1967). Extension of functions satisfying lipschitz conditions. Arkiv för Matematik, 6(6), 551-561. doi:10.1007/bf02591928 es_ES
dc.description.references Bekiros, S. D. (2010). Heterogeneous trading strategies with adaptive fuzzy Actor–Critic reinforcement learning: A behavioral approach. Journal of Economic Dynamics and Control, 34(6), 1153-1170. doi:10.1016/j.jedc.2010.01.015 es_ES
dc.description.references Bekiros, S. D. (2015). Heuristic learning in intraday trading under uncertainty. Journal of Empirical Finance, 30, 34-49. doi:10.1016/j.jempfin.2014.11.002 es_ES
dc.description.references Bertoluzzo, F., & Corazza, M. (2012). Testing Different Reinforcement Learning Configurations for Financial Trading: Introduction and Applications. Procedia Economics and Finance, 3, 68-77. doi:10.1016/s2212-5671(12)00122-0 es_ES
dc.description.references Cavalcante, R. C., Brasileiro, R. C., Souza, V. L. F., Nobrega, J. P., & Oliveira, A. L. I. (2016). Computational Intelligence and Financial Markets: A Survey and Future Directions. Expert Systems with Applications, 55, 194-211. doi:10.1016/j.eswa.2016.02.006 es_ES
dc.description.references Chen, Y., & Hao, Y. (2017). A feature weighted support vector machine and K-nearest neighbor algorithm for stock market indices prediction. Expert Systems with Applications, 80, 340-355. doi:10.1016/j.eswa.2017.02.044 es_ES
dc.description.references Chong, E., Han, C., & Park, F. C. (2017). Deep learning networks for stock market analysis and prediction: Methodology, data representations, and case studies. Expert Systems with Applications, 83, 187-205. doi:10.1016/j.eswa.2017.04.030 es_ES
dc.description.references Das, S. P., & Padhy, S. (2015). A novel hybrid model using teaching–learning-based optimization and a support vector machine for commodity futures index forecasting. International Journal of Machine Learning and Cybernetics, 9(1), 97-111. doi:10.1007/s13042-015-0359-0 es_ES
dc.description.references Defoort, M., Polyakov, A., Demesure, G., Djemai, M., & Veluvolu, K. (2015). Leader‐follower fixed‐time consensus for multi‐agent systems with unknown non‐linear inherent dynamics. IET Control Theory & Applications, 9(14), 2165-2170. doi:10.1049/iet-cta.2014.1301 es_ES
dc.description.references Dempster, M. A. H., & Leemans, V. (2006). An automated FX trading system using adaptive reinforcement learning. Expert Systems with Applications, 30(3), 543-552. doi:10.1016/j.eswa.2005.10.012 es_ES
dc.description.references Deng, Y., Bao, F., Kong, Y., Ren, Z., & Dai, Q. (2017). Deep Direct Reinforcement Learning for Financial Signal Representation and Trading. IEEE Transactions on Neural Networks and Learning Systems, 28(3), 653-664. doi:10.1109/tnnls.2016.2522401 es_ES
dc.description.references M. Dong, X. Yang, Y. Wu, J.H. Xue, Metric learning via maximizing the Lipschitz margin ratio, arXiv:1802.03464 (2018) 1–12. es_ES
dc.description.references Driessens, K., Ramon, J., & Gärtner, T. (2006). Graph kernels and Gaussian processes for relational reinforcement learning. Machine Learning, 64(1-3), 91-119. doi:10.1007/s10994-006-8258-y es_ES
dc.description.references Dunis, C. L., Rosillo, R., de la Fuente, D., & Pino, R. (2012). Forecasting IBEX-35 moves using support vector machines. Neural Computing and Applications, 23(1), 229-236. doi:10.1007/s00521-012-0821-9 es_ES
dc.description.references Fischer, T., & Krauss, C. (2018). Deep learning with long short-term memory networks for financial market predictions. European Journal of Operational Research, 270(2), 654-669. doi:10.1016/j.ejor.2017.11.054 es_ES
dc.description.references Gerlein, E. A., McGinnity, M., Belatreche, A., & Coleman, S. (2016). Evaluating machine learning classification for financial trading: An empirical approach. Expert Systems with Applications, 54, 193-207. doi:10.1016/j.eswa.2016.01.018 es_ES
dc.description.references Gottlieb, L.-A., Kontorovich, A., & Krauthgamer, R. (2014). Efficient Classification for Metric Data. IEEE Transactions on Information Theory, 60(9), 5750-5759. doi:10.1109/tit.2014.2339840 es_ES
dc.description.references Guo, X.-G., Wang, J. L., Liao, F., & Teo, R. S. H. (2016). Distributed adaptive control for vehicular platoon with unknown dead-zone inputs and velocity/acceleration disturbances. International Journal of Robust and Nonlinear Control, 27(16), 2961-2981. doi:10.1002/rnc.3720 es_ES
dc.description.references Jeong, G., & Kim, H. Y. (2019). Improving financial trading decisions using deep Q-learning: Predicting the number of shares, action strategies, and transfer learning. Expert Systems with Applications, 117, 125-138. doi:10.1016/j.eswa.2018.09.036 es_ES
dc.description.references Kearney, C., & Liu, S. (2014). Textual sentiment in finance: A survey of methods and models. International Review of Financial Analysis, 33, 171-185. doi:10.1016/j.irfa.2014.02.006 es_ES
dc.description.references Lahmiri, S. (2016). A variational mode decompoisition approach for analysis and forecasting of economic and financial time series. Expert Systems with Applications, 55, 268-273. doi:10.1016/j.eswa.2016.02.025 es_ES
dc.description.references Lee, T. K., Cho, J. H., Kwon, D. S., & Sohn, S. Y. (2019). Global stock market investment strategies based on financial network indicators using machine learning techniques. Expert Systems with Applications, 117, 228-242. doi:10.1016/j.eswa.2018.09.005 es_ES
dc.description.references Lee, J. W., Park, J., O, J., Lee, J., & Hong, E. (2007). A Multiagent Approach to $Q$-Learning for Daily Stock Trading. IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, 37(6), 864-877. doi:10.1109/tsmca.2007.904825 es_ES
dc.description.references Li, Y., Jiang, W., Yang, L., & Wu, T. (2018). On neural networks and learning systems for business computing. Neurocomputing, 275, 1150-1159. doi:10.1016/j.neucom.2017.09.054 es_ES
dc.description.references Liu, F., & Wang, J. (2012). Fluctuation prediction of stock market index by Legendre neural network with random time strength function. Neurocomputing, 83, 12-21. doi:10.1016/j.neucom.2011.09.033 es_ES
dc.description.references LOUGHRAN, T., & MCDONALD, B. (2016). Textual Analysis in Accounting and Finance: A Survey. Journal of Accounting Research, 54(4), 1187-1230. doi:10.1111/1475-679x.12123 es_ES
dc.description.references Lu, C.-J., Lee, T.-S., & Chiu, C.-C. (2009). Financial time series forecasting using independent component analysis and support vector regression. Decision Support Systems, 47(2), 115-125. doi:10.1016/j.dss.2009.02.001 es_ES
dc.description.references Mahmoudi, N., Docherty, P., & Moscato, P. (2018). Deep neural networks understand investors better. Decision Support Systems, 112, 23-34. doi:10.1016/j.dss.2018.06.002 es_ES
dc.description.references Maringer, D., & Ramtohul, T. (2011). Regime-switching recurrent reinforcement learning for investment decision making. Computational Management Science, 9(1), 89-107. doi:10.1007/s10287-011-0131-1 es_ES
dc.description.references McShane, E. J. (1934). Extension of range of functions. Bulletin of the American Mathematical Society, 40(12), 837-842. doi:10.1090/s0002-9904-1934-05978-0 es_ES
dc.description.references Milman, V. A. (1999). Absolutely minimal extensions of functions on metric spaces. Sbornik: Mathematics, 190(6), 859-885. doi:10.1070/sm1999v190n06abeh000409 es_ES
dc.description.references Moghaddam, A. H., Moghaddam, M. H., & Esfandyari, M. (2016). Stock market index prediction using artificial neural network. Journal of Economics, Finance and Administrative Science, 21(41), 89-93. doi:10.1016/j.jefas.2016.07.002 es_ES
dc.description.references Moody, J., & Saffell, M. (2001). Learning to trade via direct reinforcement. IEEE Transactions on Neural Networks, 12(4), 875-889. doi:10.1109/72.935097 es_ES
dc.description.references Khadjeh Nassirtoussi, A., Aghabozorgi, S., Ying Wah, T., & Ngo, D. C. L. (2014). Text mining for market prediction: A systematic review. Expert Systems with Applications, 41(16), 7653-7670. doi:10.1016/j.eswa.2014.06.009 es_ES
dc.description.references H. Park, M.K. Sim, D.G. Choi, An intelligent financial portfolio trading strategy using deep q-learning, arXiv:1907.03665 (2019) 1–39. es_ES
dc.description.references Patel, J., Shah, S., Thakkar, P., & Kotecha, K. (2015). Predicting stock market index using fusion of machine learning techniques. Expert Systems with Applications, 42(4), 2162-2172. doi:10.1016/j.eswa.2014.10.031 es_ES
dc.description.references Pendharkar, P. C., & Cusatis, P. (2018). Trading financial indices with reinforcement learning agents. Expert Systems with Applications, 103, 1-13. doi:10.1016/j.eswa.2018.02.032 es_ES
dc.description.references Romaguera, S., & Sanchis, M. (2000). Semi-Lipschitz Functions and Best Approximation in Quasi-Metric Spaces. Journal of Approximation Theory, 103(2), 292-301. doi:10.1006/jath.1999.3439 es_ES
dc.description.references Schmidhuber, J. (2015). Deep learning in neural networks: An overview. Neural Networks, 61, 85-117. doi:10.1016/j.neunet.2014.09.003 es_ES
dc.description.references Sezer, O. B., & Ozbayoglu, A. M. (2018). Algorithmic financial trading with deep convolutional neural networks: Time series to image conversion approach. Applied Soft Computing, 70, 525-538. doi:10.1016/j.asoc.2018.04.024 es_ES
dc.description.references Tkáč, M., & Verner, R. (2016). Artificial neural networks in business: Two decades of research. Applied Soft Computing, 38, 788-804. doi:10.1016/j.asoc.2015.09.040 es_ES
dc.description.references Ticknor, J. L. (2013). A Bayesian regularized artificial neural network for stock market forecasting. Expert Systems with Applications, 40(14), 5501-5506. doi:10.1016/j.eswa.2013.04.013 es_ES
dc.description.references Wang, B., Huang, H., & Wang, X. (2012). A novel text mining approach to financial time series forecasting. Neurocomputing, 83, 136-145. doi:10.1016/j.neucom.2011.12.013 es_ES
dc.description.references Wang, B., Huang, H., & Wang, X. (2011). A support vector machine based MSM model for financial short-term volatility forecasting. Neural Computing and Applications, 22(1), 21-28. doi:10.1007/s00521-011-0742-z es_ES
dc.description.references Xiao, G., Zhang, H., Luo, Y., & Qu, Q. (2017). General value iteration based reinforcement learning for solving optimal tracking control problem of continuous–time affine nonlinear systems. Neurocomputing, 245, 114-123. doi:10.1016/j.neucom.2017.03.038 es_ES
dc.description.references Yeh, C.-Y., Huang, C.-W., & Lee, S.-J. (2011). A multiple-kernel support vector regression approach for stock market price forecasting. Expert Systems with Applications, 38(3), 2177-2186. doi:10.1016/j.eswa.2010.08.004 es_ES
dc.description.references Zhang, X., Hu, Y., Xie, K., Zhang, W., Su, L., & Liu, M. (2015). An evolutionary trend reversion model for stock trading rule discovery. Knowledge-Based Systems, 79, 27-35. doi:10.1016/j.knosys.2014.08.010 es_ES
dc.description.references Zhang, J., & Maringer, D. (2015). Using a Genetic Algorithm to Improve Recurrent Reinforcement Learning for Equity Trading. Computational Economics, 47(4), 551-567. doi:10.1007/s10614-015-9490-y es_ES
dc.description.references Zhiqiang, G., Huaiqing, W., & Quan, L. (2012). Financial time series forecasting using LPP and SVM optimized by PSO. Soft Computing, 17(5), 805-818. doi:10.1007/s00500-012-0953-y es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem