Learning with con gurable operators and RL-based heuristics

Martínez Plumed, Fernando; Ferri Ramírez, César; Hernández Orallo, José; Ramírez Quintana, María José

doi:10.1007/978-3-642-37382-4_1

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Learning with con gurable operators and RL-based heuristics

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: NFCMP2012_LNCS (2).pdf

Tamaño: 712.3Kb

Formato: PDF

Descripción: Versión del Autor.

Abrir

Nombre: NFCMP2012-LNCS-bf ...

Tamaño: 415.7Kb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Martínez Plumed, Fernando	es_ES
dc.contributor.author	Ferri Ramírez, César	es_ES
dc.contributor.author	Hernández Orallo, José	es_ES
dc.contributor.author	Ramírez Quintana, María José	es_ES
dc.date.accessioned	2014-05-08T12:06:02Z
dc.date.issued	2013-10
dc.identifier.isbn	978-3-642-37381-7
dc.identifier.issn	0302-9743
dc.identifier.uri	http://hdl.handle.net/10251/37322
dc.description.abstract	In this paper, we push forward the idea of machine learning systems for which the operators can be modi ed and netuned for each problem. This allows us to propose a learning paradigm where users can write (or adapt) their operators, according to the problem, data representation and the way the information should be navigated. To achieve this goal, data instances, background knowledge, rules, programs and operators are all written in the same functional language, Erlang. Since changing operators a ect how the search space needs to be explored, heuristics are learnt as a result of a decision process based on reinforcement learning where each action is de ned as a choice of operator and rule. As a result, the architecture can be seen as a `system for writing machine learning systems' or to explore new operators.	es_ES
dc.description.sponsorship	This work was supported by the MEC projects CONSOLIDER-INGENIO 26706 and TIN 2010-21062-C02-02, GVA project PROMETEO/2008/051, and the REFRAME project granted by the European Coordinated Research on Long-term Challenges in Information and Communication Sciences & Technologies ERA-Net (CHIST-ERA), and funded by the Ministerio de Econom´ıa y Competitividad in Spain. Also, F. Mart´ınez-Plumed is supported by FPI-ME grant BES-2011-045099
dc.format.extent	16	es_ES
dc.language	Inglés	es_ES
dc.publisher	Springer Verlag (Germany)	es_ES
dc.relation.ispartof	New Frontiers in Mining Complex Patterns	es_ES
dc.relation.ispartofseries	Lecture Notes in Computer Science;7765
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Machine learning operators	es_ES
dc.subject	Complex data	es_ES
dc.subject	Heuristics	es_ES
dc.subject	Inducting programming	es_ES
dc.subject	Reinforcement learning	es_ES
dc.subject	Erlang	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	Learning with con gurable operators and RL-based heuristics	es_ES
dc.type	Capítulo de libro	es_ES
dc.identifier.doi	10.1007/978-3-642-37382-4_1
dc.relation.projectID	info:eu-repo/grantAgreement/Generalitat Valenciana//PROMETEO08%2F2008%2F051/ES/Advances on Agreement Technologies for Computational Entities (atforce)/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MEC/CONSOLIDER-INGENIO/26706	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/MICINN//TIN2010-21062-C02-02/ES/SWEETLOGICS-UPV/
dc.relation.projectID	info:eu-repo/grantAgreement/MICINN//BES-2011-045099/ES/BES-2011-045099/
dc.rights.accessRights	Abierto	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.description.bibliographicCitation	Martínez Plumed, F.; Ferri Ramírez, C.; Hernández Orallo, J.; Ramírez Quintana, MJ. (2013). Learning with con gurable operators and RL-based heuristics. En New Frontiers in Mining Complex Patterns. Springer Verlag (Germany). 7765:1-16. https://doi.org/10.1007/978-3-642-37382-4_1	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.conferencename	First International Workshop, NFMCP 2012 Held in Conjunction with ECML/PKDD 2012	es_ES
dc.relation.conferencedate	September 24, 2012	es_ES
dc.relation.conferenceplace	Bristol, UK	es_ES
dc.relation.publisherversion	http://dx.doi.org/10.1007/978-3-642-37382-4_1	es_ES
dc.description.upvformatpinicio	1	es_ES
dc.description.upvformatpfin	16	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	7765	es_ES
dc.relation.senia	238785
dc.contributor.funder	Ministerio de Ciencia e Innovación
dc.contributor.funder	Generalitat Valenciana
dc.contributor.funder	Ministerio de Educación y Ciencia
dc.description.references	Armstrong, J.: A history of erlang. In: Proceedings of the Third ACM SIGPLAN Conf. on History of Programming Languages, HOPL III, pp. 1–26. ACM (2007)	es_ES
dc.description.references	Brazdil, P., Giraud-Carrier: Metalearning: Concepts and systems. In: Metalearning. Cognitive Technologies, pp. 1–10. Springer, Heidelberg (2009)	es_ES
dc.description.references	Daumé III, H., Langford, J.: Search-based structured prediction (2009)	es_ES
dc.description.references	Dietterich, T., Domingos, P., Getoor, L., Muggleton, S., Tadepalli, P.: Structured machine learning: the next ten years. Machine Learning 73, 3–23 (2008)	es_ES
dc.description.references	Dietterich, T.G., Lathrop, R., Lozano-Perez, T.: Solving the multiple-instance problem with axis-parallel rectangles. Artificial Intelligence 89, 31–71 (1997)	es_ES
dc.description.references	Džeroski, S.: Towards a general framework for data mining. In: Džeroski, S., Struyf, J. (eds.) KDID 2006. LNCS, vol. 4747, pp. 259–300. Springer, Heidelberg (2007)	es_ES
dc.description.references	Dzeroski, S., De Raedt, L., Driessens, K.: Relational reinforcement learning. Machine Learning 43, 7–52 (2001), 10.1023/A:1007694015589	es_ES
dc.description.references	Dzeroski, S., Lavrac, N. (eds.): Relational Data Mining. Springer (2001)	es_ES
dc.description.references	Estruch, V., Ferri, C., Hernández-Orallo, J., Ramírez-Quintana, M.J.: Similarity functions for structured data. an application to decision trees. Inteligencia Artificial, Revista Iberoamericana de Inteligencia Artificial 10(29), 109–121 (2006)	es_ES
dc.description.references	Estruch, V., Ferri, C., Hernández-Orallo, J., Ramírez-Quintana, M.J.: Web categorisation using distance-based decision trees. ENTCS 157(2), 35–40 (2006)	es_ES
dc.description.references	Estruch, V., Ferri, C., Hernández-Orallo, J., Ramírez-Quintana, M.J.: Bridging the Gap between Distance and Generalisation. Computational Intelligence (2012)	es_ES
dc.description.references	Ferri-Ramírez, C., Hernández-Orallo, J., Ramírez-Quintana, M.J.: Incremental learning of functional logic programs. In: Kuchen, H., Ueda, K. (eds.) FLOPS 2001. LNCS, vol. 2024, pp. 233–247. Springer, Heidelberg (2001)	es_ES
dc.description.references	Gärtner, T.: Kernels for Structured Data. PhD thesis, Universitat Bonn (2005)	es_ES
dc.description.references	Holland, J.H., Booker, L.B., Colombetti, M., Dorigo, M., Goldberg, D.E., Forrest, S., Riolo, R.L., Smith, R.E., Lanzi, P.L., Stolzmann, W., Wilson, S.W.: What is a learning classifier system? In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 1999. LNCS (LNAI), vol. 1813, pp. 3–32. Springer, Heidelberg (2000)	es_ES
dc.description.references	Holmes, J.H., Lanzi, P., Stolzmann, W.: Learning classifier systems: New models, successful applications. Information Processing Letters (2002)	es_ES
dc.description.references	Kitzelmann, E.: Inductive programming: A survey of program synthesis techniques. In: Schmid, U., Kitzelmann, E., Plasmeijer, R. (eds.) AAIP 2009. LNCS, vol. 5812, pp. 50–73. Springer, Heidelberg (2010)	es_ES
dc.description.references	Koller, D., Sahami, M.: Hierarchically classifying documents using very few words. In: Proceedings of the Fourteenth International Conference on Machine Learning, ICML 1997, pp. 170–178. Morgan Kaufmann Publishers Inc., San Francisco (1997)	es_ES
dc.description.references	Lafferty, J., McCallum, A.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML 2001, pp. 282–289 (2001)	es_ES
dc.description.references	Lloyd, J.W.: Knowledge representation, computation, and learning in higher-order logic (2001)	es_ES
dc.description.references	Maes, F., Denoyer, L., Gallinari, P.: Structured prediction with reinforcement learning. Machine Learning Journal 77(2-3), 271–301 (2009)	es_ES
dc.description.references	Martínez-Plumed, F., Estruch, V., Ferri, C., Hernández-Orallo, J., Ramírez-Quintana, M.J.: Newton trees. In: Li, J. (ed.) AI 2010. LNCS, vol. 6464, pp. 174–183. Springer, Heidelberg (2010)	es_ES
dc.description.references	Muggleton, S.: Inverse entailment and Progol. New Generation Computing (1995)	es_ES
dc.description.references	Muggleton, S.H.: Inductive logic programming: Issues, results, and the challenge of learning language in logic. Artificial Intelligence 114(1-2), 283–296 (1999)	es_ES
dc.description.references	Plotkin, G.: A note on inductive generalization. Machine Intelligence 5 (1970)	es_ES
dc.description.references	Schmidhuber, J.: Optimal ordered problem solver. Maching Learning 54(3), 211–254 (2004)	es_ES
dc.description.references	Srinivasan, A.: The Aleph Manual (2004)	es_ES
dc.description.references	Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press (1998)	es_ES
dc.description.references	Tadepalli, P., Givan, R., Driessens, K.: Relational reinforcement learning: An overview. In: Proc. of the Workshop on Relational Reinforcement Learning (2004)	es_ES
dc.description.references	Tamaddoni-Nezhad, A., Muggleton, S.: A genetic algorithms approach to ILP. In: Matwin, S., Sammut, C. (eds.) ILP 2002. LNCS (LNAI), vol. 2583, pp. 285–300. Springer, Heidelberg (2003)	es_ES
dc.description.references	Tsochantaridis, I., Hofmann, T., Joachims, T., Altun, Y.: Support vector machine learning for interdependent and structured output spaces. In: ICML (2004)	es_ES
dc.description.references	Wallace, C.S., Dowe, D.L.: Refinements of MDL and MML coding. Comput. J. 42(4), 330–337 (1999)	es_ES
dc.description.references	Watkins, C., Dayan, P.: Q-learning. Machine Learning 8, 279–292 (1992)	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Artículos, conferencias, monografías [46836]

Mostrar el registro sencillo del ítem

Learning with con gurable operators and RL-based heuristics

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Learning with con gurable operators and RL-based heuristics

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)