Please use this identifier to cite or link to this item:
http://hdl.handle.net/10071/27772Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Gil, P. | - |
| dc.contributor.author | Nunes, L. | - |
| dc.date.accessioned | 2023-02-07T12:47:39Z | - |
| dc.date.available | 2023-02-07T12:47:39Z | - |
| dc.date.issued | 2013-01-01 | - |
| dc.identifier.citation | Gil, P., & Nunes, L. (2013). Hierarchical reinforcement learning using path clustering. In 2013 8th Iberian Conference on Information Systems and Technologies (CISTI), 6615769. IEEE. | - |
| dc.identifier.isbn | 978-989-98434-0-0 | - |
| dc.identifier.issn | 2166-0727 | - |
| dc.identifier.uri | http://hdl.handle.net/10071/27772 | - |
| dc.description.abstract | In this paper we intend to study the possibility to improve the performance of the Q-Learning algorithm, by automatically finding subgoals and making better use of the acquired knowledge. This research explores a method that allows an agent to gather information about sequences of states that lead to a goal, detect classes of common sequences and introduce the states at the end of these sequences as subgoals. We use the taxiproblem (a standard in Hierarchical Reinforcement Learning literature) and conclude that, even though this problem's scale is relatively small, in most of the cases subgoals do improve the learning speed, achieving relatively good results faster than standard Q-Learning. We propose a specific iteration interval as the most appropriate to insert subgoals in the learning process. We also found that early adoption of subgoals may lead to suboptimal learning. The extension to more challenging problems is an interesting subject for future work. | eng |
| dc.language.iso | eng | - |
| dc.publisher | IEEE | - |
| dc.relation.ispartof | 2013 8th Iberian Conference on Information Systems and Technologies (CISTI) | - |
| dc.rights | openAccess | - |
| dc.subject | Hierarchical reinforcement learning | eng |
| dc.subject | Q-learning | eng |
| dc.subject | Performance | eng |
| dc.subject | Subgoals | eng |
| dc.title | Hierarchical reinforcement learning using path clustering | eng |
| dc.type | conferenceObject | - |
| dc.event.title | 8th Iberian Conference on Information Systems and Technologies, CISTI 2013 | - |
| dc.event.type | Conferência | pt |
| dc.event.location | Lisboa | eng |
| dc.event.date | 2013 | - |
| dc.peerreviewed | yes | - |
| dc.date.updated | 2023-02-07T12:46:16Z | - |
| dc.description.version | info:eu-repo/semantics/acceptedVersion | - |
| dc.subject.fos | Domínio/Área Científica::Ciências Naturais::Ciências da Computação e da Informação | por |
| iscte.identifier.ciencia | https://ciencia.iscte-iul.pt/id/ci-pub-42667 | - |
| iscte.alternateIdentifiers.wos | WOS:WOS:000345737600070 | - |
| iscte.alternateIdentifiers.scopus | 2-s2.0-84887948781 | - |
| Appears in Collections: | IT-CRI - Comunicações a conferências internacionais | |
Files in This Item:
| File | Size | Format | |
|---|---|---|---|
| conferenceobject_42667.pdf | 647,53 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.












