<> <http://www.w3.org/2000/01/rdf-schema#comment> "The repository administrator has not yet configured an RDF license."^^<http://www.w3.org/2001/XMLSchema#string> .
<> <http://xmlns.com/foaf/0.1/primaryTopic> <https://discovery.ucl.ac.uk/id/eprint/10194559> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://purl.org/ontology/bibo/AcademicArticle> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://purl.org/ontology/bibo/Article> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/dc/terms/title> "Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/ontology/bibo/abstract> "In addressing control problems such as regulation and tracking through reinforcement learning (RL), it is often required to guarantee that the acquired policy meets essential performance and stability criteria such as a desired settling time and steady-state error before deployment. Motivated by this, we present a set of results and a systematic reward-shaping procedure that: 1) ensures the optimal policy generates trajectories that align with specified control requirements and 2) allows to assess whether any given policy satisfies them. We validate our approach through comprehensive numerical experiments conducted in two representative environments from OpenAI Gym: the Pendulum swing-up problem and the Lunar Lander. Utilizing both tabular and deep RL methods, our experiments consistently affirm the efficacy of our proposed framework, highlighting its effectiveness in ensuring policy adherence to the prescribed control requirements."^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/dc/terms/date> "2024-05-17" .
<https://discovery.ucl.ac.uk/id/document/1756062> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://purl.org/ontology/bibo/Document> .
<https://discovery.ucl.ac.uk/id/org/ext-eef18fe0e3c5b50f1fdd38d83d36874a> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://xmlns.com/foaf/0.1/Organization> .
<https://discovery.ucl.ac.uk/id/org/ext-eef18fe0e3c5b50f1fdd38d83d36874a> <http://xmlns.com/foaf/0.1/name> "Institute of Electrical and Electronics Engineers"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/dc/terms/publisher> <https://discovery.ucl.ac.uk/id/org/ext-eef18fe0e3c5b50f1fdd38d83d36874a> .
<https://discovery.ucl.ac.uk/id/publication/ext-d92d2359ec65290e4c7a1f3624e4806a> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://purl.org/ontology/bibo/Collection> .
<https://discovery.ucl.ac.uk/id/publication/ext-d92d2359ec65290e4c7a1f3624e4806a> <http://xmlns.com/foaf/0.1/name> "IEEE Transactions on Control Systems Technology"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/dc/terms/isPartOf> <https://discovery.ucl.ac.uk/id/publication/ext-d92d2359ec65290e4c7a1f3624e4806a> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/ontology/bibo/status> <http://purl.org/ontology/bibo/status/forthcoming> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/dc/terms/creator> <https://discovery.ucl.ac.uk/id/person/ext-0f4af34faffa058dd53780eafac96330> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/ontology/bibo/authorList> <https://discovery.ucl.ac.uk/id/eprint/10194559#authors> .
<https://discovery.ucl.ac.uk/id/eprint/10194559#authors> <http://www.w3.org/1999/02/22-rdf-syntax-ns#_1> <https://discovery.ucl.ac.uk/id/person/ext-0f4af34faffa058dd53780eafac96330> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/dc/terms/creator> <https://discovery.ucl.ac.uk/id/person/ext-51caa47d61e03bdbba0c130172507a5b> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/ontology/bibo/authorList> <https://discovery.ucl.ac.uk/id/eprint/10194559#authors> .
<https://discovery.ucl.ac.uk/id/eprint/10194559#authors> <http://www.w3.org/1999/02/22-rdf-syntax-ns#_2> <https://discovery.ucl.ac.uk/id/person/ext-51caa47d61e03bdbba0c130172507a5b> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/dc/terms/creator> <https://discovery.ucl.ac.uk/id/person/ext-6369a1ec190f6dbccc1a00e12aa51db2> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/ontology/bibo/authorList> <https://discovery.ucl.ac.uk/id/eprint/10194559#authors> .
<https://discovery.ucl.ac.uk/id/eprint/10194559#authors> <http://www.w3.org/1999/02/22-rdf-syntax-ns#_3> <https://discovery.ucl.ac.uk/id/person/ext-6369a1ec190f6dbccc1a00e12aa51db2> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/dc/terms/creator> <https://discovery.ucl.ac.uk/id/person/ext-7238c859388c6a46413916b704f82cc9> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/ontology/bibo/authorList> <https://discovery.ucl.ac.uk/id/eprint/10194559#authors> .
<https://discovery.ucl.ac.uk/id/eprint/10194559#authors> <http://www.w3.org/1999/02/22-rdf-syntax-ns#_4> <https://discovery.ucl.ac.uk/id/person/ext-7238c859388c6a46413916b704f82cc9> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/dc/terms/creator> <https://discovery.ucl.ac.uk/id/person/ext-215260b839c7d454ecbce75349b73388> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/ontology/bibo/authorList> <https://discovery.ucl.ac.uk/id/eprint/10194559#authors> .
<https://discovery.ucl.ac.uk/id/eprint/10194559#authors> <http://www.w3.org/1999/02/22-rdf-syntax-ns#_5> <https://discovery.ucl.ac.uk/id/person/ext-215260b839c7d454ecbce75349b73388> .
<https://discovery.ucl.ac.uk/id/person/ext-0f4af34faffa058dd53780eafac96330> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://xmlns.com/foaf/0.1/Person> .
<https://discovery.ucl.ac.uk/id/person/ext-0f4af34faffa058dd53780eafac96330> <http://xmlns.com/foaf/0.1/givenName> "Francesco"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-0f4af34faffa058dd53780eafac96330> <http://xmlns.com/foaf/0.1/familyName> "De Lellis"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-0f4af34faffa058dd53780eafac96330> <http://xmlns.com/foaf/0.1/name> "Francesco De Lellis"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-51caa47d61e03bdbba0c130172507a5b> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://xmlns.com/foaf/0.1/Person> .
<https://discovery.ucl.ac.uk/id/person/ext-51caa47d61e03bdbba0c130172507a5b> <http://xmlns.com/foaf/0.1/givenName> "Marco"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-51caa47d61e03bdbba0c130172507a5b> <http://xmlns.com/foaf/0.1/familyName> "Coraggio"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-51caa47d61e03bdbba0c130172507a5b> <http://xmlns.com/foaf/0.1/name> "Marco Coraggio"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-215260b839c7d454ecbce75349b73388> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://xmlns.com/foaf/0.1/Person> .
<https://discovery.ucl.ac.uk/id/person/ext-215260b839c7d454ecbce75349b73388> <http://xmlns.com/foaf/0.1/givenName> "Mario"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-215260b839c7d454ecbce75349b73388> <http://xmlns.com/foaf/0.1/familyName> "di Bernardo"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-215260b839c7d454ecbce75349b73388> <http://xmlns.com/foaf/0.1/name> "Mario di Bernardo"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-6369a1ec190f6dbccc1a00e12aa51db2> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://xmlns.com/foaf/0.1/Person> .
<https://discovery.ucl.ac.uk/id/person/ext-6369a1ec190f6dbccc1a00e12aa51db2> <http://xmlns.com/foaf/0.1/givenName> "Giovanni"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-6369a1ec190f6dbccc1a00e12aa51db2> <http://xmlns.com/foaf/0.1/familyName> "Russo"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-6369a1ec190f6dbccc1a00e12aa51db2> <http://xmlns.com/foaf/0.1/name> "Giovanni Russo"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-7238c859388c6a46413916b704f82cc9> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://xmlns.com/foaf/0.1/Person> .
<https://discovery.ucl.ac.uk/id/person/ext-7238c859388c6a46413916b704f82cc9> <http://xmlns.com/foaf/0.1/givenName> "Mirco"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-7238c859388c6a46413916b704f82cc9> <http://xmlns.com/foaf/0.1/familyName> "Musolesi"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-7238c859388c6a46413916b704f82cc9> <http://xmlns.com/foaf/0.1/name> "Mirco Musolesi"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/EPrint> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/ArticleEPrint> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://purl.org/dc/terms/isPartOf> <https://discovery.ucl.ac.uk/id/repository> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://eprints.org/ontology/hasDocument> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1756062> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://discovery.ucl.ac.uk/id/document/1756062> <http://www.w3.org/2000/01/rdf-schema#label> "Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning (Text)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/document/1756062> <http://eprints.org/ontology/hasFile> <https://discovery.ucl.ac.uk/id/eprint/10194559/2/Musolesi_Guaranteeing%20Control%20Requirements%20via%20Reward%20Shaping%20in%20Reinforcement%20Learning_AAM.pdf> .
<https://discovery.ucl.ac.uk/id/document/1756062> <http://purl.org/dc/terms/hasPart> <https://discovery.ucl.ac.uk/id/eprint/10194559/2/Musolesi_Guaranteeing%20Control%20Requirements%20via%20Reward%20Shaping%20in%20Reinforcement%20Learning_AAM.pdf> .
<https://discovery.ucl.ac.uk/id/eprint/10194559/2/Musolesi_Guaranteeing%20Control%20Requirements%20via%20Reward%20Shaping%20in%20Reinforcement%20Learning_AAM.pdf> <http://www.w3.org/2000/01/rdf-schema#label> "Musolesi_Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning_AAM.pdf"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://eprints.org/ontology/hasDocument> <https://discovery.ucl.ac.uk/id/document/1761145> .
<https://discovery.ucl.ac.uk/id/document/1761145> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://discovery.ucl.ac.uk/id/document/1761145> <http://www.w3.org/2000/01/rdf-schema#label> "Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning (Other)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/document/1761145> <http://eprints.org/relation/isVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761145> <http://eprints.org/relation/isVolatileVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761145> <http://eprints.org/relation/islightboxThumbnailVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761145> <http://eprints.org/ontology/hasFile> <https://discovery.ucl.ac.uk/id/eprint/10194559/3/lightbox.jpg> .
<https://discovery.ucl.ac.uk/id/document/1761145> <http://purl.org/dc/terms/hasPart> <https://discovery.ucl.ac.uk/id/eprint/10194559/3/lightbox.jpg> .
<https://discovery.ucl.ac.uk/id/eprint/10194559/3/lightbox.jpg> <http://www.w3.org/2000/01/rdf-schema#label> "lightbox.jpg"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://eprints.org/ontology/hasDocument> <https://discovery.ucl.ac.uk/id/document/1761146> .
<https://discovery.ucl.ac.uk/id/document/1761146> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://discovery.ucl.ac.uk/id/document/1761146> <http://www.w3.org/2000/01/rdf-schema#label> "Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning (Other)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/document/1761146> <http://eprints.org/relation/isVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761146> <http://eprints.org/relation/isVolatileVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761146> <http://eprints.org/relation/ispreviewThumbnailVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761146> <http://eprints.org/ontology/hasFile> <https://discovery.ucl.ac.uk/id/eprint/10194559/4/preview.jpg> .
<https://discovery.ucl.ac.uk/id/document/1761146> <http://purl.org/dc/terms/hasPart> <https://discovery.ucl.ac.uk/id/eprint/10194559/4/preview.jpg> .
<https://discovery.ucl.ac.uk/id/eprint/10194559/4/preview.jpg> <http://www.w3.org/2000/01/rdf-schema#label> "preview.jpg"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://eprints.org/ontology/hasDocument> <https://discovery.ucl.ac.uk/id/document/1761147> .
<https://discovery.ucl.ac.uk/id/document/1761147> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://discovery.ucl.ac.uk/id/document/1761147> <http://www.w3.org/2000/01/rdf-schema#label> "Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning (Other)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/document/1761147> <http://eprints.org/relation/isVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761147> <http://eprints.org/relation/isVolatileVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761147> <http://eprints.org/relation/ismediumThumbnailVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761147> <http://eprints.org/ontology/hasFile> <https://discovery.ucl.ac.uk/id/eprint/10194559/5/medium.jpg> .
<https://discovery.ucl.ac.uk/id/document/1761147> <http://purl.org/dc/terms/hasPart> <https://discovery.ucl.ac.uk/id/eprint/10194559/5/medium.jpg> .
<https://discovery.ucl.ac.uk/id/eprint/10194559/5/medium.jpg> <http://www.w3.org/2000/01/rdf-schema#label> "medium.jpg"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://eprints.org/ontology/hasDocument> <https://discovery.ucl.ac.uk/id/document/1761148> .
<https://discovery.ucl.ac.uk/id/document/1761148> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://discovery.ucl.ac.uk/id/document/1761148> <http://www.w3.org/2000/01/rdf-schema#label> "Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning (Other)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/document/1761148> <http://eprints.org/relation/isVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761148> <http://eprints.org/relation/isVolatileVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761148> <http://eprints.org/relation/issmallThumbnailVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761148> <http://eprints.org/ontology/hasFile> <https://discovery.ucl.ac.uk/id/eprint/10194559/6/small.jpg> .
<https://discovery.ucl.ac.uk/id/document/1761148> <http://purl.org/dc/terms/hasPart> <https://discovery.ucl.ac.uk/id/eprint/10194559/6/small.jpg> .
<https://discovery.ucl.ac.uk/id/eprint/10194559/6/small.jpg> <http://www.w3.org/2000/01/rdf-schema#label> "small.jpg"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://eprints.org/ontology/hasDocument> <https://discovery.ucl.ac.uk/id/document/1761608> .
<https://discovery.ucl.ac.uk/id/document/1761608> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://discovery.ucl.ac.uk/id/document/1761608> <http://www.w3.org/2000/01/rdf-schema#label> "Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning (Other)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/document/1761608> <http://eprints.org/relation/isVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761608> <http://eprints.org/relation/isVolatileVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761608> <http://eprints.org/relation/isIndexCodesVersionOf> <https://discovery.ucl.ac.uk/id/document/1756062> .
<https://discovery.ucl.ac.uk/id/document/1761608> <http://eprints.org/ontology/hasFile> <https://discovery.ucl.ac.uk/id/eprint/10194559/7/indexcodes.txt> .
<https://discovery.ucl.ac.uk/id/document/1761608> <http://purl.org/dc/terms/hasPart> <https://discovery.ucl.ac.uk/id/eprint/10194559/7/indexcodes.txt> .
<https://discovery.ucl.ac.uk/id/eprint/10194559/7/indexcodes.txt> <http://www.w3.org/2000/01/rdf-schema#label> "indexcodes.txt"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10194559> <http://www.w3.org/2000/01/rdf-schema#seeAlso> <https://discovery.ucl.ac.uk/id/eprint/10194559/> .
<https://discovery.ucl.ac.uk/id/eprint/10194559/> <http://purl.org/dc/elements/1.1/title> "HTML Summary of #10194559 \n\nGuaranteeing Control Requirements via Reward Shaping in Reinforcement Learning\n\n" .
<https://discovery.ucl.ac.uk/id/eprint/10194559/> <http://purl.org/dc/elements/1.1/format> "text/html" .
<https://discovery.ucl.ac.uk/id/eprint/10194559/> <http://xmlns.com/foaf/0.1/primaryTopic> <https://discovery.ucl.ac.uk/id/eprint/10194559> .