CMA_MOMAB

Version 1.0.3 (2,44 MB) von Madalina Drugan

Source code for the article "Covariance Matrix Adaptation for Multiobjective Multiarmed Bandits"

https://codeocean.com/capsule/a61267a5-0bfb-4d4c-8859-8f496dd5372e/code

Verfolgen

0.0

(0)

117 Downloads

Aktualisiert 19. Jan 2019

Lizenz auf GitHub anzeigen

Upper confidence bound (UCB) is a successful multiarmed bandit for regret minimization. The covariance matrix adaptation (CMA) for Pareto UCB (CMA-PUCB) algorithm considers stochastic reward vectors with correlated objectives. We upper bound the cumulative pseudoregret of pulling suboptimal arms for the CMA-PUCB algorithm to logarithmic number of arms K, objectives D, and samples n, O(ln(nDK) ∑i (||Σi||²/Δi)), using a variant of Berstein inequality for matrices, where Δi is the regret of pulling the suboptimal arm i. For unknown covariance matrices between objectives Σi, we upper bound the approximation of the covariance matrix using the number of samples to O(nln(nDK) + ln²(nDK) ∑i (1/Δi)). Simulations on a three objective stochastic environment show the applicability of our method.

Zitieren als

Drugan, Madalina. “Covariance Matrix Adaptation for Multiobjective Multiarmed Bandits.” IEEE Transactions on Neural Networks and Learning Systems, Institute of Electrical and Electronics Engineers (IEEE), 2019, pp. 1–10, doi:10.1109/tnnls.2018.2885123.

Mehrere Stile anzeigen

Kompatibilität der MATLAB-Version

Erstellt mit R2018b

Kompatibel mit allen Versionen

Plattform-Kompatibilität

Windows macOS Linux

Tags Tags hinzufügen

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Versionen, die den GitHub-Standardzweig verwenden, können nicht heruntergeladen werden

Version	Veröffentlicht	Versionshinweise
1.0.3	19. Jan 2019	Contains a Readme file
1.0.2	19. Jan 2019	Comparison with uniform sampling Improved cumulative regret plots	Herunterladen
1.0.1	17. Jan 2019	A bug was detected A plot file is present	Herunterladen
1.0.0	1. Jan 2019		Herunterladen

Um Probleme in diesem GitHub Add-On anzuzeigen oder zu melden, besuchen Sie das GitHub Repository.

MLA	Drugan, Madalina. “Covariance Matrix Adaptation for Multiobjective Multiarmed Bandits.” IEEE Transactions on Neural Networks and Learning Systems, Institute of Electrical and Electronics Engineers (IEEE), 2019, pp. 1–10, doi:10.1109/tnnls.2018.2885123.
APA	Drugan, M. (2019). Covariance Matrix Adaptation for Multiobjective Multiarmed Bandits. IEEE Transactions on Neural Networks and Learning Systems, 1–10. Institute of Electrical and Electronics Engineers (IEEE). Retrieved from https://doi.org/10.1109%2Ftnnls.2018.2885123
BibTeX	@article{Drugan_2019, doi = {10.1109/tnnls.2018.2885123}, url = {https://doi.org/10.1109%2Ftnnls.2018.2885123}, year = 2019, publisher = {Institute of Electrical and Electronics Engineers ({IEEE})}, pages = {1--10}, author = {Madalina Drugan}, title = {Covariance Matrix Adaptation for Multiobjective Multiarmed Bandits}, journal = {{IEEE} Transactions on Neural Networks and Learning Systems} }