arXiv Analytics

Sign in

arXiv:1302.6937 [cs.LG]AbstractReferencesReviewsResources

Online Convex Optimization Against Adversaries with Memory and Application to Statistical Arbitrage

Oren Anava, Elad Hazan, Shie Mannor

Published 2013-02-27, updated 2014-06-10Version 2

The framework of online learning with memory naturally captures learning problems with temporal constraints, and was previously studied for the experts setting. In this work we extend the notion of learning with memory to the general Online Convex Optimization (OCO) framework, and present two algorithms that attain low regret. The first algorithm applies to Lipschitz continuous loss functions, obtaining optimal regret bounds for both convex and strongly convex losses. The second algorithm attains the optimal regret bounds and applies more broadly to convex losses without requiring Lipschitz continuity, yet is more complicated to implement. We complement our theoretic results with an application to statistical arbitrage in finance: we devise algorithms for constructing mean-reverting portfolios.

Related articles: Most relevant | Search more
arXiv:2103.06473 [cs.LG] (Published 2021-03-11)
Multi-Task Federated Reinforcement Learning with Adversaries
arXiv:1502.04469 [cs.LG] (Published 2015-02-16)
Classification and its application to drug-target interaction prediction
arXiv:1506.03379 [cs.LG] (Published 2015-06-10)
The Online Discovery Problem and Its Application to Lifelong Reinforcement Learning