MLRG/fall09

From ResearchWiki

Revision as of 20:35, 15 September 2009 by Piyush (Talk | contribs)
Jump to: navigation, search

Contents

CS7941: Online Learning

Time: Fridays, 2-3:20

Location: MEB 3105, except as noted

Topic: Online learning

Participants

Schedule

Date Papers Presenter
Sep 04 Online Learning Survey by Avrim Blum Hal
Sep 11 Learning Quickly when Irrelevant Attributes Abound: A New Linear-threshold Algorithm by Nick Littlestone Sandeep and Shuying
Sep 18 The Weighted Majority Algorithm by Littlestone and Warmuth Jagadeesh
Sep 25 Efficient algorithms for the online decision problem by Adam Kalai and Santosh Vempala Seth
Oct 02 Online convex programming and generalized infinitesimal gradient ascent by M. Zinkevich Jiarong and Mike
Oct 23 Confidence-Weighted Linear Classification by Mark Dredze, Koby Crammer and Fernando Pereira Amit and Lalindra
Oct 30 Step Size-Adapted Online Support Vector Learning by Karatzoglou, Vishwanathan, Schraudolph, and Smola Ramesh
Nov 06 A New Perspective on an Old Perceptron Algorithm by Shai Shalev-Shwartz and Yoram Singer Youngjun and anyone who wants to be his team
Nov 13 Data-Driven Online to Batch Conversions by Ofer Dekel and Yoram Singer Ruihong and Adam
Nov 20 [some bandit paper] by [some bandit paper author(s)] - kindly leave the bandit papers for me :) Amrish & Avishek
Dec 04 Sparse Online Learning via Truncated Gradient by John Langford, Lihong Li and Tong Zhang Abhishek and Piyush
Dec 11 [xxx] by xxx

Possible Papers

  • Littlestone and Warmuth, The Weighted Majority Algorithm. Information and Computation 108(2):212-261, 1994.
  • Nicolo Cesa-Bianchi, Yoav Freund, David Haussler, David Helmbold, Robert Schapire, and Manfred Warmuth, How to use expert advice, Journal of the ACM, 44(3):427-485, May 1997.
  • Yoav Freund and Robert Schapire, Adaptive game playing using multiplicative weights, Games and Economic Behavior, 29:79-103, 1999.
  • Peter Auer, Nicolo Cesa-Bianchi, Yoav Freund, Robert Schapire: The Nonstochastic Multiarmed Bandit Problem, SIAM J. Comput. 32(1): 48-77 (2002).
  • Abie Flaxman, Adam Tauman Kalai, and Brendan McMahan. Online Convex Optimization in the Bandit Setting: Gradient Descent Without a Gradient. In Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 385-394, 2005.

Paper Summaries

Past Semesters

Personal tools