Publications

Journal Papers: 2002-2005

S. Mannor and R. Meir.On the Existence of Linear Weak Learners and Applications to Boosting. 2002. Machine Learning, 48(1-3):219–251.

S. Mannor and N. Shimkin.The empirical Bayes envelope and regret minimization in competitive Markov decision processes. 2003. Mathematics of Operations Research, 28(2):327–345.

S. Mannor and R. Meir and T. Zhang.Greedy Algorithms for Classification – Consistency, Convergence Rates, and Adaptivity. 2003. Journal of Machine Learning Research, 4:713-742.

S. Mannor and N. Shimkin.A Geometric Approach to Multi-Criterion Reinforcement Learning. 2004. Journal of Machine Learning Research, 5:325–360.

Y. Engel and S. Mannor and R. Meir.The Kernel Recursive Least Squares Algorithm. 2004. IEEE Transactions on Signal Processing, 52(8):2275-2285.

S. Mannor and J. N. Tsitsiklis.The sample complexity of exploration in the multi-armed bandit problem. 2004. JMLR, 5:623–648.

S. Mannor and J. N. Tsitsiklis.On the empirical state-action frequencies in Markov decision processes under general policies. 2005. Mathematics of Operations Research, 30(3):545.

P. de Boer and D. P. Kroese and S. Mannor and R. Y. Rubinstein.A Tutorial on the Cross-Entropy Method. 2005. Annals of Operations Research, 134(1):19-67.

Menache and S. Mannor and N. Shimkin.Basis function adaptation in temporal difference reinforcement learning.  Annals of Operations Research, 134:215–238.

R. Johari and S. Mannor and J. N. Tsitsiklis.Efficiency loss in a network resource allocation game: the case of elastic supply. 2005. IEEE Transactions on Automatic Control, 50(11):1712–1724.

 

Journal Papers: 2006-2009

R. Johari and S. Mannor and J. N. Tsitsiklis.A contract-based model for directed network formation. 2006. Games and Economic Behavior, 56(2):201-224.

E. Even-Dar and S. Mannor and Y. Mansour.Action Elimination and Stopping Conditions for the Multi-Armed Bandit and Reinforcement Learning Problems. 2006. Journal of Machine Learning Research, 7:1079-1105.

P. Cadotte and S. Mannor and H. Michalska and B. Boulet.Design of L1-Optimal Controllers with Robustness versus Performance Tradeoff. 2006. IEEE Transactions on Automatic Control, 51:868-873.

S. Sharifi Tehrani and W. J. Gross and S. Mannor.Stochastic Decoding of LDPC Codes. 2006. IEEE Comm. Lett., 10(10):716-718.

G. Theocharous and S. Mannor and N. Shah and P. Gandhi and B. Kveton and S. Siddiqi and C. Yu.Machine Learning for Adaptive Power Management. 2006. Intel Technology Journal, 10(4):299-311.

S. Mannor and D. Simester and P. Sun and J. N. Tsitsiklis.Bias and Variance Approximation in Value Function Estimates. 2007. Management Science, 53(2):308-322.

S. Mannor and J. S. Shamma and G. Arslan.Online calibrated forecasts: Memory efficiency versus universality for learning in games. 2007. Machine Learning, 67(1-2):77-115.

C. Caramanis and S. Mannor.An Inequality for Nearly Log-Concave Distributions With Applications to Learning. 2007. IEEE Transactions on Information Theory, 53(3):1043-1057.

Y. Yu and S. Mannor.Efficiency of Market-Based Resource Allocation among Many Participants. 2007. IEEE Journal on Selected Areas in Communications, 25(6):1244-1259.

S. Mannor and J. S. Shamma.Multi-agent learning for engineers. 2007. Artif. Intell., 171(7):417-422.

S. Mannor and N. Shimkin.Regret minimization in repeated matrix games with variable stage duration. 2008. Games and Economic Behavior, 63(1):227 – 258.

S. Sharifi Tehrani and S. Mannor and W. J. Gross.Fully Parallel Stochastic LDPC Decoders. 2008. IEEE Transactions on Signal Processing, 56(11):5692-5703.

G. Lugosi and S. Mannor and G. Stoltz.Strategies for Prediction Under Imperfect Monitoring. 2008. Mathematics of Operations Research, 33(3):513-528.

S. Mannor and J. N. Tsitsiklis.Approachability in repeated games: Computational aspects and a Stackelberg variant. 2009. Games and Economic Behavior, 66(1):315-325.

H. Xu and S. Mannor.A Kalman Filter Design Based on Performance/Robustness Tradeoff. 2009. IEEE Transactions on Automatic Control, 54(5):1171-1175.

E. Arcaute and R. Johari and S. Mannor.Network Formation: Bilateral Contracting and Myopic Dynamics. 2009. IEEE Transactions on Automatic Control, 54:1765 – 1778.

H. Xu and C. Caramanis and S. Mannor.Robustness and Regularization of Support Vector Machines. 2009. Journal of Machine Learning Research, 10(Jul):1485-1510.

S. Mannor and J. Tsitsiklis and J. Y. Yu.Online learning with sample path constraints. 2009. Journal of Machine Learning Research, 10(Mar):569–590.

J. Y. Yu and S. Mannor and N. Shimkin.Markov Decision Processes with Arbitrary Reward Processes. 2009. Mathematics of Operations Research, 34(3):737-757.

 

2010 and beyond

E. Delage and S. Mannor.Percentile Optimization for Markov Decision Processes with Parameter Uncertainty. 2010. Operations Research, 58(1):203-213.

S. Sharifi Tehrani and A. Naderi and G. Kamendje and S. Hemati and S. Mannor and W. J. Gross.Majority-based Tracking Forecast Memories for Stochastic LDPC Decoding. 2010. IEEE Transactions on Signal Processing, 58(9):4883 – 4896.

H. Xu and C. Caramanis and S. Mannor.Robust Regression and Lasso. 2010. IEEE Transactions on Information Theory, 56(7): 3561 – 3574.

S. Sharifi Tehrani and C. Winstead and W. J. Gross and S. Mannor and S. L. Howard and V. C. Gaudet .Relaxation Dynamics in Stochastic Iterative Decoders. 2010. IEEE Transactions on Signal Processing, 58(11): 5955 – 5961 .

Cushon, K. and Leroux, C. and Hemati, S. and Mannor, S. and Gross, W.J.A Min-Sum Iterative Decoder Based on Pulsewidth Message Encoding.  Circuits and Systems II: Express Briefs, IEEE Transactions on, 57(11):893 -897.

C. Leroux and S. Hemati and S. Mannor and W. Gross .Stochastic Chase Decoding of Reed-Solomon Codes. 2010. IEEE Communications Letters, 14(9): 863 – 865.

S. Mannor and G. Stoltz.A Geometric Proof of Calibration. 2010. Mathematics of Operations Research, 35(4):721–727.

Danak and S. Mannor .Efficient Bidding in Dynamic Grid Markets.  IEEE Trans. Parallel Distrib. Syst., 22(9):1483-1496.

S. Sharifi Tehrani and A. Naderi and G. Kamendje and S. Mannor and W. J. Gross.Tracking Forecast Memories for Stochastic Decoding. 2011. Signal Processing Systems, 63(1):117-127.

K. Jagannathan and S. Mannor and I. Menache and E. Modiano.A State Action Frequency Approach to Throughput Maximization over Uncertain Wireless Channels. 2011. Internet Mathematics. (In press)

Naderi, S. Mannor, M. Sawan, and W. Gross.Delayed Stochastic Decoding of LDPC Codes.  IEEE Trans. on Signal Processing. (In press)

Danak and S. Mannor.A Robust Learning Approach to Repeated Auctions with Monitoring and Entry Fees.  IEEE Trans. on Computational Intelligence and AI in games. (In press)

D. Vainsencher and S. Mannor and A. Bruckstein.The Sample Complexity of Dictionary Learning. 2011. Journal of Machine Learning Research. (In press)

H. Xu and S. Mannor.Robustness and Generalization. 2012. Machine Learning, 86(3):391-423.

H. Xu and C. Caramanis and S. Mannor.Sparse Algorithms are not Stable: A No-free-lunch Theorem. 2012. IEEE PAMI, 34(1):187-193.

K. Jagannathan and S. Mannor and I. Menache and E. Modiano.A State Action Frequency Approach to Throughput Maximization over Uncertain Wireless Channels. 2012. Internet Mathematics. (In press)

H. Xu and C. Caramanis and S. Mannor.A Distributional Interpretation of Robust Optimization. 2012. Mathematics of Operations Research. (In Press)

H. Xu and C. Caramanis and S. Mannor.Optimization under Probabilistic Envelope Constraints. 2012. Operations Research. (In Press)

H. Xu and S. Mannor.Distributionally Robust Markov Decision Processes. 2012. Mathematics of Operations Research. (In Press)

F. Leduc-Primeau and S. Hemati and S. Mannor and W. Gross.Dithered Belief Propagation Decoding. 2012. IEEE Transactions on Communications. (In Press)

 

Conference Papers 2000-2004

S. Mannor and R. Meir.Weak Learners and Improved Rates of Convergence in Boosting. 2000. in Neural Information Processing Systems (NIPS), pages 280-286.

Y. Engel and S. Mannor.Learning Embedded Maps of Markov Processes. 2001. in Proc. of the Eighteenth International Conference on Machine Learning.

S. Mannor and N. Shimkin.The Steering Approach for Multi-Criteria Reinforcement Learning. 2001. in Neural Information Processing Systems (NIPS), pages 1563-1570. (Full version: 04′ JMLR paper)

S. Mannor and R. Meir.Geometric Bounds for Generalization in Boosting. 2001. in COLT/EuroCOLT, pages 461-472. (Full version: 04′ MLJ paper)

S. Mannor and N. Shimkin.Adaptive Strategies and Regret Minimization in Arbitrarily Varying Markov Environments. 2001. in COLT/EuroCOLT, pages 128-142. (Full version: 03′ MOR paper)

Y. Engel and S. Mannor and R. Meir.Sparse Online Greedy Support Vector Regression. 2002. in 13th European Conference on Machine Learning.

Menache and S. Mannor and N. Shimkin.Q-Cut – Dynamic Discovery of Sub-goals in Reinforcement Learning.  in ECML, pages 295-306.

S. Mannor and R. Meir and T. Zhang.The Consistency of Greedy Algorithms for Classification. 2002. in COLT, pages 319-333. (Full version: 03′ JMLR paper)

E. Even-Dar and S. Mannor and Y. Mansour.PAC Bounds for Multi-armed Bandit and Markov Decision Processes. 2002. in Proceedings of the Conference on Learning Theory (COLT), pages 255–270. (Full version: 06′ JMLR paper)

Y. Engel and S. Mannor and R. Meir.Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning. 2003. in Proc. of the 20th International Conference on Machine Learning.

Y. Even-Dar and S. Mannor and Y. Mansour.Action Elimination and Stopping Conditions for Reinforcement Learning. 2003. in Proc. of the 20th International Conference on Machine Learning. (Full version: 06′ JMLR paper)

S. Mannor and J. N. Tsitsiklis.Lower Bounds on the Sample Complexity of Exploration in the Multi-armed Bandit Problem. 2003. in Proceedings of the Conference on Learning Theory (COLT), pages 418-432. (Full version: 04′ JMLR paper)

S. Mannor and N. Shimkin.On-Line Learning with Imperfect Monitoring. 2003. in COLT, pages 552-566. (Superseded by a 09′ paper with Lugosi and Stoltz and then by 11′ COLT paper with Perchet and Stoltz)

S. Mannor and R. Y. Rubinstein and Y. Gat.The Cross Entropy Method for Fast Policy Search. 2003. in ICML, pages 512-519.

S. Mannor.Reinforcement Learning for Average Reward Zero-Sum Games. 2004. in COLT, pages 49-63.

C. Caramanis and S. Mannor.An Inequality for Nearly Log-Concave Distributions with Applications to Learning.  in COLT, pages 534-548. (Full version: 07′ IEEE Trans Information Theory)

S. Mannor and D. Simester and P. Sun and J.N. Tsitsiklis.Bias and variance in Value function estimation. 2004. in Proc. of the 21st International Conference on Machine Learning. (Full version: 07′ Management Science paper)

S. Mannor and I. Menache and A. Hoze and U. Klein.Dynamic abstraction in reinforcement learning via clustering. 2004. in ICML.

R. Johari and S. Mannor and J. N. Tsitsiklis .Efficiency loss in a resource allocation game: A single link in elastic supply. 2004. in CDC, pages 4679 – 4683. (Short version of IEEE Trans Aut Control 05′ paper)

S. Mannor and D. Peleg and R. Rubinstein.The Cross Entropy Method for Classification. 2005. in Proceedings of the 22nd international conference on Machine learning, pages 561-568.

Y. Engel and S. Mannor and R. Meir.Reinforcement Learning with Gaussian Processes. 2005. in Proc. of the 22nd International Conference on Machine Learning.

F. Li and S. Mannor and A. Lippman.Probabilistic Optimization for Energy-Efficient Broadcast in All-Wireless Networks. 2005. in Proceeding of the 39th Annual IEEE Conference on Information Sciences and Systems (CISS ‘2005).

 

Conference Papers 2006-2007

S. Mannor and N. Shimkin.Online Learning with Variable Stage Duration. 2006. in COLT, pages 408-422. (Short version of GEB 08′ paper)

S. Mannor and J. N. Tsitsiklis.Online Learning with Constraints. 2006. in COLT, pages 529-543. (Short version of JMLR 09′ paper)

P. W. Keller and S. Mannor and D. Precup.Automatic basis function construction for approximate dynamic programming and reinforcement learning. 2006. in ICML, pages 449-456.

J. Y. Yu and S. Mannor.Asymptotics of Efficiency Loss in Competitive Market Mechanisms. 2006. in INFOCOM. (Short version of IEEE JSAC 0′ paper)

H. Xu and S. Mannor.The Robustness-Performance Tradeoff in Markov Decision Processes. 2006. in NIPS, pages 1537-1544.

P. Cadotte and S. Mannor and H. Michalska and B. Boulet.Design of L1-Optimal Controllers with Flexible Disturbance Rejection Level. 2006. in IEEE ACC, pages 868-873. (Short version of the 06′ IEEE TAC paper)

E. Arcaute and R. Johari and S. Mannor.Network Formation: Bilateral Contracting and Myopic Dynamics. 2007. in WINE, pages 191-207. (Short version of the 09′ IEEE TAC paper)

S. Sharifi Tehrani and S. Mannor and W. J. Gross.An Area-Efficient FPGA-Based Architecture for Fully-Parallel Stochastic LDPC Decoding. 2007. in SiPS, pages 255-260.

F. Heidari and S. Mannor and L. Mason.Reinforcement Learning-Based Load Shared Sequential Routing. 2007. in Networking, pages 832-843.

S. Sharifi Tehrani and S. Mannor and W. J. Gross.Survey of Stochastic Computation on Factor Graphs. 2007. in ISMVL, pages 54.

E. Delage and S. Mannor.Percentile optimization in uncertain Markov decision processes with application to efficient exploration. 2007. in ICML, pages 225-232. (A short version of 10′ Operations Research paper)

Chatelain and S. Mannor and F. Gagnon and D. V. Plant.Non-Cooperative Design of Translucent Networks.  in GLOBECOM, pages 2348-2352.

G. Lugosi and S. Mannor and G. Stoltz.Strategies for Prediction Under Imperfect Monitoring. 2007. in COLT, pages 248-262. (A short version of 09′ MOR paper, supersedes 03′ COLT paper with N. Shimkin)

Yu and S. Mannor and G. Theocharous and A. Pfeffer.User Model and Utility Based Power Management.  in AAAI, pages 1918-1919.

Kveton and P. Gandhi and G. Theocharous and S. Mannor and B. Rosario and N. Shah.Adaptive Timeout Policies for Fast Fine-Grained Power Management.  in AAAI, pages 1795-1800.

Arcaute and E. Dallal and R. Johari and S. Mannor.Dynamics and stability in network formation games with bilateral contracts.  in CDC, pages 3435 – 3442.

Ernst and M. Glavic and G. B. Stan and S. Mannor and L. Wehenkel.The cross-entropy method for power system combinatorial optimization problems.  in IEEE Power Tech, pages 1290 – 1295.

H. Xu and S. Mannor.A Kalman Filter Design Based on the Performance/Robustness Tradeoff. 2007. in Proceedings of Forty-Fifth Allerton Conference on Communication, Control, and Computing, pages 59-63. (A short version of the IEEE TAC paper.)

 

Conference papers: 2008 – 2009

Arcaute and R. Johari and S. Mannor.Local Two-Stage Myopic Dynamics for Network Formation Games.  in WINE, pages 263-277.

M. Farahmand and M. Ghavamzadeh and C. Szepesvari and S. Mannor.Regularized Policy Iteration. 2008. in NIPS, pages 441-448.

H. Xu and C. Caramanis and S. Mannor.Robust Regression and Lasso. 2008. in NIPS, pages 1801-1808. (Full version in IEEE IT 2010)

J. Frank and S. Mannor and D. Precup.Reinforcement learning in the presence of rare events. 2008. in ICML, pages 336-343.

M. Farahmand and M. Ghavamzadeh and C. Szepesvari and S. Mannor.Regularized Fitted Q-Iteration: Application to Planning. 2008. in EWRL, pages 55-68.

K. Dyagilev and S. Mannor and N. Shimkin.Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case. 2008. in EWRL, pages 41-54.

J. Y. Yu and S. Mannor and N. Shimkin.Markov Decision Processes with Arbitrary Reward Processes. 2008. in EWRL, pages 268-281. (A short version of 09′ MOR paper)

H. Xu and S. Mannor and C. Caramanis.Sparse Algorithms are not Stable: A No-free-lunch Theorem. 2008. in Proceedings of Forty-Sixth Allerton Conference on Communication, Control, and Computing, pages 1299 – 1303.

H. Xu and C. Caramanis and S. Mannor.Robust dimensionality reduction for high-dimension data. 2008. in Proceedings of Forty-Sixth Allerton Conference on Communication, Control, and Computing, pages 1291 – 1298.

Arcaute and R. Johari and S. Mannor.Local dynamics for network formation games.  in Proceedings of Forty-Sixth Allerton Conference on Communication, Control, and Computing, pages 937 – 938.

Kveton and J. Y. Yu and G. Theocharous and S. Mannor.A lazy approach to online learning with constraints.  in Proceedings of the International Symposium on Artificial Intelligence and Mathematics.

Kveton and J. Y. Yu and G. Theocharous and S. Mannor.Online learning with expert advice and finite-horizon constraints.  in Proceedings of the AAAI Conference on Artificial Intelligence, pages 331–336.

K. Cushon and W. J. Gross and S. Mannor.Bidirectional interleavers for LDPC decoders using transmission gates. 2009. in SiPS, pages 232-237.

J. Y. Yu and S. Mannor.Piecewise-stationary Bandit Problems with Side Observations. 2009. in Proceedings of the International Conference on Machine Learning (ICML).

Even-Dar and Robert Kleinbereg and S. Mannor and Y. Mansour.Online Learning for Global Cost Functions.  in COLT.

Sarkis and S. Mannor and W. J. Gross.Stochastic Decoding of LDPC Codes over GF(q).  in ICC, pages 1-5.

S. Sharifi Tehrani and A. Naderi and G. A. Kamendje and S. Mannor and W. J. Gross.Tracking Forecast Memories in stochastic decoders. 2009. in ICASSP, pages 561-564.

Leduc-Primeau and S. Hemati and W. J. Gross and S. Mannor.A Relaxed Half-Stochastic Iterative Decoder for LDPC Codes.  in GLOBECOM, pages 1-6.

J. Y. Yu and S. Mannor.Arbitrarily Modulated Markov Decision Processes. 2009. in Proceedings of the IEEE Conference on Decision and Control.

Xu and C. Caramanis and S. Mannor and S. Yun.Risk sensitive robust support vector machines.  in Proceedings of the IEEE Conference on Decision and Control, pages 4655-4661.

Xu and S. Mannor.Parametric regret in uncertain Markov decision processes.  in Proceedings of the IEEE Conference on Decision and Control, pages 3606-3613.

M. Farahmand and M. Ghavamzadeh and C. Szepesvari and S. Mannor.Regularized Fitted Q-Iteration for planning in continuous-space Markovian decision problems. 2009. in American Control Conference, pages 725 – 730.

H. Xu and C. Caramanis and S. Mannor.High dimensional Principal Component Analysis with contaminated data. 2009. in Networking and Information Theory, pages 246 – 250.

Danak and S. Mannor.Bidding efficiently in repeated auctions with entry and observation costs.  in IEEE Game Theory for Networks (GAMENETS), pages 299 – 307.

Y. Yu and S. Mannor.Online learning in Markov decision processes with arbitrarily changing rewards and transitions. 2009. in IEEE Game Theory for Networks (GAMENETS), pages 314 – 322.

 

Conference Papers 2010 –

Danak and S. Mannor.Resource Allocation with Supply Adjustment in Distributed Computing Systems.  in ICDCS, pages 498-506.

Di Castro and S. Mannor.Adaptive Bases for Reinforcement Learning.  in ECML/PKDD, pages 312-327.

Frank and S. Mannor and D. Precup.Activity and Gait Recognition with Time-Delay Embeddings.  in AAAI.

H. Xu and S. Mannor.Robustness and Generalization. 2010. in COLT.

H. Xu and C. Caramanis and S. Mannor.Principal Component Analysis with Contaminated Data: The High Dimensional Case. 2010. in COLT.

Even-Dar and S. Mannor and Y. Mansour.Learning with Global Cost in Stochastic Environments.  in COLT.

Sarkis and S. Hemati and S. Mannor and W. Gross.Relaxed Half-Stochastic Decoding of LDPC Codes Over GF(q).  in Proceedings of Forty-Eighth Allerton Conference on Communication, Control, and Computing, pages xxx – xxx.

Kizilkale and S. Mannor.Volatility and Efficiency in Markets with Friction.  in Proceedings of Forty-Eighth Allerton Conference on Communication, Control, and Computing, pages xxx – xxx.

Xu and C. Caramanis and S. Mannor.A Distributional Interpretation of Robust Optimization.  in Proceedings of Forty-Eighth Allerton Conference on Communication, Control, and Computing, pages xxx – xxx.

D. Di Castro and S. Mannor.Tutor Learning Using Linear Constraints in Approximate Dynamic Programming. 2010. in Proceedings of Forty-Eighth Allerton Conference on Communication, Control, and Computing, pages xxx – xxx.

Xu and S. Mannor.Distributionally Robust Markov Decision Processes.  in Neural Information Processing Systems (NIPS), pages xxx-xxx.

Bernstein and S. Mannor and N. Shimkin.Online Classification with Specificity Constraints.  in Neural Information Processing Systems (NIPS), pages xxx-xxx.

Kizilkale and S. Mannor.Regulation and Efficiency in Markets with Friction.  in CDC, pages 4137 – 4144 .

D. Di Castro and S. Mannor.Adaptive Bases for Q-Learning. 2010. in CDC, pages 4587-4593.

Jagannathan and S. Mannor and I. Menache and E. Modiyano.A State Action Frequency Approach to Throughput Maximization over Uncertain Wireless Channels.  in INFOCOM, pages xxx – xxx.

H. Xu and S. Mannor.Probabilistic Goal Markov Decision Processes. 2011. in IJCAI, pages 2046-2052.

J-Y Yu and S. Mannor.Unimodal Bandits.  in Proceedings of the International Conference on Machine Learning (ICML).

S. Mannor and J. N. Tsitsiklis.Mean-variance optimization in Markov Decision processes. 2011. in Proceedings of the International Conference on Machine Learning (ICML).

Harel and S. Mannor.Learning from Multiple Outlooks.  in Proceedings of the International Conference on Machine Learning (ICML).

D. Vainsencher and O. Dekel and S. Mannor.Bundle Selling by Online Estimation of Valuation Functions. 2011. in Proceedings of the International Conference on Machine Learning (ICML).

S. Mannor, V. Perchet and G. Stoltz.Robust approachability and regret minimization in games with partial monitoring. 2011. in Proceedings of the Conference on Learning Theory (COLT).

D. Vainsencher and S. Mannor and A. Bruckstein.The Sample Complexity of Dictionary Learning. 2011. in Proceedings of the Conference on Learning Theory (COLT).

Kizilkale and S. Mannor.Regulation and Double Price Mechanisms in Markets with Friction.  in CDC, pages xxx-xxx.

O. Avner and S. Mannor.Stochastic Bandits with Pathwise Constraints. 2011. in CDC, pages xxx-xxx.

S. Mannor and O. Shamir.From Bandits to Experts: On the Value of Side-Observations. 2011. in Neural Information Processing Systems (NIPS), pages xxx – xxx.

Bui and R. Johari and S. Mannor.Committing Bandits.  in Neural Information Processing Systems (NIPS), pages xxx – xxx.

H. Xu and C. Caramanis and S. Mannor.Statistical Optimization in High Dimensions. 2012. in AISTATs.

Milling and C. Caramanis and S. Mannor and S. Shakkottai.Network Forensics: Random Infection vs. Spreading Epidemic.  in SIGMETRICS.

 

Book Chapters

S. Mannor.K-Armed Bandit. 2010. in Encyclopedia of Machine Learning, pages 561-563. (Editted by C. Sammut and G. I. Webb)

Caramanis and S. Mannor and H. Xu.Robust Optimization and Machine Learning.  in Optimization for Machine Learning, pages xxx-xxx. (In press, edited by S. Sra, S. Nowozin and S. Wright)

Ghamvazadeh and P. Poupart and S. Mannor and N. Vlassis.Bayesian Reinforcement Learning.  in Reinforcement Learning: State of the art, pages xxx-xxx. (In press, edited by M. Wiering and M. van Otterlo)

 

Preprints that do not appear above

H. Xu and S. Mannor.Robustness and Generalization. 2010. CoRR, vol abs/1005.2243.