Safe Reinforcement Learning by Imagining the Near Future

source

arxiv

source_type

latex

converted_with

pandoc

paper_version

2202.07789v1

title

Safe Reinforcement Learning by Imagining the Near Future

authors

["Garrett Thomas","Yuping Luo","Tengyu Ma"]

date_published

2022-02-15 23:28:24+00:00

data_last_modified

2022-02-15 23:28:24+00:00

abstract

Safe reinforcement learning is a promising path toward applying reinforcement learning algorithms to real-world problems, where suboptimal behaviors may lead to actual negative consequences. In this work, we focus on the setting where unsafe states can be avoided by planning ahead a short time into the future. In this setting, a model-based agent with a sufficiently accurate model can avoid unsafe states. We devise a model-based algorithm that heavily penalizes unsafe trajectories, and derive guarantees that our algorithm can avoid unsafe states under certain assumptions. Experiments demonstrate that our algorithm can achieve competitive rewards with fewer safety violations in several continuous control tasks.

author_comment

Accepted at NeurIPS 2021

journal_ref

null

doi

null

primary_category

cs.LG

categories

["cs.LG"]

citation_level

alignment_text

pos

confidence_score

1.0

main_tex_filename

main.tex

bibliography_bbl

\begin{thebibliography}{35} \providecommand{\natexlab}[1]{#1} \providecommand{\url}[1]{\texttt{#1}} \expandafter\ifx\csname urlstyle\endcsname\relax \providecommand{\doi}[1]{doi: #1}\else \providecommand{\doi}{doi: \begingroup \urlstyle{rm}\Url}\fi \bibitem[Achiam et~al.(2017)Achiam, Held, Tamar, and Abbeel]{achiam2017constrained} Joshua Achiam, David Held, Aviv Tamar, and Pieter Abbeel. \newblock Constrained policy optimization. \newblock In \emph{International Conference on Machine Learning}, pages 22--31. PMLR, 2017. \bibitem[Amodei et~al.(2016)Amodei, Olah, Steinhardt, Christiano, Schulman, and Man{\'e}]{amodei2016concrete} Dario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schulman, and Dan Man{\'e}. \newblock Concrete problems in ai safety. \newblock \emph{arXiv preprint arXiv:1606.06565}, 2016. \bibitem[Asadi et~al.(2019)Asadi, Misra, Kim, and Littman]{asadi2019combating} Kavosh Asadi, Dipendra Misra, Seungchan Kim, and Michael~L. Littman. \newblock Combating the compounding-error problem with a multi-step model. \newblock \emph{arXiv preprint}, abs/1905.13320, 2019. \newblock URL \url{http://arxiv.org/abs/1905.13320}. \bibitem[Bansal et~al.(2017)Bansal, Chen, Herbert, and Tomlin]{bansal2017hamilton} Somil Bansal, Mo~Chen, Sylvia Herbert, and Claire~J Tomlin. \newblock Hamilton-jacobi reachability: A brief overview and recent advances. \newblock In \emph{2017 IEEE 56th Annual Conference on Decision and Control (CDC)}, pages 2242--2253. IEEE, 2017. \bibitem[Bellemare et~al.(2020)Bellemare, Candido, Castro, Gong, Machado, Moitra, Ponda, and Wang]{bellemare2020autonomous} Marc~G. Bellemare, Salvatore Candido, Pablo~Samuel Castro, Jun Gong, Marlos~C. Machado, Subhodeep Moitra, Sameera~S. Ponda, and Ziyu Wang. \newblock Autonomous navigation of stratospheric balloons using reinforcement learning. \newblock page 77–82, 2020. \bibitem[Bharadhwaj et~al.(2020)Bharadhwaj, Kumar, Rhinehart, Levine, Shkurti, and Garg]{bharadhwaj2020conservative} Homanga Bharadhwaj, Aviral Kumar, Nicholas Rhinehart, Sergey Levine, Florian Shkurti, and Animesh Garg. \newblock Conservative safety critics for exploration. \newblock \emph{arXiv preprint arXiv:2010.14497}, 2020. \bibitem[Chua et~al.(2018)Chua, Calandra, McAllister, and Levine]{chua2018deep} Kurtland Chua, Roberto Calandra, Rowan McAllister, and Sergey Levine. \newblock Deep reinforcement learning in a handful of trials using probabilistic dynamics models. \newblock \emph{arXiv preprint arXiv:1805.12114}, 2018. \bibitem[Dalal et~al.(2018)Dalal, Dvijotham, Vecerik, Hester, Paduraru, and Tassa]{dalal2018safe} Gal Dalal, Krishnamurthy Dvijotham, Matej Vecerik, Todd Hester, Cosmin Paduraru, and Yuval Tassa. \newblock Safe exploration in continuous action spaces. \newblock \emph{arXiv preprint arXiv:1801.08757}, 2018. \bibitem[Dong et~al.(2020)Dong, Luo, Yu, Finn, and Ma]{dong2020expressivity} Kefan Dong, Yuping Luo, Tianhe Yu, Chelsea Finn, and Tengyu Ma. \newblock On the expressivity of neural networks for deep reinforcement learning. \newblock In \emph{International Conference on Machine Learning}, pages 2627--2637. PMLR, 2020. \bibitem[Eysenbach et~al.(2018)Eysenbach, Gu, Ibarz, and Levine]{eysenbach2018leave} B~Eysenbach, S~Gu, J~Ibarz, and S~Levine. \newblock Leave no trace: Learning to reset for safe and autonomous reinforcement learning. \newblock In \emph{6th International Conference on Learning Representations (ICLR 2018)}. OpenReview. net, 2018. \bibitem[Fujimoto et~al.(2018)Fujimoto, Hoof, and Meger]{fujimoto2018addressing} Scott Fujimoto, Herke Hoof, and David Meger. \newblock Addressing function approximation error in actor-critic methods. \newblock In \emph{International Conference on Machine Learning}, pages 1587--1596. PMLR, 2018. \bibitem[Gu et~al.(2016)Gu, Holly, Lillicrap, and Levine]{gu2016deep} Shixiang Gu, Ethan Holly, Timothy~P. Lillicrap, and Sergey Levine. \newblock Deep reinforcement learning for robotic manipulation. \newblock abs/1610.00633, 2016. \newblock URL \url{http://arxiv.org/abs/1610.00633}. \bibitem[Haarnoja et~al.(2018{\natexlab{a}})Haarnoja, Zhou, Abbeel, and Levine]{haarnoja2018soft} Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, and Sergey Levine. \newblock Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. \newblock In \emph{International Conference on Machine Learning}, pages 1861--1870, 2018{\natexlab{a}}. \bibitem[Haarnoja et~al.(2018{\natexlab{b}})Haarnoja, Zhou, Hartikainen, Tucker, Ha, Tan, Kumar, Zhu, Gupta, Abbeel, and Levine]{haarnoja2018soft2} Tuomas Haarnoja, Aurick Zhou, Kristian Hartikainen, George Tucker, Sehoon Ha, Jie Tan, Vikash Kumar, Henry Zhu, Abhishek Gupta, Pieter Abbeel, and Sergey Levine. \newblock Soft actor-critic algorithms and applications. \newblock In \emph{International Conference on Machine Learning}, pages 1861--1870, 2018{\natexlab{b}}. \bibitem[Hans et~al.(2008)Hans, Schneega{\ss}, Sch{\"a}fer, and Udluft]{hans2008safe} Alexander Hans, Daniel Schneega{\ss}, Anton~Maximilian Sch{\"a}fer, and Steffen Udluft. \newblock Safe exploration for reinforcement learning. \newblock In \emph{ESANN}, pages 143--148. Citeseer, 2008. \bibitem[Janner et~al.(2019)Janner, Fu, Zhang, and Levine]{janner2019trust} Michael Janner, Justin Fu, Marvin Zhang, and Sergey Levine. \newblock When to trust your model: Model-based policy optimization. \newblock \emph{arXiv preprint arXiv:1906.08253}, 2019. \bibitem[Kearns and Singh(2002)]{kearns2002near} Michael Kearns and Satinder Singh. \newblock Near-optimal reinforcement learning in polynomial time. \newblock \emph{Machine learning}, 49\penalty0 (2-3):\penalty0 209--232, 2002. \bibitem[Kingma and Ba(2014)]{kingma2014adam} Diederik~P Kingma and Jimmy Ba. \newblock Adam: A method for stochastic optimization. \newblock \emph{arXiv preprint arXiv:1412.6980}, 2014. \bibitem[Kurutach et~al.(2018)Kurutach, Clavera, Duan, Tamar, and Abbeel]{kurutach2018model} Thanard Kurutach, Ignasi Clavera, Yan Duan, Aviv Tamar, and Pieter Abbeel. \newblock Model-ensemble trust-region policy optimization. \newblock \emph{arXiv preprint arXiv:1802.10592}, 2018. \bibitem[Lillicrap et~al.(2015)Lillicrap, Hunt, Pritzel, Heess, Erez, Tassa, Silver, and Wierstra]{lhphetsw15} Timothy~P Lillicrap, Jonathan~J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. \newblock Continuous control with deep reinforcement learning. \newblock \emph{arXiv preprint arXiv:1509.02971}, 2015. \bibitem[Luo et~al.(2018)Luo, Xu, Li, Tian, Darrell, and Ma]{luo2018algorithmic} Yuping Luo, Huazhe Xu, Yuanzhi Li, Yuandong Tian, Trevor Darrell, and Tengyu Ma. \newblock Algorithmic framework for model-based deep reinforcement learning with theoretical guarantees. \newblock \emph{arXiv preprint arXiv:1807.03858}, 2018. \bibitem[Luo et~al.(2019)Luo, Xu, Li, Tian, Darrell, and Ma]{luo2019algorithmic} Yuping Luo, Huazhe Xu, Yuanzhi Li, Yuandong Tian, Trevor Darrell, and Tengyu Ma. \newblock Algorithmic framework for model-based deep reinforcement learning with theoretical guarantees. \newblock In \emph{International Conference on Learning Representations}, 2019. \newblock URL \url{https://openreview.net/forum?id=BJe1E2R5KX}. \bibitem[Mnih et~al.(2015)Mnih, Kavukcuoglu, Silver, Rusu, Veness, Bellemare, Graves, Riedmiller, Fidjeland, Ostrovski, et~al.]{mnih2015human} Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei~A Rusu, Joel Veness, Marc~G Bellemare, Alex Graves, Martin Riedmiller, Andreas~K Fidjeland, Georg Ostrovski, et~al. \newblock Human-level control through deep reinforcement learning. \newblock \emph{nature}, 518\penalty0 (7540):\penalty0 529--533, 2015. \bibitem[Paszke et~al.(2019)Paszke, Gross, Massa, Lerer, Bradbury, Chanan, Killeen, Lin, Gimelshein, Antiga, Desmaison, Kopf, Yang, DeVito, Raison, Tejani, Chilamkurthy, Steiner, Fang, Bai, and Chintala]{pytorch} Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu~Fang, Junjie Bai, and Soumith Chintala. \newblock Pytorch: An imperative style, high-performance deep learning library. \newblock In H.~Wallach, H.~Larochelle, A.~Beygelzimer, F.~d~Alch\'{e}-Buc, E.~Fox, and R.~Garnett, editors, \emph{Advances in Neural Information Processing Systems 32}, pages 8024--8035. Curran Associates, Inc., 2019. \newblock URL \url{http://papers.neurips.cc/paper/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf}. \bibitem[Roderick et~al.(2021)Roderick, Nagarajan, and Kolter]{roderick2021provably} Melrose Roderick, Vaishnavh Nagarajan, and Zico Kolter. \newblock Provably safe pac-mdp exploration using analogies. \newblock In \emph{International Conference on Artificial Intelligence and Statistics}, pages 1216--1224. PMLR, 2021. \bibitem[Srinivasan et~al.(2020)Srinivasan, Eysenbach, Ha, Tan, and Finn]{srinivasan2020learning} Krishnan Srinivasan, Benjamin Eysenbach, Sehoon Ha, Jie Tan, and Chelsea Finn. \newblock Learning to be safe: Deep rl with a safety critic. \newblock \emph{arXiv preprint arXiv:2010.14603}, 2020. \bibitem[Stooke et~al.(2020)Stooke, Achiam, and Abbeel]{stooke2020responsive} Adam Stooke, Joshua Achiam, and Pieter Abbeel. \newblock Responsive safety in reinforcement learning by pid lagrangian methods. \newblock In \emph{International Conference on Machine Learning}, pages 9133--9143. PMLR, 2020. \bibitem[Tamar et~al.(2017)Tamar, Thomas, Zhang, Levine, and Abbeel]{tamar2017learning} Aviv Tamar, Garrett Thomas, Tianhao Zhang, Sergey Levine, and Pieter Abbeel. \newblock Learning from the hindsight plan — episodic mpc improvement. \newblock In \emph{2017 IEEE International Conference on Robotics and Automation (ICRA)}, pages 336--343, 2017. \newblock \doi{10.1109/ICRA.2017.7989043}. \bibitem[Tessler et~al.(2018)Tessler, Mankowitz, and Mannor]{tessler2018reward} Chen Tessler, Daniel~J Mankowitz, and Shie Mannor. \newblock Reward constrained policy optimization. \newblock \emph{arXiv preprint arXiv:1805.11074}, 2018. \bibitem[Thananjeyan et~al.(2020)Thananjeyan, Balakrishna, Nair, Luo, Srinivasan, Hwang, Gonzalez, Ibarz, Finn, and Goldberg]{thananjeyan2020recovery} Brijen Thananjeyan, Ashwin Balakrishna, Suraj Nair, Michael Luo, Krishnan Srinivasan, Minho Hwang, Joseph~E Gonzalez, Julian Ibarz, Chelsea Finn, and Ken Goldberg. \newblock Recovery rl: Safe reinforcement learning with learned recovery zones. \newblock \emph{arXiv preprint arXiv:2010.15920}, 2020. \bibitem[Todorov et~al.(2012)Todorov, Erez, and Tassa]{todorov2012mujoco} Emanuel Todorov, Tom Erez, and Yuval Tassa. \newblock Mujoco: A physics engine for model-based control. \newblock In \emph{2012 IEEE/RSJ International Conference on Intelligent Robots and Systems}, pages 5026--5033. IEEE, 2012. \bibitem[Turchetta et~al.(2020)Turchetta, Kolobov, Shah, Krause, and Agarwal]{turchetta2020safe} Matteo Turchetta, Andrey Kolobov, Shital Shah, Andreas Krause, and Alekh Agarwal. \newblock Safe reinforcement learning via curriculum induction. \newblock \emph{arXiv preprint arXiv:2006.12136}, 2020. \bibitem[Wang and Ba(2019)]{wang2019exploring} Tingwu Wang and Jimmy Ba. \newblock Exploring model-based planning with policy networks. \newblock \emph{arXiv preprint arXiv:1906.08649}, 2019. \bibitem[Yang et~al.(2020)Yang, Rosca, Narasimhan, and Ramadge]{yang2020accelerating} Tsung-Yen Yang, Justinian Rosca, Karthik Narasimhan, and Peter~J Ramadge. \newblock Accelerating safe reinforcement learning with constraint-mismatched policies. \newblock \emph{arXiv preprint arXiv:2006.11645}, 2020. \bibitem[Zanger et~al.(2021)Zanger, Daaboul, and Z{\"o}llner]{zanger2021safe} Moritz~A Zanger, Karam Daaboul, and J~Marius Z{\"o}llner. \newblock Safe continuous control with constrained model-based policy optimization. \newblock \emph{arXiv preprint arXiv:2104.06922}, 2021. \end{thebibliography}

bibliography_bib

@article{aaronson2006lower, title = {Lower bounds for local search by quantum arguments}, author = {Aaronson, Scott}, year = 2006, journal = {SIAM Journal on Computing}, publisher = {SIAM}, volume = 35, number = 4, pages = {804--824} } @inproceedings{abbasi2011improved, title = {Improved algorithms for linear stochastic bandits}, author = {Abbasi-Yadkori, Yasin and P{\'a}l, D{\'a}vid and Szepesv{\'a}ri, Csaba}, year = 2011, booktitle = {Advances in Neural Information Processing Systems} } @article{abbasi2014linear, title = {Linear programming for large-scale {M}arkov decision problems}, author = {Abbasi-Yadkori, Yasin and Bartlett, Peter L and Malek, Alan}, year = 2014, journal = {arXiv preprint arXiv:1402.6763} } @article{abe2003reinforcement, title = {Reinforcement learning with immediate rewards and linear hypotheses}, author = {Abe, Naoki and Biermann, Alan W and Long, Philip M}, year = 2003, journal = {Algorithmica}, publisher = {Springer}, volume = 37, number = 4, pages = {263--293} } @book{absil2007optimization, title = {Optimization Algorithms on Matrix Manifolds}, author = {Absil, P.A. and Mahony, R. and Sepulchre, R.}, year = 2007, publisher = {Princeton University Press}, isbn = 9780691132983, url = {https://books.google.com/books?id=gyaKmAEACAAJ}, lccn = 2007927538 } @article{adamczak2011chevet, title = {Chevet type inequality and norms of submatrices}, author = {Adamczak, Rados{\l}aw and Lata{\l}a, Rafa{\l} and Litvak, Alexander E and Pajor, Alain and Tomczak-Jaegermann, Nicole}, year = 2011, journal = {arXiv preprint arXiv:1107.4066} } @article{adhlw19, title = {Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks}, author = {Sanjeev Arora and Simon S. Du and Wei Hu and Zhiyuan Li and Ruosong Wang}, year = 2019, journal = {CoRR}, volume = {abs/1901.08584}, url = {http://arxiv.org/abs/1901.08584}, archiveprefix = {arXiv}, eprint = {1901.08584}, timestamp = {Sat, 02 Feb 2019 16:56:00 +0100}, biburl = {https://dblp.org/rec/bib/journals/corr/abs-1901-08584}, bibsource = {dblp computer science bibliography, https://dblp.org} } @book{adler2009random, title = {Random fields and geometry}, author = {Adler, Robert J and Taylor, Jonathan E}, year = 2009, publisher = {Springer Science \& Business Media} } @article{agarwal2005geometric, title = {Geometric approximation via coresets}, author = {Agarwal, Pankaj K. and {Har-Peled}, Sariel and Varadarajan, Kasturi R.}, year = 2005, journal = {Combinatorial and computational geometry}, publisher = {Cambridge University Press New York}, volume = 52, pages = {1--30} } @inproceedings{agarwal2014taming, title = {Taming the monster: A fast and simple algorithm for contextual bandits}, author = {Agarwal, Alekh and Hsu, Daniel and Kale, Satyen and Langford, John and Li, Lihong and Schapire, Robert}, year = 2014, booktitle = {International Conference on Machine Learning} } @article{agarwal2016finding, title = {Finding approximate local minima for nonconvex optimization in linear time}, author = {Agarwal, Naman and Allen-Zhu, Zeyuan and Bullins, Brian and Hazan, Elad and Ma, Tengyu}, year = 2016, journal = {arXiv preprint arXiv:1611.01146} } @misc{agarwal2017finding, title = {Finding Approximate Local Minima Faster than Gradient Descent}, author = {Naman Agarwal and Zeyuan Allen-Zhu and Brian Bullins and Elad Hazan and Tengyu Ma}, year = 2017, eprint = {1611.01146}, archiveprefix = {arXiv}, primaryclass = {math.OC} } @inproceedings{agarwal2019optimality, title = {Optimality and Approximation with Policy Gradient Methods in {Markov} Decision Processes}, author = {Agarwal, Alekh and Kakade, Sham M and Lee, Jason D and Mahajan, Gaurav}, year = 2020, month = {09--12 Jul}, booktitle = {Conference on Learning Theory}, publisher = {PMLR}, series = {Proceedings of Machine Learning Research}, volume = 125, pages = {64--66}, pdf = {http://proceedings.mlr.press/v125/agarwal20a/agarwal20a.pdf}, abstract = {Policy gradient (PG) methods are among the most effective methods in challenging reinforcement learning problems with large state and/or action spaces. However, little is known about even their most basic theoretical convergence properties, including: if and how fast they converge to a globally optimal solution (say with a sufficiently rich policy class); how they cope with approximation error due to using a restricted class of parametric policies; or their finite sample behavior. Such characterizations are important not only to compare these methods to their approximate value function counterparts (where such issues are relatively well understood, at least in the worst case), but also to help with more principled approaches to algorithm design. This work provides provable characterizations of computational, approximation, and sample size issues with regards to policy gradient methods in the context of discounted Markov Decision Processes (MDPs). We focus on both: 1) “tabular” policy parameterizations, where the optimal policy is contained in the class and where we show global convergence to the optimal policy, and 2) restricted policy classes, which may not contain the optimal policy and where we provide agnostic learning results. In the \emph{tabular setting}, our main results are: 1) convergence rate to global optimum for direct parameterization and projected gradient ascent 2) an asymptotic convergence to global optimum for softmax policy parameterization and PG; and a convergence rate with additional entropy regularization, and 3) dimension-free convergence to global optimum for softmax policy parameterization and Natural Policy Gradient (NPG) method with exact gradients. In \emph{function approximation}, we further analyze NPG with exact as well as inexact gradients under certain smoothness assumptions on the policy parameterization and establish rates of convergence in terms of the quality of the initial state distribution. One insight of this work is in formalizing how a favorable initial state distribution provides a means to circumvent worst-case exploration issues. Overall, these results place PG methods under a solid theoretical footing, analogous to the global convergence guarantees of iterative value function based algorithms.} } @article{agarwal2019reinforcement, title = {Reinforcement learning: Theory and algorithms}, author = {Agarwal, Alekh and Jiang, Nan and Kakade, Sham M}, year = 2019, journal = {CS Dept., UW Seattle, Seattle, WA, USA, Tech. Rep} } @article{agarwal2020disentangling, title = {Disentangling Adaptive Gradient Methods from Learning Rates}, author = {Agarwal, Naman and Anil, Rohan and Hazan, Elad and Koren, Tomer and Zhang, Cyril}, year = 2020, journal = {arXiv preprint arXiv:2002.11803} } @article{agarwal2020flambe, title = {FLAMBE: Structural complexity and representation learning of low rank MDPs}, author = {Agarwal, Alekh and Kakade, Sham and Krishnamurthy, Akshay and Sun, Wen}, year = 2020, journal = {arXiv preprint arXiv:2006.10814} } @inproceedings{agarwal2020pc, title = {{PC-PG}: Policy cover directed exploration for provable policy gradient learning}, author = {Agarwal, Alekh and Henaff, Mikael and Kakade, Sham and Sun, Wen}, year = 2020, booktitle = {Advances in Neural Information Processing Systems} } @inproceedings{agmr17, title = {Provable learning of noisy-or networks}, author = {Arora, Sanjeev and Ge, Rong and Ma, Tengyu and Risteski, Andrej}, year = 2017, booktitle = {Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing (STOC)}, pages = {1057--1066}, organization = {ACM} } @incollection{agralwal2017optimistic, title = {Optimistic posterior sampling for reinforcement learning: worst-case regret bounds}, author = {Agrawal, Shipra and Jia, Randy}, year = 2017, booktitle = {Advances in Neural Information Processing Systems 30}, publisher = {Curran Associates, Inc.}, pages = {1184--1194}, url = {http://papers.nips.cc/paper/6718-optimistic-posterior-sampling-for-reinforcement-learning-worst-case-regret-bounds.pdf}, editor = {I. Guyon and U. V. Luxburg and S. Bengio and H. Wallach and R. Fergus and S. Vishwanathan and R. Garnett} } @inproceedings{agrawal2012analysis, title = {Analysis of thompson sampling for the multi-armed bandit problem}, author = {Agrawal, Shipra and Goyal, Navin}, year = 2012, booktitle = {Conference on learning theory}, pages = {39--1} } @inproceedings{agrawal2013thompson, title = {Thompson sampling for contextual bandits with linear payoffs}, author = {Agrawal, Shipra and Goyal, Navin}, year = 2013, booktitle = {International Conference on Machine Learning}, pages = {127--135} } @article{agrawal2017near, title = {Near-optimal regret bounds for thompson sampling}, author = {Agrawal, Shipra and Goyal, Navin}, year = 2017, journal = {Journal of the ACM (JACM)}, publisher = {ACM New York, NY, USA}, volume = 64, number = 5, pages = {1--24} } @inproceedings{aguiar2006automatic, title = {Automatic Learning of Articulated Skeletons from 3D Marker Trajectories}, author = {Edilson de Aguiar and Christian Theobalt and Hans-Peter Seidel}, year = 2006, booktitle = {ISVC (1)}, pages = {485--494} } @inproceedings{aharon2005k, title = {K-SVD and its non-negative variant for dictionary design}, author = {Aharon, Michal and Elad, Michael and Bruckstein, Alfred M}, year = 2005, booktitle = {Optics \& Photonics 2005}, pages = {591411--591411}, organization = {International Society for Optics and Photonics}, owner = {gewor_000}, timestamp = {2013.11.10} } @article{aharon2006img, title = {K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation}, author = {Aharon, Michal and Elad, Michael and Bruckstein, Alfred}, year = 2006, journal = {Signal Processing, IEEE Transactions on}, publisher = {IEEE}, volume = 54, number = 11, pages = {4311--4322}, owner = {gewor_000}, timestamp = {2013.11.10} } @inproceedings{airoldi2009mixed, title = {Mixed membership stochastic blockmodels}, author = {Airoldi, Edoardo M and Blei, David M and Fienberg, Stephen E and Xing, Eric P}, year = 2009, booktitle = {Advances in Neural Information Processing Systems}, pages = {33--40} } @inproceedings{AK01, title = {Learning mixtures of arbitrary {G}aussians}, author = {S. Arora and R. Kannan}, year = 2001, booktitle = {STOC} } @article{akkaya2019solving, title = {Solving rubik's cube with a robot hand}, author = {Akkaya, Ilge and Andrychowicz, Marcin and Chociej, Maciek and Litwin, Mateusz and McGrew, Bob and Petron, Arthur and Paino, Alex and Plappert, Matthias and Powell, Glenn and Ribas, Raphael and others}, year = 2019, journal = {arXiv preprint arXiv:1910.07113} } @article{al192, title = {Can {SGD} Learn Recurrent Neural Networks with Provable Generalization?}, author = {Zeyuan Allen{-}Zhu and Yuanzhi Li}, year = 2019, journal = {CoRR}, volume = {abs/1902.01028}, url = {http://arxiv.org/abs/1902.01028}, archiveprefix = {arXiv}, eprint = {1902.01028}, timestamp = {Fri, 01 Mar 2019 17:14:13 +0100}, biburl = {https://dblp.org/rec/bib/journals/corr/abs-1902-01028}, bibsource = {dblp computer science bibliography, https://dblp.org} } @article{AL2016-kCCA, title = {{Doubly Accelerated Methods for Faster CCA and Generalized Eigendecomposition}}, author = {{Allen-Zhu}, Zeyuan and Li, Yuanzhi}, year = 2016, month = jul, journal = {ArXiv e-prints}, volume = {abs/1607.06017} } @inproceedings{AL2016-kSVD, title = {{Even Faster SVD Decomposition Yet Without Agonizing Pain}}, author = {{Allen-Zhu}, Zeyuan and Li, Yuanzhi}, year = 2016, booktitle = {NIPS} } @article{AL2016-onlinePCA, title = {{Fast Global Convergence of Online PCA}}, author = {{Allen-Zhu}, Zeyuan and Li, Yuanzhi}, year = 2016, month = jul, journal = {ArXiv e-prints}, volume = {abs/1607.07837} } @article{AL2016-PCR, title = {{Faster Principal Component Regression via Optimal Polynomial Approximation to sgn(x)}}, author = {{Allen-Zhu}, Zeyuan and Li, Yuanzhi}, year = 2016, month = aug, journal = {ArXiv e-prints}, volume = {abs/1608.04773} } @inproceedings{Alamgir2010, title = {Multi-agent Random Walks for Local Clustering on Graphs}, author = {Alamgir, Morteza and von Luxburg, Ulrike}, year = 2010, series = {ICDM '10}, pages = {18--27} } @article{alaoui2014fast, title = {Fast randomized kernel methods with statistical guarantees}, author = {Alaoui, Ahmed El and Mahoney, Michael W}, year = 2014, journal = {arXiv preprint arXiv:1411.0306} } @inproceedings{alekhnovich, title = {More on Average Case vs Approximation Complexity}, author = {Alekhnovich, Michael}, year = 2003, booktitle = {Proceedings of the 44th Annual IEEE Symposium on Foundations of Computer Science}, publisher = {IEEE Computer Society}, address = {Washington, DC, USA}, series = {FOCS '03}, pages = {298--}, isbn = {0-7695-2040-5}, url = {http://dl.acm.org/citation.cfm?id=946243.946338}, acmid = 946338 } @article{all18, title = {{Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers}}, author = {{Allen-Zhu}, Zeyuan and Li, Yuanzhi and Liang, Yingyu}, year = 2018, month = nov, journal = {arXiv preprint arXiv:1811.04918} } @article{allen2016first, title = {First Efficient Convergence for Streaming k-{PCA}: a Global, Gap-Free, and Near-Optimal Rate}, author = {Allen-Zhu, Zeyuan and Li, Yuanzhi}, year = 2016, journal = {arXiv preprint arXiv:1607.07837} } @article{allen2017natasha, title = {Natasha 2: Faster non-convex optimization than {SGD}}, author = {Allen-Zhu, Zeyuan}, year = 2017, journal = {arXiv preprint arXiv:1708.08694} } @article{allen2018convergence, title = {On the convergence rate of training recurrent neural networks}, author = {Allen-Zhu, Zeyuan and Li, Yuanzhi and Song, Zhao}, year = 2018, journal = {arXiv preprint arXiv:1810.12065} } @article{allen2018convergencetheory, title = {A Convergence Theory for Deep Learning via Over-Parameterization}, author = {Allen-Zhu, Zeyuan and Li, Yuanzhi and Song, Zhao}, year = 2018, month = nov, journal = {arXiv preprint arXiv:1811.03962} } @article{allen2019can, title = {What can resnet learn efficiently, going beyond kernels?}, author = {Allen-Zhu, Zeyuan and Li, Yuanzhi}, year = 2019, journal = {arXiv preprint arXiv:1905.10337} } @article{Allenzhu2016Katyusha, title = {{Katyusha: The First Direct Acceleration of Stochastic Gradient Methods}}, author = {{Allen-Zhu}, Zeyuan}, year = 2016, month = mar, journal = {ArXiv e-prints}, volume = {abs/1603.05953} } @inproceedings{ALO-bss, title = {{Spectral Sparsification and Regret Minimization Beyond Multiplicative Updates}}, author = {{Allen-Zhu}, Zeyuan and Liao, Zhenyu and Orecchia, Lorenzo}, year = 2015, booktitle = {Proceedings of the 47th Annual ACM Symposium on Theory of Computing}, series = {STOC~'15} } @inproceedings{ALO-sdp-parallel, title = {Using Optimization to Obtain a Width-Independent, Parallel, Simpler, and Faster Positive {SDP} Solver}, author = {{Allen-Zhu}, Zeyuan and Lee, Yin Tat and Orecchia, Lorenzo}, year = 2016, booktitle = {Proceedings of the 27th ACM-SIAM Symposium on Discrete Algorithms}, series = {SODA~'16} } @article{Alon86, title = {Eigenvalues and expanders}, author = {Noga Alon}, year = 1986, journal = {Combinatorica}, volume = 6, number = 2, pages = {83--96} } @article{alphago16, title = {Mastering the game of {G}o with deep neural networks and tree search}, author = {Silver, David and Huang, Aja and Maddison, Chris J and Guez, Arthur and Sifre, Laurent and Van Den Driessche, George and Schrittwieser, Julian and Antonoglou, Ioannis and Panneershelvam, Veda and Lanctot, Marc and others}, year = 2016, journal = {Nature}, publisher = {Nature Research}, volume = 529, number = 7587, pages = {484--489} } @article{alphago17, title = {Mastering the game of {G}o without human knowledge}, author = {Silver, David and Schrittwieser, Julian and Simonyan, Karen and Antonoglou, Ioannis and Huang, Aja and Guez, Arthur and Hubert, Thomas and Baker, Lucas and Lai, Matthew and Bolton, Adrian and others}, year = 2017, journal = {Nature}, publisher = {Nature Publishing Group}, volume = 550, number = 7676, pages = 354 } @book{altman1999constrained, title = {Constrained Markov decision processes}, author = {Altman, Eitan}, year = 1999, publisher = {CRC Press}, volume = 7 } @article{AltTensorDecomp2014, title = {{Guaranteed Non-Orthogonal Tensor Decomposition via Alternating Rank-$1$ Updates}}, author = {Anima Anandkumar and Rong Ge and Majid Janzamin}, year = 2014, month = feb, journal = {arXiv preprint arXiv:1402.5180} } @inproceedings{AltTensorDecomp:COLT2015, title = {{Learning Overcomplete Latent Variable Models through Tensor Methods}}, author = {A. Anandkumar and R. Ge and M. Janzamin}, year = 2015, month = jul, booktitle = {Proceedings of the Conference on Learning Theory (COLT)}, address = {Paris, France} } @inproceedings{ALY2016-geometry, title = {{Optimization Algorithms for Faster Computational Geometry}}, author = {{Allen-Zhu}, Zeyuan and Liao, Zhenyu and Yuan, Yang}, year = 2016, booktitle = {ICALP} } @inproceedings{AM05, title = {On Spectral Learning of Mixtures of Distributions}, author = {D. Achlioptas and F. McSherry}, year = 2005, booktitle = {COLT} } @article{amari1998natural, title = {Natural gradient works efficiently in learning}, author = {Amari, Shun-Ichi}, year = 1998, journal = {Neural computation}, publisher = {MIT Press}, volume = 10, number = 2, pages = {251--276} } @article{amari2002geometrical, title = {Geometrical singularities in the neuromanifold of multilayer perceptrons}, author = {Amari, Shun-ichi and Park, Hyeyoung and Ozeki, Tomoko}, year = 2002, journal = {Advances in neural information processing systems}, volume = 1, pages = {343--350} } @article{amari2006singularities, title = {Singularities affect dynamics of learning in neuromanifolds}, author = {Amari, Shun-Ichi and Park, Hyeyoung and Ozeki, Tomoko}, year = 2006, journal = {Neural computation}, publisher = {MIT Press}, volume = 18, number = 5, pages = {1007--1065} } @inproceedings{ambainis2000quantum, title = {Quantum lower bounds by quantum arguments}, author = {Ambainis, Andris}, year = 2000, booktitle = {Proceedings of the thirty-second annual ACM symposium on Theory of computing}, pages = {636--643}, organization = {ACM} } @article{amelunxen2014living, title = {Living on the edge: Phase transitions in convex programs with random data}, author = {Amelunxen, Dennis and Lotz, Martin and McCoy, Michael B and Tropp, Joel A}, year = 2014, journal = {Information and Inference: A Journal of the IMA}, publisher = {OUP}, volume = 3, number = 3, pages = {224--294} } @inproceedings{ames2019control, title = {Control barrier functions: Theory and applications}, author = {Ames, Aaron D and Coogan, Samuel and Egerstedt, Magnus and Notomista, Gennaro and Sreenath, Koushil and Tabuada, Paulo}, year = 2019, booktitle = {2019 18th European Control Conference (ECC)}, pages = {3420--3431}, organization = {IEEE} } @inproceedings{amit2007uncovering, title = {Uncovering shared structures in multiclass classification}, author = {Amit, Yonatan and Fink, Michael and Srebro, Nathan and Ullman, Shimon}, year = 2007, booktitle = {Proceedings of the 24th international conference on Machine learning}, pages = {17--24}, organization = {ACM} } @inproceedings{amos2017input, title = {Input convex neural networks}, author = {Amos, Brandon and Xu, Lei and Kolter, J Zico}, year = 2017, booktitle = {International Conference on Machine Learning}, pages = {146--155}, organization = {PMLR} } @article{AMP2010, title = {The dynamics of message passing on dense graphs, with applications to compressed sensing}, author = {Mohsen Bayati and Andrea Montanari}, year = 2010, month = jan, journal = {arXiv preprint arXiv:1001.3448} } @article{AMR09, title = {{Identifiability of parameters in latent structure models with many observed variables}}, author = {E. S. Allman and C. Matias and J. A. Rhodes}, year = 2009, journal = {The Annals of Statistics}, volume = 37, number = {6A}, pages = {3099--3132} } @inproceedings{anandkumar2015learning, title = {Learning overcomplete latent variable models through tensor methods}, author = {Anandkumar, Animashree and Ge, Rong and Janzamin, Majid}, year = 2015, booktitle = {Proceedings of the Conference on Learning Theory (COLT), Paris, France} } @article{anandkumar2016analyzing, title = {Analyzing tensor power method dynamics in overcomplete regime}, author = {Anandkumar, Anima and Ge, Rong and Janzamin, Majid}, year = 2016, journal = {JMLR} } @inproceedings{anandkumar2016efficient, title = {Efficient approaches for escaping higher order saddle points in non-convex optimization}, author = {Anandkumar, Animashree and Ge, Rong}, year = 2016, booktitle = {Conference on learning theory}, pages = {81--102}, organization = {PMLR} } @inproceedings{AnandkumarEtal:community12, title = {{A Tensor Spectral Approach to Learning Mixed Membership Community Models}}, author = {A. Anandkumar and R. Ge and D. Hsu and S. M. Kakade}, year = 2013, month = jun, booktitle = {Conference on Learning Theory (COLT)} } @article{AnandkumarEtal:communityimplementation13, title = {{Fast Detection of Overlapping Communities via Online Tensor Methods}}, author = {F. Huang and U. N. Niranjan and M. Hakeem and A. Anandkumar}, year = 2013, month = sep, journal = {ArXiv 1309.0787} } @article{AnandkumarEtal:lda12, title = {{Two SVDs Suffice: Spectral Decompositions for Probabilistic Topic Modeling and Latent Dirichlet Allocation}}, author = {A. Anandkumar and D. P. Foster and D. Hsu and S. M. Kakade and Y. K. Liu}, year = 2013, month = jul, journal = {to appear in the special issue of Algorithmica on New Theoretical Challenges in Machine Learning}, note = {arXiv:1204.6703}, eprint = {arXiv:1204.6703} } @inproceedings{AnandkumarEtal:NIPS13, title = {{When are Overcomplete Topic Models Identifiable? Uniqueness of Tensor Tucker Decompositions with Structured Sparsity}}, author = {A. Anandkumar and D. Hsu and M. Janzamin and S. M. Kakade}, year = 2013, month = dec, booktitle = {Neural Information Processing (NIPS)} } @article{AnandkumarEtal:tensor12, title = {{Tensor Methods for Learning Latent Variable Models}}, author = {A. Anandkumar and R. Ge and D. Hsu and S. M. Kakade and M. Telgarsky}, year = 2012, month = oct, journal = {Available at arXiv:1210.7559} } @inproceedings{AnandkumarHsuKakade:graphmixturesNIPS12, title = {Learning Mixtures of Tree Graphical Models}, author = {A. Anandkumar and D. Hsu and F. Huang and S.M. Kakade}, year = 2012, booktitle = {Advances in Neural Information Processing Systems 25} } @book{andersen1995linear, title = {Linear and graphical models for the multivariate complex normal distribution}, author = { Heidi H. Andersen and Malene Hojbjerre and Dorte Sorensen and Poul Svante Eriksen }, year = 1995, publisher = {Springer-Verlag}, series = {Lecture notes in statistics}, isbn = 9780387945217, lccn = 95019290, owner = {leili}, timestamp = {2010.11.13} } @inproceedings{AndersenLang06WWW, title = {Communities from seed sets}, author = {Andersen, Reid and Lang, Kevin J.}, year = 2006, series = {WWW '06}, pages = {223--232} } @inproceedings{AndersenLang2008, title = {An algorithm for improving graph partitions}, author = {Andersen, Reid and Lang, Kevin J.}, year = 2008, series = {SODA}, pages = {651--660} } @inproceedings{AndersenPeres09, title = {Finding sparse cuts locally using evolving sets}, author = {Reid Andersen and Yuval Peres}, year = 2009, series = {STOC} } @book{anderson1979optimal, title = {Optimal Filtering}, author = {Brian D. O. Anderson and John B. Moore}, year = 1979, publisher = {Prentice Hall}, address = {New York} } @article{Anderson2014, title = {{An Efficient Algorithm for Unweighted Spectral Graph Sparsification}}, author = {Anderson, David G. and Gu, Ming and Melgaard, Christopher}, year = 2014, month = oct, journal = {ArXiv e-prints}, volume = {abs/1410.4273}, url = {http://arxiv.org/abs/1410.4273v1}, eprint = {1410.4273} } @inproceedings{anderson2015spectral, title = {{Spectral Gap Error Bounds for Improving CUR Matrix Decomposition and the Nystr\"{o}m Method}}, author = {David Anderson and Simon Du and Michael Mahoney and Christopher Melgaard and Kunming Wu and Ming Gu}, year = 2015, month = {09--12 May}, booktitle = {Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics}, publisher = {PMLR}, address = {San Diego, California, USA}, series = {Proceedings of Machine Learning Research}, volume = 38, pages = {19--27}, url = {http://proceedings.mlr.press/v38/anderson15.html}, editor = {Guy Lebanon and S. V. N. Vishwanathan}, pdf = {http://proceedings.mlr.press/v38/anderson15.pdf}, abstract = {The CUR matrix decomposition and the related Nyström method build low-rank approximations of data matrices by selecting a small number of representative rows and columns of the data. Here, we introduce novel \emphspectral gap error bounds that judiciously exploit the potentially rapid spectrum decay in the input matrix, a most common occurrence in machine learning and data analysis. Our error bounds are much tighter than existing ones for matrices with rapid spectrum decay, and they justify the use of a constant amount of oversampling relative to the rank parameter k, i.e, when the number of columns/rows is \ell=k+ O(1). We demonstrate our analysis on a novel deterministic algorithm, \emphStableCUR, which additionally eliminates a previously unrecognized source of potential instability in CUR decompositions. While our algorithm accepts any method of row and column selection, we implement it with a recent column selection scheme with strong singular value bounds. Empirical results on various classes of real world data matrices demonstrate that our algorithm is as efficient as and often outperforms competing algorithms.} } @inproceedings{ando07, title = {Two-view feature generation model for semi-supervised learning}, author = {R. Ando and T. Zhang}, year = 2007, booktitle = {ICML} } @phdthesis{Andoni2009thesis, title = {Nearest Neighbor Search: the Old, the New, and the Impossible}, author = {Andoni, Alexandr}, year = 2009, school = {MIT} } @inproceedings{andoni2014learning, title = {Learning sparse polynomial functions}, author = {Andoni, Alexandr and Panigrahy, Rina and Valiant, Gregory and Zhang, Li}, year = 2014, booktitle = {Proceedings of the Twenty-Fifth Annual ACM-SIAM Symposium on Discrete Algorithms}, pages = {500--510}, organization = {Society for Industrial and Applied Mathematics} } @article{andreas2013generative, title = {A Generative Model of Vector Space Semantics}, author = {Andreas, Jacob and Ghahramani, Zoubin}, year = 2013, journal = {Transactions of the Association for Computational Linguistics} } @inproceedings{andreas2014when, title = {When and why are log-linear models self-normalizing?}, author = {Jacob Andreas and Dan Klein}, year = 2014, booktitle = {Proceedings of the Annual Meeting of the North American Chapter of the Association for Computational Linguistics} } @inproceedings{andrieu2005line, title = {On-line Parameter Estimation in General State-Space Models}, author = {Andrieu, C. and Doucet, A. and Tadic, V.}, year = 2005, booktitle = {Proceedings of the 44th Conference on Decision and Control}, pages = {332--337} } @article{andrieu2010particle, title = {Particle {M}arkov chain {M}onte {C}arlo methods}, author = {Christophe Andrieu and Arnaud Doucet and Roman Holenstein}, year = 2010, journal = {Journal of the Royal Statistical Society: Series B (Statistical Methodology)}, volume = 72, number = 3, pages = {269--342} } @article{anstreicher2002improved, title = {Improved complexity for maximum volume inscribed ellipsoids}, author = {Anstreicher, Kurt M.}, year = 2002, journal = {SIAM Journal on Optimization}, publisher = {SIAM}, volume = 13, number = 2, pages = {309--320} } @inproceedings{AO-lp-coordinate, title = {{Nearly-Linear Time Positive LP Solver with Faster Convergence Rate}}, author = {{Allen-Zhu}, Zeyuan and Orecchia, Lorenzo}, year = 2015, booktitle = {Proceedings of the 47th Annual ACM Symposium on Theory of Computing}, series = {STOC~'15} } @inproceedings{AO-lp-parallel, title = {Using Optimization to Break the Epsilon Barrier: A Faster and Simpler Width-Independent Algorithm for Solving Positive Linear Programs in Parallel}, author = {{Allen-Zhu}, Zeyuan and Orecchia, Lorenzo}, year = 2015, month = jul, journal = {ArXiv e-prints}, booktitle = {Proceedings of the 26th ACM-SIAM Symposium on Discrete Algorithms}, series = {SODA~'15}, volume = {abs/1407.1925}, bibsource = {DBLP, http://dblp.uni-trier.de} } @article{AO-survey-nesterov, title = {Linear Coupling: An Ultimate Unification of Gradient and Mirror Descent}, author = {{Allen-Zhu}, Zeyuan and Orecchia, Lorenzo}, year = 2014, month = jul, journal = {ArXiv e-prints}, volume = {abs/1407.1537}, bibsource = {DBLP, http://dblp.uni-trier.de} } @inproceedings{apvz14, title = {Learning polynomials with neural networks}, author = {Andoni, Alexandr and Panigrahy, Rina and Valiant, Gregory and Zhang, Li}, year = 2014, booktitle = {International Conference on Machine Learning (ICML)}, pages = {1908--1916} } @inproceedings{arikan2002interactive, title = {Interactive motion generation from examples}, author = {Okan Arikan and D. A. Forsyth}, year = 2002, booktitle = { SIGGRAPH '02: Proceedings of the 29th annual conference on Computer graphics and interactive techniques }, location = {San Antonio, Texas}, publisher = {ACM Press}, address = {New York, NY, USA}, pages = {483--490}, doi = {http://doi.acm.org/10.1145/566570.566606}, isbn = {1-58113-521-1} } @inproceedings{aristidou2008predicting, title = { Predicting Missing Markers to Drive Real-Time Centre of Rotation Estimation }, author = {Aristidou, Andreas and Cameron, Jonathan and Lasenby, Joan}, year = 2008, booktitle = { AMDO '08: Proceedings of the 5th international conference on Articulated Motion and Deformable Objects }, location = {Port d'Andratx, Mallorca, Spain}, publisher = {Springer-Verlag}, address = {Berlin, Heidelberg}, pages = {238--247}, doi = {http://dx.doi.org/10.1007/978-3-540-70517-8_23}, isbn = {978-3-540-70516-1} } @techreport{AroLiLiaMaetal15, title = {A Latent Variable Model Approach to {PMI}-based Word Embeddings}, author = {Sanjeev Arora and Yuanzhi Li and Yingyu Liang and Tengyu Ma and Andrej Risteski}, year = 2015, note = {\url{http://arxiv.org/abs/1502.03520}}, institution = {ArXiV} } @inproceedings{arora15simple, title = {Simple, Efficient, and Neural Algorithms for Sparse Coding}, author = {Sanjeev Arora and Rong Ge and Tengyu Ma and Ankur Moitra}, year = 2015, booktitle = {Proceedings of The 28th Conference on Learning Theory, {COLT} 2015, Paris, France, July 3-6, 2015}, pages = {113--149}, url = {http://jmlr.org/proceedings/papers/v40/Arora15.html}, crossref = {DBLP:conf/colt/2015}, timestamp = {Tue, 12 Jul 2016 21:51:13 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/colt/AroraGMM15}, bibsource = {dblp computer science bibliography, http://dblp.org}, pp = {113–149} } @inproceedings{arora16inferencetopic, title = {Provable Algorithms for Inference in Topic Models}, author = {Sanjeev Arora and Rong Ge and Frederic Koehler and Tengyu Ma and Ankur Moitra}, year = 2016, booktitle = {Proceedings of the 33nd International Conference on Machine Learning, {ICML} 2016, New York City, NY, USA, June 19-24, 2016}, pages = {2859--2867}, url = {http://jmlr.org/proceedings/papers/v48/arorab16.html}, crossref = {DBLP:conf/icml/2016}, timestamp = {Tue, 03 Jan 2017 13:40:36 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/icml/AroraGKMM16}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{Arora2013, title = {{New Algorithms for Learning Incoherent and Overcomplete Dictionaries}}, author = {{Arora}, S. and {Ge}, R. and {Moitra}, A.}, year = 2013, month = aug, journal = {ArXiv e-prints} } @inproceedings{arora2013practical, title = {A practical algorithm for topic modeling with provable guarantees}, author = {Arora, Sanjeev and Ge, Rong and Halpern, Yonatan and Mimno, David and Moitra, Ankur and Sontag, David and Wu, Yichen and Zhu, Michael}, year = 2013, booktitle = {International Conference on Machine Learning}, pages = {280--288} } @article{arora2014more, title = {More algorithms for provable dictionary learning}, author = {Arora, Sanjeev and Bhaskara, Aditya and Ge, Rong and Ma, Tengyu}, year = 2014, journal = {arXiv preprint arXiv:1401.0579} } @article{arora2015deep, title = {Why are deep nets reversible: A simple theory, with implications for training}, author = {Arora, Sanjeev and Liang, Yingyu and Ma, Tengyu}, year = 2015, journal = {arXiv preprint arXiv:1511.05653} } @article{arora2015rand, title = {Rand-walk: A latent variable model approach to word embeddings}, author = {Arora, Sanjeev and Li, Yuanzhi and Liang, Yingyu and Ma, Tengyu and Risteski, Andrej}, year = 2015, journal = {Transactions of the Association for Computational Linguistics} } @article{arora2015simple, title = {Simple, efficient, and neural algorithms for sparse coding}, author = {Arora, Sanjeev and Ge, Rong and Ma, Tengyu and Moitra, Ankur}, year = 2015, publisher = {Proceedings of Machine Learning Research} } @article{arora2016latent, title = {A latent variable model approach to PMI-based word embeddings}, author = {Arora, Sanjeev and Li, Yuanzhi and Liang, Yingyu and Ma, Tengyu and Risteski, Andrej}, year = 2016, journal = {Transactions of the Association for Computational Linguistics}, volume = 4, pages = {385--399} } @article{arora2016linear, title = {Linear algebraic structure of word senses, with applications to polysemy}, author = {Arora, Sanjeev and Li, Yuanzhi and Liang, Yingyu and Ma, Tengyu and Risteski, Andrej}, year = 2016, journal = {arXiv preprint arXiv:1601.03764} } @inproceedings{arora2016provable, title = {Provable Algorithms for Inference in Topic Models}, author = {Arora, Sanjeev and Ge, Rong and Koehler, Frederic and Ma, Tengyu and Moitra, Ankur}, year = 2016, booktitle = {The 33rd International Conference on Machine Learning (ICML 2016). arXiv preprint arXiv:1605.08491} } @inproceedings{arora2017generalization, title = {Generalization and equilibrium in generative adversarial nets ({GANs})}, author = {Arora, Sanjeev and Ge, Rong and Liang, Yingyu and Ma, Tengyu and Zhang, Yi}, year = 2017, booktitle = {International Conference on Machine Learning} } @inproceedings{arora2017provable, title = {Provable learning of noisy-OR networks}, author = {Sanjeev Arora and Rong Ge and Tengyu Ma and Andrej Risteski}, year = 2017, booktitle = {Proceedings of the 49th Annual {ACM} {SIGACT} Symposium on Theory of Computing, {STOC} 2017, Montreal, QC, Canada, June 19-23, 2017}, pages = {1057--1066}, doi = {10.1145/3055399.3055482}, url = {http://doi.acm.org/10.1145/3055399.3055482}, crossref = {DBLP:conf/stoc/2017}, timestamp = {Sat, 17 Jun 2017 18:46:57 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/stoc/Arora0MR17}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{arora2017simple, title = {A simple but tough-to-beat baseline for sentence embeddings}, author = {Arora, Sanjeev and Liang, Yingyu and Ma, Tengyu}, year = 2017, booktitle = {5th International Conference on Learning Representations (ICLR 2017)} } @article{arora2018optimization, title = {On the optimization of deep networks: Implicit acceleration by overparameterization}, author = {Arora, Sanjeev and Cohen, Nadav and Hazan, Elad}, year = 2018, journal = {arXiv preprint arXiv:1802.06509} } @article{arora2018stronger, title = {Stronger generalization bounds for deep nets via a compression approach}, author = {Arora, Sanjeev and Ge, Rong and Neyshabur, Behnam and Zhang, Yi}, year = 2018, journal = {arXiv preprint arXiv:1802.05296} } @article{arora2018theoretical, title = {Theoretical analysis of auto rate-tuning by batch normalization}, author = {Arora, Sanjeev and Li, Zhiyuan and Lyu, Kaifeng}, year = 2018, journal = {arXiv preprint arXiv:1812.03981} } @inproceedings{arora2019exact, title = {On Exact Computation with an Infinitely Wide Neural Net}, author = {Arora, Sanjeev and Du, Simon S and Hu, Wei and Li, Zhiyuan and Salakhutdinov, Russ R and Wang, Ruosong}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, publisher = {Curran Associates, Inc.}, volume = 32, pages = {}, url = {https://proceedings.neurips.cc/paper/2019/file/dbc4d84bfcfe2284ba11beffb853a8c4-Paper.pdf}, editor = {H. Wallach and H. Larochelle and A. Beygelzimer and F. d\textquotesingle Alch\'{e}-Buc and E. Fox and R. Garnett} } @inproceedings{arora2019fine, title = {Fine-grained analysis of optimization and generalization for overparameterized two-layer neural networks}, author = {Arora, Sanjeev and Du, Simon and Hu, Wei and Li, Zhiyuan and Wang, Ruosong}, year = 2019, booktitle = {International Conference on Machine Learning}, pages = {322--332}, organization = {PMLR} } @inproceedings{arora2019implicit, title = {Implicit regularization in deep matrix factorization}, author = {Arora, Sanjeev and Cohen, Nadav and Hu, Wei and Luo, Yuping}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {7413--7424} } @inproceedings{arora2019theoretical, title = {A theoretical analysis of contrastive unsupervised representation learning}, author = {Arora, Sanjeev and Khandeparkar, Hrishikesh and Khodak, Mikhail and Plevrakis, Orestis and Saunshi, Nikunj}, year = 2019, booktitle = {International Conference on Machine Learning} } @article{arora2020dropout, title = {Dropout: Explicit Forms and Capacity Control}, author = {Arora, Raman and Bartlett, Peter and Mianjy, Poorya and Srebro, Nathan}, year = 2020, journal = {arXiv preprint arXiv:2003.03397} } @inproceedings{arora2020harnessing, title = {Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks}, author = {Sanjeev Arora and Simon S. Du and Zhiyuan Li and Ruslan Salakhutdinov and Ruosong Wang and Dingli Yu}, year = 2020, booktitle = {International Conference on Learning Representations}, url = {https://openreview.net/forum?id=rkl8sJBYvH} } @inproceedings{arora2020provable, title = {Provable representation learning for imitation learning via bi-level optimization}, author = {Arora, Sanjeev and Du, Simon and Kakade, Sham and Luo, Yuping and Saunshi, Nikunj}, year = 2020, booktitle = {International Conference on Machine Learning}, pages = {367--376}, organization = {PMLR} } @book{AroraBarak, title = {Computational Complexity - {A} Modern Approach}, author = {Sanjeev Arora and Boaz Barak}, year = 2009, publisher = {Cambridge University Press}, isbn = {978-0-521-42426-4}, url = {http://www.cambridge.org/catalogue/catalogue.asp?isbn=9780521424264}, timestamp = {Mon, 29 Sep 2014 03:39:22 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/books/daglib/0023084}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{AroraGHMMSWZ13, title = {A Practical Algorithm for Topic Modeling with Provable Guarantees}, author = {Sanjeev Arora and Rong Ge and Yonatan Halpern and David M. Mimno and Ankur Moitra and David Sontag and Yichen Wu and Michael Zhu}, year = 2013, booktitle = {Proceedings of the 30th International Conference on Machine Learning, {ICML} 2013, Atlanta, GA, USA, 16-21 June 2013}, pages = {280--288} } @inproceedings{AroraGM14, title = {New Algorithms for Learning Incoherent and Overcomplete Dictionaries}, author = {Sanjeev Arora and Rong Ge and Ankur Moitra}, year = 2014, journal = {CoRR}, booktitle = {Proceedings of The 27th Conference on Learning Theory, {COLT} 2014, Barcelona, Spain, June 13-15, 2014}, volume = {abs/1308.6273}, pages = {779--806}, url = {http://jmlr.org/proceedings/papers/v35/arora14.html}, bibsource = {DBLP, http://dblp.uni-trier.de}, ee = {http://arxiv.org/abs/1308.6273}, crossref = {DBLP:conf/colt/2014}, timestamp = {Sun, 26 Oct 2014 02:37:38 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/colt/AroraGM14} } @inproceedings{AroraKale2007, title = {{A combinatorial, primal-dual approach to semidefinite programs}}, author = {Arora, Sanjeev and Kale, Satyen}, year = 2007, booktitle = {Proceedings of the thirty-ninth annual ACM symposium on Theory of computing - STOC '07}, publisher = {ACM Press}, address = {New York, New York, USA}, pages = 227, doi = {10.1145/1250790.1250823}, isbn = 9781595936318, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Arora, Kale - 2007 - A combinatorial, primal-dual approach to semidefinite programs.pdf:pdf}, mendeley-groups = {Algorithms/Multiplicative Weight,Algorithms/Multiplicative Weight/SDP} } @article{AroraKannan:Mixtures, title = {LEARNING MIXTURES OF SEPARATED NONSPHERICAL GAUSSIANS}, author = {Sanjeev Arora and Ravi Kannan}, year = 2005, journal = {The Annals of Applied Probability}, volume = 15, number = {1A}, pages = {69--92} } @article{arpit2019benefits, title = {The Benefits of Over-parameterization at Initialization in Deep ReLU Networks}, author = {Arpit, Devansh and Bengio, Yoshua}, year = 2019, journal = {arXiv preprint arXiv:1901.03611} } @article{arslan2017decentralized, title = {Decentralized {Q}-learning for stochastic teams and games}, author = {Arslan, G{\"u}rdal and Y{\"u}ksel, Serdar}, year = 2017, journal = {IEEE Transactions on Automatic Control}, publisher = {IEEE}, volume = 62, number = 4, pages = {1545--1558} } @article{arulampalam2002tutorial, title = {A tutorial on particle filters for on-line non-linear/non-{G}aussian {B}ayesian tracking}, author = {Sanjeev Arulampalam and Simon Maskell and Neil Gordon and Tim Clapp}, year = 2002, journal = {IEEE Transactions on Signal Processing}, volume = 50, number = 2, pages = {174--188} } @article{ARV, title = {Expander flows, geometric embeddings and graph partitioning}, author = {Arora, Sanjeev and Rao, Satish and Vazirani, Umesh}, year = 2009, journal = {Journal of the ACM (JACM)}, publisher = {ACM}, volume = 56, number = 2, pages = 5 } @article{ARV09, title = {Expander flows, geometric embeddings and graph partitioning}, author = {Sanjeev Arora and Satish Rao and Umesh V. Vazirani}, year = 2009, journal = {Journal of the ACM}, volume = 56, number = 2, ee = {http://doi.acm.org/10.1145/1502793.1502794}, bibsource = {DBLP, http://dblp.uni-trier.de} } @article{arxivCohenKKPPRS18, title = {Solving Directed Laplacian Systems in Nearly-Linear Time through Sparse {LU} Factorizations}, author = {Michael B. Cohen and Jonathan A. Kelner and Rasmus Kyng and John Peebles and Richard Peng and Anup B. Rao and Aaron Sidford}, year = 2018, journal = {CoRR}, booktitle = {59th {IEEE} Annual Symposium on Foundations of Computer Science, {FOCS} 2018, Paris, France, October 7-9, 2018}, volume = {abs/1811.10722}, pages = {898--909} } @article{arxivCohenKPPSV16, title = {Faster Algorithms for Computing the Stationary Distribution, Simulating Random Walks, and More}, author = {Michael B. Cohen and Jonathan A. Kelner and John Peebles and Richard Peng and Aaron Sidford and Adrian Vladu}, year = 2016, journal = {CoRR}, booktitle = {{IEEE} 57th Annual Symposium on Foundations of Computer Science, {FOCS} 2016, 9-11 October 2016, Hyatt Regency, New Brunswick, New Jersey, {USA}}, volume = {abs/1608.03270}, pages = {583--592} } @inproceedings{Asadpour2010, title = {{An $O(\log n / \log \log n )$-approximation Algorithm for the Asymmetric Traveling Salesman Problem}}, author = {Asadpour, Arash and Goemans, Michel X. and Mądry, Aleksander and Gharan, Shayan Oveis and Saberi, Amin}, year = 2010, booktitle = {Proceedings of the Twenty-First Annual ACM-SIAM Symposium on Discrete Algorithms - SODA '10}, pages = {379--389}, isbn = {0001405101}, file = {:D$\backslash$:/Mendeley Desktop/Asadpour et al. - 2010 - An O ( log n log log n ) -approximation Algorithm for the Asymmetric Traveling Salesman Problem.pdf:pdf}, mendeley-groups = {Algorithms/Traveling Salesman} } @inproceedings{asm08, title = {Fitted {Q}-iteration in continuous action-space MDPs}, author = {Antos, Andr{\'a}s and Szepesv{\'a}ri, Csaba and Munos, R{\'e}mi}, year = 2008, booktitle = {Advances in neural information processing systems}, pages = {9--16} } @article{asm08a, title = {Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path}, author = {Antos, Andr{\'a}s and Szepesv{\'a}ri, Csaba and Munos, R{\'e}mi}, year = 2008, journal = {Machine Learning}, publisher = {Springer}, volume = 71, number = 1, pages = {89--129} } @incollection{asuncion2011distributed, title = {Distributed Gibbs Sampling for Latent Variable Models}, author = {Asuncion, A. and Smyth, P. and Welling, M. and Newman, D. and Porteous, I. and Triglia, S.}, year = 2011, booktitle = {Scaling Up Machine Learning: Parallel and Distributed Approaches}, publisher = {Cambridge Univ Pr} } @article{audibert2011minimax, title = {Minimax Policies for Combinatorial Prediction Games}, author = {Audibert, Jean-Yves and Bubeck, S{\'e}bastien and Lugosi, G{\'a}bor}, year = 2011, journal = {Proceedings of COLT 2011} } @article{auer02nonstochastic, title = {The Nonstochastic Multiarmed Bandit Problem}, author = {Peter Auer and Nicol\`{o} {Cesa-Bianchi} and Yoav Freund and Robert E. Schapire}, year = 2002, journal = {SIAM Journal on Computing}, volume = 32, number = 1, pages = {48--77} } @inproceedings{Auer1995, title = {{Gambling in a rigged casino: The adversarial multi-armed bandit problem}}, author = {Auer, Peter and {Cesa-Bianchi}, Nicol\`{o} and Freund, Yoav and Schapire, Robert E.}, year = 1995, booktitle = {Proceedings of IEEE 36th Annual Foundations of Computer Science}, publisher = {IEEE Comput. Soc. Press}, pages = {322--331}, doi = {10.1109/SFCS.1995.492488}, isbn = {0-8186-7183-1}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Unknown - Unknown - Gambling in a rigged casino The adversarial multi-armed bandit problem.pdf:pdf}, mendeley-groups = {Optimization/Bandit} } @article{auer1996exponentially, title = {Exponentially many local minima for single neurons}, author = {Auer, Peter and Herbster, Mark and Warmuth, Manfred K and others}, year = 1996, journal = {Advances in neural information processing systems}, publisher = {Citeseer}, pages = {316--322} } @article{Auer2002nonstochastic, title = {The nonstochastic multiarmed bandit problem}, author = {Auer, Peter and Cesa-Bianchi, Nicolo and Freund, Yoav and Schapire, Robert E}, year = 2002, journal = {SIAM journal on computing}, publisher = {SIAM}, volume = 32, number = 1, pages = {48--77} } @article{Auer2002stochastic, title = {{Finite-time analysis of the multiarmed bandit problem}}, author = {Auer, Peter and {Cesa-Bianchi}, Nicol\`{o} and Fischer, Paul}, year = 2002, journal = {Machine Learning}, volume = 47, number = {2-3}, pages = {235--256}, doi = {10.1023/A:1013689704352}, annote = { This is for the case when there is a fixed (but unknown) distribution where the feedbacks are generated. It is different from the other type of bandit work where there is no distribution. }, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Auer, Cesa-Bianchi, Fischer - 2002 - Finite-time analysis of the multiarmed bandit problem.pdf:pdf}, mendeley-groups = {Optimization/Bandit} } @inproceedings{auer2007logarithmic, title = {Logarithmic online regret bounds for undiscounted reinforcement learning}, author = {Auer, Peter and Ortner, Ronald}, year = 2007, booktitle = {Advances in Neural Information Processing Systems}, pages = {49--56} } @article{auffinger2013complexity, title = {Complexity of random smooth functions on the high-dimensional sphere}, author = {Auffinger, Antonio and Arous, Gerard Ben and others}, year = 2013, journal = {The Annals of Probability}, publisher = {Institute of Mathematical Statistics}, volume = 41, number = 6, pages = {4214--4247} } @article{auffinger2013random, title = {Random matrices and complexity of spin glasses}, author = {Auffinger, Antonio and Arous, G{\'e}rard Ben and {\v{C}}ern{\`y}, Ji{\v{r}}{\'\i}}, year = 2013, journal = {Communications on Pure and Applied Mathematics}, publisher = {Wiley Online Library}, volume = 66, number = 2, pages = {165--201} } @article{austin2008exchangeable, title = {On exchangeable random variables and the statistics of large graphs and hypergraphs}, author = {T. Austin}, year = 2008, journal = {Probab. Survey}, volume = 5, pages = {80--145} } @inproceedings{awasthi2014learning, title = {Learning mixtures of ranking models}, author = {Awasthi, Pranjal and Blum, Avrim and Sheffet, Or and Vijayaraghavan, Aravindan}, year = 2014, booktitle = {Advances in Neural Information Processing Systems}, pages = {2609--2617} } @inproceedings{awerbuch2004adaptive, title = {Adaptive routing with end-to-end feedback: Distributed learning and geometric approaches}, author = {Awerbuch, Baruch and Kleinberg, Robert D}, year = 2004, booktitle = {Proceedings of the thirty-sixth annual ACM symposium on Theory of computing}, pages = {45--53}, organization = {ACM} } @article{Awerbuch2008, title = {{Stateless distributed gradient descent for positive linear programs}}, author = {Awerbuch, Baruch and Khandekar, Rohit}, year = 2008, journal = {Proceedings of the fourtieth annual ACM symposium on Theory of computing - STOC 08}, publisher = {ACM Press}, address = {New York, New York, USA}, pages = 691, doi = {10.1145/1374376.1374476}, isbn = 9781605580470, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Awerbuch, Khandekar - 2008 - Stateless distributed gradient descent for positive linear programs.pdf:pdf}, keywords = {convergence,distributed and stateless algorithms,fast,gradient descent,linear programming}, mendeley-groups = {Algorithms/Multiplicative Weight/LP} } @inproceedings{AwerbuchAzarKhandekar2008soda, title = {Fast Load Balancing via Bounded Best Response}, author = {Awerbuch, Baruch and Azar, Yossi and Khandekar, Rohit}, year = 2008, booktitle = {Proceedings of the Nineteenth Annual ACM-SIAM Symposium on Discrete Algorithms}, location = {San Francisco, California}, publisher = {Society for Industrial and Applied Mathematics}, address = {Philadelphia, PA, USA}, series = {SODA '08}, pages = {314--322}, numpages = 9, acmid = 1347117 } @incollection{AwerbuchKhandekar2008latin, title = {Stateless near optimal flow control with poly-logarithmic convergence}, author = {Awerbuch, Baruch and Khandekar, Rohit}, year = 2008, booktitle = {LATIN 2008: Theoretical Informatics}, publisher = {Springer}, pages = {580--592} } @article{AwerbuchKhandekar2009DistributedComputing, title = {Greedy distributed optimization of multi-commodity flows}, author = {Awerbuch, Baruch and Khandekar, Rohit}, year = 2009, journal = {Distributed Computing}, publisher = {Springer-Verlag}, volume = 21, number = 5, pages = {317--329}, doi = {10.1007/s00446-008-0074-0}, issn = {0178-2770}, keywords = {Multi-commodity flows; Distributed algorithms; Statelessness; Self-stabilization} } @article{AwerbuchKR2012, title = {{Distributed algorithms for multicommodity flow problems via approximate steepest descent framework}}, author = {Awerbuch, Baruch and Khandekar, Rohit and Rao, Satish}, year = 2012, month = dec, journal = {ACM Transactions on Algorithms}, volume = 9, number = 1, pages = {1--14}, doi = {10.1145/2390176.2390179}, issn = 15496325, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Awerbuch, Khandekar, Rao - 2012 - Distributed algorithms for multicommodity flow problems via approximate steepest descent framework.pdf:pdf}, mendeley-groups = {Algorithms/Multiplicative Weight/Flow} } @inproceedings{AY2015-coord, title = {Even Faster Accelerated Coordinate Descent Using Non-Uniform Sampling}, author = {{Allen-Zhu}, Zeyuan and Richt\'arik, Peter and Qu, Zheng and Yuan, Yang}, year = 2016, booktitle = {ICML} } @inproceedings{AY2015-univr, title = {{Improved SVRG for Non-Strongly-Convex or Sum-of-Non-Convex Objectives}}, author = {{Allen-Zhu}, Zeyuan and Yuan, Yang}, year = 2016, booktitle = {ICML} } @inproceedings{ayoub2020model, title = {Model-Based Reinforcement Learning with Value-Targeted Regression}, author = {Ayoub, Alex and Jia, Zeyu and Szepesvari, Csaba and Wang, Mengdi and Yang, Lin F}, year = 2020, booktitle = {Proceedings of the 37th International Conference on Machine Learning} } @article{azar2011reinforcement, title = {Reinforcement learning with a near optimal rate of convergence}, author = {Azar, Mohammad Gheshlaghi and Munos, R{\'e}mi and Ghavamzadeh, Mohammad and Kappen, Hilbert}, year = 2011 } @inproceedings{azar2011speedy, title = {Speedy Q-learning}, author = {Azar, Mohammad Gheshlaghi and Munos, Remi and Ghavamzadeh, Mohammad and Kappen, Hilbert}, year = 2011, booktitle = {Advances in neural information processing systems} } @article{azar2012sample, title = {On the sample complexity of reinforcement learning with a generative model}, author = {Azar, Mohammad Gheshlaghi and Munos, R{\'e}mi and Kappen, Bert}, year = 2012, journal = {arXiv preprint arXiv:1206.6461} } @book{azar2012theory, title = {On the theory of reinforcement learning: methods, convergence analysis and sample complexity}, author = {Azar, Mohammad Gheshlaghi}, year = 2012, publisher = {UB Nijmegen [host]} } @article{azar2013minimax, title = {Minimax {PAC} bounds on the sample complexity of reinforcement learning with a generative model}, author = {Azar, Mohammad Gheshlaghi and Munos, R{\'e}mi and Kappen, Hilbert J}, year = 2013, journal = {Machine learning}, publisher = {Springer}, volume = 91, number = 3, pages = {325--349} } @inproceedings{azar2017minimax, title = {Minimax regret bounds for reinforcement learning}, author = {Azar, Mohammad Gheshlaghi and Osband, Ian and Munos, R{\'e}mi}, year = 2017, booktitle = {Proceedings of the 34th International Conference on Machine Learning}, pages = {263--272} } @article{azizyan2013density, title = {Density-sensitive semisupervised inference}, author = {Azizyan, Martin and Singh, Aarti and Wasserman, Larry and others}, year = 2013, journal = {The Annals of Statistics}, publisher = {Institute of Mathematical Statistics}, volume = 41, number = 2, pages = {751--771} } @article{azizzadenesheli2016contextual, title = {Reinforcement Learning in Rich-Observation MDPs using Spectral Methods}, author = {Azizzadenesheli, Kamyar and Lazaric, Alessandro and Anandkumar, Animashree}, year = 2016, journal = {arXiv preprint arXiv:1611.03907} } @article{b94, title = {Approximation and estimation bounds for artificial neural networks}, author = {Barron, Andrew R}, year = 1994, journal = {Machine learning}, publisher = {Springer}, volume = 14, number = 1, pages = {115--133} } @inproceedings{babaioff2009characterizing, title = {Characterizing truthful multi-armed bandit mechanisms}, author = {Babaioff, Moshe and Sharma, Yogeshwer and Slivkins, Aleksandrs}, year = 2009, booktitle = {Proceedings of the 10th ACM conference on Electronic commerce}, pages = {79--88}, organization = {ACM} } @inproceedings{bacchus1996rewarding, title = {Rewarding behaviors}, author = {Bacchus, Fahiem and Boutilier, Craig and Grove, Adam}, year = 1996, booktitle = {Proceedings of the National Conference on Artificial Intelligence}, pages = {1160--1167} } @inproceedings{Bachpaper, title = {A stochastic gradient method with an exponential convergence \_rate for finite training sets}, author = {Roux, Nicolas L and Schmidt, Mark and Bach, Francis R}, year = 2012, booktitle = {Advances in Neural Information Processing Systems}, pages = {2663--2671} } @incollection{Backprop, title = {Neurocomputing: foundations of research}, author = {Rumelhart, David E. and Hinton, Geoffrey E. and Williams, Ronald J.}, year = 1988, publisher = {MIT Press}, address = {Cambridge, MA, USA}, pages = {696--699}, isbn = {0-262-01097-6}, url = {http://dl.acm.org/citation.cfm?id=65669.104451}, editor = {Anderson, James A. and Rosenfeld, Edward}, chapter = {Learning representations by back-propagating errors}, acmid = 104451, numpages = 4 } @inproceedings{Badoiu2002, title = {{Approximate clustering via core-sets}}, author = {{B{\u{a}}doiu}, Mihai and {Har-Peled}, Sariel and Indyk, Piotr}, year = 2002, booktitle = {Proceedings of the thiry-fourth annual ACM symposium on Theory of computing - STOC '02}, publisher = {ACM Press}, address = {New York, New York, USA}, pages = 250, doi = {10.1145/509907.509947}, isbn = 1581134959, mendeley-groups = {Algorithms/Computational Geometry} } @article{baes2009estimate, title = {Estimate sequence methods: extensions and approximations}, author = {Baes, Michel}, year = 2009, journal = {Institute for Operations Research, ETH, Z{\"u}rich, Switzerland} } @inproceedings{bagnell2004policy, title = {Policy search by dynamic programming}, author = {Bagnell, J Andrew and Kakade, Sham M and Schneider, Jeff G and Ng, Andrew Y}, year = 2004, booktitle = {Advances in neural information processing systems}, pages = {831--838} } @article{bai2019beyond, title = {Beyond Linearization: On Quadratic and Higher-Order Approximation of Wide Neural Networks}, author = {Bai, Yu and Lee, Jason D}, year = 2020, journal = {International Conference on Learning Representations (ICLR)} } @inproceedings{bai2019provably, title = {Provably efficient q-learning with low switching cost}, author = {Bai, Yu and Xie, Tengyang and Jiang, Nan and Wang, Yu-Xiang}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {8004--8013} } @article{bai2020provable, title = {Provable Self-Play Algorithms for Competitive Reinforcement Learning}, author = {Bai, Yu and Jin, Chi}, year = 2020, journal = {arXiv preprint arXiv:2002.04017} } @article{Bailly11, title = {Quadratic weighted automata: Spectral algorithm and likelihood maximization}, author = {R. Bailly}, year = 2011, journal = {Journal of Machine Learning Research} } @article{balakrishnan2016statistical, title = {Statistical guarantees for the EM algorithm: From population to sample-based analysis}, author = {Balakrishnan, Sivaraman and Wainwright, Martin J and Yu, Bin}, year = 2016, journal = {Annals of Stat}, publisher = {Institute of Mathematical Statistics}, volume = 45, number = 1, pages = {77--120} } @inproceedings{balakrishnan2017computationally, title = {Computationally Efficient Robust Sparse Estimation in High Dimensions}, author = {Balakrishnan, Sivaraman and Du, Simon S. and Li, Jerry and Singh, Aarti}, year = 2017, month = {07--10 Jul}, booktitle = {Proceedings of the 2017 Conference on Learning Theory}, publisher = {PMLR}, series = {Proceedings of Machine Learning Research}, volume = 65, pages = {169--212}, url = {http://proceedings.mlr.press/v65/balakrishnan17a.html}, editor = {Kale, Satyen and Shamir, Ohad}, pdf = {http://proceedings.mlr.press/v65/balakrishnan17a/balakrishnan17a.pdf}, abstract = {Many conventional statistical procedures are extremely sensitive to seemingly minor deviations from modeling assumptions. This problem is exacerbated in modern high-dimensional settings, where the problem dimension can grow with and possibly exceed the sample size. We consider the problem of robust estimation of sparse functionals, and provide a computationally and statistically efficient algorithm in the high-dimensional setting. Our theory identifies a unified set of deterministic conditions under which our algorithm guarantees accurate recovery. By further establishing that these deterministic conditions hold with high-probability for a wide range of statistical models, our theory applies to many problems of considerable interest including sparse mean and covariance estimation; sparse linear regression; and sparse generalized linear models. In certain settings, such as the detection and estimation of sparse principal components in the spiked covariance model, our general theory does not yield optimal sample complexity, and we provide a novel algorithm based on the same intuition which is able to take advantage of further structure of the problem to achieve nearly optimal rates.} } @article{balamurugan2016stochastic, title = {Stochastic Variance Reduction Methods for Saddle-Point Problems}, author = {Balamurugan, P and Bach, Francis}, year = 2016, journal = {arXiv preprint arXiv:1605.06398} } @inproceedings{balcan2016improved, title = {An Improved Gap-Dependency Analysis of the Noisy Power Method}, author = {Maria-Florina Balcan and Simon Shaolei Du and Yining Wang and Adams Wei Yu}, year = 2016, month = {23--26 Jun}, booktitle = {29th Annual Conference on Learning Theory}, publisher = {PMLR}, address = {Columbia University, New York, New York, USA}, series = {Proceedings of Machine Learning Research}, volume = 49, pages = {284--309}, url = {http://proceedings.mlr.press/v49/balcan16a.html}, editor = {Vitaly Feldman and Alexander Rakhlin and Ohad Shamir}, pdf = {http://proceedings.mlr.press/v49/balcan16a.pdf}, abstract = {We consider the \emphnoisy power method algorithm, which has wide applications in machine learning and statistics, especially those related to principal component analysis (PCA) under resource (communication, memory or privacy) constraints. Existing analysis of the noisy power method shows an unsatisfactory dependency over the “consecutive" spectral gap (\sigma_k-\sigma_k+1) of an input data matrix, which could be very small and hence limits the algorithm’s applicability. In this paper, we present a new analysis of the noisy power method that achieves improved gap dependency for both sample complexity and noise tolerance bounds. More specifically, we improve the dependency over (\sigma_k-\sigma_k+1) to dependency over (\sigma_k-\sigma_q+1), where q is an intermediate algorithm parameter and could be much larger than the target rank k. Our proofs are built upon a novel characterization of proximity between two subspaces that differ from canonical angle characterizations analyzed in previous works. Finally, we apply our improved bounds to distributed private PCA and memory-efficient streaming PCA and obtain bounds that are superior to existing results in the literature.} } @article{baldi1989neural, title = {Neural networks and principal component analysis: Learning from examples without local minima}, author = {Baldi, Pierre and Hornik, Kurt}, year = 1989, month = jan, journal = {Neural networks}, publisher = {Elsevier}, address = {Oxford, UK, UK}, volume = 2, number = 1, pages = {53--58}, doi = {10.1016/0893-6080(89)90014-2}, issn = {0893-6080}, url = {http://dx.doi.org/10.1016/0893-6080(89)90014-2}, issue_date = 1989, numpages = 6, acmid = 70362 } @inproceedings{Balsubramani2013-incrementalPCA, title = {The fast convergence of incremental pca}, author = {Balsubramani, Akshay and Dasgupta, Sanjoy and Freund, Yoav}, year = 2013, booktitle = {NIPS}, pages = {3174--3182} } @inproceedings{balzano2010column, title = {Column subset selection with missing data}, author = {Balzano, Laura and Nowak, Robert and Bajwa, Waheed}, year = 2010, booktitle = {NIPS Workshop on Low-Rank Methods for Large-Scale Machine Learning}, volume = 1, organization = {Citeseer} } @inproceedings{bandeira2014multireference, title = {Multireference alignment using semidefinite programming}, author = {Bandeira, Afonso S and Charikar, Moses and Singer, Amit and Zhu, Andy}, year = 2014, booktitle = {Proceedings of the 5th conference on Innovations in theoretical computer science}, pages = {459--470}, organization = {ACM} } @inproceedings{bandeira2016low, title = {On the low-rank approach for semidefinite programs arising in synchronization and community detection}, author = {Bandeira, Afonso S and Boumal, Nicolas and Voroninski, Vladislav}, year = 2016, booktitle = {Conference on learning theory}, pages = {361--382}, organization = {PMLR} } @article{banerjee2005clustering, title = {Clustering with Bregman divergences}, author = {Banerjee, Arindam and Merugu, Srujana and Dhillon, Inderjit S. and Ghosh, Joydeep}, year = 2005, journal = {The Journal of Machine Learning Research}, publisher = {JMLR. org}, volume = 6, pages = {1705--1749} } @inproceedings{Bansal2011, title = {{Min-max Graph Partitioning and Small Set Expansion}}, author = {Bansal, Nikhil and Feige, Uriel and Krauthgamer, Robert and Makarychev, Konstantin and Nagarajan, Viswanath and Naor, Joseph (Seffi) and Schwartz, Roy}, year = 2011, month = oct, booktitle = {2011 IEEE 52nd Annual Symposium on Foundations of Computer Science}, publisher = {IEEE}, pages = {17--26}, doi = {10.1109/FOCS.2011.79}, isbn = {978-0-7695-4571-4}, abstract = {We study graph partitioning problems from a min-max perspective, in which an input graph on n vertices should be partitioned into k parts, and the objective is to minimize the maximum number of edges leaving a single part. The two main versions we consider are where the k parts need to be of equal-size, and where they must separate a set of k given terminals. We consider a common generalization of these two problems, and design for it an \$O(\backslash sqrt\{\backslash log n\backslash log k\})\$-approximation algorithm. This improves over an \$O(\backslash log\^{}2 n)\$ approximation for the second version, and roughly \$O(k\backslash log n)\$ approximation for the first version that follows from other previous work. We also give an improved O(1)-approximation algorithm for graphs that exclude any fixed minor. Our algorithm uses a new procedure for solving the Small-Set Expansion problem. In this problem, we are given a graph G and the goal is to find a non-empty set \$S\backslash subseteq V\$ of size \$|S| \backslash leq \backslash rho n\$ with minimum edge-expansion. We give an \$O(\backslash sqrt\{\backslash log\{n\}\backslash log\{(1/\backslash rho)\}\})\$ bicriteria approximation algorithm for the general case of Small-Set Expansion, and O(1) approximation algorithm for graphs that exclude any fixed minor.}, archiveprefix = {arXiv}, arxivid = {1110.4319}, eprint = {1110.4319}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Bansal et al. - 2011 - Min-max Graph Partitioning and Small Set Expansion.pdf:pdf}, mendeley-groups = {Algorithms/Sparsest Cut,Algorithms/Small Set Expansion,Algorithms/Sparsest Cut/SSE} } @article{barak2014, title = {Dictionary Learning and Tensor Decomposition via the Sum-of-Squares Method}, author = {Barak, Boaz and Kelner, Jonathan and Steurer, David}, year = 2014, journal = {arXiv preprint arXiv:1407.1543} } @article{barak2014sum, title = {Sum-of-squares proofs and the quest toward optimal algorithms}, author = {Barak, Boaz and Steurer, David}, year = 2014, journal = {arXiv preprint arXiv:1404.5236} } @article{baraniuk2008simple, title = {A simple proof of the restricted isometry property for random matrices}, author = {Baraniuk, Richard and Davenport, Mark and DeVore, Ronald and Wakin, Michael}, year = 2008, journal = {Constructive Approximation}, publisher = {Springer}, volume = 28, number = 3, pages = {253--263} } @article{barber2019conformal, title = {Conformal prediction under covariate shift}, author = {Barber, Rina Foygel and Candes, Emmanuel J and Ramdas, Aaditya and Tibshirani, Ryan J}, year = 2019, journal = {arXiv preprint arXiv:1904.06019} } @article{barber2019limits, title = {The limits of distribution-free conditional predictive inference}, author = {Barber, Rina Foygel and Candes, Emmanuel J and Ramdas, Aaditya and Tibshirani, Ryan J}, year = 2019, journal = {arXiv preprint arXiv:1903.04684} } @article{barber2019predictive, title = {Predictive inference with the jackknife+}, author = {Barber, Rina Foygel and Candes, Emmanuel J and Ramdas, Aaditya and Tibshirani, Ryan J}, year = 2019, journal = {arXiv preprint arXiv:1905.02928} } @article{barreto2011computing, title = {Computing the stationary distribution of a finite Markov chain through stochastic factorization}, author = {Barreto, Andr{\'e} MS and Fragoso, Marcelo D}, year = 2011, journal = {SIAM Journal on Matrix Analysis and Applications}, publisher = {SIAM} } @inproceedings{barreto2011reinforcement, title = {Reinforcement learning using kernel-based stochastic factorization}, author = {Barreto, Andre and Precup, Doina and Pineau, Joelle}, year = 2011, booktitle = {Advances in Neural Information Processing Systems} } @article{barreto2014policy, title = {Policy iteration based on stochastic factorization}, author = {Barreto, Andr\'e M. S. and Pineau, Joelle and Precup, Doina}, year = 2014, journal = {J. Artificial Intelligence Res.}, volume = 50, pages = {763--803}, issn = {1076-9757}, fjournal = {Journal of Artificial Intelligence Research}, mrclass = {90C40 (68T20 90C39)}, mrnumber = 3254852, mrreviewer = {Masayuki Horiguchi} } @article{barron1993universal, title = {Universal approximation bounds for superpositions of a sigmoidal function}, author = {Barron, Andrew R}, year = 1993, journal = {IEEE Transactions on Information theory}, publisher = {IEEE}, volume = 39, number = 3, pages = {930--945} } @book{barroso2009datacenter, title = { The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines }, author = {Barroso, Luiz A. and H\"{o}lzle, Urs}, year = 2009, publisher = {Morgan and Claypool Publishers}, isbn = {159829556X, 9781598295566}, edition = {1st}, abstract = { As computation continues to move into the cloud, the computing platform of interest no longer re- sembles a pizza box or a refrigerator, but a warehouse full of computers. These new large datacenters are quite different from traditional hosting facilities of earlier times and cannot be viewed simply as a collection of co-located servers. Large portions of the hardware and software resources in these facilities must work in concert to efficiently deliver good levels of Internet service performance, something that can only be achieved by a holistic approach to their design and deployment. In other words, we must treat the datacenter itself as one massive warehouse-scale computer (WSC). We describe the architecture of WSCs, the main factors influencing their design, operation, and cost structure, and the characteristics of their software base. We hope it will be useful to architects and programmers of today's WSCs, as well as those of future many-core platforms which may one day implement the equivalent of today's WSCs on a single board. }, comment = { Pretty extensive description of the reasons behind scaling out vs. scaling up with commodity hardware and the resulting implications. }, keywords = {datacenter, google}, myurl = {http://www.morganclaypool.com/doi/abs/10.2200/S00193ED1V01Y200905CAC006} } @inproceedings{BartalByersRaz1997, title = {{Global optimization using local information with applications to flow control}}, author = {Bartal, Yair and Byers, John W. and Raz, Danny}, year = 1997, booktitle = {Proceedings 38th Annual Symposium on Foundations of Computer Science}, publisher = {IEEE Comput. Soc}, pages = {303--312}, doi = {10.1109/SFCS.1997.646119}, isbn = {0-8186-8197-7}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Bartal, Byers, Raz - 1997 - Global optimization using local information with applications to flow control.pdf:pdf}, mendeley-groups = {Algorithms/Multiplicative Weight/LP} } @article{BartalByersRaz2004, title = {{Fast, Distributed Approximation Algorithms for Positive Linear Programming with Applications to Flow Control}}, author = {Bartal, Yair and Byers, John W. and Raz, Danny}, year = 2004, month = jan, journal = {SIAM Journal on Computing}, volume = 33, number = 6, pages = {1261--1279}, doi = {10.1137/S0097539700379383}, issn = {0097-5397}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Bartal, Byers, Raz - 2004 - Fast, Distributed Approximation Algorithms for Positive Linear Programming with Applications to Flow Control.pdf:pdf}, keywords = {1,10,1137,68w15,68w25,ams subject classifications,approximation algorithm,doi,environment must make decisions,flow control,introduction,linear programming,primal-dual,processors in a distributed,s0097539700379383}, mendeley-groups = {Algorithms/Multiplicative Weight/LP} } @article{bartlett2002rademacher, title = {Rademacher and Gaussian complexities: Risk bounds and structural results}, author = {Bartlett, Peter L and Mendelson, Shahar}, year = 2002, journal = {Journal of Machine Learning Research}, volume = 3, number = {Nov}, pages = {463--482} } @article{bartlett2008high, title = {High-probability regret bounds for bandit online linear optimization}, author = {Bartlett, Peter L and Dani, Varsha and Hayes, Thomas and Kakade, Sham and Rakhlin, Alexander and Tewari, Ambuj}, year = 2008, booktitle = {COLT 2008}, file = {:D$\backslash$:/Mendeley Desktop/Bartlett et al. - 2008 - High-probability regret bounds for bandit online linear optimization.pdf:pdf}, mendeley-groups = {Optimization/Bandit} } @inproceedings{bartlett2009regal, title = {REGAL: a regularization based algorithm for reinforcement learning in weakly communicating MDPs}, author = {Bartlett, Peter L and Tewari, Ambuj}, year = 2009, journal = {arXiv preprint arXiv:1205.2661}, booktitle = {Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence (UAI 2009))} } @book{barto1998reinforcement, title = {Reinforcement learning: An introduction}, author = {Barto, Andrew G}, year = 1998, publisher = {MIT press} } @inproceedings{bash2007cool, title = { Cool job allocation: measuring the power savings of placing jobs at cooling-efficient locations in the data center }, author = {Bash, Cullen and Forman, George}, year = 2007, booktitle = { 2007 USENIX Annual Technical Conference on Proceedings of the USENIX Annual Technical Conference }, location = {Santa Clara, CA}, publisher = {USENIX Association}, address = {Berkeley, CA, USA}, pages = {29:1--29:6}, isbn = {999-8888-77-6}, acmid = 1364414, articleno = 29, myurl = {http://dl.acm.org/citation.cfm?id=1364385.1364414}, numpages = 6 } @article{batson2012twice, title = {Twice-ramanujan sparsifiers}, author = {Batson, Joshua and Spielman, Daniel A and Srivastava, Nikhil}, year = 2012, month = may, journal = {SIAM Journal on Computing}, publisher = {SIAM}, address = {New York, New York, USA}, volume = 41, number = 6, pages = {1704--1721}, doi = {10.1137/130949117}, isbn = 9781605585062, issn = {0036-1445}, abstract = {We prove that every graph has a spectral sparsifier with a number of edges linear in its number of vertices. As linear-sized spectral sparsifiers of complete graphs are expanders, our sparsifiers of arbitrary graphs can be viewed as generalizations of expander graphs. In particular, we prove that for every \$d>1\$ and every undirected, weighted graph \$G=(V,E,w)\$ on \$n\$ vertices, there exists a weighted graph \$H=(V,F,\backslash tilde\{w\})\$ with at most \$\backslash ceil\{d(n-1)\}\$ edges such that for every \$x \backslash in \backslash R\^{}\{V\}\$, $\backslash$[ x\^{}\{T\}L\_\{G\}x $\backslash$leq x\^{}\{T\}L\_\{H\}x $\backslash$leq ($\backslash$frac\{d+1+2$\backslash$sqrt\{d\}\}\{d+1-2$\backslash$sqrt\{d\}\})$\backslash$cdot x\^{}\{T\}L\_\{G\}x $\backslash$] where \$L\_\{G\}\$ and \$L\_\{H\}\$ are the Laplacian matrices of \$G\$ and \$H\$, respectively. Thus, \$H\$ approximates \$G\$ spectrally at least as well as a Ramanujan expander with \$dn/2\$ edges approximates the complete graph. We give an elementary deterministic polynomial time algorithm for constructing \$H\$.}, archiveprefix = {arXiv}, arxivid = {0808.0163}, eprint = {0808.0163}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Batson, Spielman, Srivastava - 2009 - Twice-\{R\}amanujan Sparsifiers.pdf:pdf}, mendeley-groups = {Algorithms/Sparsification} } @phdthesis{Bau96, title = {Projection Algorithms and Monotone Operators}, author = {Bauschke, Heinz H.}, year = 1996, address = {Simon Fraser University}, isbn = {0-612-16789-5}, advisor = {Borwein, Jonathan M.} } @article{baum1970maximization, title = { A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of {M}arkov Chains }, author = {Baum, Leonard E. and Petrie, Ted and Soules, George and Weiss, Norman}, year = 1970, journal = {The Annals of Mathematical Statistics}, publisher = {Institute of Mathematical Statistics}, volume = 41, number = 1, pages = {164--171}, issn = {00034851}, copyright = {Copyright ? 1970 Institute of Mathematical Statistics}, jstor_formatteddate = {Feb., 1970}, language = {English}, myurl = {http://www.jstor.org/stable/2239727} } @article{baum1990polynomial, title = {A polynomial time algorithm that learns two hidden unit nets}, author = {Baum, Eric B}, year = 1990, journal = {Neural Computation}, publisher = {MIT Press}, volume = 2, number = 4, pages = {510--522} } @article{baxter2000model, title = {A model of inductive bias learning}, author = {Baxter, Jonathan}, year = 2000, journal = {Journal of artificial intelligence research} } @article{Bazanella08, title = {Iterative minimization of $H_2$ control performance criteria}, author = {Alexandre S. Bazanella and Michel Gevers and Ljubisa Miskovic and Brian D.O. Anderson}, year = 2008, journal = {Automatica}, volume = 44, pages = {2549--2559}, date-added = {2016-04-02 19:00:08 +0000}, date-modified = {2016-04-02 19:01:05 +0000} } @article{BBL97, title = {The method of cyclic projections for closed convex sets in {H}ilbert space}, author = {Bauschke, Heinz H. and Borwein, Jonathan M. and Lewis, Adrian S.}, year = 1997, journal = {Contemp. Math.}, publisher = {Amer. Math. Soc.}, volume = 204, pages = {1--38}, doi = {10.1090/conm/204/02620}, url = {http://dx.doi.org/10.1090/conm/204/02620}, mrclass = {49M45 (47H99 47N10 65F10 90C25)}, mrnumber = 1442992, mrreviewer = {Alfredo N. Iusem} } @article{BCNN11, title = {On the use of stochastic hessian information in optimization methods for machine learning}, author = {Byrd, Richard H and Chin, Gillian M and Neveitt, Will and Nocedal, Jorge}, year = 2011, journal = {SIAM Journal on Optimization}, publisher = {SIAM}, volume = 21, number = 3, pages = {977--995} } @article{bdl18, title = {Complexity of Training {R}e{LU} Neural Network}, author = {Boob, Digvijay and Dey, Santanu S and Lan, Guanghui}, year = 2018, journal = {arXiv preprint arXiv:1809.10787} } @article{Beck2012smoothing, title = {Smoothing and first order methods: A unified framework}, author = {Beck, Amir and Teboulle, Marc}, year = 2012, journal = {SIAM Journal on Optimization}, publisher = {SIAM}, volume = 22, number = 2, pages = {557--580} } @inproceedings{belanger2015linear, title = {A Linear Dynamical System Model for Text}, author = {Belanger, David and Kakade, Sham M.}, year = 2015, booktitle = {Proceedings of the 32nd International Conference on Machine Learning} } @inproceedings{bellemare2014skip, title = {Skip context tree switching}, author = {Bellemare, Marc and Veness, Joel and Talvitie, Erik}, year = 2014, booktitle = {International Conference on Machine Learning}, pages = {1458--1466} } @inproceedings{bellemare2016unifying, title = {Unifying count-based exploration and intrinsic motivation}, author = {Bellemare, Marc and Srinivasan, Sriram and Ostrovski, Georg and Schaul, Tom and Saxton, David and Munos, Remi}, year = 2016, booktitle = {Advances in neural information processing systems} } @book{bellman1957dynamic, title = {Dynamic Programming}, author = {Bellman, Richard}, year = 1957, publisher = {Princeton University Press, Princeton, NJ} } @article{ben2007analysis, title = {Analysis of representations for domain adaptation}, author = {Ben-David, Shai and Blitzer, John and Crammer, Koby and Pereira, Fernando and others}, year = 2007, journal = {Advances in neural information processing systems}, publisher = {MIT; 1998}, volume = 19, pages = 137 } @inproceedings{ben2010impossibility, title = {Impossibility theorems for domain adaptation}, author = {Ben-David, Shai and Lu, Tyler and Luu, Teresa and P{\'a}l, D{\'a}vid}, year = 2010, booktitle = {International Conference on Artificial Intelligence and Statistics}, pages = {129--136} } @article{ben2010theory, title = {A theory of learning from different domains}, author = {Ben-David, Shai and Blitzer, John and Crammer, Koby and Kulesza, Alex and Pereira, Fernando and Vaughan, Jennifer Wortman}, year = 2010, journal = {Machine learning}, publisher = {Springer}, volume = 79, number = {1-2}, pages = {151--175} } @inproceedings{ben2012hardness, title = {On the hardness of domain adaptation and the utility of unlabeled target samples}, author = {Ben-David, Shai and Urner, Ruth}, year = 2012, booktitle = {International Conference on Algorithmic Learning Theory}, pages = {139--153}, organization = {Springer} } @techreport{BenczurKarger02, title = {{Randomized Approximation Schemes for Cuts and Flows in Capacitated Graphs}}, author = {Bencz\'{u}r, Andr\'{a}s A. and Karger, David R.}, year = 2002, month = jul, booktitle = {arXiv preprint cs/0207078}, pages = {1--20}, archiveprefix = {arXiv}, arxivid = {cs/0207078}, eprint = {0207078}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/110a03446ced34ac8baaf80534e7433c45797196.pdf:pdf}, mendeley-groups = {Algorithms/Sparsification}, primaryclass = {cs} } @inproceedings{BenczurKarger96, title = {{Approximating s-t minimum cuts in $\tilde{O}(n^2)$ time}}, author = {Bencz\'{u}r, Andr\'{a}s A. and Karger, David R.}, year = 1996, booktitle = {Proceedings of the twenty-eighth annual ACM symposium on Theory of computing - STOC '96}, publisher = {ACM Press}, address = {New York, New York, USA}, pages = {47--55}, doi = {10.1145/237814.237827}, isbn = {0897917855}, mendeley-groups = {Algorithms/Sparsification} } @incollection{bengio2006neural, title = {Neural probabilistic language models}, author = {Bengio, Yoshua and Schwenk, Holger and Sen{\'e}cal, Jean-S{\'e}bastien and Morin, Fr{\'e}deric and Gauvain, Jean-Luc}, year = 2006, booktitle = {Innovations in Machine Learning} } @article{Bengio2009, title = {Learning deep architectures for {AI}}, author = {Bengio, Yoshua}, year = 2009, month = jan, journal = {Foundations and Trends in Machine Learning}, publisher = {Now Publishers Inc.}, address = {Hanover, MA, USA}, volume = 2, number = 1, pages = {1--127}, doi = {10.1561/2200000006}, issn = {1935-8237}, url = {http://dx.doi.org/10.1561/2200000006}, note = {Also published as a book. Now Publishers, 2009.}, acmid = 1658424, file = {:..\\Citations\\deepsurvey.pdf:PDF}, issue_date = {January 2009}, numpages = 127 } @article{bengio2012unsupervised, title = {Unsupervised feature learning and deep learning: A review and new perspectives}, author = {Y. Bengio and A. Courville and P. Vincent}, year = 2012, journal = {arXiv preprint arXiv:1206.5538} } @article{Bengio2013, title = {Representation Learning: A Review and New Perspectives}, author = {Yoshua Bengio and Aaron C. Courville and Pascal Vincent}, year = 2013, journal = {IEEE Trans. Pattern Anal. Mach. Intell.}, volume = 35, number = 8, pages = {1798--1828}, bibsource = {DBLP, http://dblp.uni-trier.de}, ee = {http://doi.ieeecomputersociety.org/10.1109/TPAMI.2013.50}, owner = {rongge}, timestamp = {2013.09.25} } @article{bengio2013representation, title = {Representation learning: A review and new perspectives}, author = {Bengio, Yoshua and Courville, Aaron and Vincent, Pascal}, year = 2013, journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence}, publisher = {IEEE}, volume = 35, number = 8, pages = {1798--1828} } @article{benson2014scalable, title = {{Scalable Methods for Nonnegative Matrix Factorizations of Near-Separable Tall-and-Skinny Matrices}}, author = {Benson, Austin R and {Jason D. Lee} and Rajwa, Bartek and Gleich, David F.}, year = 2014, journal = {Neural Information Processing Systems (NIPS)}, pages = {1--9} } @article{benzi2005numerical, title = {Numerical solution of saddle point problems}, author = {Benzi, Michele and Golub, Gene H and Liesen, J{\"o}rg}, year = 2005, journal = {Acta numerica}, publisher = {Cambridge Univ Press}, volume = 14, pages = {1--137} } @article{benzi2006eigenvalues, title = {On the eigenvalues of a class of saddle point matrices}, author = {Benzi, Michele and Simoncini, Valeria}, year = 2006, journal = {Numerische Mathematik}, publisher = {Springer}, volume = 103, number = 2, pages = {173--196} } @article{berkenkamp2017safe, title = {Safe model-based reinforcement learning with stability guarantees}, author = {Berkenkamp, Felix and Turchetta, Matteo and Schoellig, Angela P and Krause, Andreas}, year = 2017, journal = {arXiv preprint arXiv:1705.08551} } @book{berlinet2011reproducing, title = {Reproducing kernel Hilbert spaces in probability and statistics}, author = {Berlinet, Alain and Thomas-Agnan, Christine}, year = 2011, publisher = {Springer Science \& Business Media} } @article{berner2019dota, title = {Dota 2 with large scale deep reinforcement learning}, author = {Berner, Christopher and Brockman, Greg and Chan, Brooke and Cheung, Vicki and Debiak, Przemyslaw and Dennison, Christy and Farhi, David and Fischer, Quirin and Hashme, Shariq and Hesse, Chris and others}, year = 2019, journal = {arXiv preprint arXiv:1912.06680} } @article{Bernnett62, title = {Probability Inequalities for the Sum of Independent Random Variables}, author = {Bennett, George}, year = 1962, journal = {Journal of the American Statistical Association}, publisher = {American Statistical Association}, volume = 57, number = 297, pages = {pp. 33--45}, issn = {01621459}, url = {http://www.jstor.org/stable/2282438}, copyright = {Copyright © 1962 American Statistical Association}, abstract = {This paper proves a number of inequalities which improve on existing upper limits to the probability distribution of the sum of independent random variables. The inequalities presented require knowledge only of the variance of the sum and the means and bounds of the component random variables. They are applicable when the number of component random variables is small and/or have different distributions. Figures show the improvement on existing inequalities.}, jstor_articletype = {research-article}, jstor_formatteddate = {Mar., 1962}, language = {English} } @article{Bernstein, author = {Bernstein, S.}, year = 1927, journal = {Theory of Probability} } @article{bernstein1984systematic, title = {A systematic approach to higher-order necessary conditions in optimization theory}, author = {Bernstein, Dennis S}, year = 1984, journal = {SIAM journal on control and optimization}, publisher = {SIAM}, volume = 22, number = 2, pages = {211--238} } @book{berry1985bandit, title = {Bandit Problems: Sequential Allocation of Experiments (Monographs on Statistics and Applied Probability)}, author = {Berry, Donald A and Fristedt, Bert}, year = 1985, publisher = {Springer} } @article{bertsekas1976dynamic, title = {Dynamic programming and stochastic control}, author = {Bertsekas, Dimitri P}, year = 1976, publisher = {Academic Press, Inc.} } @book{bertsekas1995dynamic, title = {Dynamic programming and optimal control}, author = {Bertsekas, Dimitri P}, year = 1995, publisher = {Athena Scientific, Belmont, MA}, volume = 1, number = 2 } @inproceedings{bertsekas1995neuro, title = {Neuro-dynamic programming: an overview}, author = {Bertsekas, Dimitri P and Tsitsiklis, John N}, year = 1995, booktitle = {Proceedings of the 34th IEEE Conference on Decision and Control}, volume = 1, pages = {560--564}, organization = {IEEE} } @book{bertsekas2009convex, title = {Convex Optimization Theory}, author = {Bertsekas, Dimitri P}, year = 2009, publisher = {Athena Scientific, Belmont, MA} } @article{bertsekas2011approximate, title = {Approximate policy iteration: a survey and some new methods}, author = {Bertsekas, Dimitri P.}, year = 2011, journal = {J. Control Theory Appl.}, volume = 9, number = 3, pages = {310--335}, issn = {1672-6340}, url = {https://doi.org/10.1007/s11768-011-1005-3}, fjournal = {Journal of Control Theory and Applications}, mrclass = {90C39 (68T05 90C15)}, mrnumber = 2833999, mrreviewer = {Yukihiro Maruyama} } @book{bertsekas2013abstract, title = {Abstract dynamic programming}, author = {Bertsekas, Dimitri P}, year = 2013, publisher = {Athena Scientific, Belmont, MA}, pages = {viii+248}, isbn = {978-1-886529-42-7; 1-886529-42-6}, mrclass = {90-01 (90C39)}, mrnumber = 3204932 } @book{bertsekas96neuro, title = {Neuro-Dynamic Programming}, author = {Dimitri P. Bertsekas and John N. Tsitsiklis}, year = 1996, month = sep, publisher = {Athena Scientific}, isbn = {1-886529-10-8} } @inproceedings{bg17, title = {Globally Optimal Gradient Descent for a ConvNet with Gaussian Inputs}, author = {Alon Brutzkus and Amir Globerson}, year = 2017, booktitle = {International Conference on Machine Learning (ICML)}, publisher = {http://arxiv.org/abs/1702.07966} } @inproceedings{bgms18, title = {{SGD} Learns Over-parameterized Networks that Provably Generalize on Linearly Separable Data}, author = {Alon Brutzkus and Amir Globerson and Eran Malach and Shai Shalev-Shwartz}, year = 2018, booktitle = {ICLR}, url = {https://arxiv.org/abs/1710.10174} } @article{bhandari2019global, title = {Global optimality guarantees for policy gradient methods}, author = {Bhandari, Jalaj and Russo, Daniel}, year = 2019, journal = {arXiv preprint arXiv:1906.01786} } @article{bharadhwaj2020conservative, title = {Conservative Safety Critics for Exploration}, author = {Bharadhwaj, Homanga and Kumar, Aviral and Rhinehart, Nicholas and Levine, Sergey and Shkurti, Florian and Garg, Animesh}, year = 2020, journal = {arXiv preprint arXiv:2010.14497} } @article{Bhaskar13, title = {Atomic Norm Denoising with Applications to Line Spectral Estimation}, author = {Badri Narayan Bhaskar and Gongguo Tang and Benjamin Recht}, year = 2013, journal = {{IEEE} Transactions on Signal Processing}, booktitle = {Proceedings of the 49th Annual Allerton Conference}, volume = 61, number = 23, pages = {5987--5999}, date-added = {2016-04-05 05:55:59 +0000}, date-modified = {2016-04-05 05:55:59 +0000} } @inproceedings{bhaskara2013smoothed, title = {Smoothed analysis of tensor decompositions}, author = {Bhaskara, Aditya and Charikar, Moses and Moitra, Ankur and Vijayaraghavan, Aravindan}, year = 2014, booktitle = {Proceedings of the 46th Symposium on Theory of Computing Conference, STOC 2014, New York, NY, USA, May 31 - Jun 3}, pages = {594--603}, organization = {ACM} } @inproceedings{BhaskaraCMV14, title = {Smoothed Analysis of Tensor Decompositions}, author = {Aditya Bhaskara and Moses Charikar and Ankur Moitra and Aravindan Vijayaraghavan}, year = 2014, booktitle = {STOC} } @inproceedings{BhaskaraCV14, title = {Proceedings of The 27th Conference on Learning Theory, COLT 2014, Barcelona, Spain, June 13-15, 2014}, author = {Aditya Bhaskara and Moses Charikar and Aravindan Vijayaraghavan}, year = 2014, booktitle = {COLT}, publisher = {JMLR.org}, series = {JMLR Proceedings}, volume = 35, pages = {742--778}, editor = {Maria-Florina Balcan and Csaba Szepesv{\'a}ri}, ee = {http://jmlr.org/proceedings/papers/v35/bhaskara14a.html}, bibsource = {DBLP, http://dblp.uni-trier.de} } @inproceedings{bhat2012non, title = {Non-parametric approximate dynamic programming via the kernel method}, author = {Bhat, Nikhil and Farias, Vivek and Moallemi, Ciamac C}, year = 2012, booktitle = {Advances in Neural Information Processing Systems}, pages = {386--394} } @book{Bhatia1997, title = {{Matrix Analysis}}, author = {Bhatia, Rajendra}, year = 1997, publisher = {Springer New York}, address = {New York, NY}, series = {Graduate Texts in Mathematics}, volume = 169, doi = {10.1007/978-1-4612-0653-8}, isbn = {978-1-4612-6857-4}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Bhatia - 1997 - Matrix Analysis.pdf:pdf}, mendeley-groups = {Books/Algebra} } @inproceedings{bhl18, title = {Gradient descent with identity initialization efficiently learns positive definite linear transformations}, author = {Bartlett, Peter and Helmbold, Dave and Long, Phil}, year = 2018, booktitle = {International Conference on Machine Learning (ICML)}, pages = {520--529} } @article{BHNS14, title = {A stochastic quasi-Newton method for large-scale optimization}, author = {Byrd, Richard H and Hansen, SL and Nocedal, Jorge and Singer, Yoram}, year = 2014, journal = {arXiv preprint arXiv:1401.7020} } @article{bhojanapalli2015dropping, title = {Dropping convexity for faster semi-definite optimization}, author = {Bhojanapalli, Srinadh and Kyrillidis, Anastasios and Sanghavi, Sujay}, year = 2015, journal = {arXiv:1509.03917}, date-modified = {2016-02-15 19:22:38 +0000} } @inproceedings{BhojanapalliJS2015-SVD, title = {{Tighter Low-rank Approximation via Sampling the Leveraged Element}}, author = {Bhojanapalli, Srinadh and Jain, Prateek and Sanghavi, Sujay}, year = 2015, booktitle = {SODA}, pages = {902--920} } @article{bickel2009, title = {Simultaneous analysis of Lasso and Dantzig selector}, author = {Bickel, Peter J. and Ritov, Ya’acov and Tsybakov, Alexandre B.}, year = 2009, month = {08}, journal = {Ann. Statist.}, publisher = {The Institute of Mathematical Statistics}, volume = 37, number = 4, pages = {1705--1732}, doi = {10.1214/08-AOS620}, url = {http://dx.doi.org/10.1214/08-AOS620}, fjournal = {The Annals of Statistics} } @inproceedings{bien2010cur, title = {{CUR} from a sparse optimization viewpoint}, author = {Bien, Jacob and Xu, Ya and Mahoney, Michael W}, year = 2010, booktitle = {Advances in Neural Information Processing Systems}, pages = {217--225} } @techreport{BienstockIyengar2004, title = {{Faster approximation algorithms for packing and covering problems}}, author = {Bienstock, D. and Iyengar, G.}, year = 2004, note = {Preliminary version published in STOC '04} } @article{bilu2012stable, title = {Are stable instances easy?}, author = {Bilu, Yonatan and Linial, Nathan}, year = 2012, journal = {Combinatorics, Probability and Computing}, publisher = {Cambridge Univ Press}, volume = 21, number = {05}, pages = {643--660} } @book{bishop2006pattern, title = {Pattern Recognition and Machine Learning}, author = {Christopher M. Bishop}, year = 2006, month = oct, day = {01}, publisher = {Springer}, isbn = {978-0-387-31073-2}, edition = {1st ed. 2006. Corr. 2nd printing}, abstract = { {The dramatic growth in practical applications for machine learning over the last ten years has been accompanied by many important developments in the underlying algorithms and techniques. For example, Bayesian methods have grown from a specialist niche to become mainstream, while graphical models have emerged as a general framework for describing and applying probabilistic techniques. The practical applicability of Bayesian methods has been greatly enhanced by the development of a range of approximate inference algorithms such as variational Bayes and expectation propagation, while new models based on kernels have had a significant impact on both algorithms and applications. This completely new textbook reflects these recent developments while providing a comprehensive introduction to the fields of pattern recognition and machine learning. It is aimed at advanced undergraduates or first-year PhD students, as well as researchers and practitioners. No previous knowledge of pattern recognition or machine learning concepts is assumed. Familiarity with multivariate calculus and basic linear algebra is required, and some experience in the use of probabilities would be helpful though not essential as the book includes a self-contained introduction to basic probability theory. The book is suitable for courses on machine learning, statistics, computer science, signal processing, computer vision, data mining, and bioinformatics. Extensive support is provided for course instructors, including more than 400 exercises, graded according to difficulty. Example solutions for a subset of the exercises are available from the book web site, while solutions for the remainder can be obtained by instructors from the publisher. The book is supported by a great deal of additional material, and the reader is encouraged to visit the book web site for the latest information. A forthcoming companion volume will deal with practical aspects of pattern recognition and machine learning, and will include free software implementations of the key algorithms along with example data sets and demonstration programs. Christopher Bishop is Assistant Director at Microsoft Research Cambridge, and also holds a Chair in Computer Science at the University of Edinburgh. He is a Fellow of Darwin College Cambridge, and was recently elected Fellow of the Royal Academy of Engineering. The author's previous textbook "Neural Networks for Pattern Recognition" has been widely adopted.} }, howpublished = {Hardcover}, keywords = {book, machine\_learning, pattern\_classification}, owner = {leili}, timestamp = {2011.07.28} } @article{BKS, title = {Dictionary learning using sum-of-square hierarchy}, author = {Boaz Barak and John Kelner and David Steurer}, year = 2014, booktitle = {arXiv:1407.1543} } @article{BKW, title = {Noise-tolerant learning, the parity problem, and the statistical query model}, author = {Blum, Avrim and Kalai, Adam and Wasserman, Hal}, year = 2003, month = jul, journal = {J. ACM}, publisher = {ACM}, address = {New York, NY, USA}, volume = 50, number = 4, pages = {506--519}, issn = {0004-5411}, issue_date = {July 2003}, numpages = 14 } @inproceedings{BL08, title = {Correlational spectral clustering}, author = {M. B. Blaschko and C. H. Lampert}, year = 2008, booktitle = {CVPR} } @article{BL1, title = {A correlated topic model of Science}, author = {D. Blei and J. Lafferty}, year = 2007, journal = {Annals of Applied Statistics}, pages = {17--35} } @inproceedings{BL2, title = {Dynamic topic models}, author = {D. Blei and J. Lafferty}, year = 2006, booktitle = {ICML}, pages = {113--120} } @article{black1973pricing, title = {The pricing of options and corporate liabilities}, author = {Black, Fischer and Scholes, Myron}, year = 1973, journal = {Journal of Political Economy} } @article{blackwell1968big, title = {The big match}, author = {Blackwell, David and Ferguson, Tom S}, year = 1968, journal = {The Annals of Mathematical Statistics}, publisher = {JSTOR}, volume = 39, number = 1, pages = {159--163} } @article{blanc2019implicit, title = {Implicit regularization for deep neural networks driven by an Ornstein-Uhlenbeck like process}, author = {Blanc, Guy and Gupta, Neha and Valiant, Gregory and Valiant, Paul}, year = 2019, journal = {arXiv preprint arXiv:1904.09080} } @article{Blaschke04cubica, title = {CuBICA: Independent Component Analysis by Simultaneous Third- and Fourth-Order Cumulant Diagonalization}, author = {Tobias Blaschke and Laurenz Wiskott}, year = 2004, journal = {IEEE TRANSACTIONS ON SIGNAL PROCESSING}, volume = 52, number = 5, pages = {1250--1256} } @article{blei2003latent, title = {Latent Dirichlet allocation}, author = {David M. Blei and Andrew Ng and Michael Jordan}, year = 2003, journal = {Journal of Machine Learning Research}, volume = 3, pages = {993--1022} } @inproceedings{blei2006correlated, title = {Correlated topic models}, author = {Blei, D. and Lafferty, J.}, year = 2006, booktitle = {Advances in Neural Information Processing Systems} } @inproceedings{blei2006dynamic, title = {Dynamic Topic Models}, author = {Blei, David M. and Lafferty, John D.}, year = 2006, booktitle = {Proceedings of the 23rd International Conference on Machine Learning} } @article{blei2012probabilistic, title = {Probabilistic topic models}, author = {Blei, David M.}, year = 2012, month = apr, journal = {Communication of the Association for Computing Machinery}, publisher = {ACM}, address = {New York, NY, USA}, volume = 55, number = 4, pages = {77--84}, doi = {10.1145/2133806.2133826}, issn = {0001-0782}, url = {http://doi.acm.org/10.1145/2133806.2133826}, issue_date = {April 2012}, numpages = 8, acmid = 2133826 } @book{blg13, title = {Concentration inequalities: A nonasymptotic theory of independence}, author = {Boucheron, St{\'e}phane and Lugosi, G{\'a}bor and Massart, Pascal}, year = 2013, publisher = {Oxford university press} } @inproceedings{blitzer2008learning, title = {Learning bounds for domain adaptation}, author = {Blitzer, John and Crammer, Koby and Kulesza, Alex and Pereira, Fernando and Wortman, Jennifer}, year = 2008, booktitle = {Advances in neural information processing systems}, pages = {129--136} } @article{blondel2000survey, title = {A survey of computational complexity results in systems and control}, author = {Blondel, Vincent D and Tsitsiklis, John N}, year = 2000, journal = {Automatica}, publisher = {Elsevier}, volume = 36, number = 9, pages = {1249--1274} } @article{BLS2015, title = {A geometric alternative to {N}esterov's accelerated gradient descent}, author = {Bubeck, S{\'e}bastien and Lee, Yin Tat and Singh, Mohit}, year = 2015, month = jun, journal = {ArXiv e-prints}, volume = {abs/1506.08187}, url = {http://arxiv.org/abs/1506.08187} } @inproceedings{blum1989training, title = {Training a 3-node neural network is {NP}-complete}, author = {Blum, Avrim and Rivest, Ronald L}, year = 1989, booktitle = {Advances in neural information processing systems}, publisher = {Springer}, pages = {494--501} } @inproceedings{BLV97, title = {Vandermonde factorization of a {H}ankel matrix}, author = {D. L. Boley and F. T. Luk and D. Vandevoorde}, year = 1997, booktitle = {Scientific Computing} } @inproceedings{BM12, title = {Spectral Learning of General Weighted Automata via Constrained Matrix Completion}, author = {B. Balle and M. Mohri}, year = 2012, booktitle = {Advances in Neural Information Processing Systems 25} } @book{BMP90, title = {Adaptive algorithms and stochastic approximations}, author = {Benveniste, Albert and M{\'e}tivier, Michel and Priouret, Pierre}, year = 2012, publisher = {Springer Science \& Business Media}, volume = 22 } @article{BoazEtal:DictionaryLearning, title = {{Dictionary Learning via the Sum-of-Squares Method}}, author = {B. Barak and J. Kelner and D. Steurer}, year = 2014, journal = {Unpublished manuscript} } @article{Boct2015variable, title = {A variable smoothing algorithm for solving convex optimization problems}, author = {Bo{\c{t}}, Radu Ioan and Hendrich, Christopher}, year = 2015, journal = {TOP}, publisher = {Springer}, volume = 23, number = 1, pages = {124--150} } @book{boeottcher2005spectral, title = {Spectral properties of banded Toeplitz matrices}, author = {Boe{\'o}ttcher, Albrecht and Grudsky, Sergei M}, year = 2005, publisher = {Siam}, volume = 96 } @article{bondy1977graph, title = {Graph reconstructiona survey}, author = {Bondy, J Adrian and Hemminger, Robert L}, year = 1977, journal = {Journal of Graph Theory}, volume = 1, number = 3, pages = {227--268}, owner = {gewor_000}, timestamp = {2013.09.29} } @book{books:understanding, title = {Understanding Machine Learning - From Theory to Algorithms.}, author = {Shalev-Shwartz, Shai and Ben-David, Shai}, year = 2014, publisher = {Cambridge University Press}, pages = {I-XVI, 1--397}, isbn = {978-1-10-705713-5}, added-at = {2020-06-05T00:00:00.000+0200}, biburl = {https://www.bibsonomy.org/bibtex/293329d1cd5964dd826bba3100cd17fe4/dblp}, ee = {http://www.cambridge.org/de/academic/subjects/computer-science/pattern-recognition-and-machine-learning/understanding-machine-learning-theory-algorithms}, interhash = {125d708c7b440a3cfeb6146e83ab5de3}, intrahash = {93329d1cd5964dd826bba3100cd17fe4}, keywords = {dblp}, timestamp = {2020-06-06T11:43:42.000+0200} } @inproceedings{bordes2012joint, title = {Joint learning of words and meaning representations for open-text semantic parsing}, author = {Bordes, Antoine and Glorot, Xavier and Weston, Jason and Bengio, Yoshua}, year = 2012, booktitle = {International Conference on Artificial Intelligence and Statistics} } @article{borkar2009new, title = {A new learning algorithm for optimal stopping}, author = {Borkar, Vivek S and Pinto, Jervis and Prabhu, Tarun}, year = 2009, journal = {Discrete Event Dynamic Systems}, publisher = {Springer}, volume = 19, number = 1, pages = {91--113} } @inproceedings{borkar2010risk, title = {Risk-constrained Markov decision processes}, author = {Borkar, Vivek and Jain, Rahul}, year = 2010, booktitle = {49th IEEE Conference on Decision and Control (CDC)}, pages = {2664--2669}, organization = {IEEE} } @article{borodin1981time, title = {A time-space tradeoff for sorting on non-oblivious machines}, author = {Borodin, Allan and Fischer, Michael J and Kirkpatrick, David G and Lynch, Nancy A and Tompa, Martin}, year = 1981, journal = {Journal of Computer and System Sciences}, publisher = {Elsevier}, volume = 22, number = 3, pages = {351--364} } @article{borodin1987time, title = {A time-space tradeoff for element distinctness}, author = {Borodin, Allan and Fich, Faith and Meyer auf der Heide, Friedhelm and Upfal, Eli and Wigderson, Avi}, year = 1987, journal = {SIAM Journal on Computing}, publisher = {SIAM}, volume = 16, number = 1, pages = {97--99} } @incollection{bottou-98x, title = {Online Algorithms and Stochastic Approximations}, author = {Bottou, L\'{e}on}, year = 1998, booktitle = {Online Learning and Neural Networks}, publisher = {Cambridge University Press}, address = {Cambridge, UK}, url = {http://leon.bottou.org/papers/bottou-98x}, note = {revised, oct 2012}, editor = {Saad, David} } @misc{Bottou-SGD, title = {Stochastic Gradient Descent}, author = {L\'{e}on Bottou}, howpublished = {\url{http://leon.bottou.org/projects/sgd}} } @incollection{Bottou:1999:OLS:304710.304720, title = {On-line Learning in Neural Networks}, author = {Bottou, L{\'e}on}, year = 1998, publisher = {Cambridge University Press}, address = {New York, NY, USA}, pages = {9--42}, isbn = {0-521-65263-4}, url = {http://dl.acm.org/citation.cfm?id=304710.304720}, chapter = {On-line Learning and Stochastic Approximations}, editor = {Saad, David}, numpages = 34, acmid = 304720 } @article{botvinick2001conflict, title = {Conflict monitoring and cognitive control.}, author = {Botvinick, Matthew M and Braver, Todd S and Barch, Deanna M and Carter, Cameron S and Cohen, Jonathan D}, year = 2001, journal = {Psychological review}, publisher = {American Psychological Association}, volume = 108, number = 3, pages = 624 } @article{boucheron2004concentration, title = {Concentration inequalities}, author = {Boucheron, St{\'e}phane and Lugosi, G{\'a}bor and Bousquet, Olivier}, year = 2004, journal = {Advanced lectures on machine learning}, publisher = {Springer}, pages = {208--240} } @inproceedings{boumal2016non, title = {The non-convex Burer-Monteiro approach works on smooth semidefinite programs}, author = {Boumal, Nicolas and Voroninski, Vladislav and Bandeira, Afonso S}, year = 2016, booktitle = {NIPS} } @inproceedings{boureau2007sparse, title = {Sparse feature learning for deep belief networks}, author = {Boureau, Y-lan and Cun, Yann L and others}, year = 2007, booktitle = {Advances in neural information processing systems}, pages = {1185--1192}, owner = {gewor_000}, timestamp = {2013.11.10} } @inproceedings{bousmalis2018using, title = {Using simulation and domain adaptation to improve efficiency of deep robotic grasping}, author = {Bousmalis, Konstantinos and Irpan, Alex and Wohlhart, Paul and Bai, Yunfei and Kelcey, Matthew and Kalakrishnan, Mrinal and Downs, Laura and Ibarz, Julian and Pastor, Peter and Konolige, Kurt and Levine, Sergey and Vanhoucke, Vincent}, year = 2018, booktitle = {2018 IEEE international conference on robotics and automation (ICRA)}, pages = {4243--4250}, organization = {IEEE} } @article{bousquet2002stability, title = {Stability and generalization}, author = {Bousquet, Olivier and Elisseeff, Andr{\'e}}, year = 2002, journal = {Journal of machine learning research}, volume = 2, number = {Mar}, pages = {499--526} } @article{boutilier2000stochastic, title = {Stochastic dynamic programming with factored representations}, author = {Boutilier, Craig and Dearden, Richard and Goldszmidt, Moises}, year = 2000, journal = {Artificial intelligence}, publisher = {Elsevier}, volume = 121, number = {1-2}, pages = {49--107} } @inproceedings{Boutsidis2014faster, title = {{Faster SVD-truncated regularized least-squares}}, author = {Boutsidis, Christos and Magdon-Ismail, Malik}, year = 2014, booktitle = {2014 IEEE International Symposium on Information Theory}, pages = {1321--1325}, organization = {IEEE} } @article{boutsidis2014near, title = {Near-optimal column-based matrix reconstruction}, author = {Boutsidis, Christos and Drineas, Petros and Magdon-Ismail, Malik}, year = 2014, journal = {SIAM Journal on Computing}, publisher = {SIAM}, volume = 43, number = 2, pages = {687--717} } @article{boutsidis2014optimal, title = {Optimal {CUR} matrix decompositions}, author = {Boutsidis, Christos and Woodruff, David P}, year = 2014, journal = {arXiv preprint arXiv:1405.7910} } @inproceedings{Boutsidis2015online, title = {Online principal components analysis}, author = {Boutsidis, Christos and Garber, Dan and Karnin, Zohar and Liberty, Edo}, year = 2015, booktitle = {Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms}, pages = {887--901}, organization = {SIAM} } @techreport{bowling2000analysis, title = {An analysis of stochastic game theory for multiagent reinforcement learning}, author = {Bowling, Michael and Veloso, Manuela}, year = 2000, institution = {Carnegie-Mellon Univ Pittsburgh Pa School of Computer Science} } @inproceedings{bowling2001rational, title = {Rational and convergent learning in stochastic games}, author = {Bowling, Michael and Veloso, Manuela}, year = 2001, booktitle = {International joint conference on artificial intelligence}, volume = 17, number = 1, pages = {1021--1026}, organization = {Lawrence Erlbaum Associates Ltd} } @inproceedings{bowling2002scalable, title = {Scalable learning in stochastic games}, author = {Bowling, Michael and Veloso, Manuela}, year = 2002, booktitle = {AAAI Workshop on Game Theoretic and Decision Theoretic Agents}, pages = {11--18} } @book{box1994time, title = {Time Series Analysis: Forecasting and Control}, author = {George E.P. Box and Gwilym M. Jenkins and Gregory C. Reinsel}, year = 1994, publisher = {Prentice Hall}, address = {Englewood Cliffs, NJ}, series = {Forecasting and Control Series}, isbn = 9780130607744, edition = {3rd}, lccn = 93034620, owner = {leili}, timestamp = {2011.07.28} } @book{boyd, title = {Convex optimization}, author = {Boyd, Stephen and Vandenberghe, Lieven}, year = 2004, publisher = {Cambridge university press} } @article{boyd2011distributed, title = {Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers}, author = {Boyd, Stephen and Parikh, Neal and Chu, Eric and Peleato, Borja and Eckstein, Jonathan}, year = 2011, month = jan, journal = {Found. Trends Mach. Learn.}, publisher = {Now Publishers Inc.}, address = {Hanover, MA, USA}, volume = 3, number = 1, pages = {1--122}, doi = {10.1561/2200000016}, issn = {1935-8237}, url = {http://dx.doi.org/10.1561/2200000016}, issue_date = {January 2011}, numpages = 122, acmid = 2185816 } @inproceedings{BQC11, title = {A Spectral Learning Algorithm for Finite State Transducers}, author = {B. Balle and A. Quattoni and X. Carreras}, year = 2011, booktitle = {ECML-PKDD} } @inproceedings{BQC12, title = {Local Loss Optimization in Operator Models: A New Insight into Spectral Learning}, author = {B. Balle and A. Quattoni and X. Carreras}, year = 2012, booktitle = {ICML} } @inproceedings{BradleyKBG2011, title = {Parallel coordinate descent for l1-regularized loss minimization}, author = {Bradley, Joseph K. and Kyrola, Aapo and Bickson, Danny and Guestrin, Carlos}, year = 2011, booktitle = {Proceedings of the 28th International Conference on Machine Learning}, series = {ICML' 11} } @article{bradtke1996linear, title = {Linear least-squares algorithms for temporal difference learning}, author = {Bradtke, Steven J and Barto, Andrew G}, year = 1996, journal = {Machine learning}, publisher = {Springer}, volume = 22, number = {1-3}, pages = {33--57} } @article{brafman2002r, title = {R-max - a General Polynomial Time Algorithm for Near-optimal Reinforcement Learning}, author = {Brafman, Ronen I. and Tennenholtz, Moshe}, year = 2003, month = mar, journal = {J. Mach. Learn. Res.}, publisher = {JMLR.org}, volume = 3, number = {Oct}, pages = {213--231}, issn = {1532-4435}, acmid = 944928, issue_date = {3/1/2003}, numpages = 19 } @inproceedings{brand2002incremental, title = { Incremental Singular Value Decomposition of Uncertain Data with Missing Values }, author = {Brand,, Matthew}, year = 2002, booktitle = {Proceedings of the 7th European Conference on Computer Vision}, publisher = {Springer-Verlag}, address = {London, UK}, pages = {707--720}, isbn = {3-540-43745-2} } @article{brandwood1983complex, title = { A complex gradient operator and its application in adaptive array theory }, author = {Brandwood, D.H.}, year = 1983, month = feb, journal = {Communications, Radar and Signal Processing, IEE Proceedings F}, volume = 130, number = 1, pages = {11--16}, abstract = { The problem of minimising a real scalar quantity (for example array output power, or mean square error) as a function of a complex vector (the set of weights) frequently arises in adaptive array theory. A complex gradient operator is defined in the paper for this purpose and its use justified. Three examples of its application to array theory problems are given. } } @article{braun2011generalized, title = {Generalized Direct Sampling for Hierarchical {B}ayesian Models}, author = {Braun, Michael and Damien, Paul}, year = 2011, month = sep, day = 7, abstract = { In this paper, we develop a new method to sample from posterior distributions in hierarchical models without using Markov chain Monte Carlo. This method is generally applicable to high-dimensional models involving large data sets. Illustrative analysis exemplifies the ease with which one could implement our method, which results in independent samples from the posterior distributions of interest. }, archiveprefix = {arXiv}, eprint = {1108.2245}, keywords = {bayes, efficiency, exact\_sampling, hierarchical\_model, mcmc}, posted-at = {2011-09-08 09:23:31}, priority = 2 } @inproceedings{braverman16communication, title = {Communication lower bounds for statistical estimation problems via a distributed data processing inequality}, author = {Mark Braverman and Ankit Garg and Tengyu Ma and Huy L. Nguyen and David P. Woodruff}, booktitle = {Proceedings of the 48th Symposium on Theory of Computing (STOC), 2016}, doi = {10.1145/2897518.2897582}, url = {http://doi.acm.org/10.1145/2897518.2897582}, crossref = {DBLP:conf/stoc/2016}, timestamp = {Fri, 10 Jun 2016 10:47:01 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/stoc/BravermanGMNW16}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{bringmann2012efficient, title = {Efficient sampling methods for discrete distributions}, author = {Bringmann, Karl and Panagiotou, Konstantinos}, year = 2012, booktitle = {International Colloquium on Automata, Languages, and Programming}, pages = {133--144}, organization = {Springer} } @inproceedings{bringmann2013succinct, title = {Succinct sampling from discrete distributions}, author = {Bringmann, Karl and Larsen, Kasper Green}, year = 2013, booktitle = {Proceedings of the forty-fifth annual ACM symposium on Theory of computing}, pages = {775--782}, organization = {ACM} } @article{bro70, title = {The convergence of a class of double-rank minimization algorithms 2. The new algorithm}, author = {Broyden, Charles G}, year = 1970, journal = {IMA Journal of Applied Mathematics}, publisher = {IMA}, volume = 6, number = 3, pages = {222--231} } @techreport{Brochu:2010c, title = {A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning}, author = {Eric Brochu and Vlad M Cora and Nando {de Freitas}}, year = 2010, month = dec, number = {arXiv:1012.2599}, institution = {arXiv.org}, type = {eprint} } @article{brockman2016openai, title = {Open{AI} {Gym}}, author = {Brockman, Greg and Cheung, Vicki and Pettersson, Ludwig and Schneider, Jonas and Schulman, John and Tang, Jie and Zaremba, Wojciech}, year = 2016, journal = {arXiv preprint arXiv:1606.01540} } @book{brockwell1987time, title = {Time Series: Theory and Methods}, author = {Peter J. Brockwell and Richard A. Davis}, year = 1987, publisher = {Springer-Verlag New York, Inc.}, address = {New York, NY, USA}, isbn = {0-387-96406-1}, abstract = { Discusses ARMA, ARIMA models with a very strong math view point. Gives the Yule-Walker equations for ARMA models; the Wold decomposition; the Akaike Information Criterion (AIC). It presents Hilbert spaces with inner products, fractional differencing models (FARMA) (\~{}pink noise), random variables with infinite variance, and Kalman filtering. }, owner = {leili}, timestamp = {2011.07.28} } @article{brostrom2000acceptance, title = {{Acceptance-rejection Sampling from the Conditional Distribution of Independent Discrete Random Variables, given their Sum}}, author = {Brostr\"{o}m, G\"{o}ran and Nilsson, Leif}, year = 2000, month = jan, journal = {Statistics}, volume = 34, number = 3, pages = {247--257}, doi = {10.1080/02331880008802716}, issn = {0233-1888}, url = {http://www.tandfonline.com/doi/abs/10.1080/02331880008802716}, file = {:home/leili/Dropbox/reading/sampling/Sampling on sum/rejection SAMPLING conditional independent discrete given sum - 2000.pdf:pdf}, keywords = {bernoulli distribution,bootstrap,exponential families,importance sampling,proportional hazards,simulation,sufficiency,survival analysis,tilted distributions} } @inproceedings{BRRT, title = {Factoring nonnegative matrices with linear programs}, author = {V. Bittorf and B. Recht and C. Re and J. Tropp}, year = 2012, booktitle = {NIPS} } @article{Brunovsky1970, title = {A classification of linear controllable systems}, author = {Brunovsky, Pavol}, year = 1970, journal = {Kybernetika}, publisher = {Institute of Information Theory and Automation AS CR}, volume = {06}, number = 3, pages = {(173)-188}, url = {http://eudml.org/doc/28376}, keywords = {control theory}, language = {eng} } @article{brutzkus2017globally, title = {Globally optimal gradient descent for a ConvNet with Gaussian inputs}, author = {Brutzkus, Alon and Globerson, Amir}, year = 2017, journal = {arXiv preprint arXiv:1702.07966} } @article{brutzkus2017sgd, title = {Sgd learns over-parameterized networks that provably generalize on linearly separable data}, author = {Brutzkus, Alon and Globerson, Amir and Malach, Eran and Shalev-Shwartz, Shai}, year = 2017, journal = {arXiv preprint arXiv:1710.10174} } @inproceedings{BS, title = {Polynomial Learning of Distribution Families}, author = {Mikhail Belkin and Kaushik Sinha}, year = 2010, booktitle = {51th Annual {IEEE} Symposium on Foundations of Computer Science, {FOCS} 2010, October 23-26, 2010, Las Vegas, Nevada, {USA}}, pages = {103--112}, doi = {10.1109/FOCS.2010.16}, url = {http://dx.doi.org/10.1109/FOCS.2010.16}, crossref = {DBLP:conf/focs/2010}, timestamp = {Tue, 16 Dec 2014 09:57:25 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/focs/BelkinS10}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{BS10, title = {Polynomial Learning of Distribution Families}, author = {M. Belkin and K. Sinha}, year = 2010, booktitle = {FOCS} } @inproceedings{BSG11, title = {An Online Spectral Learning Algorithm for Partially Observable Nonlinear Dynamical Systems}, author = {B. Boots and S. Siddiqi and G. Gordon}, year = 2011, booktitle = {AAAI} } @article{buadoiu2008optimal, title = {Optimal core-sets for balls}, author = {B{\u{a}}doiu, Mihai and Clarkson, Kenneth L}, year = 2008, journal = {Computational Geometry}, publisher = {Elsevier}, volume = 40, number = 1, pages = {14--22} } @article{bubeck2012regret, title = {Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems}, author = {Sebastien Bubeck and Nicolo Cesa-Bianchi}, year = 2012, journal = {Foundations and Trends in Machine Learning}, volume = 5, number = 1 } @article{Bubeck2015book, title = {Convex Optimization: Algorithms and Complexity}, author = {Bubeck, S{\'e}bastien}, year = 2015, journal = {Foundations and Trends in Machine Learning}, publisher = {Now Publishers Inc.}, volume = 8, number = {3-4}, pages = {231--357} } @article{bubeck2018sampling, title = {Sampling from a log-concave distribution with projected langevin monte carlo}, author = {Bubeck, S{\'e}bastien and Eldan, Ronen and Lehec, Joseph}, year = 2018, journal = {Discrete \& Computational Geometry}, publisher = {Springer}, volume = 59, number = 4, pages = {757--783} } @inproceedings{buehrer2007toward, title = {Toward terabyte pattern mining: an architecture-conscious solution}, author = { Buehrer, Gregory and Parthasarathy, Srinivasan and Tatikonda, Shirish and Kurc, Tahsin and Saltz, Joel }, year = 2007, booktitle = { Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming }, location = {San Jose, California, USA}, publisher = {ACM}, address = {New York, NY, USA}, series = {PPoPP '07}, pages = {2--12}, doi = {http://doi.acm.org/10.1145/1229428.1229432}, isbn = {978-1-59593-602-8}, acmid = 1229432, keywords = {itemset mining, out of core, parallel}, numpages = 11 } @article{bunse1993numerical, title = {Numerical methods for simultaneous diagonalization}, author = {Bunse-Gerstner, A. and Byers, R. and Mehrmann, V.}, year = 1993, journal = {SIAM Journal on Matrix Analysis and Applications}, publisher = {SIAM}, volume = 14, number = 4, pages = {927--949} } @inproceedings{buntine2009estimating, title = {Estimating Likelihoods for Topic Models}, author = {Wray L. Buntine}, year = 2009, booktitle = {Asian Conference on Machine Learning} } @book{BurdenNumerical, title = {Numerical Analysis}, author = {R.L. Burden and J.D. Faires}, year = 2000, publisher = {Brooks Cole, 7 edition} } @article{burer2003nonlinear, title = {A nonlinear programming algorithm for solving semidefinite programs via low-rank factorization}, author = {Burer, Samuel and Monteiro, Renato DC}, year = 2003, journal = {Mathematical Programming}, publisher = {Springer}, volume = 95, number = 2, pages = {329--357} } @article{burer2005local, title = {Local minima and convergence in low-rank semidefinite programming}, author = {Burer, Samuel and Monteiro, Renato DC}, year = 2005, journal = {Mathematical Programming}, publisher = {Springer}, volume = 103, number = 3, pages = {427--444} } @article{burke2005robust, title = {A robust gradient sampling algorithm for nonsmooth, nonconvex optimization}, author = {Burke, James V and Lewis, Adrian S and Overton, Michael L}, year = 2005, journal = {SIAM Journal on Optimization}, publisher = {SIAM}, volume = 15, number = 3, pages = {751--779} } @article{burnetas1997optimal, title = {Optimal adaptive policies for {Markov} decision processes}, author = {Burnetas, Apostolos N and Katehakis, Michael N}, year = 1997, journal = {Mathematics of Operations Research}, publisher = {INFORMS}, volume = 22, number = 1, pages = {222--255} } @article{buzsaki2014log, title = {The log-dynamic brain: how skewed distributions affect network operations}, author = {Buzs{\'a}ki, Gy{\"o}rgy and Mizuseki, Kenji}, year = 2014, journal = {Nature Reviews Neuroscience} } @inproceedings{BV08, title = {Isotropic {PCA} and Affine-Invariant Clustering}, author = {S. C. Brubaker and S. Vempala}, year = 2008, booktitle = {FOCS} } @article{BWY14, title = {Statistical guarantees for the {EM} algorithm: From population to sample-based analysis}, author = {Sivaraman Balakrishnan and Martin J. Wainwright and Bin Yu}, year = 2014, journal = {CoRR}, volume = {abs/1408.2156}, url = {http://arxiv.org/abs/1408.2156}, timestamp = {Tue, 03 Mar 4460020 12:24:48 +}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/corr/BalakrishnanWY14}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{byers2000utility, title = {Utility-based decision-making in wireless sensor networks}, author = {Byers, John and Nasser, Gabriel}, year = 2000, booktitle = {Mobile and Ad Hoc Networking and Computing, 2000. MobiHOC. 2000 First Annual Workshop on}, pages = {143--144}, organization = {IEEE} } @article{C, title = {Upper and lower bounds for the normal distribution function}, author = {John D. Cook}, publisher = {\url{https://www.johndcook.com/blog/norm-dist-bounds/}} } @article{c89, title = {Approximation by superpositions of a sigmoidal function}, author = {Cybenko, George}, year = 1989, journal = {Mathematics of control, signals and systems}, publisher = {Springer}, volume = 2, number = 4, pages = {303--314} } @article{cai2019neural, title = {Neural Temporal-Difference Learning Converges to Global Optima}, author = {Cai, Qi and Yang, Zhuoran and Lee, Jason D and Wang, Zhaoran}, year = 2019, journal = {Neural Information Processing Systems (NeurIPS)} } @inproceedings{cai2019provably, title = {Provably Efficient Exploration in Policy Optimization}, author = {Cai, Qi and Yang, Zhuoran and Jin, Chi and Wang, Zhaoran}, year = 2020, booktitle = {International Conference on Machine Learning}, pages = {1283--1294}, organization = {PMLR} } @inproceedings{camacho2017non, title = {Non-markovian rewards expressed in LTL: guiding search via reward shaping}, author = {Camacho, Alberto and Chen, Oscar and Sanner, Scott and McIlraith, Sheila A}, year = 2017, booktitle = {Tenth Annual Symposium on Combinatorial Search} } @inproceedings{camacho2019ltl, title = {LTL and beyond: Formal languages for reward function specification in reinforcement learning}, author = {Camacho, Alberto and Icarte, R Toro and Klassen, Toryn Q and Valenzano, Richard and McIlraith, Sheila A}, year = 2019, booktitle = {Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI)}, pages = {6065--6073} } @article{Campi02, title = {Finite Sample Properties of System Identification Methods}, author = {M. C. Campi and Erik Weyer}, year = 2002, journal = {{IEEE} Transactions on Automatic Control}, volume = 47, number = 8, pages = {1329--1334}, date-added = {2016-04-02 18:41:57 +0000}, date-modified = {2016-04-02 18:42:41 +0000} } @article{candes2005decoding, title = {Decoding by linear programming}, author = {Candes, Emmanuel J and Tao, Terence}, year = 2005, month = dec, journal = {IEEE transactions on information theory}, publisher = {IEEE}, volume = 51, number = 12, pages = {4203--4215}, doi = {10.1109/TIT.2005.858979}, issn = {0018-9448}, keywords = {Gaussian processes;convex programming;decoding;error correction codes;indeterminancy;linear codes;linear programming;minimisation;random codes;sparse matrices;Gaussian random matrix;basis pursuit;linear code decoding;linear programming;minimization problem;natural error correcting problem;simple convex optimization problem;sparse solution;uncertainty principle;Decoding;Equations;Error correction;Error correction codes;Information theory;Linear code;Linear programming;Mathematics;Sparse matrices;Vectors;Basis pursuit;Gaussian random matrices;decoding of (random) linear codes;duality in optimization;linear codes;linear programming;principal angles;restricted orthonormality;singular values of random matrices;sparse solutions to underdetermined systems} } @article{candes2006near, title = {Near-optimal signal recovery from random projections: Universal encoding strategies?}, author = {Candes, Emmanuel J and Tao, Terence}, year = 2006, journal = {Information Theory, IEEE Transactions on}, publisher = {IEEE}, volume = 52, number = 12, pages = {5406--5425} } @article{candes2006robust, title = {Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information}, author = {Cand{\`e}s, Emmanuel J and Romberg, Justin and Tao, Terence}, year = 2006, journal = {Information Theory, IEEE Transactions on}, publisher = {IEEE}, volume = 52, number = 2, pages = {489--509}, owner = {gewor_000}, timestamp = {2013.11.10} } @article{candes2008restricted, title = {The restricted isometry property and its implications for compressed sensing}, author = {Candes, Emmanuel J}, year = 2008, journal = {Comptes Rendus Mathematique}, publisher = {Elsevier}, volume = 346, number = 9, pages = {589--592} } @article{candes2009exact, title = {Exact matrix completion via convex optimization}, author = {Cand{\`e}s, Emmanuel J and Recht, Benjamin}, year = 2009, journal = {Foundations of Computational mathematics}, publisher = {Springer}, volume = 9, number = 6, pages = {717--772} } @article{candes2010matrix, title = {Matrix completion with noise}, author = {Candes, Emmanuel J and Plan, Yaniv}, year = 2010, journal = {Proceedings of the IEEE}, volume = 98, number = 6, pages = {925--936} } @article{candes2010power, title = {The power of convex relaxation: Near-optimal matrix completion}, author = {Cand{\`e}s, Emmanuel J and Tao, Terence}, year = 2010, journal = {Information Theory, IEEE Transactions on}, publisher = {IEEE}, volume = 56, number = 5, pages = {2053--2080} } @article{candes2011robust, title = {Robust principal component analysis?}, author = {Cand{\`e}s, Emmanuel J and Li, Xiaodong and Ma, Yi and Wright, John}, year = 2011, journal = {Journal of the ACM (JACM)}, publisher = {ACM}, volume = 58, number = 3, pages = 11 } @article{candes2013phaselift, title = {Phaselift: Exact and stable signal recovery from magnitude measurements via convex programming}, author = {Candes, Emmanuel J and Strohmer, Thomas and Voroninski, Vladislav}, year = 2013, journal = {Communications on Pure and Applied Mathematics}, publisher = {Wiley Online Library}, volume = 66, number = 8, pages = {1241--1274} } @article{candes2015phase, title = {Phase retrieval via Wirtinger flow: Theory and algorithms}, author = {Candes, Emmanuel J and Li, Xiaodong and Soltanolkotabi, Mahdi}, year = 2015, journal = {IEEE Transactions on Information Theory}, publisher = {IEEE}, volume = 61, number = 4, pages = {1985--2007} } @article{cao2019learning, title = {{Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss}}, author = {{Cao}, Kaidi and {Wei}, Colin and {Gaidon}, Adrien and {Arechiga}, Nikos and {Ma}, Tengyu}, year = 2019, month = jun, journal = {arXiv e-prints}, pages = {arXiv:1906.07413}, keywords = {Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning}, eid = {arXiv:1906.07413}, archiveprefix = {arXiv}, eprint = {1906.07413}, primaryclass = {cs.LG}, adsurl = {https://ui.adsabs.harvard.edu/abs/2019arXiv190607413C}, adsnote = {Provided by the SAO/NASA Astrophysics Data System} } @article{cao2020heteroskedastic, title = {Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization}, author = {Cao, Kaidi and Chen, Yining and Lu, Junwei and Arechiga, Nikos and Gaidon, Adrien and Ma, Tengyu}, year = 2020, journal = {arXiv preprint arXiv:2006.15766} } @inproceedings{cao2021heteroskedastic, title = {Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization}, author = {Kaidi Cao and Yining Chen and Junwei Lu and Nikos Arechiga and Adrien Gaidon and Tengyu Ma}, year = 2021, booktitle = {International Conference on Learning Representations}, url = {https://openreview.net/forum?id=mEdwVCRJuX4} } @article{caramanis1992perturbation, title = {Perturbation analysis for the design of flexible manufacturing system flow controllers}, author = {Caramanis, Michael and Liberopoulos, George}, year = 1992, journal = {Operations Research}, publisher = {INFORMS}, volume = 40, number = 6, pages = {1107--1125} } @article{carbery2001distributional, title = {Distributional and L\^{} q norm inequalities for polynomials over convex bodies in R\^{} n}, author = {Carbery, Anthony and Wright, James}, year = 2001, journal = {Mathematical Research Letters}, publisher = {International Press}, volume = 8, number = 3, pages = {233--248} } @inproceedings{cardoso1991super, title = {Super-symmetric decomposition of the fourth-order cumulant tensor. Blind identification of more sources than sensors}, author = {Cardoso, J.-F.}, year = 1991, booktitle = {Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on}, pages = {3109--3112}, organization = {IEEE} } @inproceedings{CardosoComonICA, title = {Independent component analysis, a survey of some algebraic methods}, author = {J. F. Cardoso and Pierre Comon}, year = 1996, booktitle = {IEEE International Symposium on Circuits and Systems}, pages = {93--96} } @article{carmon2016accelerated, title = {Accelerated methods for non-convex optimization}, author = {Carmon, Yair and Duchi, John C and Hinder, Oliver and Sidford, Aaron}, year = 2016, journal = {arXiv preprint arXiv:1611.00756} } @article{carmon2016gradient, title = {Gradient Descent Efficiently Finds the Cubic-Regularized Non-Convex {N}ewton Step}, author = {Carmon, Yair and Duchi, John C}, year = 2016, journal = {arXiv preprint arXiv:1612.00547} } @article{carmon2017convex, title = {Convex Until Proven Guilty: Dimension-Free Acceleration of Gradient Descent on Non-Convex Functions}, author = {Carmon, Yair and Duchi, John and Hinder, Oliver and Sidford Aaron}, year = 2017, journal = {arXiv preprint arXiv:1705.02766} } @article{CarmonAGD, title = {Accelerated Methods for Non-Convex Optimization}, author = {Yair Carmon and John C. Duchi and Oliver Hinder and Aaron Sidford}, year = 2016, journal = {arXiv preprint 1611.00756} } @inproceedings{carpentier2012bandit, title = {Bandit theory meets compressed sensing for high dimensional stochastic linear bandit}, author = {Carpentier, Alexandra and Munos, R{\'e}mi}, year = 2012, booktitle = {Artificial Intelligence and Statistics}, pages = {190--198}, organization = {PMLR} } @article{carroll1970analysis, title = {Analysis of individual differences in multidimensional scaling via an N-way generalization of â€œEckart-Youngâ€? decomposition}, author = {Carroll, J Douglas and Chang, Jih-Jie}, year = 1970, journal = {Psychometrika}, publisher = {Springer}, volume = 35, number = 3, pages = {283--319} } @article{cartisadaptive, title = {Adaptive cubic regularisation methods for unconstrained optimization. Part I: motivation, convergence and numerical results}, author = {Cartis, Coralia and Gould, Nicholas IM and Toint, Philippe L}, year = 2011, journal = {Mathematical Programming}, publisher = {Springer}, volume = 127, number = 2, pages = {245--295} } @article{cartisadaptive2, title = {Adaptive cubic regularisation methods for unconstrained optimization. Part II: worst-case function-and derivative-evaluation complexity}, author = {Cartis, Coralia and Gould, Nicholas IM and Toint, Philippe L}, year = 2011, journal = {Mathematical Programming}, publisher = {Springer}, volume = 130, number = 2, pages = {295--319} } @misc{cartwright2011number, title = {The number of eigenvalues of a tensor}, author = {D. Cartwright and B. Sturmfels}, year = 2013, journal = {Linear Algebra Appl.}, volume = 438, number = 2, pages = {942--952} } @article{cartwright2013number, title = {The number of eigenvalues of a tensor}, author = {Cartwright, Dustin and Sturmfels, Bernd}, year = 2013, journal = {Linear algebra and its applications}, publisher = {Elsevier}, volume = 438, number = 2, pages = {942--952} } @article{CartwrightSturmfels2013, title = {{The number of eigenvalues of a tensor}}, author = {Dustin Cartwright and Bernd Sturmfels}, year = 2013, month = jan, journal = {Linear Algebra and its Applications}, volume = 438, number = 2, pages = {942--952} } @article{carvalho2010particle, title = {{Particle Learning and Smoothing}}, author = {Carlos M. Carvalho and Michael S. Johannes and Hedibert F. Lopes and Nicholas G. Polson}, year = 2010, journal = {Statistical Science}, volume = 25, pages = {88--106}, doi = {10.1214/10-STS325}, issue = 2010 } @misc{casella1999monte, title = {{M}onte {C}arlo statistical methods}, author = {Casella, George and Robert, Christian P}, year = 1999, publisher = {Springer-Verlag, New York} } @article{castellano2020assured, title = {Assured RL: Reinforcement Learning with Almost Sure Constraints}, author = {Castellano, Agustin and Bazerque, Juan and Mallada, Enrique}, year = 2020, journal = {arXiv preprint arXiv:2012.13036} } @inproceedings{Catalyst2015, title = {A Universal Catalyst for First-Order Optimization}, author = {Hongzhou Lin and Julien Mairal and Za{\"{\i}}d Harchaoui}, year = 2015, booktitle = {Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, December 7-12, 2015, Montreal, Quebec, Canada}, pages = {3384--3392}, crossref = {DBLP:conf/nips/2015} } @article{cattell1944parallel, title = {Parallel proportional profiles and other principles for determining the choice of factors by rotation}, author = {Cattell, R. B.}, year = 1944, journal = {Psychometrika}, publisher = {Springer}, volume = 9, number = 4, pages = {267--283} } @article{cauchois2020knowing, title = {Knowing what you know: valid confidence sets in multiclass and multilabel prediction}, author = {Cauchois, Maxime and Gupta, Suyash and Duchi, John}, year = 2020, journal = {arXiv preprint arXiv:2004.10181} } @article{cayci2021sample, title = {Sample Complexity and Overparameterization Bounds for Projection-Free Neural {TD} Learning}, author = {Cayci, Semih and Satpathi, Siddhartha and He, Niao and Srikant, R}, year = 2021, journal = {arXiv preprint arXiv:2103.01391} } @inproceedings{cb18, title = {On the Global Convergence of Gradient Descent for Over-parameterized Models using Optimal Transport}, author = {Chizat, Lenaic and Bach, Francis}, year = 2018, journal = {arXiv preprint arXiv:1805.09545}, booktitle = {Advances in Neural Information Processing Systems (NIPS)}, publisher = {arXiv preprint arXiv:1805.09545} } @inproceedings{CDC2016, title = {An online primal-dual method for discounted {M}arkov decision processes}, author = {Wang, Mengdi and Chen, Yichen}, year = 2016, booktitle = {IEEE Conference of Decisions and Control} } @inproceedings{CDS, title = {Atomic decomposition by basis pursuit}, author = {S. Chen and D. Donoho and M. Saunders}, year = 1998, booktitle = {SIAM J. on Scientific Computing}, pages = {33--61} } @article{cen2020fast, title = {Fast global convergence of natural policy gradient methods with entropy regularization}, author = {Cen, Shicong and Cheng, Chen and Chen, Yuxin and Wei, Yuting and Chi, Yuejie}, year = 2020, journal = {arXiv preprint arXiv:2007.06558} } @article{CeS08, title = {Relaxed Alternating Projection Methods}, author = {Cegielski, Andrzej and Suchocka, Agnieszka}, year = 2008, journal = {SIAM J. Optim.}, volume = 19, number = 3, pages = {1093--1106}, doi = {10.1137/070698750}, url = {http://dx.doi.org/10.1137/070698750}, eprint = {http://dx.doi.org/10.1137/070698750}, fjournal = {SIAM Journal on Optimization} } @book{Cesa-Bianchi2006, title = {{Prediction, Learning, and Games}}, author = {{Cesa-Bianchi}, Nicolo and Lugosi, Gabor}, year = 2006, publisher = {Cambridge University Press}, address = {Cambridge}, doi = {10.1017/CBO9780511546921}, isbn = 9780511546921, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Cesa-Bianchi, Lugosi - 2006 - Prediction, Learning, and Games.pdf:pdf}, mendeley-groups = {Books/Optimization} } @inproceedings{cesa2013online, title = {Online learning with switching costs and other adaptive adversaries}, author = {Cesa-Bianchi, Nicolo and Dekel, Ofer and Shamir, Ohad}, year = 2013, booktitle = {Advances in Neural Information Processing Systems}, pages = {1160--1168} } @article{cgcb14, title = {Empirical evaluation of gated recurrent neural networks on sequence modeling}, author = {Chung, Junyoung and Gulcehre, Caglar and Cho, KyungHyun and Bengio, Yoshua}, year = 2014, journal = {arXiv preprint arXiv:1412.3555} } @article{CGLM08, title = {Symmetric tensors and symmetric tensor rank}, author = {P. Comon and G. Golub and L.-H. Lim and B. Mourrain}, year = 2008, journal = {SIAM Journal on Matrix Analysis Appl.}, volume = 30, number = 3, pages = {1254--1279} } @inproceedings{chaganty2013spectral, title = {Spectral Experts for Estimating Mixtures of Linear Regressions.}, author = {Chaganty, Arun Tejasvi and Liang, Percy}, year = 2013, booktitle = {ICML (3)}, pages = {1040--1048} } @inproceedings{chai2005performance, title = {Performance animation from low-dimensional control signals}, author = {Chai, Jinxiang and Hodgins, Jessica K.}, year = 2005, booktitle = {ACM SIGGRAPH 2005 Papers}, location = {Los Angeles, California}, publisher = {ACM}, address = {New York, NY, USA}, series = {SIGGRAPH '05}, pages = {686--696}, doi = {http://doi.acm.org/10.1145/1186822.1073248}, acmid = 1073248, keywords = { dimensionality reduction, lazy learning, local modeling, motion capture data, online control of human motion, performance animation, vision-based interface }, numpages = 11, owner = {leili}, timestamp = {2011.07.28} } @article{chambolle2011first, title = {A first-order primal-dual algorithm for convex problems with applications to imaging}, author = {Chambolle, Antonin and Pock, Thomas}, year = 2011, journal = {Journal of Mathematical Imaging and Vision}, publisher = {Springer}, volume = 40, number = 1, pages = {120--145}, doi = {10.1007/s10851-010-0251-1}, isbn = 1085101002, issn = {09249907}, abstract = {In this paper we study a first-order primal-dual algorithm for non-smooth convex optimization problems with known saddle-point structure. We prove convergence to a saddle-point with rate O(1/N) in finite dimensions for the complete class of problems. We further show accelerations of the proposed algorithm to yield improved rates on problems with some degree of smoothness. In particular we show that we can achieve O(1/N 2) convergence on problems, where the primal or the dual objective is uniformly convex, and we can show linear convergence, i.e. O($\omega$ N ) for some $\omega$∈(0,1), on smooth problems. The wide applicability of the proposed algorithm is demonstrated on several imaging problems such as image denoising, image deconvolution, image inpainting, motion estimation and multi-label image segmentation.}, annote = {Gives the full-gradient based accelerated algorithm for the saddle point problem.}, file = {:D$\backslash$:/Mendeley Desktop/Chambolle, Pock - 2011 - A first-order primal-dual algorithm for convex problems with applications to imaging.pdf:pdf}, keywords = {Convex optimization,Dual approaches,Image,Inverse problems,Reconstruction,Total variation}, mendeley-groups = {Optimization/Gradient Descent Theory} } @article{chandrasekaran2013, title = {Computational and statistical tradeoffs via convex relaxation}, author = {Chandrasekaran, Venkat and Jordan, Michael I}, year = 2013, journal = {Proceedings of the National Academy of Sciences}, publisher = {National Acad Sciences}, volume = 110, number = 13, pages = {E1181--E1190} } @incollection{chang2007psvm, title = {PSVM: Parallelizing Support Vector Machines on Distributed Computers}, author = { Edward Chang and Kaihua Zhu and Hao Wang and Hongjie Bai and Jian Li and Zhihuan Qiu and Hang Cui and Chang, Edward Y. and Zhu, Kaihua and Wang, Hao and Bai, Hongjie and Li, Jian and Qiu, Zhihuan }, year = 2007, booktitle = {Advances in Neural Information Processing Systems}, volume = 20, abstract = { Support Vector Machines ({SVMs}) suffer from a widely recognized scalability problem in both memory use and computational time. To improve scalability, we have developed a parallel {SVM} algorithm ({PSVM}), which reduces memory use through performing a row-based, approximate matrix factorization, and which loads only essential data to each machine to perform parallel computation. Let n denote the number of training instances, p the reduced matrix dimension after factorization (p is significantly smaller than n), and m the number of machines. {PSVM} reduces the memory requirement from O(n2) to O(np=m), and improves computation time to O(np2=m). Empirical study shows {PSVM} to be effective. {PSVM} Open Source is available for download at http://code.google.com/p/psvm/. }, citeulike-article-id = 3152638, keywords = {nips, parallel-computing, svm}, posted-at = {2008-08-25 04:44:30}, priority = 2 } @article{chang2015whitney, title = {The {W}hitney extension theorem in high dimensions}, author = {Chang, Alan}, year = 2015, journal = {arXiv preprint arXiv:1508.01779} } @article{Chang96, title = {Full reconstruction of {M}arkov models on evolutionary trees: Identifiability and consistency}, author = {Joseph T. Chang}, year = 1996, journal = {Mathematical Biosciences}, volume = 137, pages = {51--73} } @article{ChanHansen1990computing, title = {{Computing truncated singular value decomposition least squares solutions by rank revealing QR-factorizations}}, author = {Chan, Tony F and Hansen, Per Christian}, year = 1990, journal = {SIAM Journal on Scientific and Statistical Computing}, publisher = {SIAM}, volume = 11, number = 3, pages = {519--530} } @inproceedings{CharikarLLM10, title = {Vertex Sparsifiers and Abstract Rounding Algorithms}, author = {Moses Charikar and Tom Leighton and Shi Li and Ankur Moitra}, year = 2010, booktitle = {51th Annual {IEEE} Symposium on Foundations of Computer Science, {FOCS} 2010, October 23-26, 2010, Las Vegas, Nevada, {USA}}, pages = {265--274}, doi = {10.1109/FOCS.2010.32}, url = {http://doi.ieeecomputersociety.org/10.1109/FOCS.2010.32}, timestamp = {Mon, 03 Nov 2014 22:22:11 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/focs/CharikarLLM10}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{chaudhari2016entropy, title = {Entropy-SGD: Biasing Gradient Descent Into Wide Valleys}, author = {Chaudhari, Pratik and Choromanska, Anna and Soatto, Stefano and LeCun, Yann}, year = 2016, journal = {arXiv preprint arXiv:1611.01838}, publisher = {IOP Publishing}, volume = 2019, number = 12, pages = 124018 } @inproceedings{chaudhari2018stochastic, title = {Stochastic gradient descent performs variational inference, converges to limit cycles for deep networks}, author = {Chaudhari, Pratik and Soatto, Stefano}, year = 2018, booktitle = {2018 Information Theory and Applications Workshop (ITA)}, pages = {1--10}, organization = {IEEE} } @inproceedings{chaudhuri2009multi, title = {Multi-view clustering via canonical correlation analysis}, author = {Chaudhuri, Kamalika and Kakade, Sham M and Livescu, Karen and Sridharan, Karthik}, year = 2009, booktitle = {ICML}, pages = {129--136} } @article{ChebyshevMethod-Axelsson1985, title = {A survey of preconditioned iterative methods for linear systems of algebraic equations}, author = {Axelsson, Owe}, year = 1985, journal = {BIT Numerical Mathematics}, publisher = {Springer}, volume = 25, number = 1, pages = {165--187} } @inproceedings{chen2008energy, title = { Energy-aware server provisioning and load dispatching for connection-intensive internet services }, author = { Chen, Gong and He, Wenbo and Liu, Jie and Nath, Suman and Rigas, Leonidas and Xiao, Lin and Zhao, Feng }, year = 2008, booktitle = { Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation }, location = {San Francisco, California}, publisher = {USENIX Association}, address = {Berkeley, CA, USA}, series = {NSDI'08}, pages = {337--350}, isbn = {111-999-5555-22-1}, acmid = 1387613, numpages = 14 } @article{chen2009settling, title = {Settling the complexity of computing two-player {N}ash equilibria}, author = {Chen, Xi and Deng, Xiaotie and Teng, Shang-Hua}, year = 2009, journal = {J. ACM}, publisher = {ACM}, volume = 56, number = 3, pages = 14, fjournal = {Journal of the ACM} } @article{chen2013completing, title = {Completing any low-rank matrix, provably}, author = {Chen, Yudong and Bhojanapalli, Srinadh and Sanghavi, Sujay and Ward, Rachel}, year = 2013, journal = {arXiv preprint arXiv:1306.2979} } @article{chen2014optimal, title = {Optimal primal-dual methods for a class of saddle point problems}, author = {Chen, Yunmei and Lan, Guanghui and Ouyang, Yuyuan}, year = 2014, journal = {SIAM Journal on Optimization}, publisher = {SIAM}, volume = 24, number = 4, pages = {1779--1814} } @article{chen2015fast, title = {Fast low-rank estimation by projected gradient descent: General statistical and algorithmic guarantees}, author = {Chen, Yudong and Wainwright, Martin J}, year = 2015, journal = {arXiv preprint arXiv:1509.03025} } @inproceedings{chen2015solving, title = {Solving random quadratic systems of equations is nearly as easy as solving linear systems}, author = {Chen, Yuxin and Candes, Emmanuel}, year = 2015, booktitle = {Advances in Neural Information Processing Systems}, pages = {739--747} } @article{chen2016stochastic, title = {Stochastic Primal-Dual Methods and Sample Complexity of Reinforcement Learning}, author = {Chen, Yichen and Wang, Mengdi}, year = 2016, journal = {arXiv preprint arXiv:1612.02516} } @article{chen2017lower, title = {Lower Bound On the Computational Complexity of Discounted Markov Decision Problems}, author = {Chen, Yichen and Wang, Mengdi}, year = 2017, journal = {arXiv preprint arXiv:1705.07312} } @article{chen2018closing, title = {Closing the generalization gap of adaptive gradient methods in training deep neural networks}, author = {Chen, Jinghui and Gu, Quanquan}, year = 2018, journal = {arXiv preprint arXiv:1806.06763} } @article{chen2018statistical, title = {Statistical inference for model parameters in stochastic gradient descent}, author = {Chen, Xi and Lee, Jason D and Tong, Xin T and Zhang, Yichen}, year = 2020, journal = {The Annals of Statistics}, publisher = {Institute of Mathematical Statistics}, volume = 48, number = 1, pages = {251--273} } @inproceedings{chen2019information, title = {Information-Theoretic Considerations in Batch Reinforcement Learning}, author = {Chen, Jinglin and Jiang, Nan}, year = 2019, booktitle = {International Conference on Machine Learning} } @article{chen2020active, title = {Active Online Domain Adaptation}, author = {Chen, Yining and Luo, Haipeng and Ma, Tengyu and Zhang, Chicheng}, year = 2020, journal = {arXiv preprint arXiv:2006.14481} } @article{chen2020distributed, title = {Distributed Estimation for Principal Component Analysis: a Gap-free Approach}, author = {Chen, Xi and Lee, Jason D and Li, He and Yang, Yun}, year = 2021, journal = {Journal of the American Statistical Association} } @article{chen2020self, title = {Self-training Avoids Using Spurious Features Under Domain Shift}, author = {Chen, Yining and Wei, Colin and Kumar, Ananya and Ma, Tengyu}, year = 2020, journal = {arXiv preprint arXiv:2006.10032} } @article{chen2020stationary, title = {On Stationary-Point Hitting Time and Ergodicity of Stochastic Gradient Langevin Dynamics.}, author = {Chen, Xi and Du, Simon S and Tong, Xin T}, year = 2020, journal = {Journal of Machine Learning Research}, volume = 21, number = 68, pages = {1--41} } @article{chen2020towards, title = {Towards Understanding Hierarchical Learning: Benefits of Neural Representations}, author = {Chen, Minshuo and Bai, Yu and Lee, Jason D and Zhao, Tuo and Wang, Huan and Xiong, Caiming and Socher, Richard}, year = 2020, journal = {Neural Information Processing Systems (NeurIPS)} } @inproceedings{chen2021active, title = {Active Online Learning with Hidden Shifting Domains}, author = {Chen, Yining and Luo, Haipeng and Ma, Tengyu and Zhang, Chicheng}, year = 2021, booktitle = {International Conference on Artificial Intelligence and Statistics}, pages = {2053--2061}, organization = {PMLR} } @article{chen2021improved, title = {Improved Corruption Robust Algorithms for Episodic Reinforcement Learning}, author = {Chen, Yifang and Du, Simon S and Jamieson, Kevin}, year = 2021, journal = {arXiv preprint arXiv:2102.06875} } @article{chi2019nonconvex, title = {Nonconvex optimization meets low-rank matrix factorization: An overview}, author = {Chi, Yuejie and Lu, Yue M and Chen, Yuxin}, year = 2019, journal = {IEEE Transactions on Signal Processing}, publisher = {IEEE}, volume = 67, number = 20, pages = {5239--5269} } @inproceedings{chin1987bayesian, title = {{B}ayesian Belief Network Inference Using Simulation}, author = {Homer Chin and Gregory Cooper}, year = 1987, booktitle = {Uncertainty in Artificial Intelligence 3 Annual Conference on Uncertainty in Artificial Intelligence (UAI-87)}, publisher = {Elsevier Science}, address = {Amsterdam, NL}, pages = {129--147} } @inproceedings{chin1987stochastic, title = {Stochastic Simulation of {B}ayesian Belief Networks}, author = {Homer Chin and Gregory Cooper}, year = 1987, booktitle = {Proceedings of the Third Conference Annual Conference on Uncertainty in Artificial Intelligence (UAI-87)}, publisher = {Elsevier Science}, address = {New York, NY}, pages = {106--113} } @article{chizat2018note, title = {A note on lazy training in supervised differentiable programming}, author = {Chizat, Lenaic and Bach, Francis}, year = 2018, journal = {arXiv preprint arXiv:1812.07956}, volume = 8 } @inproceedings{cho2009kernel, title = {Kernel methods for deep learning}, author = {Cho, Youngmin and Saul, Lawrence K}, year = 2009, booktitle = {Advances in neural information processing systems}, pages = {342--350} } @article{choey1997nonlinear, title = {Nonlinear trading models through sharpe ratio maximization}, author = {Choey, Mark and Weigend, Andreas S}, year = 1997, journal = {International Journal of Neural Systems}, publisher = {World Scientific}, volume = 8, number = {04}, pages = {417--431} } @inproceedings{choromanska2015loss, title = {The Loss Surfaces of Multilayer Networks.}, author = {Choromanska, Anna and Henaff, Mikael and Mathieu, Michael and Arous, G{\'e}rard Ben and LeCun, Yann}, year = 2015, booktitle = {AISTATS} } @inproceedings{choromanska2015open, title = {Open problem: The landscape of the loss surfaces of multilayer networks}, author = {Choromanska, Anna and LeCun, Yann and Arous, G{\'e}rard Ben}, year = 2015, booktitle = {Conference on Learning Theory}, pages = {1756--1760} } @article{chow1989complexity, title = {The complexity of dynamic programming}, author = {Chow, Chef-Seng and Tsitsiklis, John N}, year = 1989, journal = {Journal of complexity}, publisher = {Elsevier}, volume = 5, number = 4, pages = {466--488} } @inproceedings{chow2015risk, title = {Risk-sensitive and robust decision-making: a cvar optimization approach}, author = {Chow, Yinlam and Tamar, Aviv and Mannor, Shie and Pavone, Marco}, year = 2015, booktitle = {Advances in Neural Information Processing Systems}, pages = {1522--1530} } @article{chow2019lyapunov, title = {Lyapunov-based safe policy optimization for continuous control}, author = {Chow, Yinlam and Nachum, Ofir and Faust, Aleksandra and Duenez-Guzman, Edgar and Ghavamzadeh, Mohammad}, year = 2019, journal = {arXiv preprint arXiv:1901.10031} } @inproceedings{chu2006map, title = {Map-Reduce for Machine Learning on Multicore}, author = { Cheng-Tao Chu and Sang Kyun Kim and Yi-An Lin and YuanYuan Yu and Gary R. Bradski and Andrew Y. Ng and Kunle Olukotun }, year = 2006, booktitle = {{NIPS} 19}, publisher = {MIT Press}, pages = {281--288}, editor = {Sch\"{o}lkopf, Bernhard and Platt, John C. and Hoffman, Thomas}, citeulike-article-id = 2308503, keywords = {mapreduce, ml, parallel}, owner = {leili}, priority = {0}, timestamp = {2011.07.28} } @article{chua2018deep, title = {Deep reinforcement learning in a handful of trials using probabilistic dynamics models}, author = {Chua, Kurtland and Calandra, Roberto and McAllister, Rowan and Levine, Sergey}, year = 2018, journal = {arXiv preprint arXiv:1805.12114} } @inproceedings{Chudak2005, title = {{Improved Approximation Schemes for Linear Programming Relaxations of Combinatorial Optimization Problems}}, author = {Chudak, Fabi\'{a}n A. and Eleut\'{e}rio, V\^{a}nia}, year = 2005, booktitle = {Proceedings of the 11th International IPCO Conference on Integer Programming and Combinatorial Optimization}, pages = {81--96}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Chudak, Eleut\'{e}rio - 2005 - Improved Approximation Schemes for Linear Programming Relaxations of Combinatorial Optimization Problems.pdf:pdf}, mendeley-groups = {Optimization/Multiplicative Weight/LP} } @article{chung2006concentration, title = {Concentration inequalities and martingale inequalities: a survey}, author = {Chung, Fan and Lu, Linyuan}, year = 2006, journal = {Internet Mathematics}, publisher = {Taylor \& Francis}, volume = 3, number = 1, pages = {79--127} } @article{church1990word, title = {Word association norms, mutual information, and lexicography}, author = {Church, Kenneth Ward and Hanks, Patrick}, year = 1990, journal = {Computational linguistics} } @article{cichocki2009fast, title = {Fast local algorithms for large scale nonnegative matrix and tensor factorizations}, author = {Cichocki, Andrzej and Anh-Huy, PHAN}, year = 2009, journal = {IEICE transactions on fundamentals of electronics, communications and computer sciences}, publisher = {The Institute of Electronics, Information and Communication Engineers}, volume = 92, number = 3, pages = {708--721} } @article{CKKRS06, title = {On the Hardness of Approximating Multicut and Sparsest-Cut}, author = {Chawla, Shuchi and Krauthgamer, Robert and Kumar, Ravi and Rabani, Yuval and Sivakumar, D.}, year = 2006, month = jun, journal = {Computational Complexity}, publisher = {Birkhauser Verlag}, volume = 15, number = 2, pages = {94--114}, numpages = 21 } @inproceedings{CKLS09, title = {Multi-View Clustering via Canonical Correlation Analysis}, author = {K. Chaudhuri and S. M. Kakade and K. Livescu and K. Sridharan}, year = 2009, booktitle = {ICML} } @inproceedings{CKMST2011, title = {{Electrical flows, laplacian systems, and faster approximation of maximum flow in undirected graphs}}, author = {Christiano, Paul and Kelner, Jonathan A. and Madry, Aleksander and Spielman, Daniel A. and Teng, Shang-Hua}, year = 2011, month = oct, booktitle = {Proceedings of the 43rd annual ACM symposium on Theory of computing - STOC '11}, publisher = {ACM Press}, address = {New York, New York, USA}, pages = 273, doi = {10.1145/1993636.1993674}, isbn = 9781450306911, abstract = {We introduce a new approach to computing an approximately maximum s-t flow in a capacitated, undirected graph. This flow is computed by solving a sequence of electrical flow problems. Each electrical flow is given by the solution of a system of linear equations in a Laplacian matrix, and thus may be approximately computed in nearly-linear time. Using this approach, we develop the fastest known algorithm for computing approximately maximum s-t flows. For a graph having n vertices and m edges, our algorithm computes a (1-$\backslash$epsilon)-approximately maximum s-t flow in time $\backslash$tilde\{O\}(mn\^{}\{1/3\} $\backslash$epsilon\^{}\{-11/3\}). A dual version of our approach computes a (1+$\backslash$epsilon)-approximately minimum s-t cut in time $\backslash$tilde\{O\}(m+n\^{}\{4/3\}$\backslash$eps\^{}\{-8/3\}), which is the fastest known algorithm for this problem as well. Previously, the best dependence on m and n was achieved by the algorithm of Goldberg and Rao (J. ACM 1998), which can be used to compute approximately maximum s-t flows in time $\backslash$tilde\{O\}(m$\backslash$sqrt\{n\}$\backslash$epsilon\^{}\{-1\}), and approximately minimum s-t cuts in time $\backslash$tilde\{O\}(m+n\^{}\{3/2\}$\backslash$epsilon\^{}\{-3\}).}, archiveprefix = {arXiv}, arxivid = {1010.2921}, eprint = {1010.2921}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Christiano et al. - 2011 - Electrical flows, laplacian systems, and faster approximation of maximum flow in undirected graphs.pdf:pdf}, mendeley-groups = {Algorithms/Maxflow} } @article{clarkson2010coresets, title = {Coresets, sparse greedy approximation, and the Frank-Wolfe algorithm}, author = {Clarkson, Kenneth L}, year = 2010, journal = {ACM Transactions on Algorithms (TALG)}, publisher = {ACM}, volume = 6, number = 4, pages = 63 } @article{clarkson2012sublinear, title = {Sublinear optimization for machine learning}, author = {Clarkson, Kenneth L and Hazan, Elad and Woodruff, David P}, year = 2012, month = oct, journal = {J. ACM}, publisher = {ACM}, volume = 59, number = 5, pages = 23, doi = {10.1145/2371656.2371658}, issn = {00045411}, fjournal = {Journal of the ACM}, file = {:D$\backslash$:/Mendeley Desktop/Clarkson, Hazan, Woodruff - 2012 - Sublinear optimization for machine learning.pdf:pdf}, mendeley-groups = {Algorithms/Computational Geometry} } @inproceedings{clarkson2013fast, title = {The Fast Cauchy Transform and faster robust linear regression}, author = {Clarkson, Kenneth L and Drineas, Petros and Magdon-Ismail, Malik and Mahoney, Michael W and Meng, Xiangrui and Woodruff, David P}, year = 2013, booktitle = {Proceedings of the Twenty-Fourth Annual ACM-SIAM Symposium on Discrete Algorithms}, pages = {466--477}, organization = {SIAM} } @inproceedings{ClarksonWoodruf2013-SVD, title = {{Low rank approximation and regression in input sparsity time}}, author = {Clarkson, Kenneth L. and Woodruff, David P.}, year = 2013, booktitle = {STOC}, pages = {81--90} } @inproceedings{clavera2019model, title = {Model-Augmented Actor-Critic: Backpropagating through Paths}, author = {Clavera, Ignasi and Fu, Yao and Abbeel, Pieter}, year = 2019, booktitle = {International Conference on Learning Representations} } @misc{cmumotion, title = {Motion capture database}, author = {CMU}, url = {http://mocap.cs.cmu.edu}, owner = {leili}, timestamp = {2011.07.28} } @misc{cmumulti, title = {Multi-Modal Activity Database}, author = {CMU}, url = {http://kitchen.cs.cmu.edu} } @inproceedings{coates2011analysis, title = {An analysis of single-layer networks in unsupervised feature learning}, author = {Coates, Adam and Ng, Andrew Y and Lee, Honglak}, year = 2011, booktitle = {International Conference on Artificial Intelligence and Statistics}, pages = {215--223} } @article{CoatesNgLee11, title = {An Analysis of Single-Layer Networks in Unsupervised Feature Learning}, author = {A. Coates and H. Lee and A. Y. Ng}, year = 2011, journal = {Journal of Machine Learning Research - Proceedings Track}, volume = 15, pages = {215--223} } @inproceedings{cogill2015primal, title = {Primal-dual algorithms for discounted {M}arkov decision processes}, author = {Cogill, Randy}, year = 2015, booktitle = {Control Conference (ECC), 2015 European}, pages = {260--265}, organization = {IEEE} } @article{cogill2016analysis, title = {An Analysis of Primal-Dual Algorithms for Discounted Markov Decision Processes}, author = {Cogill, Randy}, year = 2016, journal = {arXiv preprint arXiv:1601.04175} } @article{cohen1993nonnegative, title = {Nonnegative Ranks, Decompositions, and Factorizations of Nonnegative Matices}, author = {Cohen, Joel E and Rothblum, Uriel G}, year = 1993, journal = {Linear Algebra and its Applications} } @proceedings{Cohen2008, title = {Machine Learning, Proceedings of the Twenty-Fifth International Conference (ICML 2008), Helsinki, Finland, June 5-9, 2008}, year = 2008, booktitle = {ICML}, publisher = {ACM}, series = {ACM International Conference Proceeding Series}, volume = 307, editor = {William W. Cohen and Andrew McCallum and Sam T. Roweis}, bibsource = {DBLP, http://dblp.uni-trier.de} } @inproceedings{cohen2012spectral, title = {Spectral learning of latent-variable {PCFGs}}, author = {Cohen, Shay B. and Stratos, Karl and Collins, Michael and Foster, Dean P. and Ungar, Lyle}, year = 2012, booktitle = {Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1} } @inproceedings{Cohen2015dimensionality, title = {Dimensionality reduction for k-means clustering and low rank approximation}, author = {Cohen, Michael B. and Elder, Sam and Musco, Cameron and Musco, Christopher and Persu, Madalina}, year = 2015, booktitle = {STOC}, pages = {163--172}, organization = {ACM} } @article{cohen2015ridge, title = {Ridge Leverage Scores for Low-Rank Approximation}, author = {Cohen, Michael B and Musco, Cameron and Musco, Christopher}, year = 2015, journal = {arXiv preprint arXiv:1511.07263} } @inproceedings{cohen2015uniform, title = {Uniform sampling for matrix approximation}, author = {Cohen, Michael B and Lee, Yin Tat and Musco, Cameron and Musco, Christopher and Peng, Richard and Sidford, Aaron}, year = 2015, booktitle = {Proceedings of the 2015 Conference on Innovations in Theoretical Computer Science}, pages = {181--190} } @inproceedings{CohenKPPRSV17, title = {Almost-linear-time algorithms for Markov chains and new spectral primitives for directed graphs}, author = {Michael B. Cohen and Jonathan A. Kelner and John Peebles and Richard Peng and Anup B. Rao and Aaron Sidford and Adrian Vladu}, year = 2017, booktitle = {Proceedings of the 49th Annual {ACM} {SIGACT} Symposium on Theory of Computing, {STOC} 2017, Montreal, QC, Canada, June 19-23, 2017}, pages = {410--419} } @inproceedings{collobert2002parallel, title = {A {P}arallel {M}ixture of {SVM}s for {V}ery {L}arge {S}cale {P}roblems}, author = {Collobert, R. and Bengio, S. and Bengio, Y.}, year = 2002, booktitle = {{NIPS}}, publisher = {MIT Press}, editor = {Dietterich, T. G. and Becker, S. and Ghahramani, Z.}, details = {http://infoscience.epfl.ch/record/82786}, documenturl = {ftp://ftp.idiap.ch/pub/reports/2002/collobert_2002_nips.pdf}, keywords = {learning}, oai-id = {oai:infoscience.epfl.ch:82786}, oai-set = {conf}, unit = {LIDIAP} } @inproceedings{collobert2008unified, title = {A Unified Architecture for Natural Language Processing: Deep Neural Networks with Multitask Learning}, author = {Collobert, Ronan and Weston, Jason}, year = 2008, booktitle = {Proceedings of the 25th International Conference on Machine Learning} } @inproceedings{colohan2006tolerating, title = {Tolerating Dependences Between Large Speculative Threads Via Sub-Threads}, author = { Colohan, Christopher B. and Ailamaki, Anastassia and Steffan, J. Gregory and Mowry, Todd C. }, year = 2006, booktitle = { Proceedings of the 33rd annual international symposium on Computer Architecture }, publisher = {IEEE Computer Society}, address = {Washington, DC, USA}, series = {ISCA '06}, pages = {216--226}, doi = {http://dx.doi.org/10.1109/ISCA.2006.43}, isbn = {0-7695-2608-X}, acmid = 1136504, numpages = 11 } @article{comon2002tensor, title = {Tensor decompositions}, author = {P. Comon}, year = 2002, journal = {Mathematics in Signal Processing V}, publisher = {Oxford, UK: Clarendon}, pages = {1--24} } @article{comon2009tensor, title = {Tensor decompositions, alternating least squares and other tales}, author = {P. Comon and X. Luciani and A. De Almeida}, year = 2009, journal = {Journal of Chemometrics}, publisher = {Wiley Online Library}, volume = 23, number = {7-8}, pages = {393--405} } @article{Comon94, title = {Independent Component Analysis, a new concept?}, author = {P. Comon}, year = 1994, journal = {Signal Processing}, volume = 36, number = 3, pages = {287--314} } @book{Comon:book, title = {Handbook of Blind Source Separation: Independent Component Analysis and Applications}, author = {Comon, P. and Jutten, C.}, year = 2010, publisher = {Elsevier}, series = {Academic Press} } @book{ComonJuttenICA, title = {Handbook of Blind Source Separation: Independent Component Analysis and Applications}, author = {P. Comon and C. Jutten}, year = 2010, publisher = {Academic Press. Elsevier} } @inproceedings{ConcentrationProjections, title = {A concentration theorem for projections}, author = {Sanjoy Dasgupta and Daniel Hsu and Nakul Verma}, year = 2006, booktitle = {Twenty-Second Conference on Uncertainty in Artificial Intelligence} } @article{condon1992complexity, title = {The complexity of stochastic games}, author = {Condon, Anne}, year = 1992, journal = {Information and Computation}, publisher = {Elsevier}, volume = 96, number = 2, pages = {203--224} } @inproceedings{cong2005parallel, title = {Parallel mining of closed sequential patterns}, author = {Cong, Shengnan and Han, Jiawei and Padua, David}, year = 2005, booktitle = { Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining }, location = {Chicago, Illinois, USA}, publisher = {ACM}, address = {New York, NY, USA}, series = {KDD '05}, pages = {562--567}, doi = {http://doi.acm.org/10.1145/1081870.1081937}, isbn = {1-59593-135-X}, acmid = 1081937, keywords = {load balancing, parallel algorithms, sampling}, numpages = 6 } @article{cordts2016cityscapes, title = {The Cityscapes Dataset for Semantic Urban Scene Understanding}, author = {Cordts, Marius and Omran, Mohamed and Ramos, Sebastian and Rehfeld, Timo and Enzweiler, Markus and Benenson, Rodrigo and Franke, Uwe and Roth, Stefan and Schiele, Bernt}, year = 2016, month = jun, journal = {2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, publisher = {IEEE}, doi = {10.1109/cvpr.2016.350}, isbn = 9781467388511, url = {http://dx.doi.org/10.1109/CVPR.2016.350} } @inproceedings{corless1997reordered, title = {A reordered {S}chur factorization method for zero-dimensional polynomial systems with multiple roots}, author = {Corless, R. M. and Gianni, P. M. and Trager, B. M.}, year = 1997, booktitle = {Proceedings of the 1997 International Symposium on Symbolic and Algebraic Computation}, pages = {133--140}, organization = {ACM} } @book{cormen2001introduction, title = {Introduction to Algorithms}, author = {Cormen, Thomas H. and Stein, Clifford and Rivest, Ronald L. and Leiserson, Charles E.}, year = 2001, publisher = {McGraw-Hill Higher Education}, isbn = {0070131511}, edition = {2nd} } @inproceedings{coronato2019reinforcement, title = {A reinforcement learning based intelligent system for the healthcare treatment assistance of patients with disabilities}, author = {Coronato, Antonio and Naeem, Muddasar}, year = 2019, booktitle = {International Symposium on Pervasive Systems, Algorithms and Networks}, pages = {15--28}, organization = {Springer} } @inproceedings{cortes2011domain, title = {Domain adaptation in regression}, author = {Cortes, Corinna and Mohri, Mehryar}, year = 2011, booktitle = {Algorithmic Learning Theory}, pages = {308--323}, organization = {Springer} } @article{cortes2014domain, title = {Domain adaptation and sample bias correction theory and algorithm for regression}, author = {Cortes, Corinna and Mohri, Mehryar}, year = 2014, journal = {Theoretical Computer Science}, publisher = {Elsevier}, volume = 519, pages = {103--126} } @inproceedings{cortes2015adaptation, title = {Adaptation algorithm and theory based on generalized discrepancy}, author = {Cortes, Corinna and Mohri, Mehryar and Mu{\~n}oz Medina, Andr{\'e}s}, year = 2015, booktitle = {Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining}, pages = {169--178}, organization = {ACM} } @article{cortes2019adaptation, title = {Adaptation based on generalized discrepancy}, author = {Cortes, Corinna and Mohri, Mehryar and Medina, Andr{\'e}s Munoz}, year = 2019, journal = {The Journal of Machine Learning Research}, publisher = {JMLR. org}, volume = 20, number = 1, pages = {1--30} } @article{CosinePowerSum, title = {A Note on Cosine Power Sums}, author = {Mircea Merca}, year = 2012, month = may, journal = {Journal of Integer Sequences}, volume = 15, pages = {12.5.3} } @book{cover2012elements, title = {Elements of information theory}, author = {Cover, Thomas M. and Thomas, Joy A.}, year = 2012, publisher = {John Wiley \& Sons} } @article{covertype, title = {Comparative accuracies of artificial neural networks and discriminant analysis in predicting forest cover types from cartographic variables}, author = {Blackard, Jock A and Dean, Denis J}, year = 1999, journal = {Computers and electronics in agriculture}, publisher = {Elsevier}, volume = 24, number = 3, pages = {131--151} } @article{cps18, title = {Dynamical Isometry and a Mean Field Theory of {RNNs}: Gating Enables Signal Propagation in Recurrent Neural Networks}, author = {Chen, Minmin and Pennington, Jeffrey and Schoenholz, Samuel S.}, year = 2018, journal = {arXiv:1806.05394}, url = {http://arxiv.org/abs/1806.05394} } @inproceedings{CQ-adaptive-sampling, title = {Stochastic Dual Coordinate Ascent with Adaptive Probabilities}, author = {Dominik Csiba and Zheng Qu and Peter Richt{\'{a}}rik}, year = 2015, booktitle = {Proceedings of the 32nd International Conference on Machine Learning, {ICML} 2015, Lille, France, 6-11 July 2015}, pages = {674--683} } @inproceedings{CR08, title = {Learning Mixtures of Product Distributions using Correlations and Independence}, author = {K. Chaudhuri and S. Rao}, year = 2008, booktitle = {COLT} } @article{craig1933tchebychef, title = {On the Tchebychef inequality of Bernstein}, author = {Craig, Cecil C}, year = 1933, journal = {The Annals of Mathematical Statistics}, publisher = {JSTOR}, volume = 4, number = 2, pages = {94--102} } @book{craig2003introduction, title = {Introduction to Aerodynamics}, author = {Gale Craig}, year = 2003, publisher = {Regenerative Press}, address = {Anderson, IN}, volume = 1, edition = {1st} } @inproceedings{CRT, title = {Stable signal recovery from incomplete and inaccurate measurements}, author = {E. Candes and J. Romberg and T. Tao}, year = 2006, booktitle = {Communications of Pure and Applied Math}, pages = {1207--1223} } @inproceedings{cs16, title = {Convolutional rectifier networks as generalized tensor decompositions}, author = {Cohen, Nadav and Shashua, Amnon}, year = 2016, booktitle = {International Conference on Machine Learning (ICML)}, pages = {955--963} } @article{CS93, title = {Blind beamforming for non {G}aussian signals}, author = {J.-F. Cardoso and A. Souloumiac}, year = 1993, journal = {IEE Proceedings-F}, volume = 140, number = 6, pages = {362--370} } @inproceedings{CSCFU12, title = {Spectral Learning of Latent-Variable {PCFG}s}, author = {S. B. Cohen and K. Stratos and M. Collins and D. P. Foster and L. Ungar}, year = 2012, booktitle = {ACL} } @inproceedings{css-missing-data, title = {Column Subset Selection with Missing Data via Active Sampling}, author = {Wang, Yining and Singh, Aarti}, year = 2015, booktitle = {International Conference on Artificial Intelligence and Statisticss} } @inproceedings{CT, title = {Decoding by linear programming}, author = {E. Candes and T. Tao}, year = 2005, booktitle = {IEEE Trans. on Information Theory}, pages = {4203--4215} } @inproceedings{cur-missing, title = {{CUR} Algorithm for Partially Observed Matrices}, author = {Xu, Miao and Jin, Rong and Zhou, Zhi-Hua}, year = 2015, booktitle = {International Conference on Machine Learning} } @article{curtis2014trust, title = {A trust region algorithm with a worst-case iteration complexity of {O}({$\epsilon^{-3/2}$}) for nonconvex optimization}, author = {Curtis, Frank E and Robinson, Daniel P and Samadi, Mohammadreza}, year = 2014, journal = {Mathematical Programming}, publisher = {Springer}, pages = {1--32} } @article{curtis2016trust, title = {A trust region algorithm with a worst-case iteration complexity of$\backslash$ mathcal $\{$O$\}$($\backslash$ epsilon\^{}$\{$-3/2$\}$) for nonconvex optimization}, author = {Curtis, Frank E and Robinson, Daniel P and Samadi, Mohammadreza}, year = 2016, journal = {Mathematical Programming}, publisher = {Springer}, pages = {1--32} } @article{cvbb14, title = {On the properties of neural machine translation: Encoder-decoder approaches}, author = {Cho, Kyunghyun and Van Merri{\"e}nboer, Bart and Bahdanau, Dzmitry and Bengio, Yoshua}, year = 2014, journal = {arXiv preprint arXiv:1409.1259} } @inproceedings{cvgbbsb14, title = {Learning phrase representations using RNN encoder-decoder for statistical machine translation}, author = {Cho, Kyunghyun and Van Merri{\"e}nboer, Bart and Gulcehre, Caglar and Bahdanau, Dzmitry and Bougares, Fethi and Schwenk, Holger and Bengio, Yoshua}, year = 2014, journal = {arXiv preprint arXiv:1406.1078}, booktitle = {EMNLP}, pages = {1724--1734} } @inproceedings{d16, title = {Complexity theoretic limitations on learning halfspaces}, author = {Daniely, Amit}, year = 2016, booktitle = {Proceedings of the forty-eighth annual ACM symposium on Theory of Computing (STOC)}, pages = {105--117}, organization = {ACM} } @inproceedings{d17, title = {{SGD} learns the conjugate kernel class of the network}, author = {Daniely, Amit}, year = 2017, booktitle = {Advances in Neural Information Processing Systems (NIPS)}, pages = {2422--2430} } @article{d1963probabilistic, title = {A probabilistic production and inventory problem}, author = {d'Epenoux, F}, year = 1963, journal = {Management Science}, publisher = {INFORMS}, volume = 10, number = 1, pages = {98--108} } @inproceedings{dai17learning, title = {Learning from Conditional Distributions via Dual Embeddings}, author = {Bo Dai and Niao He and Yunpeng Pan and Byron Boots and Le Song}, year = 2017, booktitle = {Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS)}, pages = {1458--1467} } @inproceedings{dai18boosting, title = {Boosting the Actor with Dual Critic}, author = {Bo Dai and Albert Shaw and Niao He and Lihong Li and Le Song}, year = 2018, booktitle = {Proceedings of the 6th International Conference on Learning Representations (ICLR)}, note = {arXiv:1712.10282} } @article{dai2018towards, title = {Towards Theoretical Understanding of Large Batch Training in Stochastic Gradient Descent}, author = {Dai, Xiaowu and Zhu, Yuhua}, year = 2018, journal = {arXiv preprint arXiv:1812.00542} } @article{dalal2018safe, title = {Safe exploration in continuous action spaces}, author = {Dalal, Gal and Dvijotham, Krishnamurthy and Vecerik, Matej and Hester, Todd and Paduraru, Cosmin and Tassa, Yuval}, year = 2018, journal = {arXiv preprint arXiv:1801.08757} } @article{dalalyan2017theoretical, title = {Theoretical guarantees for approximate sampling from smooth and log-concave densities}, author = {Dalalyan, Arnak S}, year = 2017, journal = {Journal of the Royal Statistical Society: Series B (Statistical Methodology)}, publisher = {Wiley Online Library}, volume = 79, number = 3, pages = {651--676} } @article{dang2014randomized, title = {Randomized first-order methods for saddle point optimization}, author = {Dang, Cong and Lan, Guanghui}, year = 2014, journal = {arXiv preprint arXiv:1409.8625} } @inproceedings{dani2007price, title = {The price of bandit information for online optimization}, author = {Dani, Varsha and Kakade, Sham M and Hayes, Thomas P}, year = 2007, booktitle = {Advances in Neural Information Processing Systems}, pages = {345--352} } @inproceedings{dani2008stochastic, title = {Stochastic linear optimization under bandit feedback}, author = {Dani, Varsha and Hayes, Thomas P and Kakade, Sham M}, year = 2008, booktitle = {Conference on Learning Theory} } @inproceedings{daniely2016toward, title = {Toward deeper understanding of neural networks: The power of initialization and a dual view on expressivity}, author = {Daniely, Amit and Frostig, Roy and Singer, Yoram}, year = 2016, booktitle = {Advances In Neural Information Processing Systems}, pages = {2253--2261} } @article{dann2014policy, title = {Policy evaluation with temporal differences: a survey and comparison.}, author = {Dann, Christoph and Neumann, Gerhard and Peters, Jan}, year = 2014, journal = {Journal of Machine Learning Research}, volume = 15, number = 1, pages = {809--883} } @inproceedings{dann2015sample, title = {Sample complexity of episodic fixed-horizon reinforcement learning}, author = {Dann, Christoph and Brunskill, Emma}, year = 2015, booktitle = {Advances in Neural Information Processing Systems}, pages = {2818--2826} } @inproceedings{dann2017unifying, title = {Unifying {PAC} and Regret: Uniform {PAC} Bounds for Episodic Reinforcement Learning}, author = {Dann, Christoph and Lattimore, Tor and Brunskill, Emma}, year = 2017, booktitle = {Proceedings of the 31st International Conference on Neural Information Processing Systems}, location = {Long Beach, California, USA}, publisher = {Curran Associates Inc.}, address = {Red Hook, NY, USA}, series = {NIPS’17}, pages = {5717–5727}, isbn = 9781510860964, numpages = 11 } @inproceedings{dann2018oracle, title = {On Oracle-Efficient {PAC}-{RL} with Rich Observations}, author = {Christoph Dann and Nan Jiang and Akshay Krishnamurthy and Alekh Agarwal and John Langford and Robert E. Schapire}, year = 2018, booktitle = {Advances in Neural Information Processing Systems} } @article{dann2018polynomial, title = {On Polynomial Time PAC Reinforcement Learning with Rich Observations}, author = {Dann, Christoph and Jiang, Nan and Krishnamurthy, Akshay and Agarwal, Alekh and Langford, John and Schapire, Robert E}, year = 2018, journal = {arXiv preprint arXiv:1803.00606} } @inproceedings{dann2019policy, title = {Policy Certificates: Towards Accountable Reinforcement Learning}, author = {Dann, Christoph and Li, Lihong and Wei, Wei and Brunskill, Emma}, year = 2019, booktitle = {Proceedings of the 36th International Conference on Machine Learning}, pages = {1507--1516} } @book{dantzig2016linear, title = {Linear Programming and Extensions}, author = {Dantzig, George}, year = 2016, publisher = {Princeton University Press, Princeton, NJ} } @inproceedings{Das99, title = {Learning Mixutres of {G}aussians}, author = {S. Dasgupta}, year = 1999, booktitle = {FOCS} } @inproceedings{dasgupta1999learning, title = {Learning mixtures of Gaussians}, author = {Dasgupta, Sanjoy}, year = 1999, booktitle = {Foundations of Computer Science, 1999. 40th Annual Symposium on}, pages = {634--644}, organization = {IEEE} } @article{dasgupta2003elementary, title = {An elementary proof of a theorem of Johnson and Lindenstrauss}, author = {Dasgupta, Sanjoy and Gupta, Anupam}, year = 2003, journal = {Random Structures \& Algorithms}, publisher = {Wiley Online Library}, volume = 22, number = 1, pages = {60--65} } @inproceedings{Dasgupta:GaussianMixture, title = {Learning mixutres of Gaussians}, author = {Sanjoy Dasgupta}, year = 1999, booktitle = {FOCS} } @inproceedings{Daskalakis2011, title = {{Near-optimal no-regret algorithms for zero-sum games}}, author = {Daskalakis, Constantinos and Deckelbaum, Alan and Kim, Anthony}, year = 2011, booktitle = {Proceedings of the Twenty-Second Annual ACM-SIAM Symposium on Discrete Algorithms - SODA '11}, pages = {235--254}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Daskalakis, Deckelbaum, Kim - 2011 - Near-optimal no-regret algorithms for zero-sum games.pdf:pdf}, mendeley-groups = {Game Theory/Zero-sum Games} } @article{daskalakis2016ten, title = {Ten steps of {EM} suffice for mixtures of two Gaussians}, author = {Daskalakis, Constantinos and Tzamos, Christos and Zampetakis, Manolis}, year = 2016, journal = {arXiv preprint arXiv:1609.00368} } @inproceedings{dauphin2014identifying, title = {Identifying and attacking the saddle point problem in high-dimensional non-convex optimization}, author = {Dauphin, Yann N and Pascanu, Razvan and Gulcehre, Caglar and Cho, Kyunghyun and Ganguli, Surya and Bengio, Yoshua}, year = 2014, booktitle = {Advances in neural information processing systems}, pages = {2933--2941} } @article{dauphin2016language, title = {Language modeling with gated convolutional networks}, author = {Dauphin, Yann N and Fan, Angela and Auli, Michael and Grangier, David}, year = 2016, journal = {arXiv preprint arXiv:1612.08083} } @article{davenport20141, title = {1-bit matrix completion}, author = {Davenport, Mark A and Plan, Yaniv and van den Berg, Ewout and Wootters, Mary}, year = 2014, journal = {Information and Inference}, publisher = {Oxford University Press}, volume = 3, number = 3, pages = {189--223} } @article{davidson2001local, title = {Local operator theory, random matrices and Banach spaces}, author = {Davidson, Kenneth R and Szarek, Stanislaw J}, year = 2001, journal = {Handbook of the geometry of Banach spaces}, volume = 1, number = {317-366}, pages = 131 } @inproceedings{davies1987logical, title = {A logical approach to reasoning by analogy}, author = {Todd R. Davies and Stuart J. Russell}, year = 1987, booktitle = {In IJCAI-87}, publisher = {Morgan Kaufmann}, pages = {264--270} } @article{davis1997adaptive, title = {Adaptive greedy approximations}, author = {Davis, Geoff and Mallat, Stephane and Avellaneda, Marco}, year = 1997, journal = {Constructive approximation}, publisher = {Springer}, volume = 13, number = 1, pages = {57--98}, owner = {gewor_000}, timestamp = {2013.11.10} } @article{davis2018stochastic, title = {Stochastic subgradient method converges on tame functions}, author = {Davis, Damek and Drusvyatskiy, Dmitriy and Kakade, Sham and Lee, Jason D}, year = 2018, journal = {arXiv preprint arXiv:1804.07795}, publisher = {Springer}, volume = 20, number = 1, pages = {119--154} } @article{DavisKahan, title = {The rotation of eigenvectors by a perturbation. {III}}, author = {Davis, Chandler and Kahan, William Morton}, year = 1970, journal = {SIAM Journal on Numerical Analysis}, publisher = {SIAM}, volume = 7, number = 1, pages = {1--46} } @inproceedings{DBLP:conf/acl/MaasDPHNP11, title = {Learning Word Vectors for Sentiment Analysis}, author = {Andrew L. Maas and Raymond E. Daly and Peter T. Pham and Dan Huang and Andrew Y. Ng and Christopher Potts}, year = 2011, booktitle = {The 49th Annual Meeting of the Association for Computational Linguistics} } @proceedings{DBLP:conf/aistats/2015, title = {Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, {AISTATS} 2015, San Diego, California, USA, May 9-12, 2015}, year = 2015, publisher = {JMLR.org}, series = {{JMLR} Workshop and Conference Proceedings}, volume = 38, url = {http://jmlr.org/proceedings/papers/v38/}, editor = {Guy Lebanon and S. V. N. Vishwanathan}, timestamp = {Tue, 12 Jul 2016 21:51:16 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/aistats/2015}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{DBLP:conf/colt/0002N15, title = {Fast Exact Matrix Completion with Finite Samples}, author = {Prateek Jain and Praneeth Netrapalli}, year = 2015, booktitle = {Proceedings of The 28th Conference on Learning Theory, {COLT} 2015, Paris, France, July 3-6, 2015}, pages = {1007--1034}, url = {http://jmlr.org/proceedings/papers/v40/Jain15.html}, crossref = {DBLP:conf/colt/2015}, timestamp = {Mon, 06 Jul 2015 08:31:46 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/colt/0002N15}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{DBLP:conf/colt/LeeSJR16, title = {Gradient Descent Only Converges to Minimizers}, author = {Jason D. Lee and Max Simchowitz and Michael I. Jordan and Benjamin Recht}, year = 2016, booktitle = {Proceedings of the 29th Conference on Learning Theory, {COLT} 2016, New York, USA, June 23-26, 2016}, pages = {1246--1257}, url = {http://jmlr.org/proceedings/papers/v49/lee16.html}, crossref = {DBLP:conf/colt/2016}, timestamp = {Wed, 13 Jul 2016 17:28:13 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/colt/LeeSJR16}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{DBLP:conf/eccv/HeZRS16, title = {Identity Mappings in Deep Residual Networks}, author = {Kaiming He and Xiangyu Zhang and Shaoqing Ren and Jian Sun}, year = 2016, booktitle = {Computer Vision - {ECCV} 2016 - 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part {IV}}, pages = {630--645}, doi = {10.1007/978-3-319-46493-0_38}, url = {http://dx.doi.org/10.1007/978-3-319-46493-0_38}, crossref = {DBLP:conf/eccv/2016-4}, timestamp = {Tue, 20 Sep 2016 08:40:38 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/eccv/HeZRS16}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{DBLP:conf/emnlp/PenningtonSM14, title = {Glove: Global Vectors for Word Representation}, author = {Jeffrey Pennington and Richard Socher and Christopher D. Manning}, booktitle = {Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing(EMNLP), 2014}, url = {http://aclweb.org/anthology/D/D14/D14-1162.pdf}, crossref = {DBLP:conf/emnlp/2014}, timestamp = {Sat, 15 Nov 2014 14:45:18 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/emnlp/PenningtonSM14}, bibsource = {dblp computer science bibliography, http://dblp.org} } @proceedings{DBLP:conf/focs/1999, title = {40th Annual Symposium on Foundations of Computer Science, FOCS '99, 17-18 October, 1999, New York, NY, USA}, year = 1999, booktitle = {FOCS}, publisher = {IEEE Computer Society}, bibsource = {DBLP, http://dblp.uni-trier.de} } @inproceedings{DBLP:conf/focs/Dasgupta99, title = {Learning Mixtures of Gaussians}, author = {Sanjoy Dasgupta}, year = 1999, booktitle = {FOCS}, pages = {634--644}, ee = {http://doi.ieeecomputersociety.org/10.1109/SFFCS.1999.814639}, crossref = {DBLP:conf/focs/1999}, bibsource = {DBLP, http://dblp.uni-trier.de} } @proceedings{DBLP:conf/icml/2008, title = {Machine Learning, Proceedings of the Twenty-Fifth International Conference (ICML 2008), Helsinki, Finland, June 5-9, 2008}, year = 2008, booktitle = {ICML}, publisher = {ACM}, series = {ACM International Conference Proceeding Series}, volume = 307, editor = {William W. Cohen and Andrew McCallum and Sam T. Roweis}, bibsource = {DBLP, http://dblp.uni-trier.de} } @proceedings{DBLP:conf/icml/2013, title = {Proceedings of the 30th International Conference on Machine Learning, {ICML} 2013, Atlanta, GA, USA, 16-21 June 2013}, year = 2013, publisher = {JMLR.org}, series = {{JMLR} Proceedings}, volume = 28, url = {http://jmlr.org/proceedings/papers/v28/}, timestamp = {Thu, 11 Sep 2014 07:28:55 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/icml/2013}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{DBLP:conf/icml/IoffeS15, title = {Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift}, author = {Sergey Ioffe and Christian Szegedy}, year = 2015, booktitle = {Proceedings of the 32nd International Conference on Machine Learning, {ICML} 2015, Lille, France, 6-11 July 2015}, pages = {448--456}, url = {http://jmlr.org/proceedings/papers/v37/ioffe15.html}, crossref = {DBLP:conf/icml/2015}, timestamp = {Tue, 12 Jul 2016 21:51:15 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/icml/IoffeS15}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{DBLP:conf/icml/NairH10, title = {Rectified Linear Units Improve Restricted Boltzmann Machines}, author = {Vinod Nair and Geoffrey E. Hinton}, year = 2010, booktitle = {Proceedings of the 27th International Conference on Machine Learning (ICML-10), June 21-24, 2010, Haifa, Israel}, pages = {807--814}, url = {http://www.icml2010.org/papers/432.pdf}, crossref = {DBLP:conf/icml/2010}, timestamp = {Fri, 12 Jun 2015 19:15:11 +0200}, biburl = {http://dblp2.uni-trier.de/rec/bib/conf/icml/NairH10}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{DBLP:conf/icml/SaRO15, title = {Global Convergence of Stochastic Gradient Descent for Some Non-convex Matrix Problems}, author = {Christopher De Sa and Christopher R{\'{e}} and Kunle Olukotun}, year = 2015, booktitle = {Proceedings of the 32nd International Conference on Machine Learning, {ICML} 2015, Lille, France, 6-11 July 2015}, pages = {2332--2341}, url = {http://jmlr.org/proceedings/papers/v37/sa15.html}, crossref = {DBLP:conf/icml/2015}, timestamp = {Tue, 12 Jul 2016 21:51:16 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/icml/SaRO15}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{DBLP:conf/icml/VincentLBM08, title = {Extracting and composing robust features with denoising autoencoders}, author = {Pascal Vincent and Hugo Larochelle and Yoshua Bengio and Pierre-Antoine Manzagol}, year = 2008, booktitle = {ICML}, pages = {1096--1103}, bibsource = {DBLP, http://dblp.uni-trier.de}, ee = {http://doi.acm.org/10.1145/1390156.1390294}, owner = {rongge}, timestamp = {2013.09.25} } @inproceedings{DBLP:conf/nips/ArgyriouEP06, title = {Multi-Task Feature Learning}, author = {Andreas Argyriou and Theodoros Evgeniou and Massimiliano Pontil}, year = 2006, booktitle = {NIPS}, pages = {41--48}, ee = {http://books.nips.cc/papers/files/nips19/NIPS2006_0251.pdf}, crossref = {DBLP:conf/nips/2006}, bibsource = {DBLP, http://dblp.uni-trier.de} } @inproceedings{DBLP:conf/nips/LevyG14, title = {Neural Word Embedding as Implicit Matrix Factorization}, author = {Omer Levy and Yoav Goldberg}, booktitle = {Advances in Neural Information Processing Systems (NIPS), 2015}, url = {http://papers.nips.cc/paper/5477-neural-word-embedding-as-implicit-matrix-factorization}, crossref = {DBLP:conf/nips/2014}, timestamp = {Wed, 10 Dec 2014 21:34:12 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/nips/LevyG14}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{DBLP:conf/nips/MaronLB10, title = {Sphere Embedding: An Application to Part-of-Speech Induction}, author = {Yariv Maron and Michael Lamar and Elie Bienenstock}, year = 2010, booktitle = {Advances in Neural Information Processing Systems} } @inproceedings{DBLP:conf/nips/MaW15, title = {Sum-of-Squares Lower Bounds for Sparse {PCA}}, author = {Tengyu Ma and Avi Wigderson}, year = 2015, booktitle = {Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, December 7-12, 2015, Montreal, Quebec, Canada}, pages = {1612--1620}, url = {http://papers.nips.cc/paper/5724-sum-of-squares-lower-bounds-for-sparse-pca}, crossref = {DBLP:conf/nips/2015}, timestamp = {Fri, 08 Apr 2016 19:32:52 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/nips/MaW15}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{DBLP:conf/nips/SutskeverVL14, title = {Sequence to Sequence Learning with Neural Networks}, author = {Ilya Sutskever and Oriol Vinyals and Quoc V. Le}, year = 2014, booktitle = {Proc.~$27$th NIPS}, pages = {3104--3112}, url = {http://papers.nips.cc/paper/5346-sequence-to-sequence-learning-with-neural-networks}, timestamp = {Wed, 10 Dec 2014 21:34:12 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/nips/SutskeverVL14}, bibsource = {dblp computer science bibliography, http://dblp.org} } @proceedings{DBLP:conf/soda/2008, title = {Proceedings of the Nineteenth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2008, San Francisco, California, USA, January 20-22, 2008}, year = 2008, booktitle = {SODA}, publisher = {SIAM}, editor = {Shang-Hua Teng}, bibsource = {DBLP, http://dblp.uni-trier.de}, ee = {http://dl.acm.org/citation.cfm?id=1347082} } @inproceedings{DBLP:conf/soda/Indyk08, title = {Explicit constructions for compressed sensing of sparse signals}, author = {Piotr Indyk}, year = 2008, booktitle = {SODA}, pages = {30--33}, bibsource = {DBLP, http://dblp.uni-trier.de}, crossref = {DBLP:conf/soda/2008}, ee = {http://dl.acm.org/citation.cfm?id=1347082.1347086} } @inproceedings{DBLP:conf/stoc/BarakKS15, title = {Dictionary Learning and Tensor Decomposition via the Sum-of-Squares Method}, author = {Boaz Barak and Jonathan A. Kelner and David Steurer}, year = 2015, booktitle = {Proceedings of the Forty-Seventh Annual {ACM} on Symposium on Theory of Computing, {STOC} 2015, Portland, OR, USA, June 14-17, 2015}, pages = {143--151}, doi = {10.1145/2746539.2746605}, url = {http://doi.acm.org/10.1145/2746539.2746605}, crossref = {DBLP:conf/stoc/2015}, timestamp = {Wed, 10 Jun 2015 17:20:57 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/stoc/BarakKS15}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{DBLP:conf/stoc/HopkinsSSS16, title = {Fast spectral algorithms from sum-of-squares proofs: tensor decomposition and planted sparse vectors}, author = {Samuel B. Hopkins and Tselil Schramm and Jonathan Shi and David Steurer}, year = 2016, booktitle = {Proceedings of the 48th Annual {ACM} {SIGACT} Symposium on Theory of Computing, {STOC} 2016, Cambridge, MA, USA, June 18-21, 2016}, pages = {178--191}, doi = {10.1145/2897518.2897529}, url = {http://doi.acm.org/10.1145/2897518.2897529}, crossref = {DBLP:conf/stoc/2016}, timestamp = {Fri, 10 Jun 2016 10:47:01 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/stoc/HopkinsSSS16}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{DBLP:journals/corr/AgarwalA0NT13, title = {Learning Sparsely Used Overcomplete Dictionaries via Alternating Minimization}, author = {Alekh Agarwal and Animashree Anandkumar and Prateek Jain and Praneeth Netrapalli and Rashish Tandon}, year = 2013, journal = {CoRR}, volume = {abs/1310.7991}, ee = {http://arxiv.org/abs/1310.7991}, bibsource = {DBLP, http://dblp.uni-trier.de} } @article{DBLP:journals/corr/AgarwalAN13, title = {Exact Recovery of Sparsely Used Overcomplete Dictionaries}, author = {Alekh Agarwal and Animashree Anandkumar and Praneeth Netrapalli}, year = 2013, journal = {CoRR}, volume = {abs/1309.1952}, ee = {http://arxiv.org/abs/1309.1952}, bibsource = {DBLP, http://dblp.uni-trier.de} } @article{DBLP:journals/corr/AroraLM15, title = {Why are deep nets reversible: {A} simple theory, with implications for training}, author = {Sanjeev Arora and Yingyu Liang and Tengyu Ma}, year = 2015, journal = {CoRR}, volume = {abs/1511.05653}, url = {http://arxiv.org/abs/1511.05653}, timestamp = {Tue, 01 Dec 2015 19:22:34 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/corr/AroraLM15}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{DBLP:journals/corr/CohenS16, title = {Convolutional Rectifier Networks as Generalized Tensor Decompositions}, author = {Nadav Cohen and Amnon Shashua}, year = 2016, journal = {CoRR}, volume = {abs/1603.00162}, url = {http://arxiv.org/abs/1603.00162}, archiveprefix = {arXiv}, eprint = {1603.00162}, timestamp = {Wed, 07 Jun 2017 14:41:05 +0200}, biburl = {http://dblp.org/rec/bib/journals/corr/CohenS16}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{DBLP:journals/corr/GeM15, title = {Decomposing Overcomplete 3rd Order Tensors using Sum-of-Squares Algorithms}, author = {Rong Ge and Tengyu Ma}, year = 2015, month = apr, journal = {CoRR}, booktitle = {Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques(APPROX/RANDOM), 2015}, volume = {abs/1504.05287}, doi = {10.4230/LIPIcs.APPROX-RANDOM.2015.829}, url = {http://arxiv.org/abs/1504.05287}, timestamp = {Sat, 02 May 2015 17:50:32 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/corr/GeM15}, bibsource = {dblp computer science bibliography, http://dblp.org}, crossref = {DBLP:conf/approx/2015} } @article{DBLP:journals/corr/HardtM16, title = {Identity Matters in Deep Learning}, author = {Moritz Hardt and Tengyu Ma}, year = 2016, journal = {CoRR}, booktitle = {5th International Conference on Learning Representations (ICLR 2017)}, volume = {abs/1611.04231}, url = {http://arxiv.org/abs/1611.04231}, timestamp = {Thu, 01 Dec 2016 19:32:08 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/corr/HardtM16}, bibsource = {dblp computer science bibliography, http://dblp.org}, alteditor = {editor} } @article{DBLP:journals/corr/HardtMR16, title = {Gradient Descent Learns Linear Dynamical Systems.}, author = {Moritz Hardt and Tengyu Ma and Benjamin Recht}, year = 2016, journal = {CoRR}, volume = {abs/1609.05191}, number = 29, pages = {1--44}, url = {http://arxiv.org/abs/1609.05191}, timestamp = {Mon, 03 Oct 2016 17:51:10 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/corr/HardtMR16}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{DBLP:journals/corr/HardtP14, title = {Sharp bounds for learning a mixture of two gaussians}, author = {Moritz Hardt and Eric Price}, year = 2014, journal = {CoRR}, volume = {abs/1404.4997}, bibsource = {DBLP, http://dblp.uni-trier.de}, ee = {http://arxiv.org/abs/1404.4997} } @article{DBLP:journals/corr/HuangLW16a, title = {Densely Connected Convolutional Networks}, author = {Gao Huang and Zhuang Liu and Kilian Q. Weinberger}, year = 2016, journal = {CoRR}, volume = {abs/1608.06993}, url = {http://arxiv.org/abs/1608.06993}, timestamp = {Fri, 02 Sep 2016 17:46:24 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/corr/HuangLW16a}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{DBLP:journals/corr/ZhangPS17, title = {Electron-Proton Dynamics in Deep Learning}, author = {Qiuyi Zhang and Rina Panigrahy and Sushant Sachdeva}, year = 2017, journal = {CoRR}, volume = {abs/1702.00458}, url = {http://arxiv.org/abs/1702.00458}, timestamp = {Wed, 07 Jun 2017 14:43:10 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/corr/ZhangPS17}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{DBLP:journals/ijcv/RussakovskyDSKS15, title = {ImageNet Large Scale Visual Recognition Challenge}, author = {Olga Russakovsky and Jia Deng and Hao Su and Jonathan Krause and Sanjeev Satheesh and Sean Ma and Zhiheng Huang and Andrej Karpathy and Aditya Khosla and Michael S. Bernstein and Alexander C. Berg and Fei{-}Fei Li}, year = 2015, journal = {International Journal of Computer Vision}, volume = 115, number = 3, pages = {211--252}, doi = {10.1007/s11263-015-0816-y}, url = {http://dx.doi.org/10.1007/s11263-015-0816-y}, timestamp = {Thu, 12 Nov 2015 16:51:37 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/ijcv/RussakovskyDSKS15}, bibsource = {dblp computer science bibliography, http://dblp.org}, eprint = {arXiv:1409.0575} } @article{DBLP:journals/jcss/Raghavan88, title = {Probabilistic Construction of Deterministic Algorithms: Approximating Packing Integer Programs}, author = {Prabhakar Raghavan}, year = 1988, journal = {J. Comput. Syst. Sci.}, volume = 37, number = 2, pages = {130--143}, url = {http://www.cc.gatech.edu/~mihail/Rag88.pdf}, ee = {http://dx.doi.org/10.1016/0022-0000(88)90003-7}, bibsource = {DBLP, http://dblp.uni-trier.de} } @article{DBLP:journals/jmlr/LohW15, title = {Regularized M-estimators with nonconvexity: statistical and algorithmic theory for local optima}, author = {Po{-}Ling Loh and Martin J. Wainwright}, year = 2015, journal = {Journal of Machine Learning Research}, volume = 16, pages = {559--616}, url = {http://dl.acm.org/citation.cfm?id=2789291}, timestamp = {Thu, 11 Feb 2016 17:46:04 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/jmlr/LohW15}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{DBLP:journals/jmlr/SpielmanWW12, title = {Exact Recovery of Sparsely-Used Dictionaries}, author = {Daniel A. Spielman and Huan Wang and John Wright}, year = 2012, journal = {Journal of Machine Learning Research - Proceedings Track}, volume = 23, pages = {37.1--37.18}, bibsource = {DBLP, http://dblp.uni-trier.de}, ee = {http://www.jmlr.org/proceedings/papers/v23/spielman12/spielman12.pdf} } @article{DBLP:journals/ml/HazanAK07, title = {Logarithmic regret algorithms for online convex optimization}, author = {Elad Hazan and Amit Agarwal and Satyen Kale}, year = 2007, month = aug, journal = {Machine Learning}, volume = 69, number = {2-3}, pages = {169--192}, doi = {10.1007/s10994-007-5016-8}, issn = {0885-6125}, url = {http://dx.doi.org/10.1007/s10994-007-5016-8}, timestamp = {Thu, 13 Mar 2008 10:35:45 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/ml/HazanAK07}, bibsource = {dblp computer science bibliography, http://dblp.org}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Hazan, Agarwal, Kale - 2007 - Logarithmic regret algorithms for online convex optimization.pdf:pdf}, mendeley-groups = {Optimization/Stochastic Online Optimization} } @article{DBLP:journals/tit/JafarpourXHC09, title = {Efficient and robust compressed sensing using optimized expander graphs}, author = {Sina Jafarpour and Weiyu Xu and Babak Hassibi and A. Robert Calderbank}, year = 2009, journal = {IEEE Transactions on Information Theory}, volume = 55, number = 9, pages = {4299--4308}, bibsource = {DBLP, http://dblp.uni-trier.de}, ee = {http://dx.doi.org/10.1109/TIT.2009.2025528} } @article{DDV01, title = {Independent Component Analysis and (Simultaneous) Third-Order Tensor Diagonalization}, author = {De Lathauwer, L. and De Moor, B. and Vandewalle, J.}, year = 2001, journal = {IEEE Transactions on Signal Processing}, volume = 49, number = 10 } @inproceedings{DE, title = {Optimally sparse representation in general (non-orthogonal) dictionaries via $\ell_1$-minimization}, author = {D. Donoho and M. Elad}, year = 2003, booktitle = {PNAS}, pages = {2197--2202} } @article{de1960problemes, title = {Les problemes de decisions sequentielles}, author = {De Ghellinck, Guy}, year = 1960, journal = {Cahiers du Centre d’Etudes de Recherche Op{\'e}rationnelle}, volume = 2, number = 2, pages = {161--179} } @article{de2003linear, title = {The linear programming approach to approximate dynamic programming}, author = {de Farias, Daniela Pucci and Van Roy, Benjamin}, year = 2003, journal = {Operations Research}, publisher = {INFORMS}, volume = 51, number = 6, pages = {850--865} } @article{de2007fourth, title = {Fourth-order cumulant-based blind identification of underdetermined mixtures}, author = {De Lathauwer, L. and Castaing, J. and Cardoso, J.-F.}, year = 2007, journal = {Signal Processing, IEEE Transactions on}, volume = 55, number = 6, pages = {2965--2973} } @inproceedings{dean2008mapreduce, title = {MapReduce: simplified data processing on large clusters}, author = {Dean, Jeffrey and Ghemawat, Sanjay}, year = 2008, month = jan, journal = {Commun. ACM}, publisher = {ACM}, address = {New York, NY, USA}, volume = 51, pages = {107--113}, doi = {http://doi.acm.org/10.1145/1327452.1327492}, issn = {0001-0782}, acmid = 1327492, issue = 1, issue_date = {January 2008}, numpages = 7 } @article{dean2019sample, title = {On the sample complexity of the linear quadratic regulator}, author = {Dean, Sarah and Mania, Horia and Matni, Nikolai and Recht, Benjamin and Tu, Stephen}, year = 2019, journal = {Foundations of Computational Mathematics}, publisher = {Springer}, pages = {1--47} } @article{deepapply, title = {Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition.}, author = {Dahl, George E. and Yu, Dong and Deng, Li and Acero, Alex}, year = 2012, journal = {IEEE Transactions on Audio, Speech \& Language Processing}, volume = 20, number = 1, pages = {30--42}, keywords = {dblp} } @article{deepsurvey2, title = {Deep Learning in Neural Networks: An Overview}, author = {J. Schmidhuber}, year = 2015, journal = {Neural Networks}, volume = 61, pages = {85--117}, doi = {10.1016/j.neunet.2014.09.003}, note = {Published online 2014; based on TR arXiv:1404.7828 [cs.NE]} } @article{deerwester1990indexing, title = {Indexing by latent semantic analysis}, author = {Deerwester, Scott C. and Dumais, Susan T and Landauer, Thomas K. and Furnas, George W. and Harshman, Richard A.}, year = 1990, journal = {Journal of the American Society for Information Science} } @article{defarias04constraint, title = {On Constraint Sampling in the Linear Programming Approach to Approximate Dynamic Programming}, author = {Daniela Pucci {de Farias }and Benjamin {Van Roy}}, year = 2004, journal = {Mathematics of Operations Research}, volume = 29, number = 3, pages = {462--478} } @inproceedings{Defazio2014-Finito, title = {{Finito: A Faster, Permutable Incremental Gradient Method for Big Data Problems}}, author = {Defazio, Aaron J. and Caetano, Tib\'{e}rio S. and Domke, Justin}, year = 2014, booktitle = {Proceedings of the 31st International Conference on Machine Learning}, series = {ICML 2014}, url = {http://jmlr.org/proceedings/papers/v32/defazio14.pdf}, abstract = {Recent advances in optimization theory have shown that smooth strongly convex finite sums can be minimized faster than by treating them as a black box ”batch” problem. In this work we introduce a new method in this class with a theoretical convergence rate four times faster than existing methods, for sums with sufficiently many terms. This method is also amendable to a sampling without replacement scheme that in practice gives further speed-ups. We give empirical results showing state of the art performance. 1}, archiveprefix = {arXiv}, arxivid = {1407.2710}, eprint = {1407.2710}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Defazio, Caetano, Domke - 2014 - Finito A Faster, Permutable Incremental Gradient Method for Big Data Problems.pdf:pdf}, mendeley-groups = {Optimization/[with Yuan Yang],Optimization/Variance Reduction} } @inproceedings{Defazio2014-SAGA, title = {{SAGA: A Fast Incremental Gradient Method With Support for Non-Strongly Convex Composite Objectives}}, author = {Defazio, Aaron and Bach, Francis and {Lacoste-Julien}, Simon}, year = 2014, booktitle = {NIPS}, pages = {1646--1654}, url = {http://arxiv.org/abs/1407.0202}, abstract = {In this work we introduce a new optimisation method called SAGA in the spirit of SAG, SDCA, MISO and SVRG, a set of recently proposed incremental gradient algorithms with fast linear convergence rates. SAGA improves on the theory behind SAG and SVRG, with better theoretical convergence rates, and has support for composite objectives where a proximal operator is used on the regulariser. Unlike SDCA, SAGA supports non-strongly convex problems directly, and is adaptive to any inherent strong convexity of the problem. We give experimental results showing the effectiveness of our method.}, archiveprefix = {arXiv}, arxivid = {arXiv:1407.0202v2}, eprint = {arXiv:1407.0202v2}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Defazio, Bach, Lacoste-Julien - 2014 - SAGA A Fast Incremental Gradient Method With Support for Non-Strongly Convex Composite Objectives.pdf:pdf}, mendeley-groups = {Optimization/[with Yuan Yang],Optimization/Variance Reduction} } @article{DeH06a, title = {The rate of convergence for the cyclic projections algorithm. {I}. {A}ngles between convex sets}, author = {Deutsch, Frank and Hundal, Hein}, year = 2006, journal = {J. Approx. Theory}, volume = 142, number = 1, pages = {36--55}, doi = {10.1016/j.jat.2006.02.005}, issn = {0021-9045}, url = {http://dx.doi.org/10.1016/j.jat.2006.02.005}, fjournal = {Journal of Approximation Theory}, mrclass = {41A65 (46N10 47H09)}, mrnumber = 2257064 } @article{DeH06b, title = {The rate of convergence for the cyclic projections algorithm. {II}. {N}orms of nonlinear operators}, author = {Deutsch, Frank and Hundal, Hein}, year = 2006, journal = {J. Approx. Theory}, volume = 142, number = 1, pages = {56--82}, doi = {10.1016/j.jat.2006.02.006}, issn = {0021-9045}, url = {http://dx.doi.org/10.1016/j.jat.2006.02.006}, fjournal = {Journal of Approximation Theory}, mrclass = {41A65 (46N10 47H09)}, mrnumber = 2257065, mrreviewer = {Heinz H. Bauschke} } @article{DeH08, title = {The rate of convergence for the Cyclic Projections Algorithm {III}: {R}egularity of Convex Sets}, author = {Deutsch, Frank and Hundal, Hein}, year = 2008, journal = {J. Approx. Theory}, publisher = {Academic Press, Inc.}, address = {Orlando, FL, USA}, volume = 155, number = 2, pages = {155--184}, doi = {10.1016/j.jat.2008.04.001}, issn = {0021-9045}, url = {http://dx.doi.org/10.1016/j.jat.2008.04.001}, acmid = 1465355, issue_date = {December, 2008}, keywords = {Alternating projections, Angle between convex sets, Angle between subspaces, Convex feasibility problem, Cyclic projections, Norm of nonlinear operators, Orthogonal projections, POCS, Projections onto convex sets, Rate of convergence, Regularity properties of convex sets: regular, linearly regular, boundedly regular, boundedly linearly regular, normal, weakly normal, uniformly normal, The strong conical hull intersection property (strong CHIP)}, numpages = 30 } @inproceedings{deisenrothmodel, title = {{PILCO:} A model-based and data-efficient approach to policy search}, author = {Deisenroth, Marc P and Rasmussen, Carl M}, booktitle = {Proceedings of the 28th International Conference on Machine Learning}, pages = {465--472} } @article{Dekel2012, title = {{Optimal distributed online prediction using mini-batches}}, author = {Dekel, Ofer and {Gilad-Bachrach}, Ran and Shamir, Ohad and Xiao, Lin}, year = 2012, journal = {The Journal of Machine Learning Research}, volume = 13, number = 1, pages = {165--202}, isbn = {978-1-4503-0619-5}, issn = {1532-4435}, abstract = {Online prediction methods are typically presented as serial algorithms running on a single processor. However, in the age of web-scale prediction problems, it is increasingly common to encounter situations where a single processor cannot keep up with the high rate at which inputs arrive. In this work, we present the $\backslash$emph\{distributed mini-batch\} algorithm, a method of converting many serial gradient-based online prediction algorithms into distributed algorithms. We prove a regret bound for this method that is asymptotically optimal for smooth convex loss functions and stochastic inputs. Moreover, our analysis explicitly takes into account communication latencies between nodes in the distributed environment. We show how our method can be used to solve the closely-related distributed stochastic optimization problem, achieving an asymptotically linear speed-up over multiple processors. Finally, we demonstrate the merits of our approach on a web-scale online prediction problem.}, annote = {Contains some information about "using mirror descent steps" on smooth objectives, though analyzed in stochastic way.}, archiveprefix = {arXiv}, arxivid = {1012.1367}, eprint = {1012.1367}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Dekel et al. - 2012 - Optimal distributed online prediction using mini-batches.pdf:pdf}, keywords = {convex,distributed computing,online learning,regret bounds,stochastic optimization}, mendeley-groups = {Optimization/Stochastic Online Optimization} } @article{delfosse1995adaptive, title = {Adaptive blind separation of independent sources: a deflation approach}, author = {Delfosse, N. and Loubaton, P.}, year = 1995, journal = {Signal processing}, publisher = {Elsevier}, volume = 45, number = 1, pages = {59--83} } @article{deng2016global, title = {On the global and linear convergence of the generalized alternating direction method of multipliers}, author = {Deng, Wei and Yin, Wotao}, year = 2016, journal = {Journal of Scientific Computing}, publisher = {Springer}, volume = 66, number = 3, pages = {889--916} } @article{derksen2013matrix, title = {Matrix Completion and Tensor Rank}, author = {Derksen, Harm}, year = 2013, journal = {arXiv preprint arXiv:1302.2639} } @inproceedings{desai09smoothed, title = {A Smoothed Approximate Linear Program}, author = {Vijay Desai and Vivek Farias and Ciamac C. Moallemi}, year = 2009, booktitle = {Advances in Neural Information Processing Systems 22 (NIPS)}, pages = {459--467} } @inproceedings{deshpande2004model, title = {Model-driven data acquisition in sensor networks}, author = { Deshpande, Amol and Guestrin, Carlos and Madden, Samuel R. and Hellerstein, Joseph M. and Hong, Wei }, year = 2004, booktitle = { Proceedings of the Thirtieth international conference on Very large data bases - Volume 30 }, location = {Toronto, Canada}, publisher = {VLDB Endowment}, series = {VLDB '04}, pages = {588--599}, isbn = {0-12-088469-0}, acmid = 1316741, numpages = 12 } @article{devries2017improved, title = {Improved regularization of convolutional neural networks with cutout}, author = {DeVries, Terrance and Taylor, Graham W}, year = 2017, journal = {arXiv preprint arXiv:1708.04552} } @book{devroye2012combinatorial, title = {Combinatorial methods in density estimation}, author = {Devroye, Luc and Lugosi, G{\'a}bor}, year = 2012, publisher = {Springer Science \& Business Media} } @misc{dfc2008dfc, title = { DFC Intelligence Forecasts Video Game Market to Reach \$ 57 Billion in 2009 }, author = {DFC}, year = 2008, month = jun, url = {http://www.dfcint.com/wp/?p=222}, owner = {leili}, timestamp = {2009.11.20} } @article{DG03, title = {An elementary proof of a theorem of {J}ohnson and {L}indenstrauss}, author = {S. Dasgupta and A. Gupta}, year = 2003, journal = {Random Structures and Algorithms}, volume = 22, number = 1, pages = {60--65} } @article{dg18, title = {Improved Learning of One-hidden-layer Convolutional Neural Networks with Overlaps}, author = {Du, Simon S and Goel, Surbhi}, year = 2018, journal = {arXiv preprint arXiv:1805.07798} } @inproceedings{DH, title = {Uncertainty principles and ideal atomic decomposition}, author = {D. Donoho and X. Huo}, year = 1999, booktitle = {IEEE Trans. on Information Theory}, pages = {2845--2862} } @inproceedings{dhillon2011multi, title = {Multi-View Learning of Word Embeddings via CCA}, author = {Dhillon, Paramveer and Foster, Dean P and Ungar, Lyle H}, year = 2011, booktitle = {NIPS}, pages = {199--207} } @article{dickinson2014computational, title = {On the computational complexity of membership problems for the completely positive cone and its dual}, author = {Dickinson, Peter JC and Gijben, Luuk}, year = 2014, journal = {Computational optimization and applications}, publisher = {Springer}, volume = 57, number = 2, pages = {403--415} } @inproceedings{dietterich2013pac, title = {{PAC} Optimal Planning for Invasive Species Management: Improved Exploration for Reinforcement Learning from Simulator-Defined {MDP}s}, author = {Dietterich, Thomas G. and Taleghan, Majid Alkaee and Crowley, Mark}, year = 2013, booktitle = {AAAI} } @article{dieuleveut2020bridging, title = {Bridging the gap between constant step size stochastic gradient descent and markov chains}, author = {Dieuleveut, Aymeric and Durmus, Alain and Bach, Francis}, year = 2020, journal = {Annals of Statistics}, publisher = {Institute of Mathematical Statistics}, volume = 48, number = 3, pages = {1348--1382} } @inproceedings{ding2004k, title = {K-means clustering via principal component analysis}, author = {Ding, Chris and He, Xiaofeng}, year = 2004, booktitle = { Proceedings of the twenty-first international conference on Machine learning }, location = {Banff, Alberta, Canada}, publisher = {ACM}, address = {New York, NY, USA}, series = {ICML '04}, pages = {29--}, doi = {http://doi.acm.org/10.1145/1015330.1015408}, isbn = {1-58113-838-5}, acmid = 1015408 } @article{ding2010convex, title = {Convex and semi-nonnegative matrix factorizations}, author = {Ding, Chris and Li, Tao and Jordan, Michael I}, year = 2010, journal = {Pattern Analysis and Machine Intelligence, IEEE Transactions on}, publisher = {IEEE}, volume = 32, number = 1, pages = {45--55} } @article{ding2014efficient, title = {Efficient Distributed Topic Modeling with Provable Guarantees}, author = {Weicong Ding and Mohammad H. Rohban and Prakash Ishwar and Venkatesh Saligrama}, year = 2014, journal = {JMLR}, pages = {167--175} } @inproceedings{dinh2017sharp, title = {Sharp minima can generalize for deep nets}, author = {Dinh, Laurent and Pascanu, Razvan and Bengio, Samy and Bengio, Yoshua}, year = 2017, booktitle = {Proceedings of the 34th International Conference on Machine Learning-Volume 70}, pages = {1019--1028}, organization = {JMLR. org} } @article{Dinic1970, title = {Algorithm for solution of a problem of maximum flow in networks with power estimation}, author = {Dinic, E. A.}, year = 1970, journal = {Soviet Math Doklady}, volume = 11, pages = {1277--1280} } @article{dllwz18, title = {Gradient Descent Finds Global Minima of Deep Neural Networks}, author = {Du, Simon S. and Lee, Jason D. and Li, Haochuan and Wang, Liwei and Zhai, Xiyu}, year = 2018, month = nov, journal = {arXiv preprint arXiv:1811.03804}, booktitle = {International Conference on Machine Learning}, pages = {1675--1685}, organization = {PMLR} } @article{DLR, title = {Maximum likelihood from incomplete data via the EM Algorithm}, author = {A.~P. Dempster and N.~M. Laird and D.~B. Rubin}, year = 1977, journal = {J. Roy. Statist. Soc. Ser. B}, volume = 39, pages = {1--38} } @inproceedings{dlt18, title = {When is a Convolutional Filter Easy to Learn?}, author = {Du, Simon S and Lee, Jason D and Tian, Yuandong}, year = 2018, journal = {arXiv preprint arXiv:1709.06129}, booktitle = {ICLR} } @inproceedings{dltps18, title = {Gradient Descent Learns One-hidden-layer {CNN:} Don't be Afraid of Spurious Local Minima}, author = {Simon S. Du and Jason D. Lee and Yuandong Tian and Barnab{\'{a}}s P{\'{o}}czos and Aarti Singh}, year = 2018, booktitle = {International Conference on Machine Learning (ICML)}, publisher = {http://arxiv.org/abs/1712.00779} } @inproceedings{DMA, title = {Greedy adaptive approximations}, author = {G. Davis and S. Mallat and M. Avellaneda}, year = 1997, booktitle = {J. of Constructive Approximation}, pages = {57--98} } @inproceedings{do2005transfer, title = {Transfer learning for text classification}, author = {Do, Chuong and Ng, Andrew Y}, year = 2005, booktitle = {NIPS}, pages = {299--306} } @article{doi:10.1093/qmath/11.1.50, title = {SYMMETRIC GAUGE FUNCTIONS AND UNITARILY INVARIANT NORMS}, author = {MIRSKY, L.}, year = 1960, journal = {The Quarterly Journal of Mathematics}, volume = 11, number = 1, pages = {50--59}, doi = {10.1093/qmath/11.1.50}, url = {+ http://dx.doi.org/10.1093/qmath/11.1.50}, eprint = {/oup/backfile/content_public/journal/qjmath/11/1/10.1093_qmath_11.1.50/3/11-1-50.pdf} } @article{dong2018information, title = {An information-theoretic analysis for thompson sampling with many actions}, author = {Dong, Shi and Van Roy, Benjamin}, year = 2018, journal = {arXiv preprint arXiv:1805.11845} } @inproceedings{dong2019performance, title = {On the Performance of Thompson Sampling on Logistic Bandits}, author = {Dong, Shi and Ma, Tengyu and Van Roy, Benjamin}, year = 2019, booktitle = {Conference on Learning Theory}, pages = {1158--1160} } @article{dong2019q, title = {Q-learning with ucb exploration is sample efficient for infinite-horizon mdp}, author = {Dong, Kefan and Wang, Yuanhao and Chen, Xiaoyu and Wang, Liwei}, year = 2019, journal = {arXiv preprint arXiv:1901.09311} } @inproceedings{dong2019sqrt, title = {$\sqrt{n}$-Regret for Learning in {M}arkov Decision Processes with Function Approximation and Low {B}ellman Rank}, author = {Dong, Kefan and Peng, Jian and Wang, Yining and Zhou, Yuan}, year = 2020, booktitle = {Conference on Learning Theory} } @inproceedings{dong2020expressivity, title = {On the expressivity of neural networks for deep reinforcement learning}, author = {Dong, Kefan and Luo, Yuping and Yu, Tianhe and Finn, Chelsea and Ma, Tengyu}, year = 2020, booktitle = {International Conference on Machine Learning}, pages = {2627--2637}, organization = {PMLR} } @misc{dong2020provably, title = {Provably Efficient Reinforcement Learning with Aggregated States}, author = {Shi Dong and Benjamin Van Roy and Zhengyuan Zhou}, year = 2020, eprint = {1912.06366}, archiveprefix = {arXiv}, primaryclass = {stat.ML} } @inproceedings{dong2020root, title = {Root-n-Regret for Learning in {Markov} Decision Processes with Function Approximation and Low {Bellman} Rank}, author = {Dong, Kefan and Peng, Jian and Wang, Yining and Zhou, Yuan}, year = 2020, booktitle = {Conference on Learning Theory}, pages = {1554--1557}, organization = {PMLR} } @article{dong2021provable, title = {Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature}, author = {Dong, Kefan and Yang, Jiaqi and Ma, Tengyu}, year = 2021, journal = {arXiv preprint arXiv:2102.04168} } @article{donoho2001uncertainty, title = {Uncertainty principles and ideal atomic decomposition}, author = {Donoho, David L and Huo, Xiaoming}, year = 2001, journal = {Information Theory, IEEE Transactions on}, publisher = {IEEE}, volume = 47, number = 7, pages = {2845--2862} } @inproceedings{donoho2004does, title = {When does non-negative matrix factorization give a correct decomposition into parts?}, author = {Donoho, David and Stodden, Victoria}, year = 2004, booktitle = {Advances in neural information processing systems}, pages = {1141--1148} } @article{donoho2006compressed, title = {Compressed sensing}, author = {D. Donoho}, year = 2006, journal = {Information Theory, IEEE Transactions on}, publisher = {IEEE}, volume = 52, number = 4, pages = {1289--1306} } @article{donti2020enforcing, title = {Enforcing robust control guarantees within neural network policies}, author = {Donti, Priya L and Roderick, Melrose and Fazlyab, Mahyar and Kolter, J Zico}, year = 2020, journal = {arXiv preprint arXiv:2011.08105} } @techreport{dorfmullerulhaas2003robust, title = {Robust Optical User Motion Tracking Using a Kalman Filter}, author = {Klaus Dorfm{\"u}ller-Ulhaas}, year = 2003, month = may, address = {Institut fuer Informatik, Universit{\"a}tsstr. 2, 86159 Augsburg}, number = {2003-6} } @article{doucet2011tutorial, title = {A tutorial on particle filtering and smoothing: fifteen years later}, author = {Doucet, Arnaud and Johansen, Adam M.}, year = 2011, month = dec, journal = {The Oxford Handbook of Nonlinear Filtering}, booktitle = {OXFORD HANDBOOK OF NONLINEAR FILTERING}, pages = {4--6}, abstract = {Optimal estimation problems for non-linear {non-Gaussian} state-space models do not typically admit analytic solutions. Since their introduction in 1993, particle filtering methods have become a very popular class of algorithms to solve these estimation problems numerically in an online manner, i.e. recursively as observations become available, and are now routinely used in fields as diverse as computer vision, econometrics, robotics and navigation. The objective of this tutorial is to provide a complete, up-to-date survey of this field as of 2008. Basic and advanced particle methods for filtering as well as smoothing are presented.}, citeulike-article-id = 9086845, citeulike-linkout-0 = {http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.157.772}, keywords = {algorithms}, posted-at = {2012-01-28 15:28:07}, priority = 2 } @inproceedings{douceur2002sybil, title = {The Sybil Attack}, author = {Douceur, John R.}, year = 2002, booktitle = {Revised Papers from the First International Workshop on Peer-to-Peer Systems}, publisher = {Springer-Verlag}, address = {London, UK, UK}, series = {IPTPS '01}, pages = {251--260}, isbn = {3-540-44179-4}, url = {http://dl.acm.org/citation.cfm?id=646334.687813}, acmid = 687813, numpages = 10 } @article{dougherty1989nonnegativity, title = {Nonnegativity-, monotonicity-, or convexity-preserving cubic and quintic {H}ermite interpolation}, author = {Dougherty, Randall L and Edelman, Alan S and Hyman, James M}, year = 1989, journal = {Mathematics of Computation}, volume = 52, number = 186, pages = {471--494} } @inproceedings{DRCFU12, title = {Spectral Dependency Parsing with Latent Variables}, author = {P. S. Dhillon and J. Rodu and M. Collins and D. P. Foster and L. H. Ungar}, year = 2012, booktitle = {EMNLP-CoNLL} } @inproceedings{drineas2003pass, title = {Pass efficient algorithms for approximating large matrices.}, author = {Drineas, Petros and Kannan, Ravi}, year = 2003, booktitle = {SODA}, volume = 3, pages = {223--232} } @article{drineas2005nystrom, title = {On the {Nystr{\"o}m} method for approximating a Gram matrix for improved kernel-based learning}, author = {Drineas, Petros and Mahoney, Michael W}, year = 2005, journal = {Journal of Machine Learning Research}, publisher = {JMLR. org}, volume = 6, pages = {2153--2175} } @article{drineas2006fast, title = {Fast Monte Carlo algorithms for matrices III: Computing a compressed approximate matrix decomposition}, author = {Drineas, Petros and Kannan, Ravi and Mahoney, Michael W}, year = 2006, journal = {SIAM Journal on Computing}, publisher = {SIAM}, volume = 36, number = 1, pages = {184--206} } @inproceedings{drineas2006sampling, title = {Sampling algorithms for l 2 regression and applications}, author = {Drineas, Petros and Mahoney, Michael W and Muthukrishnan, S}, year = 2006, booktitle = {Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm}, pages = {1127--1136}, organization = {Society for Industrial and Applied Mathematics} } @article{drineas2008relative, title = {Relative-error {CUR} matrix decompositions}, author = {Drineas, Petros and Mahoney, Michael W and Muthukrishnan, S}, year = 2008, journal = {SIAM Journal on Matrix Analysis and Applications}, publisher = {SIAM}, volume = 30, number = 2, pages = {844--881} } @article{drineas2012fast, title = {Fast approximation of matrix coherence and statistical leverage}, author = {Drineas, Petros and Magdon-Ismail, Malik and Mahoney, Michael W and Woodruff, David P}, year = 2012, journal = {Journal of Machine Learning Research}, publisher = {JMLR. org}, volume = 13, number = 1, pages = {3475--3506} } @article{drton2007algebraic, title = {Algebraic factor analysis: tetrads, pentads and beyond}, author = {Drton, M. and Sturmfels, B. and Sullivant, S.}, year = 2007, journal = {Probability Theory and Related Fields}, publisher = {Springer}, volume = 138, number = 3, pages = {463--493} } @inproceedings{DS, title = {Uncertainty principles and signal recovery}, author = {D. Donoho and P. Stark}, year = 1999, booktitle = {SIAM J. on Appl. Math}, pages = {906--931} } @article{DS07, title = {A Probabilistic Analysis of {EM} for Mixtures of Separated, Spherical {G}aussians}, author = {S. Dasgupta and L. Schulman}, year = 2007, journal = {Journal of Machine Learning Research}, volume = 8, number = {Feb}, pages = {203--226} } @inproceedings{ds16, title = {Complexity theoretic limitations on learning DNF’s}, author = {Daniely, Amit and Shalev-Shwartz, Shai}, year = 2016, booktitle = {Conference on Learning Theory (COLT)}, pages = {815--830} } @article{dtmr18, title = {Safely Learning to Control the Constrained Linear Quadratic Regulator}, author = {Dean, Sarah and Tu, Stephen and Matni, Nikolai and Recht, Benjamin}, year = 2018, journal = {arXiv preprint arXiv:1809.10121} } @inproceedings{du17stochastic, title = {Stochastic Variance Reduction Methods for Policy Evaluation}, author = {Simon S. Du and Jianshu Chen and Lihong Li and Lin Xiao and Dengyong Zhou}, year = 2017, month = {06--11 Aug}, booktitle = {Proceedings of the 34th International Conference on Machine Learning}, publisher = {PMLR}, series = {Proceedings of Machine Learning Research}, volume = 70, pages = {1049--1058}, url = {http://proceedings.mlr.press/v70/du17a.html}, editor = {Precup, Doina and Teh, Yee Whye}, pdf = {http://proceedings.mlr.press/v70/du17a/du17a.pdf}, abstract = {Policy evaluation is concerned with estimating the value function that predicts long-term values of states under a given policy. It is a crucial step in many reinforcement-learning algorithms. In this paper, we focus on policy evaluation with linear function approximation over a fixed dataset. We first transform the empirical policy evaluation problem into a (quadratic) convex-concave saddle-point problem, and then present a primal-dual batch gradient method, as well as two stochastic variance reduction methods for solving the problem. These algorithms scale linearly in both sample size and feature dimension. Moreover, they achieve linear convergence even when the saddle-point problem has only strong concavity in the dual variables but no strong convexity in the primal variables. Numerical experiments on benchmark problems demonstrate the effectiveness of our methods.} } @article{du2017gradient, title = {Gradient Descent Can Take Exponential Time to Escape Saddle Points}, author = {Du, Simon S and Jin, Chi and Lee, Jason D and Jordan, Michael I and Singh, Aarti and Poczos, Barnabas}, year = 2017, journal = {Neural Information Processing Systems (NIPS)} } @inproceedings{du2017hypothesis, title = {Hypothesis Transfer Learning via Transformation Functions}, author = {Du, Simon S and Koushik, Jayanth and Singh, Aarti and Poczos, Barnabas}, year = 2017, booktitle = {Advances in Neural Information Processing Systems}, publisher = {Curran Associates, Inc.}, volume = 30, pages = {}, url = {https://proceedings.neurips.cc/paper/2017/file/352fe25daf686bdb4edca223c921acea-Paper.pdf}, editor = {I. Guyon and U. V. Luxburg and S. Bengio and H. Wallach and R. Fergus and S. Vishwanathan and R. Garnett} } @inproceedings{du2017power, title = {On the Power of Truncated {SVD} for General High-rank Matrix Estimation Problems}, author = {Du, Simon S and Wang, Yining and Singh, Aarti}, year = 2017, booktitle = {Advances in Neural Information Processing Systems}, publisher = {Curran Associates, Inc.}, volume = 30, pages = {}, url = {https://proceedings.neurips.cc/paper/2017/file/89f0fd5c927d466d6ec9a21b9ac34ffa-Paper.pdf}, editor = {I. Guyon and U. V. Luxburg and S. Bengio and H. Wallach and R. Fergus and S. Vishwanathan and R. Garnett} } @article{du2017spurious, title = {Gradient Descent Learns One-hidden-layer {CNN}: Don't be Afraid of Spurious Local Minima}, author = {Du, Simon S and Lee, Jason D and Tian, Yuandong and Poczos, Barnabas and Singh, Aarti}, year = 2017, journal = {Proceedings of the 35th International Conference on Machine Learning}, pages = {1339--1348} } @article{du2018algorithmic, title = {Algorithmic Regularization in Learning Deep Homogeneous Models: Layers are Automatically Balanced}, author = {Du, Simon S and Hu, Wei and Lee, Jason D}, year = 2018, journal = {Neural Information Processing Systems (NIPS)} } @inproceedings{du2018how, title = {How Many Samples are Needed to Estimate a Convolutional Neural Network?}, author = {Du, Simon S and Wang, Yining and Zhai, Xiyu and Balakrishnan, Sivaraman and Salakhutdinov, Russ R and Singh, Aarti}, year = 2018, booktitle = {Advances in Neural Information Processing Systems}, publisher = {Curran Associates, Inc.}, volume = 31, pages = {}, url = {https://proceedings.neurips.cc/paper/2018/file/03c6b06952c750899bb03d998e631860-Paper.pdf}, editor = {S. Bengio and H. Wallach and H. Larochelle and K. Grauman and N. Cesa-Bianchi and R. Garnett} } @article{du2018improved, title = {Improved Learning of One-hidden-layer Convolutional Neural Networks with Overlaps}, author = {S. Du and Surbhi Goel}, year = 2018, journal = {ArXiv}, volume = {abs/1805.07798} } @inproceedings{du2018linear, title = {Linear Convergence of the Primal-Dual Gradient Method for Convex-Concave Saddle Point Problems without Strong Convexity}, author = {Du, Simon S. and Hu, Wei}, year = 2019, month = {16--18 Apr}, booktitle = {Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics}, publisher = {PMLR}, series = {Proceedings of Machine Learning Research}, volume = 89, pages = {196--205}, url = {http://proceedings.mlr.press/v89/du19b.html}, editor = {Chaudhuri, Kamalika and Sugiyama, Masashi}, pdf = {http://proceedings.mlr.press/v89/du19b/du19b.pdf}, abstract = {We consider the convex-concave saddle point problem $\min_{x}\max_{y} f(x)+y^\top A x-g(y)$ where $f$ is smooth and convex and $g$ is smooth and strongly convex. We prove that if the coupling matrix $A$ has full column rank, the vanilla primal-dual gradient method can achieve linear convergence even if $f$ is not strongly convex. Our result generalizes previous work which either requires $f$ and $g$ to be quadratic functions or requires proximal mappings for both $f$ and $g$. We adopt a novel analysis technique that in each iteration uses a "ghost" update as a reference, and show that the iterates in the primal-dual gradient method converge to this "ghost" sequence. Using the same technique we further give an analysis for the primal-dual stochastic variance reduced gradient method for convex-concave saddle point problems with a finite-sum structure.} } @article{du2018power, title = {On the Power of Over-parametrization in Neural Networks with Quadratic Activation}, author = {Du, Simon S and Lee, Jason D}, year = 2018, journal = {International Conference on Machine Learning (ICML)} } @article{du2018robust, title = {Robust Nonparametric Regression under Huber's epsilon-contamination Model}, author = {S. Du and Y. Wang and Sivaraman Balakrishnan and Pradeep Ravikumar and A. Singh}, year = 2018, journal = {ArXiv}, volume = {abs/1805.10406} } @inproceedings{du2018when, title = {When is a Convolutional Filter Easy to Learn?}, author = {Simon S. Du and Jason D. Lee and Yuandong Tian}, year = 2018, booktitle = {International Conference on Learning Representations}, url = {https://openreview.net/forum?id=SkA-IE06W} } @article{du2019continuous, title = {Continuous Control with Contexts, Provably}, author = {Du, Simon S and Wang, Ruosong and Wang, Mengdi and Yang, Lin F}, year = 2019, journal = {arXiv preprint arXiv:1910.13614} } @inproceedings{du2019decoding, title = {Provably efficient RL with rich observations via latent state decoding}, author = {Du, Simon and Krishnamurthy, Akshay and Jiang, Nan and Agarwal, Alekh and Dudik, Miroslav and Langford, John}, year = 2019, booktitle = {International Conference on Machine Learning}, pages = {1665--1674}, organization = {PMLR} } @inproceedings{du2019dsec, title = {Provably efficient {Q}-learning with function approximation via distribution shift error checking oracle}, author = {Du, Simon S and Luo, Yuping and Wang, Ruosong and Zhang, Hanrui}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {8058--8068} } @inproceedings{du2019good, title = {Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?}, author = {Du, Simon S and Kakade, Sham M and Wang, Ruosong and Yang, Lin F}, year = 2020, booktitle = {International Conference on Learning Representations} } @inproceedings{du2019graph, title = {Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels}, author = {Du, Simon S and Hou, Kangcheng and Salakhutdinov, Russ R and Poczos, Barnabas and Wang, Ruosong and Xu, Keyulu}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, publisher = {Curran Associates, Inc.}, volume = 32, pages = {}, url = {https://proceedings.neurips.cc/paper/2019/file/663fd3c5144fd10bd5ca6611a9a5b92d-Paper.pdf}, editor = {H. Wallach and H. Larochelle and A. Beygelzimer and F. d\textquotesingle Alch\'{e}-Buc and E. Fox and R. Garnett} } @inproceedings{du2019width, title = {Width provably matters in optimization for deep linear neural networks}, author = {Du, Simon and Hu, Wei}, year = 2019, booktitle = {International Conference on Machine Learning}, pages = {1655--1664}, organization = {PMLR} } @article{du2020agnostic, title = {Agnostic Q-learning with function approximation in deterministic systems: Tight bounds on approximation error and sample complexity}, author = {Du, Simon S and Lee, Jason D and Mahajan, Gaurav and Wang, Ruosong}, year = 2020, journal = {Neural Information Processing Systems (NeurIPS)} } @article{du2020few, title = {Few-shot learning via learning the representation, provably}, author = {Du, Simon S and Hu, Wei and Kakade, Sham M and Lee, Jason D and Lei, Qi}, year = 2021, journal = {International Conference on Learning Representations (ICLR)} } @article{du2020particle, title = {When is Particle Filtering Efficient for POMDP Sequential Planning?}, author = {Du, Simon S and Hu, Wei and Li, Zhiyuan and Shen, Ruoqi and Song, Zhao and Wu, Jiajun}, year = 2020, journal = {arXiv preprint arXiv:2006.05975} } @article{du2021bilinear, title = {Bilinear Classes: A Structural Framework for Provable Generalization in RL}, author = {Du, Simon S and Kakade, Sham M and Lee, Jason D and Lovett, Shachar and Mahajan, Gaurav and Sun, Wen and Wang, Ruosong}, year = 2021, journal = {arXiv preprint arXiv:2103.10897} } @inproceedings{du2021fewshot, title = {Few-Shot Learning via Learning the Representation, Provably}, author = {Simon Shaolei Du and Wei Hu and Sham M. Kakade and Jason D. Lee and Qi Lei}, year = 2021, booktitle = {International Conference on Learning Representations}, url = {https://openreview.net/forum?id=pW2Q2xLwIMD} } @inproceedings{duan2016benchmarking, title = {Benchmarking deep reinforcement learning for continuous control}, author = {Duan, Yan and Chen, Xi and Houthooft, Rein and Schulman, John and Abbeel, Pieter}, year = 2016, booktitle = {International conference on machine learning}, pages = {1329--1338}, organization = {PMLR} } @article{duan2018adaptive, title = {Adaptive Low-Nonnegative-Rank Approximation for State Aggregation of Markov Chains}, author = {Duan, Yaqi and Wang, Mengdi and Wen, Zaiwen and Yuan, Yaxiang}, year = 2018, journal = {arXiv preprint arXiv:1810.06032}, publisher = {SIAM} } @article{duan2018state, title = {State Aggregation Learning from Markov Transition Data}, author = {Duan, Yaqi and Ke, Zheng Tracy and Wang, Mengdi}, year = 2018, journal = {arXiv preprint arXiv:1811.02619} } @article{DuanPettie2014, title = {{Linear-Time Approximation for Maximum Weight Matching}}, author = {Duan, Ran and Pettie, Seth}, year = 2014, month = jan, journal = {Journal of the ACM}, volume = 61, number = 1, pages = {1--23}, doi = {10.1145/2529989}, issn = {00045411}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/3492dea6a70b4a1339999fc8ae8e26be784d1cb1.pdf:pdf}, mendeley-groups = {Algorithms/Maxflow} } @inproceedings{Duchi2010, title = {{Composite Objective Mirror Descent}}, author = {Duchi, John and {Shalev-Shwartz}, Shai and Singer, Yoram and Tewari, Ambuj}, year = 2010, booktitle = {Proceedings of the 23rd Annual Conference on Learning Theory - COLT '10}, number = 1, abstract = {We present a new method for regularized convex optimization and analyze it under both online and stochastic optimization settings. In addition to unifying previously known ﬁrstorder algorithms, such as the projected gradient method, mirror descent, and forwardbackward splitting, our method yields new analysis and algorithms. We also derive speciﬁc instantiations of our method for commonly used regularization functions, such as ℓ1, mixed norm, and trace-norm.}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Duchi et al. - 2010 - Composite Objective Mirror Descent.pdf:pdf}, keywords = {Learning/Statistics \& Optimisation,Theory \& Algorithms}, mendeley-groups = {Optimization/Gradient Descent Theory/Composite} } @article{duchi2013optimal, title = {Optimal rates for zero-order optimization: the power of two function evaluations}, author = {Duchi, John C and Jordan, Michael I and Wainwright, Martin J and Wibisono, Andre}, year = 2013, journal = {arXiv preprint arXiv:1312.2139} } @article{duchi2014optimality, title = {Optimality guarantees for distributed statistical estimation}, author = {Duchi, John C and Jordan, Michael I and Wainwright, Martin J and Zhang, Yuchen}, year = 2014, journal = {arXiv preprint arXiv:1405.0782} } @article{duchi2015optimal, title = {Optimal rates for zero-order convex optimization: The power of two function evaluations}, author = {Duchi, John C and Jordan, Michael I and Wainwright, Martin J and Wibisono, Andre}, year = 2015, journal = {IEEE Transactions on Information Theory}, publisher = {IEEE}, volume = 61, number = 5, pages = {2788--2806} } @inproceedings{DuchiSSC08projection, title = {Efficient projections onto the \emph{l}${}_{\mbox{1}}$-ball for learning in high dimensions}, author = {John C. Duchi and Shai Shalev{-}Shwartz and Yoram Singer and Tushar Chandra}, year = 2008, booktitle = {Machine Learning, Proceedings of the Twenty-Fifth International Conference {(ICML} 2008), Helsinki, Finland, June 5-9, 2008}, pages = {272--279}, doi = {10.1145/1390156.1390191}, url = {http://doi.acm.org/10.1145/1390156.1390191}, bibsource = {dblp computer science bibliography, http://dblp.org}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/icml/DuchiSSC08}, crossref = {DBLP:conf/icml/2008}, timestamp = {Sat, 21 Jan 2012 17:47:23 +0100}, bdsk-url-1 = {http://doi.acm.org/10.1145/1390156.1390191}, bdsk-url-2 = {http://dx.doi.org/10.1145/1390156.1390191} } @article{dudik2011efficient, title = {Efficient optimal learning for contextual bandits}, author = {Dudik, Miroslav and Hsu, Daniel and Kale, Satyen and Karampatziakis, Nikos and Langford, John and Reyzin, Lev and Zhang, Tong}, year = 2011, journal = {arXiv preprint arXiv:1106.2369} } @inproceedings{dumais1994latent, title = {Latent Semantic Indexing ({LSI}) and {TREC}-2}, author = {Susan T. Dumais}, year = 1994, month = mar, booktitle = {The Second Text Retrieval Conference (TREC-2)}, publisher = {NIST}, address = {Gaithersburg, MD}, pages = {105--115}, note = {Special publication 500-215}, editor = {D. K. Harman}, owner = {leili}, timestamp = {2011.07.28} } @book{durbin, title = {Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids}, author = {R. Durbin and S. R. Eddy and A. Krogh and G. Mitchison}, year = 1999, publisher = {Cambridge University Press} } @article{durrant2006simultaneous, title = {Simultaneous localization and mapping: part I}, author = {Durrant-Whyte, Hugh and Bailey, Tim}, year = 2006, journal = {Robotics \& Automation Magazine, IEEE}, publisher = {IEEE}, volume = 13, number = 2, pages = {99--110}, date-added = {2016-04-04 17:35:36 +0000}, date-modified = {2016-04-04 17:35:36 +0000} } @inproceedings{DworkTTZ2014-onlineEV, title = {Analyze gauss: optimal bounds for privacy-preserving principal component analysis}, author = {Dwork, Cynthia and Talwar, Kunal and Thakurta, Abhradeep and Zhang, Li}, year = 2014, booktitle = {STOC}, pages = {11--20}, organization = {ACM} } @inproceedings{dwzbss18, title = {How Many Samples are Needed to Learn a Convolutional Neural Network?}, author = {Du, Simon S and Wang, Yining and Zhai, Xiyu and Balakrishnan, Sivaraman and Salakhutdinov, Ruslan and Singh, Aarti}, year = 2018, booktitle = {International Conference on Machine Learning (ICML)}, publisher = {arXiv preprint arXiv:1805.07883} } @article{dziugaite2017computing, title = {Computing nonvacuous generalization bounds for deep (stochastic) neural networks with many more parameters than training data}, author = {Dziugaite, Gintare Karolina and Roy, Daniel M}, year = 2017, journal = {arXiv preprint arXiv:1703.11008} } @article{dzps18, title = {{Gradient Descent Provably Optimizes Over-parameterized Neural Networks}}, author = {Simon S. Du and Xiyu Zhai and Barnabas Poczos and Aarti Singh}, year = 2018, journal = {ArXiv e-prints}, archiveprefix = {arXiv}, eprint = {1810.02054} } @misc{e2lsh, title = {{E2LSH}}, author = {Alexandr Andoni}, year = 2004, howpublished = {\url{http://www.mit.edu/~andoni/LSH/}} } @article{e90, title = {Finding structure in time}, author = {Elman, Jeffrey L}, year = 1990, journal = {Cognitive science}, publisher = {Wiley Online Library}, volume = 14, number = 2, pages = {179--211} } @inproceedings{EA, title = {Image denoising via sparse and redundant representations over learned dictionaries}, author = {M. Elad and M. Aharon}, year = 2006, booktitle = {IEEE Trans. on Signal Processing}, pages = {3736--3745} } @inproceedings{EAH, title = {Method of optimal directions for frame design}, author = {K. Engan and S. Aase and J. Hakon-Husoy}, year = 1999, booktitle = {ICASSP}, pages = {2443--2446} } @article{eckart1936approximation, title = {The approximation of one matrix by another of lower rank}, author = {Eckart, Carl and Young, Gale}, year = 1936, journal = {Psychometrika}, publisher = {Springer}, volume = 1, number = 3, pages = {211--218} } @inproceedings{eckhard2011global, title = {On the global convergence of identification of output error models}, author = {Eckhard, Diego and Bazanella, Alexandre Sanfelice}, year = 2011, booktitle = {Proc.~$18$th IFAC World congress} } @article{ECP1624, title = {Freedman's inequality for matrix martingales}, author = {Joel Tropp}, year = 2011, journal = {Electron. Commun. Probab.}, volume = 16, pages = {no. 25, 262--270}, doi = {10.1214/ECP.v16-1624}, issn = {1083-589X}, url = {http://ecp.ejpecp.org/article/view/1624}, fjournal = {Electronic Communications in Probability}, keywords = {Discrete-time martingale, large deviation, probability inequality, random matrix}, abstract = {Freedman's inequality is a martingale counterpart to Bernstein's inequality. This result shows that the large-deviation behavior of a martingale is controlled by the predictable quadratic variation and a uniform upper bound for the martingale difference sequence. Oliveira has recently established a natural extension of Freedman's inequality that provides tail bounds for the maximum singular value of a matrix-valued martingale. This note describes a different proof of the matrix Freedman inequality that depends on a deep theorem of Lieb from matrix analysis. This argument delivers sharp constants in the matrix Freedman inequality, and it also yields tail bounds for other types of matrix martingales. The new techniques are adapted from recent work by the present author.} } @article{edmonds1965maximum, title = {Maximum matching and a polyhedron with 0,1-vertices}, author = {Edmonds, Jack}, year = 1965, journal = {Journal of Research of the National Bureau of Standards--B} } @article{ekeland1976convex, title = {Convex analysis and 9 variational problems}, author = {Ekeland, Ivar and Temam, Roger}, year = 1976, publisher = {SIAM} } @article{elad2006image, title = {Image denoising via sparse and redundant representations over learned dictionaries}, author = {Elad, Michael and Aharon, Michal}, year = 2006, journal = {Image Processing, IEEE Transactions on}, publisher = {IEEE}, volume = 15, number = 12, pages = {3736--3745}, owner = {gewor_000}, timestamp = {2013.11.10} } @book{Elad:2010:SRR:1895005, title = {Sparse and Redundant Representations: From Theory to Applications in Signal and Image Processing}, author = {Elad, Michael}, year = 2010, publisher = {Springer Publishing Company, Incorporated}, isbn = {144197010X, 9781441970107}, edition = {1st} } @misc{EladHazan2016-email, author = {Hazan, Elad}, year = 2016, howpublished = {private communication} } @misc{elektronikee575, title = {EE575 Series - HVAC Miniature Air Velocity Transmitter}, author = {E+E Elektronik}, howpublished = {Available at \url{http://www.epluse.com/uploads/tx_EplusEprDownloads/datasheet_EE575_e_02.pdf}} } @inproceedings{elkan11reinforcement, title = {Reinforcement Learning with a Bilinear {Q} Function}, author = {Charles Elkan}, year = 2011, booktitle = {Recent Advances in Reinforcement Learning - 9th European Workshop (EWRL)}, series = {Lecture Notes in Computer Science}, number = 7188, pages = {78--88} } @article{Elliott1968error, title = {Error analysis of an algorithm for summing certain finite series}, author = {Elliott, David}, year = 1968, journal = {Journal of the Australian Mathematical Society}, publisher = {Cambridge Univ Press}, volume = 8, number = {02}, pages = {213--221} } @article{elzinga1975central, title = {A central cutting plane algorithm for the convex programming problem}, author = {Elzinga, Jack and Moore, Thomas G.}, year = 1975, journal = {Mathematical Programming}, publisher = {Springer}, volume = 8, number = 1, pages = {134--145} } @inproceedings{engan1999method, title = {Method of optimal directions for frame design}, author = {Engan, Kjersti and Aase, Sven Ole and Hakon Husoy, J}, year = 1999, booktitle = {Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on}, volume = 5, pages = {2443--2446}, organization = {IEEE}, owner = {gewor_000}, timestamp = {2013.11.10} } @article{engstrom2020identifying, title = {Identifying Statistical Bias in Dataset Replication}, author = {Engstrom, Logan and Ilyas, Andrew and Santurkar, Shibani and Tsipras, Dimitris and Steinhardt, Jacob and Madry, Aleksander}, year = 2020, journal = {arXiv preprint arXiv:2005.09619} } @article{entrywise-sampling-PetroA2011, title = {{A Note on Element-wise Matrix Sparsification via a Matrix-valued Bernstein Inequality}}, author = {Petros Drineas and Anastasios Zouzias}, year = 2011, month = jan, journal = {ArXiv e-prints}, volume = {abs/1006.0407} } @techreport{epa2007epa, title = {EPA Report to Congress on Server and Data Center Energy Efficiency}, author = {{EPA}}, year = 2007, institution = {U.S. Environmental Protection Agency}, citeulike-article-id = 2483731, citeulike-linkout-0 = {http://www.energystar.gov/ia/partners/prod_development/downloads/EPA_Datacenter_Report_Congress_Final1.pdf}, keywords = {data\_center}, myurl = {http://www.energystar.gov/ia/partners/prod_development/downloads/EPA_Datacenter_Report_Congress_Final1.pdf}, posted-at = {2008-07-20 22:23:42}, priority = 2 } @article{Erdogan09, title = {On the convergence of {ICA} algorithms with symmetric orthogonalization}, author = {A. T. Erdogan}, year = 2009, journal = {IEEE Transactions on Signal Processing}, volume = 57, pages = {2209--2221} } @article{Eremenko2007uniform, title = {Uniform approximation of sgn x by polynomials and entire functions}, author = {Eremenko, Alexandre and Yuditskii, Peter}, year = 2007, journal = {Journal d'Analyse Math{\'e}matique}, publisher = {Springer}, volume = 101, number = 1, pages = {313--324} } @article{Eremenko2011polynomials, title = {Polynomials of the best uniform approximation to sgn (x) on two intervals}, author = {Eremenko, Alexandre and Yuditskii, Peter}, year = 2011, journal = {Journal d'Analyse Math{\'e}matique}, publisher = {Springer}, volume = 114, number = 1, pages = {285--315} } @techreport{erol2013extended, title = {The Extended Parameter Filter}, author = {Erol, Yusuf and Li, Lei and Ramsundar, Bharath and Russell, Stuart J.}, year = 2013, month = may, number = {UCB/EECS-2013-48}, url = {http://www.eecs.berkeley.edu/Pubs/TechRpts/2013/EECS-2013-48.html}, institution = {EECS Department, University of California, Berkeley}, abstract = {The parameters of temporal models, such as dynamic Bayesian networks, may be modelled in a Bayesian context as static or atemporal variables that influence transition probabilities at every time step. Particle filters fail for models that include such variables, while methods that use Gibbs sampling of parameter variables may incur a per-sample cost that grows linearly with the length of the observation sequence. Storvik devised a method for incremental computation of exact sufficient statistics that, for some cases, reduces the per-sample cost to a constant. In this paper, we demonstrate a connection between Storvik's filter and a Kalman filter in parameter space and establish more general conditions under which Storvik's filter works. Drawing on an analogy to the extended Kalman filter, we develop and analyze, both theoretically and experimentally, a Taylor approximation to the parameter posterior that allows Storvik's method to be applied to a broader class of models. Our experiments on both synthetic examples and real applications show improvement over existing methods.} } @inproceedings{es16, title = {The power of depth for feedforward neural networks}, author = {Eldan, Ronen and Shamir, Ohad}, year = 2016, booktitle = {Conference on Learning Theory (COLT)}, pages = {907--940} } @article{EshofFLSV2002numerical, title = {Numerical methods for the QCDd overlap operator. I. Sign-function and error bounds}, author = {van den Eshof, Jasper and Frommer, Andreas and Lippert, Th and Schilling, Klaus and van der Vorst, Henk A.}, year = 2002, journal = {Computer Physics Communications}, publisher = {Elsevier}, volume = 146, number = 2, pages = {203--224} } @book{estermann1962complex, title = {Complex numbers and functions}, author = {Estermann, T.}, year = 1962, publisher = {Athlone Press}, url = {https://books.google.com/books?id=ITbvAAAAMAAJ}, lccn = 62006689, bdsk-url-1 = {https://books.google.com/books?id=ITbvAAAAMAAJ} } @article{ethayarajh2020your, title = {Is Your Classifier Actually Biased? Measuring Fairness under Uncertainty with Bernstein Bounds}, author = {Ethayarajh, Kawin}, year = 2020, journal = {arXiv preprint arXiv:2004.12332} } @inproceedings{even2002pac, title = {PAC bounds for multi-armed bandit and {M}arkov decision processes}, author = {Even-Dar, Eyal and Mannor, Shie and Mansour, Yishay}, year = 2002, booktitle = {International Conference on Computational Learning Theory}, pages = {255--270}, organization = {Springer} } @article{even2006action, title = {Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems}, author = {Even-Dar, Eyal and Mannor, Shie and Mansour, Yishay}, year = 2006, journal = {Journal of machine learning research}, volume = 7, number = {Jun}, pages = {1079--1105} } @article{even2009online, title = {Online Markov decision processes}, author = {Even-Dar, Eyal and Kakade, Sham M and Mansour, Yishay}, year = 2009, journal = {Mathematics of Operations Research}, publisher = {INFORMS}, volume = 34, number = 3, pages = {726--736} } @proceedings{evgeniou2007multi, title = {Advances in Neural Information Processing Systems 19, Proceedings of the Twentieth Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 4-7, 2006}, year = 2007, booktitle = {NIPS}, publisher = {MIT Press}, isbn = {0-262-19568-2}, editor = {Bernhard Sch{\"o}lkopf and John C. Platt and Thomas Hoffman}, bibsource = {DBLP, http://dblp.uni-trier.de} } @article{eysenbach2017leave, title = {Leave no trace: Learning to reset for safe and autonomous reinforcement learning}, author = {Eysenbach, Benjamin and Gu, Shixiang and Ibarz, Julian and Levine, Sergey}, year = 2017, journal = {arXiv preprint arXiv:1711.06782} } @article{f17, title = {An Overview of ResNet and its Variants}, author = {Vincent Fung}, year = 2017, journal = {https://towardsdatascience.com/an-overview-of-resnet-and-its-variants-5281e2f56035} } @article{f89, title = {On the approximate realization of continuous mappings by neural networks}, author = {Funahashi, Ken-Ichi}, year = 1989, journal = {Neural networks}, publisher = {Pergamon}, volume = 2, number = 3, pages = {183--192} } @techreport{faloutsos1994fastmap, title = { FastMap: {A} Fast Algorithm for Indexing, Data-Mining and Visualization of Traditional and Multimedia Datasets }, author = {Christos Faloutsos and King-Ip (David) Lin}, year = 1994, address = {College Park}, number = {94-80}, institution = {Dept. of Computer Science, Univ. of Maryland}, type = {CS-TR-3383 UMIACS-TR-94-132 ISR TR}, owner = {leili}, timestamp = {2011.07.28} } @inproceedings{faloutsos1999power, title = {On power-law relationships of the Internet topology}, author = {Faloutsos, Michalis and Faloutsos, Petros and Faloutsos, Christos}, year = 1999, booktitle = {SIGCOMM '99}, location = {Cambridge, Massachusetts, United States}, publisher = {ACM}, address = {New York, NY, USA}, pages = {251--262}, doi = {10.1145/316188.316229}, isbn = {1-58113-135-6}, url = {http://doi.acm.org/10.1145/316188.316229}, acmid = 316229, numpages = 12 } @inproceedings{fan2007power, title = {Power provisioning for a warehouse-sized computer}, author = {Fan, Xiaobo and Weber, Wolf-Dietrich and Barroso, Luiz Andre}, year = 2007, booktitle = { Proceedings of the 34th annual international symposium on Computer architecture }, location = {San Diego, California, USA}, publisher = {ACM}, address = {New York, NY, USA}, series = {ISCA '07}, pages = {13--23}, doi = {http://doi.acm.org/10.1145/1250662.1250665}, isbn = {978-1-59593-706-3}, acmid = 1250665, keywords = {energy efficiency, power modeling, power provisioning}, numpages = 11 } @article{fan2019theoretical, title = {A theoretical analysis of deep {Q}-learning}, author = {Fan, Jianqing and Wang, Zhaoran and Xie, Yuchen and Yang, Zhuoran}, year = 2019, journal = {arXiv preprint arXiv:1901.00137} } @article{fang2020modeling, title = {Modeling from Features: a Mean-field Framework for Over-parameterized Deep Neural Networks}, author = {Fang, Cong and Lee, Jason D and Yang, Pengkun and Zhang, Tong}, year = 2020, journal = {arXiv preprint arXiv:2007.01452} } @inproceedings{farahmand2017value, title = {Value-aware loss function for model-based reinforcement learning}, author = {Farahmand, Amir-massoud and Barreto, Andre and Nikovski, Daniel}, year = 2017, booktitle = {Artificial Intelligence and Statistics} } @article{FastICA, title = {Fast and robust fixed-point algorithms for independent component analysis}, author = {Hyvarinen, A.}, year = 1999, journal = {Neural Networks, IEEE Transactions on}, volume = 10, number = 3, pages = {626--634} } @article{FastPCAGarber, title = {Fast and Simple PCA via Convex Optimization}, author = {Garber, Dan and Hazan, Elad}, year = 2015, month = sep, journal = {arXiv preprint arXiv:1509.05647}, volumn = {abs/1509.05647} } @inproceedings{faury2020improved, title = {Improved optimistic algorithms for logistic bandits}, author = {Faury, Louis and Abeille, Marc and Calauz{\`e}nes, Cl{\'e}ment and Fercoq, Olivier}, year = 2020, booktitle = {International Conference on Machine Learning}, pages = {3052--3060}, organization = {PMLR} } @article{fawzi2017classification, title = {Classification regions of deep neural networks}, author = {Fawzi, Alhussein and Moosavi-Dezfooli, Seyed-Mohsen and Frossard, Pascal and Soatto, Stefano}, year = 2017, journal = {arXiv preprint arXiv:1705.09552} } @inproceedings{fazel2001rank, title = {A rank minimization heuristic with application to minimum order system approximation}, author = {Fazel, Maryam and Hindi, Haitham and Boyd, Stephen P}, year = 2001, booktitle = {Proc.~American Control Conference}, volume = 6, pages = {4734--4739}, organization = {IEEE} } @inproceedings{fazel2004rank, title = {Rank minimization and applications in system theory}, author = {Fazel, Maryam and Hindi, Haitham and Boyd, S}, year = 2004, booktitle = {Proc.~American Control Conference}, volume = 4, pages = {3273--3278}, organization = {IEEE} } @inproceedings{fb16, title = {Topology and geometry of half-rectified network optimization}, author = {Freeman, C. Daniel and Bruna, Joan}, year = 2017, journal = {arXiv preprint arXiv:1611.01540}, booktitle = {ICLR}, url = {https://arxiv.org/abs/1611.01540} } @misc{feamsterbgp, title = {BGP Monitor - The Datapository Project, http://www.datapository.net/bgpmon/}, author = {N. Feamster and D. Andersen and H. Balakrishnan and F. Kaashoek}, owner = {leili}, timestamp = {2011.07.28} } @article{fearnley2015learning, title = {Learning equilibria of games via payoff queries.}, author = {Fearnley, John and Gairing, Martin and Goldberg, Paul W and Savani, Rahul}, year = 2015, journal = {Journal of Machine Learning Research}, volume = 16, pages = {1305--1344} } @article{fearnley2016finding, title = {Finding approximate Nash equilibria of bimatrix games via payoff queries}, author = {Fearnley, John and Savani, Rahul}, year = 2016, journal = {ACM Transactions on Economics and Computation (TEAC)}, publisher = {ACM}, volume = 4, number = 4, pages = 25 } @article{FeigeKrauthgamer02, title = {A Polylogarithmic Approximation of the Minimum Bisection}, author = {Feige, Uriel and Krauthgamer, Robert}, year = 2002, month = apr, journal = {SIAM J. Comput.}, publisher = {Society for Industrial and Applied Mathematics}, volume = 31, number = 4, issue_date = 2002, numpages = 29 } @article{feinberg2014value, title = {The value iteration algorithm is not strongly polynomial for discounted dynamic programming}, author = {Feinberg, Eugene A and Huang, Jefferson}, year = 2014, journal = {Operations Research Letters}, publisher = {Elsevier}, volume = 42, number = 2, pages = {130--131} } @article{feizi2017porcupine, title = {Porcupine neural networks:(almost) all local optima are global}, author = {Feizi, Soheil and Javadi, Hamid and Zhang, Jesse and Tse, David}, year = 2017, journal = {arXiv preprint arXiv:1710.02196} } @inproceedings{feng2020provably, title = {Provably Efficient Exploration for RL with Unsupervised Learning}, author = {Feng, Fei and Wang, Ruosong and Yin, Wotao and Du, Simon S and Yang, Lin F}, year = 2020, booktitle = {Advances in Neural Information Processing Systems} } @article{Fercoq2013, title = {Accelerated, Parallel, and Proximal Coordinate Descent}, author = {Fercoq, Olivier and Richt\'{a}rik, Peter}, year = 2015, journal = {SIAM Journal on Optimization}, volume = 25, number = 4, pages = {1997--2023}, note = {First appeared on ArXiv 1312.5799 in 2013} } @inproceedings{Fercoq2014fast, title = {Fast distributed coordinate descent for non-strongly convex losses}, author = {Fercoq, Olivier and Qu, Zheng and Richt{\'a}rik, Peter and Tak{\'a}c, Martin}, year = 2014, booktitle = {MLSP}, pages = {1--6}, organization = {IEEE} } @article{FercoqRichtarik2013smooth, title = {Smooth minimization of nonsmooth functions with parallel coordinate descent methods}, author = {{Fercoq}, Olivier and {Richt{\'a}rik}, Peter}, year = 2013, month = sep, journal = {ArXiv e-prints}, volume = {abs/1309.5885} } @inproceedings{FGKP, title = {New Results for Learning Noisy Parities and Halfspaces}, author = {Feldman, Vitaly and Gopalan, Parikshit and Khot, Subhash and Ponnuswami, Ashok Kumar}, year = 2006, booktitle = {Proceedings of the 47th Annual IEEE Symposium on Foundations of Computer Science}, publisher = {IEEE Computer Society}, address = {Washington, DC, USA}, series = {FOCS '06}, pages = {563--574}, isbn = {0-7695-2720-5}, numpages = 12 } @inproceedings{figurnov2018implicit, title = {Implicit reparameterization gradients}, author = {Figurnov, Mikhail and Mohamed, Shakir and Mnih, Andriy}, year = 2018, booktitle = {Advances in Neural Information Processing Systems} } @book{filar2012competitive, title = {Competitive Markov decision processes}, author = {Filar, Jerzy and Vrieze, Koos}, year = 2012, publisher = {Springer Science \& Business Media} } @inproceedings{filippi2010parametric, title = {Parametric bandits: The generalized linear case}, author = {Filippi, Sarah and Cappe, Olivier and Garivier, Aur{\'e}lien and Szepesv{\'a}ri, Csaba}, year = 2010, booktitle = {Advances in Neural Information Processing Systems}, pages = {586--594} } @article{fiorini2013combinatorial, title = {Combinatorial bounds on nonnegative rank and extended formulations}, author = {Fiorini, Samuel and Kaibel, Volker and Pashkovich, Kanstantsin and Theis, Dirk Oliver}, year = 2013, journal = {Discrete Mathematics}, publisher = {Elsevier} } @book{firth1957a, title = {A synopsis of linguistic theory}, author = {John Rupert Firth}, year = 1957 } @inproceedings{FJK, title = {Learning Linear Transformations}, author = {Alan M. Frieze and Mark Jerrum and Ravindran Kannan}, year = 1996, booktitle = {FOCS} } @inproceedings{FJK96, title = {Learning Linear Transformations}, author = {A. M. Frieze and M. Jerrum and R. Kannan}, year = 1996, booktitle = {FOCS} } @article{flash1985coordination, title = { The coordination of arm movements: an experimentally confirmed mathematical model. }, author = {Flash, T. and Hogan, N.}, year = 1985, month = jul, journal = {J Neurosci}, volume = 5, number = 7, pages = {1688--1703}, issn = {0270-6474}, abstract = { This paper presents studies of the coordination of voluntary human arm movements. A mathematical model is formulated which is shown to predict both the qualitative features and the quantitative details observed experimentally in planar, multijoint arm movements. Coordination is modeled mathematically by defining an objective function, a measure of performance for any possible movement. The unique trajectory which yields the best performance is determined using dynamic optimization theory. In the work presented here, the objective function is the square of the magnitude of jerk (rate of change of acceleration) of the hand integrated over the entire movement. This is equivalent to assuming that a major goal of motor coordination is the production of the smoothest possible movement of the hand. Experimental observations of human subjects performing voluntary unconstrained movements in a horizontal plane are presented. They confirm the following predictions of the mathematical model: unconstrained point-to-point motions are approximately straight with bell-shaped tangential velocity profiles; curved motions (through an intermediate point or around an obstacle) have portions of low curvature joined by portions of high curvature; at points of high curvature, the tangential velocity is reduced; the durations of the low-curvature portions are approximately equal. The theoretical analysis is based solely on the kinematics of movement independent of the dynamics of the musculoskeletal system and is successful only when formulated in terms of the motion of the hand in extracorporal space. The implications with respect to movement organization are discussed. }, citeulike-article-id = 701244, keywords = {arm, coordination, jerk, ngd, smoothness}, myurl = {http://www.jneurosci.org/cgi/content/abstract/5/7/1688}, priority = 2 } @inproceedings{flaxman2005online, title = {Online convex optimization in the bandit setting: gradient descent without a gradient}, author = {Flaxman, Abraham D and Kalai, Adam Tauman and McMahan, H Brendan}, year = 2005, booktitle = {Proceedings of the sixteenth annual ACM-SIAM symposium on Discrete algorithms}, pages = {385--394}, organization = {Society for Industrial and Applied Mathematics} } @article{fle70, title = {A new approach to variable metric algorithms}, author = {Fletcher, Roger}, year = 1970, journal = {The computer journal}, publisher = {Br Computer Soc}, volume = 13, number = 3, pages = {317--322} } @article{Fleischer2000, title = {{Approximating Fractional Multicommodity Flow Independent of the Number of Commodities}}, author = {Fleischer, Lisa K.}, year = 2000, month = jan, journal = {SIAM Journal on Discrete Mathematics}, volume = 13, number = 4, pages = {505--520}, doi = {10.1137/S0895480199355754}, issn = {0895-4801}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Fleischer - 2000 - Approximating Fractional Multicommodity Flow Independent of the Number of Commodities.pdf:pdf}, mendeley-groups = {Algorithms/Multiplicative Weight/Flow} } @article{fonteneau2013batch, title = {Batch mode reinforcement learning based on the synthesis of artificial trajectories}, author = {Fonteneau, Raphael and Murphy, Susan A and Wehenkel, Louis and Ernst, Damien}, year = 2013, journal = {Annals of operations research}, publisher = {Springer}, volume = 208, number = 1, pages = {383--416} } @article{fort2019deep, title = {Deep ensembles: A loss landscape perspective}, author = {Fort, Stanislav and Hu, Huiyi and Lakshminarayanan, Balaji}, year = 2019, journal = {arXiv preprint arXiv:1912.02757} } @inproceedings{FOS05, title = {Learning Mixtures of Product Distributions over Discrete Domains}, author = {J. Feldman and R. O'Donnell and R. Servedio}, year = 2005, booktitle = {FOCS} } @inproceedings{FOS06, title = {{PAC} Learning Mixtures of Axis-Aligned {G}aussians with No Separation Assumption}, author = {J. Feldman and R. O'Donnell and R. Servedio}, year = 2006, booktitle = {COLT} } @inproceedings{foster2018practical, title = {Practical Contextual Bandits with Regression Oracles}, author = {Foster, Dylan and Agarwal, Alekh and Dudik, Miroslav and Luo, Haipeng and Schapire, Robert}, year = 2018, booktitle = {International Conference on Machine Learning}, pages = {1539--1548} } @inproceedings{foster2020beyond, title = {Beyond {UCB}: Optimal and Efficient Contextual Bandits with Regression Oracles}, author = {Foster, Dylan and Rakhlin, Alexander}, year = 2020, month = {13--18 Jul}, booktitle = {Proceedings of the 37th International Conference on Machine Learning}, publisher = {PMLR}, series = {Proceedings of Machine Learning Research}, volume = 119, pages = {3199--3210} } @article{foster2020instance, title = {Instance-dependent complexity of contextual bandits and reinforcement learning: A disagreement-based perspective}, author = {Foster, Dylan J and Rakhlin, Alexander and Simchi-Levi, David and Xu, Yunzong}, year = 2020, journal = {arXiv preprint arXiv:2010.03104} } @article{fourierpca, title = {Fourier PCA}, author = {N. Goyal and S. Vempala and Y. Xiao}, year = 2013, journal = {arXiv preprint arXiv:1306.5825} } @inproceedings{fox16taming, title = {Taming the Noise in Reinforcement Learning via Soft Updates}, author = {Roy Fox and Ari Pakman and Naftali Tishby}, year = 2016, booktitle = {Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence (UAI-16)} } @inproceedings{foygel2011learning, title = {Learning with the weighted trace-norm under arbitrary sampling distributions}, author = {Foygel, Rina and Salakhutdinov, Ruslan and Shamir, Ohad and Srebro, Nathan}, year = 2011, booktitle = {Proc. of NIPS} } @article{Freedman, title = {Freedman's inequality for matrix martingales}, author = {Tropp, Joel and others}, year = 2011, journal = {Electronic Communications in Probability}, publisher = {The Institute of Mathematical Statistics and the Bernoulli Society}, volume = 16, pages = {262--270} } @article{freedman1975tail, title = {On tail probabilities for martingales}, author = {Freedman, David A}, year = 1975, journal = {the Annals of Probability}, publisher = {Institute of Mathematical Statistics}, volume = 3, number = 1, pages = {100--118} } @inproceedings{freund1995desicion, title = {A desicion-theoretic generalization of on-line learning and an application to boosting}, author = {Freund, Yoav and Schapire, Robert E}, year = 1995, booktitle = {Computational learning theory}, pages = {23--37}, organization = {Springer} } @inproceedings{friedmann2011subexponential, title = {Subexponential lower bounds for randomized pivoting rules for the simplex algorithm}, author = {Friedmann, Oliver and Hansen, Thomas Dueholm and Zwick, Uri}, year = 2011, booktitle = {Proceedings of the forty-third annual ACM symposium on Theory of computing}, pages = {283--292}, organization = {ACM} } @inproceedings{frieze1996learning, title = {Learning linear transformations}, author = {Frieze, Alan and Jerrum, Mark and Kannan, Ravi}, year = 1996, booktitle = {focs}, pages = 359, organization = {IEEE} } @inproceedings{frostig2015regularizing, title = {Un-regularizing: approximate proximal point and faster stochastic algorithms for empirical risk minimization}, author = {Frostig, Roy and Ge, Rong and Kakade, Sham M and Sidford, Aaron}, year = 2015, booktitle = {Proceedings of the 32nd International Conference on Machine Learning (ICML)}, volume = 37, pages = {1--28}, url = {http://arxiv.org/abs/1506.07512}, abstract = {We develop a family of accelerated stochastic algorithms that minimize sums of convex functions. Our algorithms improve upon the fastest running time for empirical risk minimization (ERM), and in particular linear least-squares regression, across a wide range of problem settings. To achieve this, we establish a framework based on the classical proximal point algorithm. Namely, we provide several algorithms that reduce the minimization of a strongly convex function to approximate minimizations of regularizations of the function. Using these results, we accelerate recent fast stochastic algorithms in a black-box fashion. Empirically, we demonstrate that the resulting algorithms exhibit notions of stability that are advantageous in practice. Both in theory and in practice, the provided algorithms reap the computational benefits of adding a large strongly convex regularization term, without incurring a corresponding bias to the original problem.}, archiveprefix = {arXiv}, arxivid = {1506.07512}, eprint = {1506.07512}, file = {:D$\backslash$:/Mendeley Desktop/Frostig et al. - 2015 - Un-regularizing approximate proximal point and faster stochastic algorithms for empirical risk minimization.pdf:pdf}, mendeley-groups = {Optimization/Gradient Descent Theory} } @inproceedings{FrostigMMS2016, title = {{Principal Component Projection Without Principal Component Analysis}}, author = {Frostig, Roy and Musco, Cameron and Musco, Christopher and Sidford, Aaron}, year = 2016, booktitle = {ICML} } @misc{FrostigMMS2016-pcr-krylov, title = {{Code \verb"kpcr.m"}}, author = {Frostig, Roy and Musco, Cameron and Musco, Christopher and Sidford, Aaron}, year = 2015, note = {Accessed: 2016-07, \url{http://www.chrismusco.com/kpcr.m}} } @inproceedings{FRU12, title = {Using Regression for Spectral Estimation of HMMs}, author = {Jordan Rodu and Dean P. Foster and Weichen Wu and Lyle H. Ungar}, year = 2013, booktitle = {Statistical Language and Speech Processing}, pages = {212--223} } @inproceedings{fruit2018efficient, title = {Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning}, author = {Fruit, Ronan and Pirotta, Matteo and Lazaric, Alessandro and Ortner, Ronald}, year = 2018, booktitle = {ICML 2018-The 35th International Conference on Machine Learning}, volume = 80, pages = {1578--1586} } @inproceedings{fruit2018near, title = {Near optimal exploration-exploitation in non-communicating markov decision processes}, author = {Fruit, Ronan and Pirotta, Matteo and Lazaric, Alessandro}, year = 2018, booktitle = {Advances in Neural Information Processing Systems}, pages = {2994--3004} } @article{fruit2019improved, title = {Improved Analysis of UCRL2B}, author = {Fruit, Ronan and Pirotta, Matteo and Lazaric, Alessandro}, year = 2019, journal = {Available at rlgammazero. github. io/docs/ucrl2b\_improved. pdf} } @inproceedings{fsm10, title = {Error propagation for approximate policy and value iteration}, author = {Farahmand, Amir-massoud and Szepesv{\'a}ri, Csaba and Munos, R{\'e}mi}, year = 2010, booktitle = {Advances in Neural Information Processing Systems}, pages = {568--576} } @inproceedings{fu2005scaling, title = {Scaling and time warping in time series querying}, author = { Fu, Ada Wai-chee and Keogh, Eamonn and Lau, Leo Yung Hang and Ratanamahatana, Chotirat Ann }, year = 2005, booktitle = { Proceedings of the 31st international conference on Very large data bases }, location = {Trondheim, Norway}, publisher = {VLDB Endowment}, series = {VLDB '05}, pages = {649--660}, isbn = {1-59593-154-6}, acmid = 1083668, numpages = 12 } @book{fudenberg1991game, title = {Game theory}, author = {Fudenberg, Drew and Tirole, Jean}, year = 1991, publisher = {MIT Press, Cambridge, MA}, pages = {xxiv+579}, isbn = {0-262-06141-4}, mrclass = {90-02 (90D10 90D20 90D40 90D80)}, mrnumber = 1124618, mrreviewer = {Fran\c{c}oise Forges} } @inproceedings{fujimoto2018addressing, title = {Addressing function approximation error in actor-critic methods}, author = {Fujimoto, Scott and Hoof, Herke and Meger, David}, year = 2018, booktitle = {International Conference on Machine Learning}, pages = {1587--1596}, organization = {PMLR} } @inproceedings{fujiwara2008spiral, title = { SPIRAL: efficient and exact model identification for hidden {M}arkov models }, author = {Fujiwara, Yasuhiro and Sakurai, Yasushi and Yamamuro, Masashi}, year = 2008, booktitle = { Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining }, location = {Las Vegas, Nevada, USA}, publisher = {ACM}, address = {New York, NY, USA}, series = {KDD '08}, pages = {247--255}, doi = {http://doi.acm.org/10.1145/1401890.1401924}, isbn = {978-1-60558-193-4}, acmid = 1401924, keywords = {Hidden Markov model, likelihood, upper bound}, numpages = 9 } @book{fukunaga1990introduction, title = {Introduction to statistical pattern recognition (2nd ed.)}, author = {Fukunaga, Keinosuke}, year = 1990, publisher = {Academic Press Professional, Inc.}, address = {San Diego, CA, USA}, isbn = {0-12-269851-7} } @inproceedings{fung1989weighing, title = {Weighing and Integrating Evidence for Stochastic Simulation in {B}ayesian Networks}, author = {Robert Fung and Kuo-Chu Chang}, year = 1989, booktitle = {Proceedings of the Fifth Conference Annual Conference on Uncertainty in Artificial Intelligence (UAI-89)}, publisher = {Elsevier Science}, address = {New York, NY}, pages = {112--117} } @inproceedings{fung1994backward, title = {Backward Simulation in {B}ayesian Networks}, author = { Robert M. Fung and Brendan Del Favero }, year = 1994, booktitle = {UAI}, pages = {227--234}, ee = {http://uai.sis.pitt.edu/displayArticleDetails.jsp?mmnu=1{\&}smnu=2{\&}article_id=508{\&}proceeding_id=10} } @inproceedings{G, title = {Robustness Analysis of HottTopixx, a Linear Programming Model for Factoring Nonnegative Matrices}, author = {N. Gillis}, year = 2012, note = {http://arxiv.org/abs/1211.6687} } @article{Gallo1989, title = {A Fast Parametric Maximum Flow Algorithm and Applications}, author = {Gallo, Giorgio and Grigoriadis, Michael D. and Tarjan, Robert E.}, year = 1989, month = feb, journal = {SIAM Journal on Computing}, volume = 18, number = 1, pages = {30--55} } @article{gandy2011tensor, title = {Tensor completion and low-n-rank tensor recovery via convex optimization}, author = {Gandy, Silvia and Recht, Benjamin and Yamada, Isao}, year = 2011, journal = {Inverse Problems}, publisher = {IOP Publishing}, volume = 27, number = 2, pages = {025010} } @article{gao2008classifying, title = { Classifying Data Streams with Skewed Class Distributions and Concept Drifts }, author = {Gao, Jing and Ding, B. and Fan, Wei and Han, Jiawei and Yu, P. S.}, year = 2008, journal = {Internet Computing}, volume = 12, number = 6, pages = {37--49}, doi = {10.1109/MIC.2008.119}, issn = {1089-7801}, abstract = { Classification is an important data analysis tool that uses a model built from historical data to predict class labels for new observations. More and more applications are featuring data streams, rather than finite stored data sets, which are a challenge for traditional classification algorithms. Concept drifts and skewed distributions, two common properties of data stream applications, make the task of learning in streams difficult. The authors aim to develop a new approach to classify skewed data streams that uses an ensemble of models to match the distribution over under-samples of negatives and repeated samples of positives. }, keywords = { data analysis, pattern classification, concept drifts, data analysis tool, data streams classification, skewed distributions, classification algorithms, concept drifts, data mining, data stream, model averaging, skewed distributions }, owner = {leili}, timestamp = {2010.02.05} } @article{gao2019convergence, title = {Convergence of Adversarial Training in Overparametrized Networks}, author = {Gao, Ruiqi and Cai, Tianle and Li, Haochuan and Wang, Liwei and Hsieh, Cho-Jui and Lee, Jason D}, year = 2019, journal = {Neural Information Processing Systems (NeurIPS)} } @article{gao2021provably, title = {A Provably Efficient Algorithm for Linear Markov Decision Process with Low Switching Cost}, author = {Gao, Minbo and Xie, Tianle and Du, Simon S and Yang, Lin F}, year = 2021, journal = {arXiv preprint arXiv:2101.00494} } @inproceedings{GarberHazan-et-al-2016-ICML, title = {Robust Shift-and-Invert Preconditioning: Faster and More Sample Efficient Algorithms for Eigenvector Computation}, author = {Dan Garber and Elad Hazan and Chi Jin and Kakade, Sham M. and Cameron Musco and Praneeth Netrapalli and Aaron Sidford}, year = 2016, booktitle = {ICML} } @inproceedings{GarberHazanMa2015-onlineEV, title = {Online learning of eigenvectors}, author = {Garber, Dan and Hazan, Elad and Ma, Tengyu}, year = 2015, booktitle = {Proceedings of the 32nd International Conference on Machine Learning (ICML-15)}, pages = {560--568} } @inproceedings{GarberHJKMNS16, title = {Faster Eigenvector Computation via Shift-and-Invert Preconditioning}, author = {Dan Garber and Elad Hazan and Chi Jin and Sham M. Kakade and Cameron Musco and Praneeth Netrapalli and Aaron Sidford}, year = 2016, booktitle = {Proceedings of the 33nd International Conference on Machine Learning, {ICML} 2016, New York City, NY, USA, June 19-24, 2016}, pages = {2626--2634}, url = {http://jmlr.org/proceedings/papers/v48/garber16.html}, crossref = {DBLP:conf/icml/2016}, timestamp = {Tue, 12 Jul 2016 21:51:16 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/icml/GarberHJKMNS16}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{garg2014on, title = {On Communication Cost of Distributed Statistical Estimation and Dimensionality}, author = {Ankit Garg and Tengyu Ma and Huy L. Nguyen}, booktitle = {Advances in Neural Information Processing Systems (NIPS), 2014}, url = {http://papers.nips.cc/paper/5442-on-communication-cost-of-distributed-statistical-estimation-and-dimensionality}, crossref = {DBLP:conf/nips/2014}, timestamp = {Wed, 10 Dec 2014 21:34:12 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/nips/GargMN14}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{GargK2007, title = {{Faster and Simpler Algorithms for Multicommodity Flow and Other Fractional Packing Problems}}, author = {Garg, Naveen and K\"{o}nemann, Jochen}, year = 2007, month = jan, journal = {SIAM Journal on Computing}, publisher = {IEEE Comput. Soc}, volume = 37, number = 2, pages = {630--652}, doi = {10.1137/S0097539704446232}, isbn = {0-8186-9172-7}, issn = {0097-5397}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Garg, K\"{o}nemann - 2007 - Faster and Simpler Algorithms for Multicommodity Flow and Other Fractional Packing Problems.pdf:pdf}, mendeley-groups = {Algorithms/Multiplicative Weight/LP,Algorithms/Multiplicative Weight/Flow} } @book{garofalakis2009data, title = {Data Stream Management: Processing High-Speed Data Streams}, author = {Minos Garofalakis and Johannes Gehrke and Rajeev Rastogi}, year = 2009, publisher = {Springer}, isbn = 9783540286073, owner = {leili}, timestamp = {2011.07.28} } @inproceedings{gartner2009coresets, title = {Coresets for polytope distance}, author = {G{\"a}rtner, Bernd and Jaggi, Martin}, year = 2009, booktitle = {Proceedings of the 25th annual symposium on computational geometry}, pages = {33--42}, organization = {ACM} } @inproceedings{GCY92, title = {One sense per discourse}, author = {W. A. Gale and K. W. Church and D. Yarowsky}, year = 1992, booktitle = {4th DARPA Speech and Natural Language Workshop} } @inproceedings{Ge, title = {Learning topic models--going beyond SVD}, author = {Arora, Sanjeev and Ge, Rong and Moitra, Ankur}, year = 2012, booktitle = {Foundations of Computer Science (FOCS), 2012 IEEE 53rd Annual Symposium on}, pages = {1--10}, organization = {IEEE}, file = {:..\\Originals\\LDA.pdf:PDF}, owner = {rongge}, timestamp = {2013.09.26} } @article{ge2015decomposing, title = {Decomposing Overcomplete 3rd Order Tensors using Sum-of-Squares Algorithms}, author = {Ge, Rong and Ma, Tengyu}, year = 2015, journal = {arXiv preprint arXiv:1504.05287} } @inproceedings{ge2015escaping, title = {Escaping from saddle points?online stochastic gradient for tensor decomposition}, author = {Ge, Rong and Huang, Furong and Jin, Chi and Yuan, Yang}, year = 2015, booktitle = {Conference on Learning Theory}, series = {COLT 2015}, pages = {797--842} } @inproceedings{ge2016efficient, title = {Efficient Algorithms for Large-scale Generalized Eigenvector Computation and Canonical Correlation Analysis}, author = {Ge, Rong and Jin, Chi and Kakade, Sham M and Netrapalli, Praneeth and Sidford, Aaron}, year = 2016, booktitle = {ICML} } @article{ge2016matrix, title = {Matrix Completion has No Spurious Local Minimum}, author = {Ge, Rong and {Jason D. Lee} and Ma, Tengyu}, year = 2016, journal = {Neural Information Processing Systems (NIPS)} } @article{ge2016on, title = {{On the optimization landscape of tensor decomposition}}, author = {Rong Ge and Tengyu Ma}, year = 2016, journal = {manuscript}, keywords = {Statistics - Machine Learning, Computer Science - Learning, Mathematics - Optimization and Control}, adsurl = {http://adsabs.harvard.edu/abs/2016arXiv160507110K}, adsnote = {Provided by the SAO/NASA Astrophysics Data System} } @article{ge2017neural, title = {Learning One-hidden-layer Neural Networks with Landscape Design}, author = {Rong Ge, Jason D. Lee, and Tengyu Ma}, year = 2017, booktitle = {ICLR}, publisher = {manuscript}, url = {http://arxiv.org/abs/1711.00501} } @article{ge2017no, title = {No Spurious Local Minima in Nonconvex Low Rank Problems: A Unified Geometric Analysis}, author = {Ge, Rong and Jin, Chi and Zheng, Yi}, year = 2017, journal = {arXiv preprint arXiv:1704.00708} } @article{ge2017on, title = {{On the Optimization Landscape of Tensor Decompositions}}, author = {{Ge}, R. and {Ma}, T.}, year = 2017, month = jun, journal = {ArXiv e-prints}, booktitle = {Advances in Neural Information Processing Systems}, publisher = {Springer}, pages = {3653--3663}, archiveprefix = {arXiv}, eprint = {1706.05598}, primaryclass = {cs.LG}, keywords = {Computer Science - Learning, Computer Science - Data Structures and Algorithms, Mathematics - Optimization and Control, Mathematics - Probability, Statistics - Machine Learning}, adsurl = {http://adsabs.harvard.edu/abs/2017arXiv170605598G}, adsnote = {Provided by the SAO/NASA Astrophysics Data System} } @article{ge2018simulated, title = {Simulated tempering {Langevin Monte Carlo II}: An improved proof using soft {Markov} chain decomposition}, author = {Ge, Rong and Lee, Holden and Risteski, Andrej}, year = 2018, journal = {arXiv preprint arXiv:1812.00793} } @article{ge2019step, title = {{The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure}}, author = {{Ge}, Rong and {Kakade}, Sham M. and {Kidambi}, Rahul and {Netrapalli}, Praneeth}, year = 2019, month = apr, journal = {arXiv e-prints}, pages = {arXiv:1904.12838}, keywords = {Computer Science - Machine Learning, Mathematics - Optimization and Control, Statistics - Machine Learning}, eid = {arXiv:1904.12838}, archiveprefix = {arXiv}, eprint = {1904.12838}, primaryclass = {cs.LG}, adsurl = {https://ui.adsabs.harvard.edu/abs/2019arXiv190412838G}, adsnote = {Provided by the SAO/NASA Astrophysics Data System} } @book{geer2000empirical, title = {Empirical Processes in M-estimation}, author = {Van de Geer, Sara}, year = 2000, publisher = {Cambridge University Press} } @inproceedings{GeHJY15, title = {Escaping From Saddle Points - Online Stochastic Gradient for Tensor Decomposition}, author = {Rong Ge and Furong Huang and Chi Jin and Yang Yuan}, year = 2015, booktitle = {Proceedings of The 28th Conference on Learning Theory, {COLT} 2015, Paris, France, July 3-6, 2015}, pages = {797--842} } @inproceedings{GeHK15, title = {Learning Mixtures of Gaussians in High Dimensions}, author = {Rong Ge and Qingqing Huang and Sham M. Kakade}, year = 2015, booktitle = {Proceedings of the Forty-Seventh Annual {ACM} on Symposium on Theory of Computing, {STOC} 2015, Portland, OR, USA, June 14-17, 2015}, pages = {761--770}, doi = {10.1145/2746539.2746616}, url = {http://doi.acm.org/10.1145/2746539.2746616}, crossref = {DBLP:conf/stoc/2015}, timestamp = {Wed, 10 Jun 2015 17:20:57 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/stoc/GeHK15}, bibsource = {dblp computer science bibliography, http://dblp.org}, pp = {761-770} } @article{GeJKNS2016-CCA, title = {{Efficient Algorithms for Large-scale Generalized Eigenvector Computation and Canonical Correlation Analysis}}, author = {Rong Ge and Chi Jin and Sham M. Kakade and Praneeth Netrapalli and Aaron Sidford}, year = 2016, month = apr, journal = {ArXiv e-prints}, volume = {abs/1604.03930} } @inproceedings{gemulla2011large, title = {Large-scale matrix factorization with distributed stochastic gradient descent}, author = {Gemulla, Rainer and Nijkamp, Erik and Haas, Peter J and Sismanis, Yannis}, year = 2011, booktitle = {Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining}, pages = {69--77}, organization = {ACM} } @article{GeneticAlgorithm1950, title = {Computing machinery and intelligence}, author = {Turing, Alan M.}, year = 1950, journal = {Mind}, publisher = {JSTOR}, pages = {433--460} } @inproceedings{gentile2014online, title = {Online clustering of bandits}, author = {Gentile, Claudio and Li, Shuai and Zappella, Giovanni}, year = 2014, booktitle = {International Conference on Machine Learning}, pages = {757--765} } @article{ghadimilan, title = {Stochastic first-and zeroth-order methods for nonconvex stochastic programming}, author = {Ghadimi, Saeed and Lan, Guanghui}, year = 2013, journal = {SIAM Journal on Optimization}, publisher = {SIAM}, volume = 23, number = 4, pages = {2341--2368} } @article{GhadimiLan2015, title = {{Accelerated gradient methods for nonconvex nonlinear and stochastic programming}}, author = {Ghadimi, Saeed and Lan, Guanghui}, year = 2015, month = feb, journal = {Mathematical Programming}, pages = {1--26}, doi = {10.1007/s10107-015-0871-8}, issn = {0025-5610}, url = {http://arxiv.org/abs/1310.3787 http://link.springer.com/10.1007/s10107-015-0871-8}, archiveprefix = {arXiv}, arxivid = {1310.3787}, eprint = {1310.3787}, file = {:D$\backslash$:/Mendeley Desktop/Ghadimi, Lan - 2013 - Accelerated gradient methods for nonconvex nonlinear and stochastic programming.pdf:pdf}, keywords = {62l20,68q25,90c15,90c25,accelerated gradient,ams 2000 subject classification,complexity,nonconvex optimization,stochastic programming}, mendeley-groups = {Optimization/Gradient Descent Theory/Nonconvex,Optimization/Non-Convex} } @inproceedings{ghafoorian2017transfer, title = {Transfer learning for domain adaptation in mri: Application in brain lesion segmentation}, author = {Ghafoorian, Mohsen and Mehrtash, Alireza and Kapur, Tina and Karssemeijer, Nico and Marchiori, Elena and Pesteie, Mehran and Guttmann, Charles RG and de Leeuw, Frank-Erik and Tempany, Clare M and Van Ginneken, Bram and Wells III, William M.}, year = 2017, booktitle = {International conference on medical image computing and computer-assisted intervention}, pages = {516--524}, organization = {Springer} } @inproceedings{ghahramani1994supervised, title = {Supervised learning from incomplete data via an {EM} approach}, author = {Zoubin Ghahramani and Michael I. Jordan}, year = 1994, booktitle = {Advances in Neural Information Processing Systems}, publisher = {Morgan Kaufmann Publishers, Inc.}, volume = 6, pages = {120--127}, url = {citeseer.ist.psu.edu/ghahramani94supervised.html}, editor = {Jack D. Cowan and Gerald Tesauro and Joshua Alspector}, owner = {leili}, timestamp = {2011.07.28} } @techreport{ghahramani1996parameter, title = {Parameter Estimation for Linear Dynamical Systems}, author = {Ghahramani, Zoubin and Hinton, Geoffrey E.}, year = 1996, month = feb, number = {CRG-TR-96-2}, abstract = { Linear systems have been used extensively in engineering to model and control the behavior of dynamical systems. In this note, we present the Expectation Maximization (EM) algorithm for estimating the parameters of linear systems (Shumway and Stoffer, 1982). We also point out the relationship between linear dynamical systems, factor analysis, and hidden Markov models. Introduction The goal of this note is to introduce the EM algorithm for estimating the parameters of linear dynamical systems... }, keywords = {dynamical, linear, systems} } @inproceedings{GharanTrevisan12, title = {Approximating the Expansion Profile and Almost Optimal Local Graph Clustering}, author = {Gharan, Shayan Oveis and Trevisan, Luca}, year = 2012, series = {FOCS}, pages = {187--196} } @inproceedings{ghavamzadeh2011finite, title = {Finite-sample analysis of Lasso-TD}, author = {Ghavamzadeh, Mohammad and Lazaric, Alessandro and Munos, R{\'e}mi and Hoffman, Matt}, year = 2011, booktitle = {International Conference on Machine Learning} } @article{GhLI, title = {Optimal stochastic approximation algorithms for strongly convex stochastic composite optimization {I}: {A} generic algorithmic framework}, author = {Ghadimi, Saeed and Lan, Guanghui}, year = 2012, journal = {SIAM J. Optim.}, volume = 22, number = 4, pages = {1469--1492}, doi = {10.1137/110848864}, issn = {1052-6234}, url = {http://dx.doi.org/10.1137/110848864}, fjournal = {SIAM Journal on Optimization}, mrclass = {62L20 (68W25 90C15 90C25)}, mrnumber = 3023780 } @article{GhLII, title = {Optimal stochastic approximation algorithms for strongly convex stochastic composite optimization, {II}: {S}hrinking procedures and optimal algorithms}, author = {Ghadimi, Saeed and Lan, Guanghui}, year = 2013, journal = {SIAM J. Optim.}, volume = 23, number = 4, pages = {2061--2089}, doi = {10.1137/110848876}, issn = {1052-6234}, url = {http://dx.doi.org/10.1137/110848876}, fjournal = {SIAM Journal on Optimization}, mrclass = {62L20 (68Q25 68W25 90C25)}, mrnumber = 3118261 } @inproceedings{ghorbani2019limitations, title = {Limitations of Lazy Training of Two-layers Neural Network}, author = {Ghorbani, Behrooz and Mei, Song and Misiakiewicz, Theodor and Montanari, Andrea}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {9108--9118} } @article{gibbs2003international, title = {The international HapMap project}, author = {Gibbs, Richard A and Belmont, John W and Hardenbol, Paul and Willis, Thomas D and Yu, Fuli and Yang, Huanming and Ch'ang, Lan-Yang and Huang, Wei and Liu, Bin and Shen, Yan and others}, year = 2003, journal = {Nature}, publisher = {Nature Publishing Group}, volume = 426, number = 6968, pages = {789--796} } @inproceedings{gilbert2001surfing, title = { Surfing Wavelets on Streams: One-Pass Summaries for Approximate Aggregate Queries }, author = { Gilbert, Anna C. and Kotidis, Yannis and Muthukrishnan, S. and Strauss, Martin }, year = 2001, booktitle = { Proceedings of the 27th International Conference on Very Large Data Bases }, publisher = {Morgan Kaufmann Publishers Inc.}, address = {San Francisco, CA, USA}, series = {VLDB '01}, pages = {79--88}, isbn = {1-55860-804-4}, acmid = 672174, numpages = 10 } @inproceedings{giles2001overfitting, title = {Overfitting in Neural Nets: Backpropagation, Conjugate Gradient, and Early Stopping}, author = {Giles, Rich Caruana Steve Lawrence Lee}, year = 2001, booktitle = {Advances in Neural Information Processing Systems 13: Proceedings of the 2000 Conference}, volume = 13, pages = 402 } @article{gilks1992adaptive, title = {Adaptive rejection sampling for Gibbs sampling}, author = {Gilks, W. R. and Wild, P.}, year = 1992, journal = {Applied Statistics}, volume = 41, pages = {337--348} } @article{gilks2001following, title = {Following a Moving Target -- {M}onte {C}arlo Inference for Dynamic Bayesian Models}, author = {Gilks, Walter R. and Berzuini, Carlo}, year = 2001, journal = {Journal of the Royal Statistical Society. Series B (Statistical Methodology)}, volume = 63, number = 1, pages = {127--146} } @article{gillis2014and, title = {The why and how of nonnegative matrix factorization}, author = {Gillis, Nicolas}, year = 2014, journal = {Regularization, Optimization, Kernels, and Support Vector Machines}, publisher = {Chapman \& Hall}, volume = 12, number = 257 } @misc{GillisVavasis, title = {Fast and Robust Recursive Algorithms for Separable Nonnegative Matrix Factorization}, author = {N. Gillis and S. Vavasis}, year = 2012, note = {http://arxiv.org/abs/1208.1237} } @book{GilSeguraTemme2007, title = {{Numerical Methods for Special Functions}}, author = {Gil, Amparo and Segura, Javier and Temme, Nico M.}, year = 2007, month = jan, publisher = {Society for Industrial and Applied Mathematics}, pages = 405, doi = {10.1137/1.9780898717822}, isbn = {978-0-89871-634-4}, issn = {0029599X}, url = {http://epubs.siam.org/doi/abs/10.1137/1.9780898717822 http://epubs.siam.org/doi/book/10.1137/1.9780898717822}, abstract = {Special functions arise in many problems of pure and applied mathematics, mathematical statistics, physics, and engineering. This book provides an up-to-date overview of numerical methods for computing special functions and discusses when to use these methods depending on the function and the range of parameters. Not only are standard and simple parameter domains considered, but methods valid for large and complex parameters are described as well. The first part of the book (basic methods) covers convergent and divergent series, Chebyshev expansions, numerical quadrature, and recurrence relations. Its focus is on the computation of special functions; however, it is suitable for general numerical courses. Pseudoalgorithms are given to help students write their own algorithms. In addition to these basic tools, the authors discuss other useful and efficient methods, such as methods for computing zeros of special functions, uniform asymptotic expansions, Pad{\'{e}} approximations, and sequence transformations. The book also provides specific algorithms for computing several special functions (like Airy functions and parabolic cylinder functions, among others).}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Gil, Segura, Temme - 2007 - Numerical Methods for Special Functions.pdf:pdf}, mendeley-groups = {Books/Book-Optimization} } @article{gissin2019implicit, title = {The Implicit Bias of Depth: How Incremental Learning Drives Generalization}, author = {Gissin, Daniel and Shalev-Shwartz, Shai and Daniely, Amit}, year = 2019, journal = {arXiv preprint arXiv:1909.12051} } @article{gittens2011spectral, title = {The spectral norm error of the naive {Nystr{\"o}m} extension}, author = {Gittens, Alex}, year = 2011, journal = {arXiv preprint arXiv:1110.5305} } @article{gittens2013revisiting, title = {Revisiting the {Nystr{\"o}m} method for improved large-scale machine learning}, author = {Gittens, Alex and Mahoney, Michael W}, year = 2013, journal = {arXiv preprint arXiv:1303.1849} } @inproceedings{gkkt17, title = {Reliably learning the {R}e{LU} in polynomial time}, author = {Goel, Surbhi and Kanade, Varun and Klivans, Adam and Thaler, Justin}, year = 2017, journal = {arXiv preprint arXiv:1611.10258}, booktitle = {Conference on Learning Theory (COLT)} } @inproceedings{gkm18, title = {Learning One Convolutional Layer with Overlapping Patches}, author = {Goel, Surbhi and Klivans, Adam and Meka, Raghu}, year = 2018, journal = {arXiv preprint arXiv:1802.02547}, booktitle = {International Conference on Machine Learning (ICML)}, publisher = {arXiv preprint arXiv:1802.02547} } @article{glasserman1995sensitivity, title = {Sensitivity analysis for base-stock levels in multiechelon production-inventory systems}, author = {Glasserman, Paul and Tayur, Sridhar}, year = 1995, journal = {Management Science}, publisher = {INFORMS}, volume = 41, number = 2, pages = {263--281} } @inproceedings{GLMY11, title = {Large-Scale Community Detection on YouTube for Topic Discovery and Exploration}, author = {Ullas Gargi and Wenjun Lu and Vahab S. Mirrokni and Sangho Yoon}, year = 2011, booktitle = {AAAI Conference on Weblogs and Social Media} } @article{globerson2007euclidean, title = {Euclidean Embedding of Co-occurrence Data}, author = {Globerson, Amir and Chechik, Gal and Pereira, Fernando and Tishby, Naftali}, year = 2007, journal = {Journal of Machine Learning Research} } @inproceedings{globerson2007exponentiated, title = {Exponentiated gradient algorithms for log-linear structured prediction}, author = {Globerson, Amir and Koo, Terry Y and Carreras, Xavier and Collins, Michael}, year = 2007, booktitle = {Proceedings of the 24th international conference on Machine learning}, pages = {305--312}, organization = {ACM} } @inproceedings{glorot2010understanding, title = {Understanding the difficulty of training deep feedforward neural networks}, author = {Glorot, Xavier and Bengio, Yoshua}, year = 2010, booktitle = {Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics}, pages = {249--256} } @article{GLPR12, title = {The inverse moment problem for convex polytopes}, author = {N. Gravin and J. Lasserre and D. Pasechnik and S. Robins}, year = 2012, journal = {Discrete and Computational Geometry}, note = {To appear} } @misc{GM, title = {On the Optimization Landscape of Tensor Decompositions}, author = {Rong Ge and Tengyu Ma}, year = 2016, alteditor = {editor}, date = {}, optsubtitle = {subtitle}, opttitleaddon = {titleaddon}, optlanguage = {language}, opthowpublished = {howpublished}, opttype = {type}, optversion = {version}, optnote = {note}, optorganization = {organization}, optlocation = {location}, optdate = {date}, optmonth = {month}, optaddendum = {addendum}, optpubstate = {pubstate}, optdoi = {doi}, opteprint = {eprint}, opteprintclass = {eprintclass}, opteprinttype = {eprinttype}, opturl = {url}, opturldate = {urldate} } @inproceedings{gmh13, title = {Speech recognition with deep recurrent neural networks}, author = {Graves, Alex and Mohamed, Abdel-rahman and Hinton, Geoffrey}, year = 2013, booktitle = {{IEEE} International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, pages = {6645--6649}, organization = {IEEE} } @article{GMMICA2013, title = {{The More, the Merrier: the Blessing of Dimensionality for Learning Large Gaussian Mixtures}}, author = {J. Anderson and M. Belkin and N. Goyal and L. Rademacher and J. Voss}, year = 2013, month = nov, journal = {arXiv preprint arXiv:1311.2891} } @inproceedings{GMS, title = {Approximation of functions over redundant dictionaries using coherence}, author = {A. Gilbert and S. Muthukrishnan and M. Strauss}, year = 2003, booktitle = {SODA} } @inproceedings{GN, title = {Sparse representations in unions of bases}, author = {R. Gribonval and M. Nielsen}, year = 2003, booktitle = {IEEE Transactions on Information Theory}, pages = {3320--3325} } @article{goel2017eigenvalue, title = {Eigenvalue Decay Implies Polynomial-Time Learnability for Neural Networks}, author = {Goel, Surbhi and Klivans, Adam}, year = 2017, journal = {arXiv preprint arXiv:1708.03708} } @article{goel2017learning, title = {Learning Depth-Three Neural Networks in Polynomial Time}, author = {Goel, Surbhi and Klivans, Adam}, year = 2017, journal = {arXiv preprint arXiv:1709.06010} } @article{gogate2011samplesearch, title = {SampleSearch: Importance sampling in presence of determinism}, author = {Gogate, Vibhav and Dechter, Rina}, year = 2011, month = feb, journal = {Artif. Intell.}, publisher = {Elsevier Science Publishers Ltd.}, address = {Essex, UK}, volume = 175, number = 2, pages = {694--729}, doi = {10.1016/j.artint.2010.10.009}, issn = {0004-3702}, acmid = 1924819, issue_date = {February, 2011}, keywords = {Approximate inference, Bayesian networks, Constraint satisfaction, Importance sampling, Markov chain Monte Carlo, Markov networks, Model counting, Probabilistic inference, Satisfiability}, numpages = 36 } @book{gohberg2006indefinite, title = {Indefinite linear algebra and applications}, author = {Gohberg, Israel and Lancaster, Peter and Rodman, Leiba}, year = 2006, publisher = {Springer Science \& Business Media} } @article{gol70, title = {A family of variable-metric methods derived by variational means}, author = {Goldfarb, Donald}, year = 1970, journal = {Mathematics of computation}, volume = 24, number = 109, pages = {23--26} } @article{Goldberg1998, title = {Beyond the flow decomposition barrier}, author = {Goldberg, Andrew V. and Rao, Satish}, year = 1998, month = sep, journal = {Journal of the ACM}, volume = 45, number = 5, pages = {783--797} } @article{goldberg2016bounds, title = {Bounds for the query complexity of approximate equilibria}, author = {Goldberg, Paul W and Roth, Aaron}, year = 2016, journal = {ACM Transactions on Economics and Computation (TEAC)}, publisher = {ACM}, volume = 4, number = 4, pages = 24 } @book{Golub&VanLoan:book, title = {Matrix Computations}, author = {G.H. Golub and C.F. Van Loan}, year = 1990, publisher = {The Johns Hopkins University Press}, address = {Baltimore, Maryland} } @book{golub1996matrix, title = {Matrix computations (3rd ed.)}, author = {Golub, Gene H. and Van Loan, Charles F.}, year = 1996, publisher = {Johns Hopkins University Press}, address = {Baltimore, MD, USA}, isbn = {0801854148}, citeulike-article-id = 2122238, citeulike-linkout-0 = {http://portal.acm.org/citation.cfm?id=248979}, citeulike-linkout-1 = {http://portal.acm.org/citation.cfm?id=248979}, keywords = {algebra, book, computation, numerical}, owner = {leili}, posted-at = {2008-03-30 22:15:25}, priority = 2, timestamp = {2011.07.28} } @book{golub2012matrix, title = {Matrix computations}, author = {Golub, Gene H. and Van Loan, Charles F.}, year = 2012, publisher = {JHU Press}, volume = 3, pages = 784, isbn = 1421407949, edition = {4th}, file = {:C$\backslash$:/Users/Zeyuan/Desktop/2013 Matrix Computations 4th.pdf:pdf}, mendeley-groups = {Books/Book-Optimization} } @inproceedings{gong2012geodesic, title = {Geodesic flow kernel for unsupervised domain adaptation}, author = {Gong, Boqing and Shi, Yuan and Sha, Fei and Grauman, Kristen}, year = 2012, booktitle = {2012 IEEE Conference on Computer Vision and Pattern Recognition}, pages = {2066--2073}, organization = {IEEE} } @inproceedings{goodfellow2014generative, title = {Generative adversarial nets}, author = {Goodfellow, Ian and Pouget-Abadie, Jean and Mirza, Mehdi and Xu, Bing and Warde-Farley, David and Ozair, Sherjil and Courville, Aaron and Bengio, Yoshua}, year = 2014, booktitle = {Advances in neural information processing systems} } @book{goodfellow2016deep, title = {Deep learning}, author = {Goodfellow, Ian and Bengio, Yoshua and Courville, Aaron}, year = 2016, publisher = {MIT press} } @article{goodman1963statistical, title = { Statistical Analysis Based on a Certain Multivariate Complex Gaussian Distribution (An Introduction) }, author = {Goodman, N. R.}, year = 1963, journal = {The Annals of Mathematical Statistics}, publisher = {Institute of Mathematical Statistics}, volume = 34, number = 1, pages = {152--177}, copyright = {Copyright ? 1963 Institute of Mathematical Statistics}, jstor_articletype = {research-article}, jstor_formatteddate = {Mar., 1963}, language = {English}, owner = {leili}, timestamp = {2011.07.28} } @inproceedings{googlenet15, title = {Going deeper with convolutions}, author = {Szegedy, Christian and Liu, Wei and Jia, Yangqing and Sermanet, Pierre and Reed, Scott and Anguelov, Dragomir and Erhan, Dumitru and Vanhoucke, Vincent and Rabinovich, Andrew}, year = 2015, month = jun, booktitle = {Proceedings of the IEEE conference on computer vision and pattern recognition}, pages = {1--9} } @article{goreinov1997pseudo, title = {Pseudo-skeleton approximations by matrices of maximal volume}, author = {Goreinov, Sergei A and Zamarashkin, Nikolai Leonidovich and Tyrtyshnikov, Evgenii Evgen'evich}, year = 1997, journal = {Mathematical Notes}, publisher = {Springer}, volume = 62, number = 4, pages = {515--519} } @article{goreinov1997theory, title = {A theory of pseudoskeleton approximations}, author = {Goreinov, Sergei A and Tyrtyshnikov, Eugene E and Zamarashkin, Nickolai L}, year = 1997, journal = {Linear Algebra and Its Applications}, publisher = {Elsevier}, volume = 261, number = 1, pages = {1--21} } @inproceedings{goyal2014fourier, title = {Fourier PCA and robust tensor decomposition}, author = {Goyal, Navin and Vempala, Santosh and Xiao, Ying}, year = 2014, booktitle = {Proceedings of the 46th Annual ACM Symposium on Theory of Computing}, pages = {584--593}, organization = {ACM} } @article{goyal2017accurate, title = {Accurate, large minibatch sgd: Training imagenet in 1 hour}, author = {Goyal, Priya and Doll{\'a}r, Piotr and Girshick, Ross and Noordhuis, Pieter and Wesolowski, Lukasz and Kyrola, Aapo and Tulloch, Andrew and Jia, Yangqing and He, Kaiming}, year = 2017, journal = {arXiv preprint arXiv:1706.02677} } @article{GPR67, title = {The method of projections for finding the common point of convex sets}, author = {Gubin, LG and Polyak, BT and Raik, EV}, year = 1967, journal = {USSR Comput. Math. Math. Phys.}, publisher = {Elsevier}, volume = 7, number = 6, pages = {1--24}, fjournal = {USSR Computational Mathematics and Mathematical Physics} } @incollection{graf2005parallel, title = {Parallel Support Vector Machines: The Cascade SVM}, author = { Hans Peter {Graf} and Eric {Cosatto} and L\'{e}on {Bottou} and Igor {Dourdanovic} and Vladimir {Vapnik} }, year = 2005, booktitle = {Advances in Neural Information Processing Systems}, publisher = {MIT Press}, address = {Cambridge, MA}, pages = {521--528}, editor = {Lawrence K. Saul and Yair Weiss and {L\'{e}on} Bottou} } @article{GravinEtal:ConvexPolytopes, title = {{The inverse moment problem for convex polytopes}}, author = {Nick Gravin and Jean Lasserre and Dmitrii Pasechnik and Sinai Robins}, year = 2011, month = jun, journal = {arXiv preprint arXiv:1106.5723} } @article{Gray05, title = {Toeplitz and Circulant Matrices: A Review.}, author = {Gray, Robert M.}, year = 2005, journal = {Foundations and Trends in Communications and Information Theory}, volume = 2, number = 3, url = {http://dblp.uni-trier.de/db/journals/ftcit/ftcit2.html#Gray05}, added-at = {2008-05-21T00:00:00.000+0200}, biburl = {http://www.bibsonomy.org/bibtex/207697e274947ffbcce7cf3bff5b428b5/dblp}, date = {2008-05-21}, description = {dblp}, ee = {http://dx.doi.org/10.1561/0100000006}, interhash = {46ba3a0286283541309110ada4316612}, intrahash = {07697e274947ffbcce7cf3bff5b428b5}, keywords = {dblp}, timestamp = {2008-05-22T11:44:22.000+0200}, bdsk-url-1 = {http://dblp.uni-trier.de/db/journals/ftcit/ftcit2.html#Gray05} } @inproceedings{gretton2006kernel, title = {A kernel method for the two-sample-problem}, author = {Gretton, Arthur and Borgwardt, Karsten M and Rasch, Malte and Sch{\"o}lkopf, Bernhard and Smola, Alex J}, year = 2006, booktitle = {Advances in neural information processing systems}, pages = {513--520} } @article{griffiths2004finding, title = {Finding scientific topics}, author = {Griffiths, Thomas L and Steyvers, Mark}, year = 2004, journal = {Proceedings of the National Academy of Sciences}, publisher = {National Acad Sciences}, volume = 101, number = {suppl 1}, pages = {5228--5235} } @article{griffithsFinding, title = {Finding scientific topics}, author = {T.~L. Griffiths and M. Steyvers}, year = 2004, journal = {Proceedings of the National Academy of Sciences}, volume = 101, pages = {5228--5235} } @article{Grigoriadis1995, title = {{A sublinear-time randomized approximation algorithm for matrix games}}, author = {Grigoriadis, Michael D. and Khachiyan, Leonid G.}, year = 1995, journal = {Operations Research Letters}, publisher = {Elsevier}, volume = 18, number = 2, pages = {53--58}, doi = {10.1016/0167-6377(95)00032-0}, issn = {01676377}, abstract = {This paper presents a parallel randomizedalgorithm which computes a pair of $\epsilon$-optimal strategies for a given (m,n)-matrixgameA = [aij] ? [?1, 1] in O($\epsilon$?2log2(n+m)) expected time on an (n+m)/log(n+m)-processor EREW PRAM. For any fixed accuracy ? > 0, the expected sequential running time of the suggested algorithm is O((n + m)log(n + m)), which is sublinear in mn, the number of input elements of A. On the other hand, simple arguments are given to show that for , any deterministic algorithm for computing a pair of $\epsilon$-optimal strategies of an (m, n)-matrixgameA with ± 1 elements examines $\Omega$(mn) of its elements. In particular, for m = n the randomizedalgorithm achieves an almost quadratic expected speedup relative to any deterministic method.}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Grigoriadis, Khachiyan - 1995 - A sublinear-time randomized approximation algorithm for matrix games.pdf:pdf}, keywords = {approximation algorithms,complexity,linear programming,matrix games,parallel algorithms,randomized}, mendeley-groups = {Optimization/Multiplicative Weight/LP} } @article{gritzmann1995, title = {Largest j-simplices in n-polytopes}, author = {Gritzmann, Peter and Klee, Victor and Larman, David}, year = 1995, journal = {Discrete \& Computational Geometry}, publisher = {Springer}, volume = 13, number = 1, pages = {477--515} } @article{Gross11, title = {Recovering Low-Rank Matrices From Few Coefficients in Any Basis}, author = {Gross, D.}, year = 2011, month = mar, journal = {IEEE Trans. Inf. Theor.}, publisher = {IEEE Press}, address = {Piscataway, NJ, USA}, volume = 57, number = 3, pages = {1548--1566}, doi = {10.1109/TIT.2011.2104999}, issn = {0018-9448}, url = {http://dx.doi.org/10.1109/TIT.2011.2104999}, issue_date = {March 2011}, numpages = 19, acmid = 2273790, keywords = {Compressed sensing, matrix completion, matrix recovery, operator large-deviation bound, quantum-state tomography} } @inproceedings{grunwald2000policies, title = {Policies for dynamic clock scheduling}, author = { Grunwald, Dirk and Morrey,III, Charles B. and Levis, Philip and Neufeld, Michael and Farkas, Keith I. }, year = 2000, booktitle = { Proceedings of the 4th conference on Symposium on Operating System Design \& Implementation - Volume 4 }, location = {San Diego, California}, publisher = {USENIX Association}, address = {Berkeley, CA, USA}, series = {OSDI'00}, pages = {6--6}, acmid = 1251235, numpages = 1 } @article{gs05, title = {Framewise phoneme classification with bidirectional {LSTM} and other neural network architectures}, author = {Graves, Alex and Schmidhuber, J{\"u}rgen}, year = 2005, journal = {Neural Networks}, publisher = {Elsevier}, volume = 18, number = {5-6}, pages = {602--610} } @inproceedings{GS12, title = {Vertex neighborhoods, low conductance cuts, and good seeds for local community methods}, author = {David F. Gleich and C. Seshadhri}, year = 2012, booktitle = {KDD '2012} } @article{gss02, title = {Learning precise timing with {LSTM} recurrent networks}, author = {Gers, Felix A and Schraudolph, Nicol N and Schmidhuber, J{\"u}rgen}, year = 2002, journal = {Journal of machine learning research}, volume = 3, number = {Aug}, pages = {115--143} } @article{gu1996efficient, title = {Efficient algorithms for computing a strong rank-revealing QR factorization}, author = {Gu, Ming and Eisenstat, Stanley C}, year = 1996, journal = {SIAM Journal on Scientific Computing}, publisher = {SIAM}, volume = 17, number = 4, pages = {848--869} } @inproceedings{gu2010collaborative, title = {Collaborative Filtering: Weighted Nonnegative Matrix Factorization Incorporating User and Item Graphs.}, author = {Gu, Quanquan and Zhou, Jie and Ding, Chris HQ}, year = 2010, booktitle = {SDM}, pages = {199--210}, organization = {SIAM} } @book{gu2013smoothing, title = {Smoothing spline ANOVA models}, author = {Gu, Chong}, year = 2013, publisher = {Springer Science \& Business Media}, volume = 297 } @article{gu2014subspace, title = {Subspace Iteration Randomization and Singular Value Problems}, author = {Gu,Ming}, year = 2014, journal = {arXiv preprint arXiv:1408.2208} } @article{gu2020characterize, title = {How to Characterize The Landscape of Overparameterized Convolutional Neural Networks}, author = {Gu, Yihong and Zhang, Weizhong and Fang, Cong and Lee, Jason D and Zhang, Tong}, year = 2020, journal = {Neural Information Processing Systems (NeurIPS)} } @article{guedon2007lp, title = {Lp-moments of random vectors via majorizing measures}, author = {Gu{\'e}don, Olivier and Rudelson, Mark}, year = 2007, journal = {Advances in Mathematics}, publisher = {Elsevier}, volume = 208, number = 2, pages = {798--823} } @article{guest2001morse, title = {Morse theory in the 1990's}, author = {Guest, Martin}, year = 2001, journal = {arXiv preprint math/0104155} } @inproceedings{gunasekar2017implicit, title = {Implicit regularization in matrix factorization}, author = {Gunasekar, Suriya and Woodworth, Blake E and Bhojanapalli, Srinadh and Neyshabur, Behnam and Srebro, Nati}, year = 2017, booktitle = {Advances in Neural Information Processing Systems}, pages = {6151--6159} } @article{gunasekar2018characterizing, title = {Characterizing Implicit Bias in Terms of Optimization Geometry}, author = {Gunasekar, Suriya and Lee, Jason and Soudry, Daniel and Srebro, Nathan}, year = 2018, journal = {International Conference on Machine Learning (ICML)} } @article{gunasekar2018implicit, title = {Implicit Bias of Gradient Descent on Linear Convolutional Networks}, author = {Gunasekar, Suriya and Lee, Jason and Soudry, Daniel and Srebro, Nathan}, year = 2018, journal = {Neural Information Processing Systems (NIPS)} } @inproceedings{gunopulos2001time, title = {Time Series Similarity Measures and Time Series Indexing}, author = {Dimitrios Gunopulos and Gautam Das}, year = 2001, booktitle = {SIGMOD Conference}, address = {Santa Barbara, CA}, note = {Tutorial}, owner = {leili}, timestamp = {2011.07.28} } @article{guo2012mean, title = {A mean--variance optimization problem for discounted Markov decision processes}, author = {Guo, Xianping and Ye, Liuer and Yin, George}, year = 2012, journal = {European Journal of Operational Research}, publisher = {Elsevier}, volume = 220, number = 2, pages = {423--429} } @article{gupta2020distribution, title = {Distribution-free binary classification: prediction sets, confidence intervals and calibration}, author = {Gupta, Chirag and Podkopaev, Aleksandr and Ramdas, Aaditya}, year = 2020, journal = {arXiv preprint arXiv:2006.10564} } @inproceedings{Guruswami2001, title = {Expander-Based Constructions of Efficiently Decodable Codes}, author = {Guruswami, V. and Indyk, P.}, year = 2001, booktitle = {Proceedings of the 42nd IEEE symposium on Foundations of Computer Science}, publisher = {IEEE Computer Society}, address = {Washington, DC, USA}, series = {FOCS '01}, pages = {658--}, isbn = {0-7695-1390-5}, url = {http://dl.acm.org/citation.cfm?id=874063.875548}, acmid = 875548 } @inproceedings{guruswami2012optimal, title = {Optimal column-based low-rank matrix reconstruction}, author = {Guruswami, Venkatesan and Sinop, Ali Kemal}, year = 2012, booktitle = {Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms}, pages = {1207--1214}, organization = {SIAM} } @article{gutierrez2013guaranteed, title = {Guaranteed Model Order Estimation and Learnability Bounds for LDA}, author = {Guti{\'e}rrez, ED}, year = 2013, journal = {arXiv preprint arXiv:1312.2646} } @inproceedings{GV, title = {Matrix Computations}, author = {G. Golub and C. van Loan}, year = 1996, booktitle = {The Johns Hopkins University Press} } @inproceedings{GVX, title = {Fourier PCA and robust tensor decomposition}, author = {N. Goyal and S. Vempala and Y. Xiao.}, year = 2014, booktitle = {STOC}, pages = {584--593} } @article{gwd14, title = {Neural Turing Machines}, author = {Alex Graves and Greg Wayne and Ivo Danihelka}, year = 2014, journal = {CoRR}, volume = {abs/1410.5401}, url = {http://arxiv.org/abs/1410.5401} } @inproceedings{GWW, title = {On the local correctness of $\ell_1$-minimization for dictionary learning}, author = {Q. Geng and H. Wang and J. Wright.}, year = 2013, booktitle = {arXiv:1101.5672} } @misc{gym, title = {OpenAI Gym}, author = {Greg Brockman and Vicki Cheung and Ludwig Pettersson and Jonas Schneider and John Schulman and Jie Tang and Wojciech Zaremba}, year = 2016, eprint = {arXiv:1606.01540} } @book{gyorfi2006distribution, title = {A distribution-free theory of nonparametric regression}, author = {Gy{\"o}rfi, L{\'a}szl{\'o} and Kohler, Michael and Krzyzak, Adam and Walk, Harro}, year = 2006, publisher = {Springer Science \& Business Media} } @article{gyorgy2007line, title = {The On-Line Shortest Path Problem Under Partial Monitoring}, author = {Gy{\"o}rgy, Andr{\'a}s and Linder, Tam{\'a}s and Lugosi, G{\'a}bor and Ottucs{\'a}k, Gy{\"o}rgy}, year = 2007, journal = {Journal of Machine Learning Research}, volume = 8, pages = {2369--2403} } @inproceedings{H, title = {On the provable convergence of alternating minimization for matrix completion}, author = {M. Hardt}, year = 2013, booktitle = {arxiv:1312.0925} } @article{h91, title = {Untersuchungen zu dynamischen neuronalen Netzen}, author = {Hochreiter, Sepp}, year = 1991, journal = {Diploma, Technische Universit{\"a}t M{\"u}nchen}, volume = 91, number = 1 } @article{h98, title = {On the piecewise analysis of networks of linear threshold neurons}, author = {Hahnloser, Richard LT}, year = 1998, journal = {Neural Networks}, publisher = {Elsevier}, volume = 11, number = 4, pages = {691--697} } @inproceedings{haarnoja2018soft, title = {Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor}, author = {Haarnoja, Tuomas and Zhou, Aurick and Abbeel, Pieter and Levine, Sergey}, year = 2018, booktitle = {International Conference on Machine Learning}, pages = {1861--1870} } @inproceedings{haarnoja2018soft2, title = {Soft Actor-Critic Algorithms and Applications}, author = {Tuomas Haarnoja and Aurick Zhou and Kristian Hartikainen and George Tucker and Sehoon Ha and Jie Tan and Vikash Kumar and Henry Zhu and Abhishek Gupta and Pieter Abbeel and Sergey Levine}, year = 2018, booktitle = {International Conference on Machine Learning}, pages = {1861--1870} } @article{haastad1990tensor, title = {Tensor rank is NP-complete}, author = {H{\aa}stad, Johan}, year = 1990, journal = {Journal of Algorithms}, publisher = {Elsevier}, volume = 11, number = 4, pages = {644--654} } @article{haeffele2015global, title = {Global optimality in tensor factorization, deep learning, and beyond}, author = {Haeffele, Benjamin D and Vidal, Ren{\'e}}, year = 2015, journal = {arXiv preprint arXiv:1506.07540} } @inproceedings{hafner2019dream, title = {Dream to Control: Learning Behaviors by Latent Imagination}, author = {Hafner, Danijar and Lillicrap, Timothy and Ba, Jimmy and Norouzi, Mohammad}, year = 2019, booktitle = {International Conference on Learning Representations} } @inproceedings{hafner2019learning, title = {Learning latent dynamics for planning from pixels}, author = {Hafner, Danijar and Lillicrap, Timothy and Fischer, Ian and Villegas, Ruben and Ha, David and Lee, Honglak and Davidson, James}, year = 2019, booktitle = {International Conference on Machine Learning}, pages = {2555--2565}, organization = {PMLR} } @article{Hal62, title = {The product of projection operators}, author = {Halperin, Israel}, year = 1962, journal = {Acta Sci. Math.}, volume = 23, number = 1, pages = {96--99} } @article{halko2011finding, title = {Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions}, author = {Halko, Nathan and Martinsson, Per-Gunnar and Tropp, Joel A}, year = 2011, journal = {SIAM review}, publisher = {SIAM}, volume = 53, number = 2, pages = {217--288} } @article{han2015deep, title = {Deep compression: Compressing deep neural network with pruning, trained quantization and huffman coding}, author = {Han, Song and Mao, Huizi and Dally, William J}, year = 2015, journal = {CoRR, abs/1510.00149}, volume = 2 } @article{han2015minimax, title = {Minimax estimation of discrete distributions under $\ell_1$ loss}, author = {Han, Yanjun and Jiao, Jiantao and Weissman, Tsachy}, year = 2015, journal = {IEEE Transactions on Information Theory}, publisher = {IEEE}, volume = 61, number = 11, pages = {6343--6354} } @inproceedings{hanneke2019value, title = {On the value of target data in transfer learning}, author = {Hanneke, Steve and Kpotufe, Samory}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {9871--9881} } @article{hansen2013strategy, title = {Strategy iteration is strongly polynomial for 2-player turn-based stochastic games with a constant discount factor}, author = {Hansen, Thomas Dueholm and Miltersen, Peter Bro and Zwick, Uri}, year = 2013, journal = {Journal of the ACM (JACM)}, publisher = {ACM}, volume = 60, number = 1, pages = 1 } @inproceedings{hao2020adaptive, title = {Adaptive exploration in linear contextual bandit}, author = {Hao, Botao and Lattimore, Tor and Szepesvari, Csaba}, year = 2020, booktitle = {International Conference on Artificial Intelligence and Statistics}, pages = {3536--3545}, organization = {PMLR} } @article{hao2020high, title = {High-Dimensional Sparse Linear Bandits}, author = {Hao, Botao and Lattimore, Tor and Wang, Mengdi}, year = 2020, journal = {arXiv preprint arXiv:2011.04020} } @inproceedings{hao2021online, title = {Online Sparse Reinforcement Learning}, author = {Hao, Botao and Lattimore, Tor and Szepesv{\'a}ri, Csaba and Wang, Mengdi}, year = 2021, booktitle = {International Conference on Artificial Intelligence and Statistics}, pages = {316--324}, organization = {PMLR} } @article{haochen2020shape, title = {Shape Matters: Understanding the Implicit Bias of the Noise Covariance}, author = {HaoChen, Jeff Z. and Wei, Colin and Lee, Jason D. and Ma, Tengyu}, year = 2020, journal = {arXiv preprint arXiv:2006.08680} } @inproceedings{har2007maximum, title = {Maximum margin coresets for active and noise tolerant learning}, author = {{Har-Peled}, Sariel and Roth, Dan and Zimak, Dav}, year = 2007, booktitle = {Proceedings of the 20th international joint conference on Artifical intelligence}, pages = {836--841}, organization = {Morgan Kaufmann Publishers Inc.} } @article{hardt2013provable, title = {On the Provable Convergence of Alternating Minimization for Matrix Completion}, author = {Hardt, Moritz}, year = 2013, journal = {arXiv preprint arXiv:1312.0925} } @inproceedings{hardt2014fast, title = {Fast Matrix Completion Without the Condition Number}, author = {Hardt, Moritz and Wootters, Mary}, year = 2014, booktitle = {COLT 2014}, pages = {638--678} } @inproceedings{hardt2014noisy, title = {The noisy power method: A meta algorithm with applications}, author = {Hardt, Moritz and Price, Eric}, year = 2014, booktitle = {Advances in Neural Information Processing Systems}, pages = {2861--2869} } @inproceedings{hardt2014understanding, title = {Understanding alternating minimization for matrix completion}, author = {Hardt, Moritz}, year = 2014, booktitle = {FOCS 2014}, organization = {IEEE} } @article{hardt2016identity, title = {Identity matters in deep learning}, author = {Hardt, Moritz and Ma, Tengyu}, year = 2016, journal = {arXiv preprint arXiv:1611.04231} } @article{hardt2018gradient, title = {Gradient Descent Learns Linear Dynamical Systems}, author = {Hardt, Moritz and Ma, Tengyu and Recht, Benjamin}, year = 2018, journal = {Journal of Machine Learning Research}, volume = 19, pages = {1--44} } @misc{hardtma, title = {Express your Identity with deep learning}, author = {Moritz Hardt, Tengyu Ma}, year = 2016, alteditor = {editor}, date = 2016, optsubtitle = {subtitle}, opttitleaddon = {titleaddon}, optlanguage = {language}, opthowpublished = {howpublished}, opttype = {type}, optversion = {version}, optnote = {note}, optorganization = {organization}, optlocation = {location}, optdate = {date}, optmonth = {month}, optaddendum = {addendum}, optpubstate = {pubstate}, optdoi = {doi}, opteprint = {eprint}, opteprintclass = {eprintclass}, opteprinttype = {eprinttype}, opturl = {url}, opturldate = {urldate} } @techreport{Harshman, title = {Foundations of the {PARAFAC} procedure: model and conditions for an `explanatory' multi-mode factor analysis}, author = {R. Harshman}, year = 1970, institution = {UCLA Working Papers in Phonetics} } @article{harshman1970foundations, title = {Foundations of the PARAFAC procedure: models and conditions for an" explanatory" multimodal factor analysis}, author = {Harshman, Richard A}, year = 1970, publisher = {University of California at Los Angeles Los Angeles} } @article{harshman1994parafac, title = {PARAFAC: Parallel factor analysis}, author = {Harshman, Richard A and Lundy, Margaret E}, year = 1994, journal = {Computational Statistics \& Data Analysis}, publisher = {Elsevier}, volume = 18, number = 1, pages = {39--72} } @book{harvey1990forecasting, title = {Forecasting, Structural Time Series Models and the Kalman Filter}, author = {Harvey, Andrew C.}, year = 1990, month = mar, day = 30, publisher = {Cambridge University Press}, isbn = {0521321964}, abstract = { This book provides a synthesis of concepts and materials that ordinarily appear separately in time series and econometrics literature, presenting a comprehensive review of both theoretical and applied concepts. Perhaps the most novel feature of the book is its use of Kalman filtering together with econometric and time series methodology. From a technical point of view, state space models and the Kalman filter play a key role in the statistical treatment of structural time series models. This technique was originally developed in control engineering but is becoming increasingly important in economics and operations research. The book is primarily concerned with modeling economic and social time series and with addressing the special problems that the treatment of such series pose. }, howpublished = {Hardcover}, keywords = {forecasting, kalman-filter, state-space-models, statistics, textbook}, myurl = {http://www.worldcat.org/isbn/0521321964}, subjects = {Time-series analysis.; Kalman filtering.} } @article{hasanbeig2018logically, title = {Logically-constrained reinforcement learning}, author = {Hasanbeig, Mohammadhosein and Abate, Alessandro and Kroening, Daniel}, year = 2018, journal = {arXiv preprint arXiv:1801.08099} } @article{hashimoto2016word, title = {Word Embeddings as Metric Recovery in Semantic Spaces}, author = {Hashimoto, Tatsunori B. and Alvarez-Melis, David and Jaakkola, Tommi S.}, year = 2016, journal = {Transactions of the Association for Computational Linguistics} } @book{hastie2003elements, title = {The Elements of Statistical Learning}, author = {Hastie, T. and Tibshirani, R. and Friedman, J. H.}, year = 2003, month = jul, publisher = {Springer}, address = {New York, NY, USA}, series = {Springer Series in Statistics}, isbn = {0387952845}, edition = {Corrected}, abstract = { During the past decade there has been an explosion in computation and information technology. With it has come vast amounts of data in a variety of fields such as medicine, biology, finance, and marketing. The challenge of understanding these data has led to the development of new tools in the field of statistics, and spawned new areas such as data mining, machine learning, and bioinformatics. Many of these tools have common underpinnings but are often expressed with different terminology. This book describes the important ideas in these areas in a common conceptual framework. While the approach is statistical, the emphasis is on concepts rather than mathematics. Many examples are given, with a liberal use of color graphics. It should be a valuable resource for statisticians and anyone interested in data mining in science or industry. The book's coverage is broad, from supervised learning (prediction) to unsupervised learning. The many topics include neural networks, support vector machines, classification trees and boosting--the first comprehensive treatment of this topic in any book. Trevor Hastie, Robert Tibshirani, and Jerome Friedman are professors of statistics at Stanford University. They are prominent researchers in this area: Hastie and Tibshirani developed generalized additive models and wrote a popular book of that title. Hastie wrote much of the statistical modeling software in S-PLUS and invented principal curves and surfaces. Tibshirani proposed the Lasso and is co-author of the very successful An Introduction to the Bootstrap. Friedman is the co-inventor of many data-mining tools including CART, MARS, and projection pursuit. FROM THE REVIEWS: TECHNOMETRICS "[This] is a vast and complex book. Generally, it concentrates on explaining why and how the methods work, rather than how to use them. Examples and especially the visualizations are principle features...As a source for the methods of statistical learning...it will probably be a long time before there is a competitor to this book." }, howpublished = {Hardcover}, keywords = {machine-learning, statistic}, owner = {leili}, posted-at = {2007-02-13 15:09:19}, priority = 2, timestamp = {2011.07.28} } @article{hastie2014matrix, title = {Matrix completion and low-rank SVD via fast alternating least squares}, author = {Hastie, Trevor and Mazumder, Rahul and , Jason and Zadeh, Reza}, year = 2014, journal = {Journal of Machine Learning Research} } @article{hastie2019surprises, title = {Surprises in high-dimensional ridgeless least squares interpolation}, author = {Hastie, Trevor and Montanari, Andrea and Rosset, Saharon and Tibshirani, Ryan J}, year = 2019, journal = {arXiv preprint arXiv:1903.08560} } @article{haupt2006signal, title = {Signal reconstruction from noisy random projections}, author = {Haupt, Jarvis and Nowak, Robert}, year = 2006, journal = {Information Theory, IEEE Transactions on}, publisher = {IEEE}, volume = 52, number = 9, pages = {4036--4048} } @inproceedings{Haveliwala02, title = {Topic-sensitive PageRank}, author = {Taher H. Haveliwala}, year = 2002, booktitle = {WWW '02}, pages = {517--526} } @article{hazan2011hard, title = {How hard is it to approximate the best Nash equilibrium?}, author = {Hazan, Elad and Krauthgamer, Robert}, year = 2011, journal = {SIAM Journal on Computing}, publisher = {SIAM}, volume = 40, number = 1, pages = {79--91} } @incollection{Hazan2012-survey, title = {The Convex Optimization Approach to Regret Minimization}, author = {Hazan, Elad}, year = 2012, booktitle = {Optimization for machine learning}, publisher = {MIT press}, pages = {287--304}, editors = {Suvrit Sra, Sebastian Nowozin and Stephen J. Wright}, chapter = 10 } @article{hazan2014beyond, title = {Beyond the regret minimization barrier: optimal algorithms for stochastic strongly-convex optimization.}, author = {Hazan, Elad and Kale, Satyen}, year = 2014, journal = {Journal of Machine Learning Research}, publisher = {JMLR.org}, volume = 15, number = 1, pages = {2489--2512} } @inproceedings{hazan2015beyond, title = {Beyond convexity: Stochastic quasi-convex optimization}, author = {Hazan, Elad and Levy, Kfir and Shalev-Shwartz, Shai}, year = 2015, month = jul, journal = {ArXiv e-prints}, booktitle = {Advances in Neural Information Processing Systems}, pages = {1594--1602}, adsnote = {Provided by the SAO/NASA Astrophysics Data System}, adsurl = {http://adsabs.harvard.edu/abs/2015arXiv150702030H}, archiveprefix = {arXiv}, eprint = {1507.02030}, keywords = {Computer Science - Learning, Mathematics - Optimization and Control}, primaryclass = {cs.LG} } @inproceedings{hazan2016anon, title = {A Non-generative Framework and Convex Relaxations for Unsupervised Learning.}, author = {Elad Hazan and Tengyu Ma}, year = 2016, booktitle = {Neural Information Processing Systems (NIPS), 2016}, url = {http://arxiv.org/abs/1610.01132}, timestamp = {Wed, 02 Nov 2016 09:51:26 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/corr/HazanM16}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{hazan2018provably, title = {Provably efficient maximum entropy exploration}, author = {Hazan, Elad and Kakade, Sham M and Singh, Karan and Van Soest, Abby}, year = 2019, booktitle = {International Conference on Machine Learning} } @article{HazanBook, title = {{DRAFT}: Introduction to Online Convex Optimimization}, author = {Elad Hazan}, year = 2015, journal = {Foundations and Trends in Machine Learning}, volume = {XX}, number = {XX}, pages = {1--168} } @article{HazanKoren2015trustregion, title = {A linear-time algorithm for trust region problems}, author = {Hazan, Elad and Koren, Tomer}, year = 2015, journal = {Mathematical Programming}, publisher = {Springer}, pages = {1--19} } @inproceedings{HazanKS2012, title = {{Near-optimal algorithms for online matrix prediction}}, author = {Hazan, Elad and Kale, Satyen and {Shalev-Shwartz}, Shai}, year = 2012, booktitle = {Proceedings of the 25th Annual Conference on Learning Theory - COLT '12}, pages = {38.1----38.13}, issn = 15337928, url = {http://arxiv.org/abs/1204.0136}, abstract = {In several online prediction problems of recent interest the comparison class is composed of matrices with bounded entries. For example, in the online max-cut problem, the comparison class is matrices which represent cuts of a given graph and in online gambling the comparison class is matrices which represent permutations over n teams. Another important example is online collaborative filtering in which a widely used comparison class is the set of matrices with a small trace norm. In this paper we isolate a property of matrices, which we call (beta,tau)-decomposability, and derive an efficient online learning algorithm, that enjoys a regret bound of O*(sqrt(beta tau T)) for all problems in which the comparison class is composed of (beta,tau)-decomposable matrices. By analyzing the decomposability of cut matrices, triangular matrices, and low trace-norm matrices, we derive near optimal regret bounds for online max-cut, online gambling, and online collaborative filtering. In particular, this resolves (in the affirmative) an open problem posed by Abernethy (2010); Kleinberg et al (2010). Finally, we derive lower bounds for the three problems and show that our upper bounds are optimal up to logarithmic factors. In particular, our lower bound for the online collaborative filtering problem resolves another open problem posed by Shamir and Srebro (2011).}, archiveprefix = {arXiv}, arxivid = {arXiv:1204.0136v1}, eprint = {arXiv:1204.0136v1}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Hazan, Kale, Shalev-Shwartz - 2012 - Near-optimal algorithms for online matrix prediction.pdf:pdf}, mendeley-groups = {Optimization/Mirror Descent/Mirror Descent for NP-hard Problems} } @inproceedings{he15deepresidual, title = {Deep Residual Learning for Image Recognition}, author = {Kaiming He and Xiangyu Zhang and Shaoqing Ren and Jian Sun}, year = 2015, booktitle = {arXiv prepring arXiv:1506.01497} } @inproceedings{he2015delving, title = {Delving deep into rectifiers: Surpassing human-level performance on imagenet classification}, author = {He, Kaiming and Zhang, Xiangyu and Ren, Shaoqing and Sun, Jian}, year = 2015, booktitle = {Proceedings of the IEEE international conference on computer vision}, pages = {1026--1034} } @inproceedings{he2016deep, title = {Deep residual learning for image recognition}, author = {He, Kaiming and Zhang, Xiangyu and Ren, Shaoqing and Sun, Jian}, year = 2016, booktitle = {Proceedings of the IEEE conference on computer vision and pattern recognition}, pages = {770--778} } @inproceedings{he2016identity, title = {Identity mappings in deep residual networks}, author = {He, Kaiming and Zhang, Xiangyu and Ren, Shaoqing and Sun, Jian}, year = 2016, booktitle = {European Conference on Computer Vision}, pages = {630--645}, organization = {Springer} } @article{he2020logarithmic, title = {Logarithmic Regret for Reinforcement Learning with Linear Function Approximation}, author = {He, Jiafan and Zhou, Dongruo and Gu, Quanquan}, year = 2020, journal = {arXiv preprint arXiv:2011.11566} } @inproceedings{heath2006mercury, title = { Mercury and freon: temperature emulation and management for server systems }, author = { Heath, Taliver and Centeno, Ana Paula and George, Pradeep and Ramos, Luiz and Jaluria, Yogesh and Bianchini, Ricardo }, year = 2006, booktitle = { Proceedings of the 12th international conference on Architectural support for programming languages and operating systems }, location = {San Jose, California, USA}, publisher = {ACM}, address = {New York, NY, USA}, series = {ASPLOS-XII}, pages = {106--116}, doi = {http://doi.acm.org/10.1145/1168857.1168872}, isbn = {1-59593-451-0}, acmid = 1168872, keywords = { energy conservation, server clusters, temperature modeling, thermal management }, numpages = 11 } @book{Hebb1949, title = {The Organization of Behavior: A Neuropsychological Theory}, author = {Hebb, Donald O.}, year = 1949, month = jun, day = 15, publisher = {Wiley}, address = {New York}, isbn = {0805843000}, url = {http://www.worldcat.org/isbn/0805843000}, edition = {New edition}, howpublished = {Hardcover}, keywords = {viva}, posted-at = {2008-10-07 15:32:39}, priority = 2, timestamp = {2013.08.27} } @book{Heij07, title = {Introduction to mathematical systems theory : linear systems, identification and control}, author = {Heij, Christiaan and Ran, Andr{\'e} and Schagen, Freek van}, year = 2007, publisher = {Birkh{\"a}user}, address = {Basel, Boston, Berlin}, isbn = {3-7643-7548-5}, url = {http://opac.inria.fr/record=b1130636}, bdsk-url-1 = {http://opac.inria.fr/record=b1130636} } @inproceedings{heller2010elastictree, title = {ElasticTree: saving energy in data center networks}, author = { Heller, Brandon and Seetharaman, Srini and Mahadevan, Priya and Yiakoumis, Yiannis and Sharma, Puneet and Banerjee, Sujata and McKeown, Nick }, year = 2010, booktitle = { Proceedings of the 7th USENIX conference on Networked systems design and implementation }, location = {San Jose, California}, publisher = {USENIX Association}, address = {Berkeley, CA, USA}, series = {NSDI'10}, pages = {17--17}, acmid = 1855728, numpages = 1 } @inproceedings{henrion1986propagating, title = { Propagating uncertainty in {B}ayesian networks by probabilistic logic sampling }, author = {Max Henrion}, year = 1986, booktitle = {UAI}, pages = {149--164} } @inproceedings{herda2000skeleton, title = { Skeleton-Based Motion Capture for Robust Reconstruction of Human Motion }, author = { Herda, L. and Fua, P. and Pl\"{a}nkers, R. and Boulic, R. and Thalmann, D. }, year = 2000, booktitle = {Proceedings of the Computer Animation}, publisher = {IEEE Computer Society}, address = {Washington, DC, USA}, series = {CA '00}, pages = {77--86}, acmid = 872908, keywords = {Motion capture, skeleton-based tracking} } @book{hespanha2009, title = {Linear systems theory}, author = {Hespanha, Joao P}, year = 2009, publisher = {Princeton university press} } @article{HessianPearlmutter, title = {Fast exact multiplication by the Hessian}, author = {Pearlmutter, Barak A}, year = 1994, journal = {Neural computation}, publisher = {MIT Press}, volume = 6, number = 1, pages = {147--160} } @inproceedings{HHR03, title = {A polynomial-time tree decomposition to minimize congestion}, author = {Harrelson, Chris and Hildrum, Kirsten and Rao, Satish}, year = 2003, series = {SPAA '03}, pages = {34--43}, isbn = {1-58113-661-7}, numpages = 10 } @book{Higham2008, title = {Functions of Matrices}, author = {Higham, N.}, year = 2008, publisher = {Society for Industrial and Applied Mathematics}, address = {}, doi = {10.1137/1.9780898717778}, url = {http://epubs.siam.org/doi/abs/10.1137/1.9780898717778}, edition = {}, eprint = {http://epubs.siam.org/doi/pdf/10.1137/1.9780898717778} } @article{hillar2009most, title = {Most tensor problems are {NP} hard}, author = {C. Hillar and L.-H. Lim}, year = 2013, journal = {J. ACM} } @article{hillar2013most, title = {Most tensor problems are NP-hard}, author = {Hillar, Christopher J and Lim, Lek-Heng}, year = 2013, journal = {Journal of the ACM (JACM)}, publisher = {ACM}, volume = 60, number = 6, pages = 45 } @article{HillarL13, title = {Most Tensor Problems Are NP-Hard}, author = {Christopher J. Hillar and Lek{-}Heng Lim}, year = 2013, month = nov, journal = {J. {ACM}}, volume = 60, number = 6, pages = 45, doi = {10.1145/2512329}, url = {http://doi.acm.org/10.1145/2512329}, timestamp = {Fri, 06 Dec 2013 15:28:53 +0100}, biburl = {http://dblp2.uni-trier.de/rec/bib/journals/jacm/HillarL13}, bibsource = {dblp computer science bibliography, http://dblp.org}, article = 45 } @inproceedings{hindman2011mesos, title = {Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center.}, author = {Hindman, Benjamin and Konwinski, Andy and Zaharia, Matei and Ghodsi, Ali and Joseph, Anthony D and Katz, Randy H and Shenker, Scott and Stoica, Ion}, year = 2011, booktitle = {NSDI}, volume = 11, pages = {22--22} } @book{hinton1984distributed, title = {Parallel Distributed Processing: Explorations in the Microstructure of Cognition}, year = 1986, editor = {Rumelhart, David E. and Hinton, Geoffrey E. and McClelland, James L.} } @article{Hinton2002, title = {Training products of experts by minimizing contrastive divergence}, author = {Hinton, Geoffrey E.}, year = 2002, month = aug, journal = {Neural Comput.}, publisher = {MIT Press}, address = {Cambridge, MA, USA}, volume = 14, number = 8, pages = {1771--1800}, doi = {10.1162/089976602760128018}, issn = {0899-7667}, url = {http://dx.doi.org/10.1162/089976602760128018}, acmid = 639730, issue_date = {August 2002}, numpages = 30, owner = {gewor_000}, timestamp = {2013.09.15} } @article{Hinton2006, title = {A fast learning algorithm for deep belief nets}, author = {Hinton, Geoffrey E. and Osindero, Simon and Teh, Yee-Whye}, year = 2006, month = jul, journal = {Neural Comput.}, publisher = {MIT Press}, address = {Cambridge, MA, USA}, volume = 18, number = 7, pages = {1527--1554}, doi = {10.1162/neco.2006.18.7.1527}, issn = {0899-7667}, url = {http://dx.doi.org/10.1162/neco.2006.18.7.1527}, acmid = 1161605, issue_date = {July 2006}, numpages = 28 } @inproceedings{HJ, title = {Matrix Analysis}, author = {R. Horn and C. Johnson}, year = 1990, booktitle = {Cambridge University Press} } @article{hjorungnes2007complex, title = {Complex-Valued Matrix Differentiation: Techniques and Key Results}, author = {Hjorungnes, A. and Gesbert, D.}, year = 2007, month = jun, journal = {IEEE Transactions on Signal Processing}, volume = 55, number = 6, pages = {2740--2746}, abstract = { A systematic theory is introduced for finding the derivatives of complex-valued matrix functions with respect to a complex-valued matrix variable and the complex conjugate of this variable. In the framework introduced, the differential of the complex-valued matrix function is used to identify the derivatives of this function. Matrix differentiation results are derived and summarized in tables which can be exploited in a wide range of signal processing related situations }, keywords = { complex conjugate;complex-valued matrix differentiation;complex-valued matrix function;signal processing;matrix algebra;signal processing; } } @misc{HK12, title = {Learning mixtures of spherical {G}aussians: moment methods and spectral decompositions}, author = {Daniel Hsu and Sham M. Kakade}, year = 2012, booktitle = {Fourth Innovations in Theoretical Computer Science}, url = {http://arxiv.org/abs/1206.5766}, note = {arXiv:1206.5766 (to appear in ITCS, 2013)}, eprint = {arXiv:1206.5766} } @inproceedings{HKZ09, title = {A spectral algorithm for learning hidden {M}arkov models}, author = {D. Hsu and S. M. Kakade and T. Zhang}, year = 2009, booktitle = {COLT} } @article{HKZ12, title = {A spectral algorithm for learning hidden {M}arkov models}, author = {Daniel Hsu and Sham M. Kakade and Tong Zhang}, year = 2012, journal = {Journal of Computer and System Sciences}, volume = 78, number = 5, pages = {1460--1480} } @incollection{HLA, title = {Tensors and hypermatrices}, author = {L.-H. Lim}, year = 2013, booktitle = {Handbook of Linear Algebra}, publisher = {CRC Press}, editor = {L. Hogben}, edition = {2nd} } @inproceedings{HLM2015, title = {Variance Reduced Stochastic Gradient Descent with Neighbors}, author = {Hofmann, Thomas and Lucchi, Aurelien and Lacoste-Julien, Simon and McWilliams, Brian}, year = 2015, booktitle = {NIPS 2015}, pages = {2296--2304} } @inproceedings{hlszz18, title = {Spectral Filtering for General Linear Dynamical Systems}, author = {Hazan, Elad and Lee, Holden and Singh, Karan and Zhang, Cyril and Zhang, Yi}, year = 2018, booktitle = {Advances in Neural Information Processing Systems (NINPS)} } @article{HO00, title = {Independent component analysis: algorithms and applications}, author = {A. Hyv{\"a}rinen and E. Oja}, year = 2000, journal = {Neural Networks}, volume = 13, number = {4--5}, pages = {411--430} } @inproceedings{ho2013more, title = {More effective distributed ml via a stale synchronous parallel parameter server}, author = {Ho, Qirong and Cipar, James and Cui, Henggang and Lee, Seunghak and Kim, Jin Kyu and Gibbons, Phillip B and Gibson, Garth A and Ganger, Greg and Xing, Eric}, year = 2013, booktitle = {Advances in Neural Information Processing Systems}, pages = {1223--1231} } @article{hochreiter1997flat, title = {Flat minima}, author = {Hochreiter, Sepp and Schmidhuber, J{\"u}rgen}, year = 1997, journal = {Neural Computation}, publisher = {MIT Press}, volume = 9, number = 1, pages = {1--42} } @article{hoeffding, title = {Probability Inequalities for Sums of Bounded Random Variables}, author = {Wassily Hoeffding}, year = 1963, journal = {Journal of the American Statistical Association}, publisher = {[American Statistical Association, Taylor \& Francis, Ltd.]}, volume = 58, number = 301, pages = {13--30}, issn = {01621459}, url = {http://www.jstor.org/stable/2282952}, abstract = {Upper bounds are derived for the probability that the sum S of n independent random variables exceeds its mean ES by a positive number nt. It is assumed that the range of each summand of S is bounded or bounded above. The bounds for <tex-math>$\Pr \{ S - ES \geq nt \}$</tex-math> depend only on the endpoints of the ranges of the summands and the mean, or the mean and the variance of S. These results are then used to obtain analogous inequalities for certain sums of dependent random variables such as U statistics and the sum of a random sample without replacement from a finite population.} } @article{hoeffding1963, title = {Probability Inequalities for Sums of Bounded Random Variables}, author = {Hoeffding, Wassily}, year = 1963, journal = {Journal of the American Statistical Association}, publisher = {American Statistical Association}, volume = 58, number = 301, pages = {pp. 13--30}, issn = {01621459}, url = {http://www.jstor.org/stable/2282952}, copyright = {Copyright Â© 1963 American Statistical Association}, abstract = {Upper bounds are derived for the probability that the sum S of n independent random variables exceeds its mean ES by a positive number nt. It is assumed that the range of each summand of S is bounded or bounded above. The bounds for <tex-math>$\Pr \{ S - ES \geq nt \}$</tex-math> depend only on the endpoints of the ranges of the summands and the mean, or the mean and the variance of S. These results are then used to obtain analogous inequalities for certain sums of dependent random variables such as U statistics and the sum of a random sample without replacement from a finite population.}, jstor_articletype = {research-article}, jstor_formatteddate = {Mar., 1963}, language = {English} } @inproceedings{Hof, title = {Probabilistic latent semantic analysis}, author = {T. Hofmann}, year = 1999, booktitle = {UAI}, pages = {289--296} } @inproceedings{hoffer2017train, title = {Train longer, generalize better: closing the generalization gap in large batch training of neural networks}, author = {Hoffer, Elad and Hubara, Itay and Soudry, Daniel}, year = 2017, booktitle = {Advances in Neural Information Processing Systems}, pages = {1731--1741} } @article{hoffer2018fix, title = {Fix your classifier: the marginal value of training the last weight layer}, author = {Hoffer, Elad and Hubara, Itay and Soudry, Daniel}, year = 2018, journal = {arXiv preprint arXiv:1801.04540} } @inproceedings{hoffer2018norm, title = {Norm matters: efficient and accurate normalization schemes in deep networks}, author = {Hoffer, Elad and Banner, Ron and Golan, Itay and Soudry, Daniel}, year = 2018, booktitle = {Advances in Neural Information Processing Systems}, pages = {2160--2170} } @article{hoffman1966on, title = {On nonterminating stochastic games}, author = {Hoffman, Alan J and Karp, Richard M}, year = 1966, journal = {Management Sci.}, volume = 12, pages = {359--370}, doi = {10.1287/mnsc.12.5.359}, issn = {0025-1909}, url = {https://doi.org/10.1287/mnsc.12.5.359}, fjournal = {Management Science. Journal of the Institute of Management Science. Application and Theory Series}, mrclass = {90.72}, mrnumber = {0189842} } @inproceedings{hoffman2010online, title = {Online learning for latent {D}irichlet allocation}, author = {M. D. Hoffman and D. M. Blei and F. Bach}, year = 2010, booktitle = {Advances in Neural Information Processing Systems} } @inproceedings{hofmann1999plsa, title = {Probilistic latent semantic analysis}, author = {Thomas Hofmann}, year = 1999, booktitle = {UAI} } @inproceedings{hofmann1999probabilistic, title = {Probabilistic latent semantic analysis}, author = {Hofmann, Thomas}, year = 1999, booktitle = {Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence} } @article{hoke2006intemon, title = {InteMon: continuous mining of sensor data in large-scale self-infrastructures}, author = { Hoke, Evan and Sun, Jimeng and Strunk, John D. and Ganger, Gregory R. and Faloutsos, Christos }, year = 2006, journal = {SIGOPS Oper. Syst. Rev.}, publisher = {ACM}, address = {New York, NY, USA}, volume = 40, number = 3, pages = {38--44}, doi = {http://doi.acm.org/10.1145/1151374.1151384}, issn = {0163-5980} } @article{holodnak2014conditioning, title = {Conditioning of Leverage Scores and Computation by QR Decomposition}, author = {Holodnak, John T and Ipsen, Ilse CF and Wentworth, Thomas}, year = 2014, journal = {arXiv preprint arXiv:1402.0957} } @article{hong2018gradient, title = {Gradient Primal-Dual Algorithm Converges to Second-Order Stationary Solutions for Nonconvex Distributed Optimization}, author = {Hong, Mingyi and Lee, Jason D and Razaviyayn, Meisam}, year = 2018, journal = {International Conference on Machine Learning (ICML)} } @inproceedings{hopm_1995, title = {Higher-order power method—Application in independent component analysis}, author = {L. De Lathauwer and P. Comon and B. De Moor and J. Vandewalle}, year = 1995, booktitle = {International Symposium on Nonlinear Theory and Its Applications}, pages = {91--96} } @book{horn2012matrix, title = {Matrix analysis}, author = {Horn, Roger A. and Johnson, Charles R.}, year = 2012, publisher = {Cambridge university press} } @article{Hotelling35, title = {The most predictable criterion}, author = {H. Hotelling}, year = 1935, journal = {Journal of Educational Psychology}, volume = 26, number = 2, pages = {139--142} } @book{howard1960dynamic, title = {Dynamic programming and {M}arkov processes}, author = {Howard, Ronald A.}, year = 1960, publisher = {The MIT press, Cambridge, MA}, pages = {viii+136}, mrclass = {90.00}, mrnumber = {0118514}, mrreviewer = {R. E. Kalaba} } @inproceedings{hoyer2002non, title = {Non-negative sparse coding}, author = {Hoyer, Patrik O}, year = 2002, booktitle = {Neural Networks for Signal Processing, 2002. Proceedings of the 2002 12th IEEE Workshop on}, pages = {557--565}, organization = {IEEE}, owner = {gewor_000}, timestamp = {2013.11.10} } @article{hs97, title = {Long short-term memory}, author = {Hochreiter, Sepp and Schmidhuber, J{\"u}rgen}, year = 1997, journal = {Neural computation}, publisher = {MIT Press}, volume = 9, number = 8, pages = {1735--1780} } @inproceedings{hsu2004example, title = {Example-based control of human motion}, author = {Eugene Hsu and Sommer Gentry and Jovan Popovi\'{c}}, year = 2004, booktitle = { SCA '04: Proceedings of the 2004 ACM SIGGRAPH/Eurographics symposium on Computer animation }, location = {Grenoble, France}, publisher = {Eurographics Association}, address = {Aire-la-Ville, Switzerland, Switzerland}, pages = {69--77}, doi = {http://doi.acm.org/10.1145/1028523.1028534}, isbn = {3-905673-14-2}, owner = {leili}, timestamp = {2011.07.28} } @article{hsu2012spectral, title = {A spectral algorithm for learning hidden Markov models}, author = {Hsu, Daniel and Kakade, Sham M and Zhang, Tong}, year = 2012, journal = {Journal of Computer and System Sciences}, publisher = {Elsevier}, volume = 78, number = 5, pages = {1460--1480} } @article{hsu2012tail, title = {A tail inequality for quadratic forms of subgaussian random vectors}, author = {Hsu, Daniel and Kakade, Sham and Zhang, Tong and others}, year = 2012, journal = {Electronic Communications in Probability}, publisher = {The Institute of Mathematical Statistics and the Bernoulli Society}, volume = 17 } @inproceedings{hsu2013learning, title = {Learning mixtures of spherical gaussians: moment methods and spectral decompositions}, author = {Hsu, Daniel and Kakade, Sham M}, year = 2013, booktitle = {Proceedings of the 4th conference on Innovations in Theoretical Computer Science}, pages = {11--20}, organization = {ACM} } @article{hsw89, title = {Multilayer feedforward networks are universal approximators}, author = {Hornik, Kurt and Stinchcombe, Maxwell and White, Halbert}, year = 1989, journal = {Neural networks}, publisher = {Elsevier}, volume = 2, number = 5, pages = {359--366} } @inproceedings{hsz17, title = {Learning linear dynamical systems via spectral filtering}, author = {Hazan, Elad and Singh, Karan and Zhang, Cyril}, year = 2017, booktitle = {Advances in Neural Information Processing Systems (NIPS)}, pages = {6702--6712} } @inproceedings{hu1998multiagent, title = {Multiagent reinforcement learning: theoretical framework and an algorithm.}, author = {Hu, Junling and Wellman, Michael P and others}, year = 1998, booktitle = {ICML}, volume = 98, pages = {242--250}, organization = {Citeseer} } @article{hu2003nash, title = {Nash {Q}-learning for general-sum stochastic games}, author = {Hu, Junling and Wellman, Michael P}, year = 2003, journal = {Journal of machine learning research}, volume = 4, number = {Nov}, pages = {1039--1069} } @inproceedings{hu2009accelerated, title = {Accelerated gradient methods for stochastic optimization and online learning}, author = {Hu, Chonghai and Pan, Weike and Kwok, James T}, year = 2009, booktitle = {Advances in Neural Information Processing Systems}, pages = {781--789} } @article{hu2013fast, title = {Fast and accurate matrix completion via truncated nuclear norm regularization}, author = {Hu, Yao and Zhang, Debing and Ye, Jieping and Li, Xuelong and He, Xiaofei}, year = 2013, journal = {Pattern Analysis and Machine Intelligence, IEEE Transactions on}, publisher = {IEEE}, volume = 35, number = 9, pages = {2117--2130} } @article{hu2017diffusion, title = {On the diffusion approximation of nonconvex stochastic gradient descent}, author = {Hu, Wenqing and Li, Chris Junchi and Li, Lei and Liu, Jian-Guo}, year = 2017, journal = {arXiv preprint arXiv:1705.07562} } @inproceedings{huang2006correcting, title = {Correcting sample selection bias by unlabeled data}, author = {Huang, Jiayuan and Gretton, Arthur and Borgwardt, Karsten M and Sch{\"o}lkopf, Bernhard and Smola, Alex J}, year = 2006, booktitle = {Advances in neural information processing systems}, pages = {601--608} } @article{hunt2020verifiably, title = {Verifiably safe exploration for end-to-end reinforcement learning}, author = {Hunt, Nathan and Fulton, Nathan and Magliacane, Sara and Hoang, Nghia and Das, Subhro and Solar-Lezama, Armando}, year = 2020, journal = {arXiv preprint arXiv:2007.01223} } @article{huseby2004system, title = {System reliability evaluation using conditional {M}onte {C}arlo methods}, author = {Arne Bang Huseby and Morten Naustdal and Ingeborg Drengstig V˚arli}, year = 2004, journal = {in “Statistical Res. Rep}, volume = 2, pages = {0806--3842} } @article{hussain2018autonomous, title = {Autonomous cars: Research results, issues, and future challenges}, author = {Hussain, Rasheed and Zeadally, Sherali}, year = 2018, journal = {IEEE Communications Surveys \& Tutorials}, publisher = {IEEE}, volume = 21, number = 2, pages = {1275--1313} } @inproceedings{HW, title = {A bound on tail probabilities for quadratic forms in independent random variables}, author = {D. Hanson and F. Wright}, year = 1971, booktitle = {Annals of Math. Stat.}, pages = {1079--1083} } @article{hyde2019applications, title = {Applications of supervised machine learning in autism spectrum disorder research: a review}, author = {Hyde, Kayleigh K and Novack, Marlena N and LaHaye, Nicholas and Parlett-Pelleriti, Chelsea and Anden, Raymond and Dixon, Dennis R and Linstead, Erik}, year = 2019, journal = {Review Journal of Autism and Developmental Disorders}, publisher = {Springer}, volume = 6, number = 2, pages = {128--146} } @article{hyvarinen2000independent, title = {Independent component analysis: algorithms and applications}, author = {Aapo Hyv{\"a}rinen and Erkki Oja}, year = 2000, journal = {Neural Networks}, volume = 13, number = {4-5}, pages = {411--430}, bibsource = {DBLP, http://dblp.uni-trier.de}, ee = {http://dx.doi.org/10.1016/S0893-6080(00)00026-5}, owner = {leili}, timestamp = {2011.07.28} } @book{hyvarinen2001independent, title = {Independent Component Analysis}, author = {Hyv\"{a}rinen, Aapo and Karhunen, Juha and Oja, Erkki}, year = 2001, publisher = {Wiley-Interscience}, isbn = {047140540X}, edition = 1, abstract = { {A comprehensive introduction to ICA for students and practitioners Independent Component Analysis (ICA) is one of the most exciting new topics in fields such as neural networks, advanced statistics, and signal processing. This is the first book to provide a comprehensive introduction to this new technique complete with the fundamental mathematical background needed to understand and utilize it. It offers a general overview of the basics of ICA, important solutions and algorithms, and in-depth coverage of new applications in image processing, telecommunications, audio signal processing, and more. Independent Component Analysis is divided into four sections that cover: * General mathematical concepts utilized in the book * The basic ICA model and its solution * Various extensions of the basic ICA model * Real-world applications for ICA models Authors Hyvarinen, Karhunen, and Oja are well known for their contributions to the development of ICA and here cover all the relevant theory, new algorithms, and applications in various fields. Researchers, students, and practitioners from a variety of disciplines will find this accessible volume both helpful and informative.} }, howpublished = {Hardcover}, keywords = {analysis, component, independent} } @article{ICA:BelkinEtal12, title = {{Blind Signal Separation in the Presence of Gaussian Noise}}, author = {Mikhail Belkin and Luis Rademacher and James Voss}, year = 2012, month = nov, journal = {arXiv preprint arXiv:1211.1716} } @book{ICAbook, title = {Independent Component Analysis}, author = {Aapo Hyv{\"a}rinen and J. Karhunen and E. Oja.}, year = 2001, publisher = {Wiley Interscience} } @inproceedings{icarte2018using, title = {Using reward machines for high-level task specification and decomposition in reinforcement learning}, author = {Icarte, Rodrigo Toro and Klassen, Toryn and Valenzano, Richard and McIlraith, Sheila}, year = 2018, booktitle = {International Conference on Machine Learning}, pages = {2107--2116} } @inproceedings{icarte2019learning, title = {Learning Reward Machines for Partially Observable Reinforcement Learning}, author = {Icarte, Rodrigo Toro and Waldie, Ethan and Klassen, Toryn and Valenzano, Rick and Castro, Margarita and McIlraith, Sheila}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {15497--15508} } @incollection{ImageNet, title = {ImageNet Classification with Deep Convolutional Neural Networks}, author = {Alex Krizhevsky and Ilya Sutskever and Geoff Hinton}, year = 2012, booktitle = {Advances in Neural Information Processing Systems 25}, pages = {1106--1114}, owner = {rongge}, timestamp = {2013.09.26} } @inproceedings{imagenet_cvpr09, title = {{ImageNet: A Large-Scale Hierarchical Image Database}}, author = {Deng, J. and Dong, W. and Socher, R. and Li, L.-J. and Li, K. and Fei-Fei, L.}, year = 2009, booktitle = {CVPR09}, bibsource = {http://www.image-net.org/papers/imagenet_cvpr09.bib} } @inproceedings{ImprovedCheeger2013, title = {Improved Cheeger's Inequality: Analysis of Spectral Partitioning Algorithms through Higher Order Spectral Gap}, author = {Kwok, Tsz Chiu and Lau, Lap Chi and Lee, Yin Tat and {Oveis Gharan}, Shayan and Trevisan, Luca}, year = 2013, month = jan, booktitle = {STOC '13} } @inproceedings{IndykStrauss, title = {Combining Geometry and Combinatorics: a Unified Approach to Sparse Signal Recovery}, author = {Berinde, R. and Gilbert, A.C. and Indyk, P. and Karloff, H. and Strauss, M.J.}, year = 2008, booktitle = {46th Annual Allerton Conference on Communication, Control, and Computing}, pages = {798--805} } @article{inoue2003line, title = {On-line learning theory of soft committee machines with correlated hidden units--steepest gradient descent and natural gradient descent--}, author = {Inoue, Masato and Park, Hyeyoung and Okada, Masato}, year = 2003, journal = {Journal of the Physical Society of Japan}, publisher = {The Physical Society of Japan}, volume = 72, number = 4, pages = {805--810} } @misc{intel2007intel, title = {Intel Research Advances 'Era Of Tera': www.intel.com/pressroom/archive/releases/20070204comp.htm}, author = {Intel}, year = 2007, url = {http://www.intel.com/pressroom/archive/releases/20070204comp.htm} } @misc{interpolation, title = {"Lagrange Interpolating Polynomial." From MathWorld--A Wolfram Web Resource.}, author = {Archer, Branden and Weisstein, Eric W}, url = {http://mathworld.wolfram.com/LagrangeInterpolatingPolynomial.html}, bdsk-url-1 = {http://mathworld.wolfram.com/LagrangeInterpolatingPolynomial.html} } @article{ioffe2015batch, title = {Batch normalization: Accelerating deep network training by reducing internal covariate shift}, author = {Ioffe, Sergey and Szegedy, Christian}, year = 2015, journal = {arXiv preprint arXiv:1502.03167} } @article{IP, title = {On the complexity of K-SAT}, author = {Impagliazzo, Russel and Paturi, Ramamohan}, year = 2001, month = mar, journal = {J. Comput. Syst. Sci.}, publisher = {Academic Press, Inc.}, address = {Orlando, FL, USA}, volume = 62, number = 2, pages = {367--375}, doi = {10.1006/jcss.2000.1727}, issn = {0022-0000}, url = {http://dx.doi.org/10.1006/jcss.2000.1727}, issue_date = {March 2001}, numpages = 9, acmid = 374991 } @inproceedings{IPDPSW2011, title = {Efficiently Computing Tensor Eigenvalues on a {GPU}}, author = {G. Ballard and T. G. Kolda and T. Plantenga}, year = 2011, month = may, booktitle = {IPDPSW'11: Proceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum}, publisher = {IEEE Computer Society}, pages = {1340--1348} } @article{ipsen1998relative, title = {Relative perturbation results for matrix eigenvalues and singular values}, author = {Ipsen, Ilse CF}, year = 1998, journal = {Acta numerica}, publisher = {Cambridge Univ Press}, volume = 7, pages = {151--201} } @inproceedings{jackson2002learnability, title = {Learnability beyond $AC^0$}, author = {Jackson, Jeffrey C and Klivans, Adam R and Servedio, Rocco A}, year = 2002, booktitle = {Proceedings of the thiry-fourth annual ACM symposium on Theory of computing}, pages = {776--784}, organization = {ACM}, owner = {gewor_000}, timestamp = {2013.10.01} } @inproceedings{jacot2018neural, title = {Neural tangent kernel: Convergence and generalization in neural networks}, author = {Jacot, Arthur and Gabriel, Franck and Hongler, Cl{\'e}ment}, year = 2018, booktitle = {Advances in neural information processing systems}, pages = {8571--8580} } @article{jaeger, title = {Observable Operator Models for Discrete Stochastic Time Series}, author = {H. Jaeger}, year = 2000, journal = {Neural Comput.}, volume = 12, number = 6 } @inproceedings{jahangiri2005shift, title = { SHIFT-SPLIT: I/O efficient maintenance of wavelet-transformed multidimensional data }, author = {Jahangiri, Mehrdad and Sacharidis, Dimitris and Shahabi, Cyrus}, year = 2005, booktitle = { Proceedings of the 2005 ACM SIGMOD international conference on Management of data }, location = {Baltimore, Maryland}, publisher = {ACM}, address = {New York, NY, USA}, series = {SIGMOD '05}, pages = {275--286}, doi = {http://doi.acm.org/10.1145/1066157.1066189}, isbn = {1-59593-060-4}, acmid = 1066189, numpages = 12 } @inproceedings{jain2004adaptive, title = {Adaptive stream resource management using Kalman Filters}, author = {Jain, Ankur and Chang, Edward Y. and Wang, Yuan-Fang}, year = 2004, booktitle = { Proceedings of the 2004 ACM SIGMOD international conference on Management of data }, location = {Paris, France}, publisher = {ACM}, address = {New York, NY, USA}, series = {SIGMOD '04}, pages = {11--22}, doi = {http://doi.acm.org/10.1145/1007568.1007573}, isbn = {1-58113-859-8}, acmid = 1007573, numpages = 12 } @inproceedings{jain2012low, title = {Low-rank Matrix Completion using Alternating Minimization}, author = {Jain, Prateek and Netrapalli, Praneeth and Sanghavi, Sujay}, year = 2013, booktitle = {ACM STOC}, pages = {665--674}, organization = {ACM} } @inproceedings{jain2015fast, title = {Fast Exact Matrix Completion with Finite Samples}, author = {Jain, Prateek and Netrapalli, Praneeth}, year = 2015, booktitle = {Proceedings of The 28th Conference on Learning Theory}, pages = {1007--1034} } @inproceedings{jain2016matching, title = {Matching Matrix Bernstein with Little Memory: Near-Optimal Finite Sample Guarantees for Oja's Algorithm}, author = {Jain, Prateek and Jin, Chi and Kakade, Sham M and Netrapalli, Praneeth and Sidford, Aaron}, year = 2016, journal = {COLT} } @inproceedings{jain2016streaming, title = {Streaming PCA: Matching Matrix Bernstein and Near-Optimal Finite Sample Guarantees for Oja’s Algorithm}, author = {Jain, Prateek and Jin, Chi and Kakade, Sham M and Netrapalli, Praneeth and Sidford, Aaron}, year = 2016, booktitle = {29th Annual Conference on Learning Theory}, pages = {1147--1164} } @inproceedings{jain2017global, title = {Global Convergence of Non-Convex Gradient Descent for Computing Matrix Squareroot}, author = {Jain, Prateek and Jin, Chi and Kakade, Sham and Netrapalli, Praneeth}, year = 2017, booktitle = {Artificial Intelligence and Statistics}, pages = {479--488} } @article{JainJiUpadhyayWatrous2009, title = {{QIP = PSPACE}}, author = {Jain, Rahul and Ji, Zhengfeng and Upadhyay, Sarvagya and Watrous, John}, year = 2011, journal = {Journal of the ACM (JACM)}, publisher = {ACM}, volume = 58, number = 6, pages = 30 } @inproceedings{JainJKNS2016-online1SVD, title = {{Streaming PCA: Matching Matrix Bernstein and Near-Optimal Finite Sample Guarantees for Oja's Algorithm}}, author = {Prateek Jain and Chi Jin and Sham M. Kakade and Praneeth Netrapalli and Aaron Sidford}, year = 2016, booktitle = {COLT} } @article{JainYao2011, title = {{A Parallel Approximation Algorithm for Positive Semidefinite Programming}}, author = {Jain, Rahul and Yao, Penghui}, year = 2011, month = oct, journal = {2011 IEEE 52nd Annual Symposium on Foundations of Computer Science}, publisher = {Ieee}, pages = {463--471}, doi = {10.1109/FOCS.2011.25}, isbn = {978-0-7695-4571-4}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Jain, Yao - 2011 - A Parallel Approximation Algorithm for Positive Semidefinite Programming.pdf:pdf}, keywords = {-fast parallel algorithms,gramming,multiplicative weight update,positive semidefinite pro-}, mendeley-groups = {Algorithms/Multiplicative Weight/SDP} } @techreport{JainYao2012, title = {{A parallel approximation algorithm for mixed packing and covering semidefinite programs}}, author = {Jain, Rahul and Yao, Penghui}, year = 2012, month = jan, booktitle = {arXiv preprint arXiv:1201.6090}, pages = 8, abstract = {We present a parallel approximation algorithm for a class of mixed packing and covering semidefinite programs which generalize on the class of positive semidefinite programs as considered by Jain and Yao [2011]. As a corollary we get a faster approximation algorithm for positive semidefinite programs with better dependence of the parallel running time on the approximation factor, as compared to that of Jain and Yao [2011]. Our algorithm and analysis is on similar lines as that of Young [2001] who considered analogous linear programs.}, archiveprefix = {arXiv}, arxivid = {1201.6090}, eprint = {1201.6090}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop//Jain, Yao - 2012 - A parallel approximation algorithm for mixed packing and covering semidefinite programs.pdf:pdf}, mendeley-groups = {Algorithms/Multiplicative Weight/SDP} } @article{jaksch2010near, title = {Near-optimal regret bounds for reinforcement learning}, author = {Jaksch, Thomas and Ortner, Ronald and Auer, Peter}, year = 2010, journal = {Journal of Machine Learning Research}, volume = 11, number = {Apr}, pages = {1563--1600} } @inproceedings{jang2017categorical, title = {Categorical Reparametrization with Gumbel-Softmax}, author = {Jang, Eric and Gu, Shixiang and Poole, Ben}, year = 2017, booktitle = {International Conference on Learning Representations} } @article{janner2019trust, title = {When to trust your model: Model-based policy optimization}, author = {Janner, Michael and Fu, Justin and Zhang, Marvin and Levine, Sergey}, year = 2019, journal = {arXiv preprint arXiv:1906.08253} } @article{jastrzkebski2017three, title = {Three factors influencing minima in sgd}, author = {Jastrz{\k{e}}bski, Stanis{\l}aw and Kenton, Zachary and Arpit, Devansh and Ballas, Nicolas and Fischer, Asja and Bengio, Yoshua and Storkey, Amos}, year = 2017, journal = {arXiv preprint arXiv:1711.04623} } @article{jastrzkebski2018dnn, title = {DNN's Sharpest Directions Along the SGD Trajectory}, author = {Jastrz{\k{e}}bski, Stanis{\l}aw and Kenton, Zachary and Ballas, Nicolas and Fischer, Asja and Bengio, Yoshua and Storkey, Amos}, year = 2018, journal = {arXiv preprint arXiv:1807.05031} } @article{javanmard2018flexible, title = {A Flexible Framework for Hypothesis Testing in High-dimensions}, author = {Javanmard, Adel and Lee, Jason D}, year = {}, journal = {Accepted Journal of the Royal Statistical Society Series B} } @article{JavanmardM14, title = {Confidence intervals and hypothesis testing for high-dimensional regression}, author = {Adel Javanmard and Andrea Montanari}, year = 2014, journal = {Journal of Machine Learning Research}, volume = 15, number = 1, pages = {2869--2909}, url = {http://dl.acm.org/citation.cfm?id=2697057}, timestamp = {Wed, 07 Jan 2015 20:37:19 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/jmlr/JavanmardM14}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{javed2011efficient, title = {Efficient Genomewide Selection of PCA-Correlated tSNPs for Genotype Imputation}, author = {Javed, Asif and Drineas, Petros and Mahoney, Michael W and Paschou, Peristera}, year = 2011, journal = {Annals of human genetics}, publisher = {Wiley Online Library}, volume = 75, number = 6, pages = {707--722} } @inproceedings{jensen2007trax, title = {TRAX: real-world tracking of moving objects}, author = {Jensen, Christian S. and Pakalnis, Stardas}, year = 2007, booktitle = { Proceedings of the 33rd international conference on Very large data bases }, location = {Vienna, Austria}, publisher = {VLDB Endowment}, series = {VLDB '07}, pages = {1362--1365}, isbn = {978-1-59593-649-3}, acmid = 1326015, numpages = 4 } @article{JGJS, title = {Introduction to variational methods for graphical models}, author = {M. Jordan and Z. Ghahramani and T. Jaakola and L. Saul}, year = 1999, journal = {Machine Learning}, pages = {183--233} } @article{JHSPS-VR-Framework-parallel, title = {On Variance Reduction in Stochastic Gradient Descent and its Asynchronous Variants}, author = {Sashank J. Reddi and Ahmed Hefny and Suvrit Sra and Barnab{\'{a}}s P{\'{o}}czos and Alexander J. Smola}, year = 2015, journal = {NIPS} } @article{ji2018gradient, title = {Gradient descent aligns the layers of deep linear networks}, author = {Ji, Ziwei and Telgarsky, Matus}, year = 2018, journal = {arXiv preprint arXiv:1810.02032} } @article{ji2018risk, title = {Risk and parameter convergence of logistic regression}, author = {Ji, Ziwei and Telgarsky, Matus}, year = 2018, journal = {arXiv preprint arXiv:1803.07300} } @article{ji2020convergence, title = {Convergence of Meta-Learning with Task-Specific Adaptation over Partial Parameters}, author = {Ji, Kaiyi and Lee, Jason D and Liang, Yingbin and Poor, H Vincent}, year = 2020, journal = {Neural Information Processing Systems (NeurIPS)} } @article{jiang2004kruskal, title = {Kruskal's permutation lemma and the identification of CANDECOMP/PARAFAC and bilinear models with constant modulus constraints}, author = {Jiang, Tao and Sidiropoulos, Nicholas D}, year = 2004, journal = {Signal Processing, IEEE Transactions on}, publisher = {IEEE}, volume = 52, number = 9, pages = {2625--2636} } @inproceedings{jiang2007instance, title = {Instance weighting for domain adaptation in NLP}, author = {Jiang, Jing and Zhai, ChengXiang}, year = 2007, booktitle = {ACL}, volume = 7, pages = {264--271} } @inproceedings{jiang2015abstraction, title = {Abstraction selection in model-based reinforcement learning}, author = {Jiang, Nan and Kulesza, Alex and Singh, Satinder}, year = 2015, booktitle = {International Conference on Machine Learning} } @article{jiang2016contextual, title = {Contextual Decision Processes with Low Bellman Rank are PAC-Learnable}, author = {Jiang, Nan and Krishnamurthy, Akshay and Agarwal, Alekh and Langford, John and Schapire, Robert E}, year = 2016, journal = {arXiv preprint arXiv:1610.09512}, booktitle = {International Conference on Machine Learning} } @inproceedings{jiang2018open, title = {Open problem: The dependence of sample complexity lower bounds on planning horizon}, author = {Jiang, Nan and Agarwal, Alekh}, year = 2018, booktitle = {Conference On Learning Theory}, pages = {3395--3398} } @inproceedings{jin2017escape, title = {How to escape saddle points efficiently}, author = {Jin, Chi and Ge, Rong and Netrapalli, Praneeth and Kakade, Sham M and Jordan, Michael I}, year = 2017, booktitle = {International Conference on Machine Learning}, pages = {1724--1732}, organization = {PMLR} } @inproceedings{jin2018q, title = {Is Q-learning provably efficient?}, author = {Jin, Chi and Allen-Zhu, Zeyuan and Bubeck, Sebastien and Jordan, Michael I}, year = 2018, booktitle = {Proceedings of the 32nd International Conference on Neural Information Processing Systems}, pages = {4868--4878} } @article{jin2019learning, title = {Learning adversarial markov decision processes with bandit feedback and unknown transition}, author = {Jin, Chi and Jin, Tiancheng and Luo, Haipeng and Sra, Suvrit and Yu, Tiancheng}, year = 2019, journal = {arXiv preprint arXiv:1912.01192} } @inproceedings{jin2019provably, title = {Provably efficient reinforcement learning with linear function approximation}, author = {Jin, Chi and Yang, Zhuoran and Wang, Zhaoran and Jordan, Michael I}, year = 2020, booktitle = {Conference on Learning Theory}, pages = {2137--2143} } @inproceedings{jin2020reward, title = {Reward-Free Exploration for Reinforcement Learning}, author = {Jin, Chi and Krishnamurthy, Akshay and Simchowitz, Max and Yu, Tiancheng}, year = 2020, booktitle = {International Conference on Machine Learning} } @article{jin2021bellman, title = {Bellman Eluder Dimension: New Rich Classes of RL Problems, and Sample-Efficient Algorithms}, author = {Jin, Chi and Liu, Qinghua and Miryoosefi, Sobhan}, year = 2021, journal = {arXiv preprint arXiv:2102.00815} } @article{JL1984, title = {{Extensions of Lipschitz mappings into a Hilbert space}}, author = {Johnson, William B. and Lindenstrauss, Joram}, year = 1984, journal = {Contemporary Mathematics}, volume = 26, number = {189-206}, pages = {189--206}, doi = {10.1090/conm/026/737400}, mendeley-groups = {Algorithms/Sublinear Algorithms/JL} } @article{JMLR:v15:anandkumar14b, title = {{Tensor Decompositions for Learning Latent Variable Models}}, author = {Animashree Anandkumar and Rong Ge and Daniel Hsu and Sham M. Kakade and Matus Telgarsky}, year = 2014, journal = {Journal of Machine Learning Research}, volume = 15, pages = {2773--2832}, url = {http://jmlr.org/papers/v15/anandkumar14b.html} } @inproceedings{JNS, title = {Low rank matrix completion using alternating minimization}, author = {P. Jain and P. Netrapalli and S. Sanghavi}, year = 2013, booktitle = {STOC}, pages = {665--674} } @inproceedings{johns2007constructing, title = {Constructing basis functions from directed graphs for value function approximation}, author = {Johns, Jeff and Mahadevan, Sridhar}, year = 2007, booktitle = {Proceedings of the 24th international conference on Machine learning}, pages = {385--392}, organization = {ACM} } @inproceedings{Johnson013, title = {Accelerating Stochastic Gradient Descent using Predictive Variance Reduction}, author = {Rie Johnson and Tong Zhang}, year = 2013, booktitle = {Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013, Lake Tahoe, Nevada, United States.}, pages = {315--323}, url = {http://papers.nips.cc/paper/4937-accelerating-stochastic-gradient-descent-using-predictive-variance-reduction}, bdsk-url-1 = {http://papers.nips.cc/paper/4937-accelerating-stochastic-gradient-descent-using-predictive-variance-reduction}, bibsource = {dblp computer science bibliography, http://dblp.org}, biburl = {http://dblp.org/rec/bib/conf/nips/Johnson013}, crossref = {DBLP:conf/nips/2013}, timestamp = {Fri, 31 Jan 2014 12:11:40 +0100} } @inproceedings{johnson2013accelerating, title = {Accelerating stochastic gradient descent using predictive variance reduction}, author = {Johnson, Rie and Zhang, Tong}, year = 2013, booktitle = {Advances in Neural Information Processing Systems}, series = {NIPS 2013}, pages = {315--323}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Johnson, Zhang - 2013 - Accelerating stochastic gradient descent using predictive variance reduction.pdf:pdf}, mendeley-groups = {Optimization/Variance Reduction,Optimization/[with Yuan Yang]} } @book{jolliffe2002principal, title = {Principal Component Analysis}, author = {I.T. Jolliffe}, year = 2002, publisher = {Springer Verlag}, isbn = {0-387-95442-2}, edition = {2nd}, abstract = { seems like a great book on PCA - it shows the connection between PCA and SVD; talks about how to choose the number of eigenvectors to keep; discusses outlier detection; uses PCA for stock prices (Dow Jones) }, owner = {leili}, timestamp = {2011.07.28} } @article{jordan2018communication, title = {Communication-efficient distributed statistical learning}, author = {Jordan, Michael I and {Jason D. Lee} and Yang, Yun}, year = 2018, journal = {Journal of the American Statistics Association} } @misc{Jordanbook, title = {An Introduction to Graphical Models}, author = {Michael I. Jordan and Christopher M. Bishop}, publisher = {forthcoming.} } @misc{Juditsky13-lecture, title = {Convex Optimization II: Algorithms}, author = {Anatoli Juditsky}, year = 2013, month = nov, howpublished = {Lecture notes} } @article{juditsky2011solving, title = {Solving variational inequalities with stochastic mirror-prox algorithm}, author = {Juditsky, Anatoli and Nemirovski, Arkadi and Tauvel, Claire and others}, year = 2011, journal = {Stochastic Systems}, publisher = {INFORMS Applied Probability Society}, volume = 1, number = 1, pages = {17--58} } @inproceedings{julier1997new, title = {A New Extension of the Kalman Filter to nonlinear Systems}, author = {Simon J. Julier and Jeffery K. Uhlmann}, year = 1997, booktitle = { The Proceedings of AeroSense: The 11th International Symposium on Aerospace/Defense Sensing, Simulation and Controls, Multi Sensor Fusion, Tracking and Resource Management } } @article{julier2004unscented, title = {Unscented filtering and nonlinear estimation}, author = {Simon J. Julier and Jeffrey K. Uhlmann}, year = 2004, journal = {Proceedings of the IEEE}, volume = 92, number = 3, pages = {401--422} } @article{k10, title = {Products of random matrices: Dimension and growth in norm}, author = {Kargin, Vladislav}, year = 2010, journal = {The Annals of Applied Probability}, publisher = {Institute of Mathematical Statistics, \url{https://arxiv.org/pdf/0903.0632.pdf}}, volume = 20, number = 3, pages = {890--906} } @article{Kaczmarz1937, title = {Angen{\"a}herte aufl{\"o}sung von systemen linearer gleichungen}, author = {Kaczmarz, Stefan}, year = 1937, journal = {Bulletin International de l’Academie Polonaise des Sciences et des Lettres}, volume = 35, pages = {355--357} } @inproceedings{kagami2003measurement, title = {Measurement and comparison of human and humanoid walking}, author = { Kagami, S. and Mochimaru, M. and Ehara, Y. and Miyata, N. and Nishiwaki, K. and Kanade, T. and Inoue, H. }, year = 2003, month = jul, booktitle = { Proceedings of 2003 IEEE International Symposium on Computational Intelligence in Robotics and Automation }, volume = 2, pages = {918--922 vol.2}, issn = {}, abstract = { This paper describes our research efforts aimed at understanding human being walking functions. Using motion capture system, force plates and distributed force sensors, both human being and humanoid H7 walk motion were captured. Experimental results are shown. Comparison in between human being with H7 walk in following points are discussed: 1) ZMP trajectories, 2) torso movement, 3) free leg trajectories, 4) joint angle usage, 5) joint torque usage. Furthermore, application to the humanoid robot is discussed. }, keywords = { distributed force sensors; force plates; free leg trajectories; human being walking functions; humanoid robot; humanoid walking; joint angle usage; joint torque usage; motion capture system; torso movement; distributed sensors; force sensors; legged locomotion; motion control; motion measurement; }, owner = {leili}, timestamp = {2011.07.28} } @article{kaiser2019model, title = {Model-based reinforcement learning for atari}, author = {Kaiser, Lukasz and Babaeizadeh, Mohammad and Milos, Piotr and Osinski, Blazej and Campbell, Roy H and Czechowski, Konrad and Erhan, Dumitru and Finn, Chelsea and Kozakowski, Piotr and Levine, Sergey, and Sepassi, Ryan and Tucker, George and Michalewski, Henryk}, year = 2019, journal = {arXiv preprint arXiv:1903.00374} } @article{kakade2001natural, title = {A natural policy gradient}, author = {Kakade, Sham M}, year = 2001, journal = {Advances in neural information processing systems}, booktitle = {Advances in neural information processing systems}, volume = 14, pages = {1531--1538} } @inproceedings{kakade2002approximately, title = {Approximately optimal approximate reinforcement learning}, author = {Kakade, Sham and Langford, John}, year = 2002, booktitle = {International Conference on Machine Learning}, pages = {267--274} } @phdthesis{kakade2003sample, title = {On the sample complexity of reinforcement learning}, author = {Kakade, Sham Machandranath}, year = 2003, school = {UCL (University College London)} } @incollection{kakade2007multi, title = {Multi-view regression via canonical correlation analysis}, author = {Kakade, Sham M and Foster, Dean P}, year = 2007, booktitle = {Learning theory}, publisher = {Springer}, pages = {82--96} } @techreport{Kakade2009, title = {{On the duality of strong convexity and strong smoothness: Learning applications and matrix regularization}}, author = {Kakade, Sham M. and {Shalev-Shwartz}, Shai and Tewari, Ambuj}, year = 2009, booktitle = {\ldots Manuscript, http://ttic. \ldots}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Kakade, Shalev-Shwartz, Tewari - 2009 - On the duality of strong convexity and strong smoothness Learning applications and matrix regula.pdf:pdf}, mendeley-groups = {Optimization/General Theory} } @inproceedings{kakade2011efficient, title = {Efficient learning of generalized linear and single index models with isotonic regression}, author = {Kakade, Sham M and Kanade, Varun and Shamir, Ohad and Kalai, Adam}, year = 2011, booktitle = {Advances in Neural Information Processing Systems}, pages = {927--935} } @article{kakade2018provable, title = {Provably Correct Automatic Subdifferentiation for Qualified Programs}, author = {Kakade, Sham and Lee, Jason D}, year = 2018, journal = {Neural Information Processing Systems (NIPS)} } @article{kakade2020information, title = {Information theoretic regret bounds for online nonlinear control}, author = {Kakade, Sham and Krishnamurthy, Akshay and Lowrey, Kendall and Ohnishi, Motoya and Sun, Wen}, year = 2020, journal = {arXiv preprint arXiv:2006.12466} } @inproceedings{kakadecca, title = {Multi-view Regression Via Canonical Correlation Analysi s.}, author = {Sham M. Kakade and Dean P. Foster}, year = 2007, booktitle = {COLT}, publisher = {Springer}, series = {Lecture Notes in Computer Science}, volume = 4539, pages = {82--96}, editor = {Nader H. Bshouty and Claudio Gentile} } @inproceedings{KalaiEtal:GaussianMixture, title = {Efficiently learning mixtures of two Gaussians}, author = {A. T. Kalai and A. Moitra and G. Valiant}, year = 2010, booktitle = {STOC} } @article{kalman1960new, title = {A New Approach to Linear Filtering and Prediction Problems}, author = {Kalman, Rudolf E.}, year = 1960, journal = {Transactions of the ASME -- Journal of Basic Engineering}, volume = {82 (Series D)}, pages = {35--45}, citeulike-article-id = 347166, keywords = {kalman-filter, statistics, time-series}, posted-at = {2007-10-16 18:40:17}, priority = 3 } @inproceedings{kalpakis2001distance, title = {Distance Measures for Effective Clustering of ARIMA Time-Series}, author = {Konstantinos Kalpakis and Dhiral Gada and Vasundhara Puttagunta}, year = 2001, booktitle = { ICDM 2001: Proceeding of 2001 IEEE International Conference on Data Mining }, pages = {273--280} } @inproceedings{kantas2009overview, title = {An overview of Sequential {M}onte {C}arlo Methods for Parameter Estimation in General State-Space Models}, author = {Kantas, Nicholas and Doucet, Arnaud and Singh, Sumeetpal Sindhu and Maciejowski, Jan}, year = 2009, booktitle = {15th IFAC Symposium on System Identification}, volume = 15, pages = {774--785} } @inproceedings{karampatziakis2014discriminative, title = {Discriminative Features via Generalized Eigenvectors}, author = {Karampatziakis, Nikos and Mineiro, Paul}, year = 2014, booktitle = {ICML}, pages = {494--502} } @inproceedings{karimi2016linear, title = {Linear convergence of gradient and proximal-gradient methods under the polyak-{\l}ojasiewicz condition}, author = {Karimi, Hamed and Nutini, Julie and Schmidt, Mark}, year = 2016, booktitle = {Joint European Conference on Machine Learning and Knowledge Discovery in Databases}, pages = {795--811}, organization = {Springer} } @inproceedings{Karnin2015online, title = {Online PCA with spectral bounds}, author = {Karnin, Zohar and Liberty, Edo}, year = 2015, booktitle = {Proceedings of the 28th Annual Conference on Computational Learning Theory (COLT)}, pages = {505--509} } @article{Katyusha2016, title = {Katyusha: Accelerated Variance Reduction for Faster {SGD}}, author = {Zeyuan Allen Zhu}, year = 2016, journal = {CoRR}, volume = {abs/1603.05953}, url = {http://arxiv.org/abs/1603.05953}, timestamp = {Sat, 02 Apr 2016 11:49:48 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/corr/Zhu16c}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{kaufmann2020adaptive, title = {Adaptive Reward-Free Exploration}, author = {Kaufmann, Emilie and M{\'e}nard, Pierre and Domingues, Omar Darwiche and Jonsson, Anders and Leurent, Edouard and Valko, Michal}, year = 2020, journal = {arXiv preprint arXiv:2006.06294} } @article{Kawaguchi, title = {{Deep Learning without Poor Local Minima}}, author = {{Kawaguchi}, K.}, year = 2016, month = may, journal = {ArXiv e-prints}, booktitle = {Proceedings of the 30th International Conference on Neural Information Processing Systems}, pages = {586--594}, archiveprefix = {arXiv}, eprint = {1605.07110}, primaryclass = {stat.ML}, keywords = {Statistics - Machine Learning, Computer Science - Learning, Mathematics - Optimization and Control}, adsurl = {http://adsabs.harvard.edu/abs/2016arXiv160507110K}, adsnote = {Provided by the SAO/NASA Astrophysics Data System} } @inproceedings{kb13, title = {Recurrent continuous translation models}, author = {Kalchbrenner, Nal and Blunsom, Phil}, year = 2013, booktitle = {Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing}, pages = {1700--1709} } @inproceedings{kbdjk17, title = {Reluplex: An efficient SMT solver for verifying deep neural networks}, author = {Katz, Guy and Barrett, Clark and Dill, David L and Julian, Kyle and Kochenderfer, Mykel J}, year = 2017, booktitle = {International Conference on Computer Aided Verification (CAV)}, pages = {97--117}, organization = {Springer} } @article{ke2003efficient, title = {Efficient selective screening of haplotype tag SNPs}, author = {Ke, Xiayi and Cardon, Lon R}, year = 2003, journal = {Bioinformatics}, publisher = {Oxford Univ Press}, volume = 19, number = 2, pages = {287--288} } @article{Kearns, title = {Efficient noise-tolerant learning from statistical queries}, author = {Kearns, Michael}, year = 1998, month = nov, journal = {J. ACM}, publisher = {ACM}, address = {New York, NY, USA}, volume = 45, number = 6, pages = {983--1006}, issn = {0004-5411}, issue_date = {Nov. 1998}, numpages = 24, keywords = {computational learning theory, machine learning} } @inproceedings{kearns1998near, title = {Near-Optimal Reinforcement Learning in Polynominal Time}, author = {Kearns, Michael J and Singh, Satinder P}, year = 1998, booktitle = {Proceedings of the Fifteenth International Conference on Machine Learning}, pages = {260–268} } @inproceedings{kearns1999efficient, title = {Efficient reinforcement learning in factored MDPs}, author = {Kearns, Michael and Koller, Daphne}, year = 1999, booktitle = {IJCAI}, volume = 16, pages = {740--747} } @inproceedings{kearns1999finite, title = {Finite-sample convergence rates for {Q}-learning and indirect algorithms}, author = {Kearns, Michael J and Singh, Satinder P}, year = 1999, booktitle = {Advances in neural information processing systems}, pages = {996--1002} } @inproceedings{kearns2000approximate, title = {Approximate planning in large {POMDPs} via reusable trajectories}, author = {Kearns, Michael J and Mansour, Yishay and Ng, Andrew Y}, year = 2000, booktitle = {Advances in Neural Information Processing Systems}, pages = {1001--1007} } @article{kearns2002near, title = {Near-optimal reinforcement learning in polynomial time}, author = {Kearns, Michael and Singh, Satinder}, year = 2002, journal = {Machine learning}, publisher = {Springer}, volume = 49, number = {2-3}, pages = {209--232} } @article{kearns2002sparse, title = {A sparse sampling algorithm for near-optimal planning in large Markov decision processes}, author = {Kearns, Michael and Mansour, Yishay and Ng, Andrew Y}, year = 2002, journal = {Machine Learning}, publisher = {Springer}, volume = 49, number = {2-3}, pages = {193--208} } @inproceedings{keogh2001locally, title = { Locally adaptive dimensionality reduction for indexing large time series databases }, author = { Keogh, Eamonn and Chakrabarti, Kaushik and Pazzani, Michael and Mehrotra, Sharad }, year = 2001, booktitle = { Proceedings of the 2001 ACM SIGMOD international conference on Management of data }, location = {Santa Barbara, California, United States}, publisher = {ACM}, address = {New York, NY, USA}, series = {SIGMOD '01}, pages = {151--162}, doi = {http://doi.acm.org/10.1145/375663.375680}, isbn = {1-58113-332-4}, acmid = 375680, keywords = {content-based retrieval, dimensionality reduction, indexing}, numpages = 12 } @inproceedings{keogh2002exact, title = {Exact indexing of dynamic time warping}, author = {Keogh, Eamonn}, year = 2002, booktitle = { Proceedings of the 28th international conference on Very Large Data Bases }, location = {Hong Kong, China}, publisher = {VLDB Endowment}, series = {VLDB '02}, pages = {406--417}, acmid = 1287405, numpages = 12 } @inproceedings{keogh2004indexing, title = {Indexing large human-motion databases}, author = { Keogh, Eamonn and Palpanas, Themistoklis and Zordan, Victor B. and Gunopulos, Dimitrios and Cardle, Marc }, year = 2004, booktitle = { Proceedings of the Thirtieth international conference on Very large data bases - Volume 30 }, location = {Toronto, Canada}, publisher = {VLDB Endowment}, series = {VLDB '04}, pages = {780--791}, isbn = {0-12-088469-0}, acmid = 1316757, keywords = {animation, indexing, motion capture, time series}, numpages = 12 } @incollection{KernelDeep, title = {Kernel Methods for Deep Learning}, author = {Youngmin Cho and Lawrence Saul}, year = 2009, booktitle = {Advances in Neural Information Processing Systems 22}, pages = {342--350}, owner = {rongge}, timestamp = {2013.09.26} } @article{keshavan2010matrix, title = {Matrix Completion From a Few Entries}, author = {Keshavan, Raghunandan H and Montanari, Andrea and Oh, Sewoong}, year = 2010, journal = {Information Theory, IEEE Transactions on}, publisher = {IEEE}, volume = 56, number = 6, pages = {2980--2998} } @article{keshavan2010matrixnoisy, title = {Matrix completion from noisy entries}, author = {Keshavan, Raghunandan H and Montanari, Andrea and Oh, Sewoong}, year = 2010, journal = {The Journal of Machine Learning Research}, publisher = {JMLR. org}, volume = 11, pages = {2057--2078} } @article{keskar2016large, title = {On large-batch training for deep learning: Generalization gap and sharp minima}, author = {Keskar, Nitish Shirish and Mudigere, Dheevatsa and Nocedal, Jorge and Smelyanskiy, Mikhail and Tang, Ping Tak Peter}, year = 2016, journal = {arXiv preprint arXiv:1609.04836} } @article{keskar2017improving, title = {Improving generalization performance by switching from adam to sgd}, author = {Keskar, Nitish Shirish and Socher, Richard}, year = 2017, journal = {arXiv preprint arXiv:1712.07628} } @article{khachiyan1993complexity, title = {On the complexity of approximating the maximal inscribed ellipsoid for a polytope}, author = {Khachiyan, Leonid G and Todd, Michael J}, year = 1993, journal = {Mathematical Programming}, publisher = {Springer}, volume = 61, number = 1, pages = {137--159} } @article{khachiyan1996rounding, title = {Rounding of polytopes in the real number model of computation}, author = {Khachiyan, Leonid G.}, year = 1996, journal = {Mathematics of Operations Research}, publisher = {INFORMS}, volume = 21, number = 2, pages = {307--320} } @inproceedings{khanna2013parallel, title = {Parallel matrix factorization for binary response}, author = {Khanna, Rajiv and Zhang, Liang and Agarwal, Deepak and Chen, Bee-chung}, year = 2013, booktitle = {Big Data, 2013 IEEE International Conference on}, pages = {430--438}, organization = {IEEE} } @article{kilgarriff1997don, title = {I don’t believe in word senses}, author = {Kilgarriff, Adam}, year = 1997, journal = {Computers and the Humanities} } @inproceedings{kim2009weighted, title = {Weighted nonnegative matrix factorization}, author = {Kim, Yong-Deok and Choi, Seungjin}, year = 2009, booktitle = {Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on}, pages = {1541--1544}, organization = {IEEE} } @article{kim2011fast, title = {Fast nonnegative matrix factorization: An active-set-like method and comparisons}, author = {Kim, Jingu and Park, Haesun}, year = 2011, journal = {SIAM Journal on Scientific Computing}, publisher = {SIAM}, volume = 33, number = 6, pages = {3261--3281} } @article{kim2014algorithms, title = {Algorithms for nonnegative matrix and tensor factorizations: A unified view based on block coordinate descent framework}, author = {Kim, Jingu and He, Yunlong and Park, Haesun}, year = 2014, journal = {Journal of Global Optimization}, publisher = {Springer}, volume = 58, number = 2, pages = {285--319} } @article{kingma2014adam, title = {Adam: A method for stochastic optimization}, author = {Kingma, Diederik P and Ba, Jimmy}, year = 2014, journal = {arXiv preprint arXiv:1412.6980} } @inproceedings{kirk2005skeletal, title = {Skeletal Parameter Estimation from Optical Motion Capture Data}, author = {Adam G. Kirk and James F. O'Brien and David A. Forsyth}, year = 2005, month = jun, booktitle = { IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2005 }, pages = {782--788}, owner = {leili}, timestamp = {2011.07.28} } @article{kirkpatrick1983optimization, title = {Optimization by simulated annealing}, author = {Kirkpatrick, Scott and Gelatt, C Daniel and Vecchi, Mario P}, year = 1983, journal = {science}, publisher = {American association for the advancement of science}, volume = 220, number = 4598, pages = {671--680} } @article{Kirkpatrick83optimizationby, title = {Optimization by simulated annealing}, author = {S. Kirkpatrick and C. D. Gelatt and M. P. Vecchi}, year = 1983, journal = {SCIENCE}, volume = 220, number = 4598, pages = {671--680} } @book{kittay1990metaphor, title = {Metaphor: Its cognitive force and linguistic structure}, author = {Kittay, Eva Feder}, year = 1990 } @article{kivinen1997exponentiated, title = {Exponentiated gradient versus gradient descent for linear predictors}, author = {Kivinen, Jyrki and Warmuth, Manfred K}, year = 1997, journal = {Information and Computation}, publisher = {Elsevier}, volume = 132, number = 1, pages = {1--63}, doi = {10.1006/inco.1996.2612}, issn = {0890-5401}, fjournal = {Information and Computation}, mrclass = {68T05 (68Q99)}, mrnumber = 1429254, mrreviewer = {Peter Auer} } @article{Kivinen95exponentiatedgradient, title = {Exponentiated Gradient Versus Gradient Descent for Linear Predictors}, author = {Jyrki Kivinen and Manfred K. Warmuth}, year = 1995, journal = {Inform. and Comput.}, volume = 132, fjournal = {Information and Computation} } @inproceedings{Kleinberg2003, title = {{The value of knowing a demand curve: bounds on regret for online posted-price auctions}}, author = {Kleinberg, R. and Leighton, T.}, year = 2003, booktitle = {44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings.}, publisher = {IEEE Computer. Soc}, number = {Focs 2003}, pages = {594--605}, doi = {10.1109/SFCS.2003.1238232}, isbn = {0-7695-2040-5}, issn = {0272-5428}, abstract = {We consider price-setting algorithms for a simple market in which a seller has an unlimited supply of identical copies of some good, and interacts sequentially with a pool of n buyers, each of whom wants at most one copy of the good. In each transaction, the seller offers a price between 0 and 1, and the buyer decides whether or not to buy, by comparing the offered price to his privately-held valuation for the good. The price offered to a given buyer may be influenced by the outcomes of prior transactions, but each individual buyer participates only once. In this setting, what is the value of knowing the demand curve? In other words, how much revenue can an uninformed seller expect to obtain, relative to a seller with prior information about the buyers' valuations? The answer depends on how the buyers' valuations are modeled. We analyze three cases - identical, random, and worst-case valuations - in each case deriving upper and lower bounds which match within a sublogarithmic factor.}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Unknown - Unknown - No Title(2).pdf:pdf}, mendeley-groups = {Operation Research} } @article{KleinbergS08, title = {Using mixture models for collaborative filtering}, author = {Jon M. Kleinberg and Mark Sandler}, year = 2008, journal = {J. Comput. Syst. Sci.}, volume = 74, number = 1, pages = {49--69}, doi = {10.1016/j.jcss.2007.04.013}, url = {http://dx.doi.org/10.1016/j.jcss.2007.04.013}, timestamp = {Wed, 05 Mar 2008 11:35:52 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/jcss/KleinbergS08}, bibsource = {dblp computer science bibliography, http://dblp.org} } @incollection{KleinYoung99, title = {On the Number of Iterations for Dantzig-Wolfe Optimization and Packing-Covering Approximation Algorithms}, author = {Klein, Philip and Young, Neal}, year = 1999, booktitle = {Integer Programming and Combinatorial Optimization}, publisher = {Springer Berlin Heidelberg}, series = {Lecture Notes in Computer Science}, volume = 1610, pages = {320--327}, doi = {10.1007/3-540-48777-8_24}, isbn = {978-3-540-66019-4}, editor = {Cornu\'{e}jols, G\'{e}rard and Burkard, Rainer E. and Woeginger, Gerhard J.} } @article{kliegl2010generalized, title = {{Generalized DCell Structure for Load-Balanced Data Center Networks}}, author = {Kliegl, Markus and {Jason D. Lee} and Li, Jun and Zhang, Xinchao and Guo, Chuanxiong and Rinc{\'o}n, David}, year = 2010, journal = {IEEE Conference on Computer Communications (INFOCOM)}, pages = {1--5}, url = {http://research.microsoft.com/apps/pubs/default.aspx?id=103129} } @article{klivans2009cryptographic, title = {Cryptographic hardness for learning intersections of halfspaces}, author = {Klivans, Adam R and Sherstov, Alexander A}, year = 2009, journal = {Journal of Computer and System Sciences}, publisher = {Elsevier}, volume = 75, number = 1, pages = {2--12}, owner = {gewor_000}, timestamp = {2013.10.01} } @inproceedings{KLOS2014, title = {{An Almost-Linear-Time Algorithm for Approximate Max Flow in Undirected Graphs, and its Multicommodity Generalizations}}, author = {Kelner, Jonathan A. and Lee, Yin Tat and Orecchia, Lorenzo and Sidford, Aaron}, year = 2014, month = apr, booktitle = {Proceedings of the 25th Annual ACM-SIAM Symposium on Discrete Algorithms - SODA '14}, series = {STOC '14}, number = 1, doi = {10.1137/1.9781611973402.16}, abstract = {In this paper, we introduce a new framework for approximately solving flow problems in capacitated, undirected graphs and apply it to provide asymptotically faster algorithms for the maximum \$s\$-\$t\$ flow and maximum concurrent multicommodity flow problems. For graphs with \$n\$ vertices and \$m\$ edges, it allows us to find an \$\backslash epsilon\$-approximate maximum \$s\$-\$t\$ flow in time \$O(m\^{}\{1+o(1)\}\backslash epsilon\^{}\{-2\})\$, improving on the previous best bound of \$\backslash tilde\{O\}(mn\^{}\{1/3\} poly(1/\backslash epsilon))\$. Applying the same framework in the multicommodity setting solves a maximum concurrent multicommodity flow problem with \$k\$ commodities in \$O(m\^{}\{1+o(1)\}\backslash epsilon\^{}\{-2\}k\^{}2)\$ time, improving on the existing bound of \$\backslash tilde\{O\}(m\^{}\{4/3\} poly(k,\backslash epsilon\^{}\{-1\})\$. Our algorithms utilize several new technical tools that we believe may be of independent interest: - We give a non-Euclidean generalization of gradient descent and provide bounds on its performance. Using this, we show how to reduce approximate maximum flow and maximum concurrent flow to the efficient construction of oblivious routings with a low competitive ratio. - We define and provide an efficient construction of a new type of flow sparsifier. In addition to providing the standard properties of a cut sparsifier our construction allows for flows in the sparse graph to be routed (very efficiently) in the original graph with low congestion. - We give the first almost-linear-time construction of an \$O(m\^{}\{o(1)\})\$-competitive oblivious routing scheme. No previous such algorithm ran in time better than \$\backslash tilde\{\{\backslash Omega\}\}(mn)\$. We also note that independently Jonah Sherman produced an almost linear time algorithm for maximum flow and we thank him for coordinating submissions.}, archiveprefix = {arXiv}, arxivid = {1304.2338}, eprint = {1304.2338}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Kelner et al. - 2014 - An Almost-Linear-Time Algorithm for Approximate Max Flow in Undirected Graphs, and its Multicommodity Generalizat.pdf:pdf}, mendeley-groups = {Algorithms/Maxflow} } @inproceedings{kmeans, title = {Some Methods for Classification and Analysis of Multivariate Observations}, author = {J. B. MacQueen}, year = 1967, booktitle = {Proceedings of the fifth Berkeley Symposium on Mathematical Statistics and Probability}, publisher = {University of California Press}, volume = 1, pages = {281--297} } @inproceedings{KMRELS, title = {Dictionary learning algorithms for sparse representation}, author = {K. Kreutz-Delgado and J. Murray and B. Rao, K. Engan and T. Lee and T. Sejnowski.}, year = 2003, booktitle = {Neural Computation} } @inproceedings{kno18, title = {Expressive power of recurrent neural networks}, author = {Valentin Khrulkov and Alexander Novikov and Ivan Oseledets}, year = 2018, booktitle = {International Conference on Learning Representations}, url = {https://openreview.net/forum?id=S1WRibb0Z} } @article{knyazev2012principal, title = {Principal angles between subspaces and their tangents}, author = {Knyazev, Andrew V and Zhu, Peizhen}, year = 2012, journal = {Arxiv preprint} } @article{kober2013reinforcement, title = {Reinforcement learning in robotics: A survey}, author = {Kober, Jens and Bagnell, J Andrew and Peters, Jan}, year = 2013, journal = {The International Journal of Robotics Research}, publisher = {SAGE Publications Sage UK: London, England}, volume = 32, number = 11, pages = {1238--1274} } @article{kofidis_regalia_power_convexity, title = {On the best rank-1 approximation of higher-order supersymmetric tensors}, author = {E. Kofidis and P. A. Regalia}, year = 2002, journal = {SIAM Journal on Matrix Analysis and Applications}, volume = 23, number = 3, pages = {863--884} } @inproceedings{kol2017time, title = {Time-space Hardness of Learning Sparse Parities}, author = {Kol, Gillat and Raz, Ran and Tal, Avishay}, year = 2017, booktitle = {Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing}, location = {Montreal, Canada}, publisher = {ACM}, address = {New York, NY, USA}, series = {STOC 2017}, pages = {1067--1080}, doi = {10.1145/3055399.3055430}, isbn = {978-1-4503-4528-6}, url = {http://doi.acm.org/10.1145/3055399.3055430}, acmid = 3055430, keywords = {Fourier analysis, PAC learning, bounded storage cryptography, branching program, lower bounds, time-space tradeoff}, numpages = 14 } @article{kolda2001orthogonal, title = {Orthogonal tensor decompositions}, author = {T. Kolda}, year = 2001, journal = {SIAM Journal on Matrix Analysis and Applications}, publisher = {SIAM}, volume = 23, number = 1, pages = {243--255} } @inproceedings{kolda2008scalable, title = {Scalable Tensor Decompositions for Multi-aspect Data Mining}, author = {Kolda, T. G. and Sun, Jimeng}, year = 2008, booktitle = { ICDM '08: Proceeding of Eighth IEEE International Conference on Data Mining }, pages = {363--372}, doi = {10.1109/ICDM.2008.89}, issn = {1550-4786}, abstract = { Modern applications such as Internet traffic, telecommunication records, and large-scale social networks generate massive amounts of data with multiple aspects and high dimensionalities. Tensors (i.e., multi-way arrays) provide a natural representation for such data. Consequently, tensor decompositions such as Tucker become important tools for summarization and analysis. One major challenge is how to deal with high-dimensional, sparse data. In other words, how do we compute decompositions of tensors where most of the entries of the tensor are zero. Specialized techniques are needed for computing the Tucker decompositions for sparse tensors because standard algorithms do not account for the sparsity of the data. As a result, a surprising phenomenon is observed by practitioners: Despite the fact that there is enough memory to store both the input tensors and the factorized output tensors, memory overflows occur during the tensor factorization process. To address this intermediate blowup problem, we propose Memory-Efficient Tucker (MET). Based on the available memory, MET adaptively selects the right execution strategy during the decomposition. We provide quantitative and qualitative evaluation of MET on real tensors. It achieves over 1000X space reduction without sacrificing speed; it also allows us to work with much larger tensors that were too big to handle before. Finally, we demonstrate a data mining case-study using MET. }, keywords = { Internet, data mining, matrix decomposition, social networking (online), sparse matrices, telecommunication traffic, tensors, Internet traffic, Memory-Efficient Tucker, Tucker decompositions, intermediate blowup problem, large-scale social networks, multiaspect data mining, scalable tensor decompositions, sparse tensors, telecommunication records, tensor decompositions, tensor factorization, Data mining, Sparse data, Tensor Decomposition, Tucker Decomposition }, owner = {leili}, timestamp = {2010.02.05} } @article{kolda2009tensor, title = {Tensor decompositions and applications}, author = {Kolda, T. G. and Bader, B. W.}, year = 2009, journal = {SIAM review}, volume = 51, number = 3, pages = 455 } @article{kolda2011shifted, title = {Shifted power method for computing tensor eigenpairs}, author = {Kolda, Tamara G and Mayo, Jackson R}, year = 2011, journal = {SIAM Journal on Matrix Analysis and Applications}, publisher = {SIAM}, volume = 32, number = 4, pages = {1095--1124} } @article{kolda_survey, title = {{Tensor decompositions and applications}}, author = {T. Kolda and B. Bader}, year = 2009, journal = {SIREV}, volume = 51, number = 3, pages = {455--500} } @inproceedings{kollios1999indexing, title = {On indexing mobile objects}, author = {Kollios, George and Gunopulos, Dimitrios and Tsotras, Vassilis J.}, year = 1999, booktitle = { Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems }, location = {Philadelphia, Pennsylvania, United States}, publisher = {ACM}, address = {New York, NY, USA}, series = {PODS '99}, pages = {261--272}, doi = {http://doi.acm.org/10.1145/303976.304002}, isbn = {1-58113-062-7}, acmid = 304002, numpages = 12 } @inproceedings{kolter09regularization, title = {Regularization and Feature Selection in Least-Squares Temporal Difference Learning}, author = {J. Zico Kolter and Andrew Y. Ng}, year = 2009, booktitle = {Proceedings of the 26th International Conference on Machine Learning (ICML)}, pages = {521--528} } @inproceedings{kolter2009near, title = {Near-Bayesian exploration in polynomial time}, author = {Kolter, J Zico and Ng, Andrew Y}, year = 2009, booktitle = {Proceedings of the 26th annual international conference on machine learning}, pages = {513--520} } @inproceedings{kolter2009regularization, title = {Regularization and feature selection in least-squares temporal difference learning}, author = {Kolter, J Zico and Ng, Andrew Y}, year = 2009, booktitle = {Proceedings of the 26th annual international conference on machine learning}, pages = {521--528}, organization = {ACM} } @inproceedings{kolter2019learning, title = {Learning stable deep dynamics models}, author = {Kolter, J Zico and Manek, Gaurav}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {11128--11136} } @article{konda1999actor, title = {Actor-Critic--Type Learning Algorithms for Markov Decision Processes}, author = {Konda, Vijaymohan R and Borkar, Vivek S}, year = 1999, journal = {SIAM Journal on control and Optimization}, publisher = {SIAM}, volume = 38, number = 1, pages = {94--123} } @inproceedings{konda2000actor, title = {Actor-critic algorithms}, author = {Konda, Vijay R and Tsitsiklis, John N}, year = 2000, booktitle = {Advances in neural information processing systems}, pages = {1008--1014} } @article{koren2009bellkor, title = {The bellkor solution to the netflix grand prize}, author = {Koren, Yehuda}, year = 2009, journal = {Netflix prize documentation}, volume = 81 } @article{koren2009matrix, title = {Matrix factorization techniques for recommender systems}, author = {Koren, Yehuda and Bell, Robert and Volinsky, Chris}, year = 2009, journal = {Computer}, publisher = {Institute of Electrical and Electronics Engineers, Inc., 3 Park Avenue, 17 th Fl New York NY 10016-5997 United States}, volume = 42, number = 8, pages = {30--37} } @inproceedings{korn1997efficiently, title = {Efficiently supporting ad hoc queries in large datasets of time sequences}, author = {Korn, Flip and Jagadish, H. V. and Faloutsos, Christos}, year = 1997, booktitle = { Proceedings of the 1997 ACM SIGMOD international conference on Management of data }, location = {Tucson, Arizona, United States}, publisher = {ACM}, address = {New York, NY, USA}, series = {SIGMOD '97}, pages = {289--300}, doi = {http://doi.acm.org/10.1145/253260.253332}, isbn = {0-89791-911-4}, acmid = 253332, numpages = 12 } @inproceedings{KOSZ13, title = {A Simple, Combinatorial Algorithm for Solving {SDD} Systems in Nearly-{L}inear Time}, author = {Jonathan A. Kelner and Lorenzo Orecchia and Aaron Sidford and Zeyuan Allen Zhu}, year = 2013, booktitle = {Proceedings of the 45th Annual ACM Symposium on Theory of Computing}, series = {STOC~'13} } @article{KotlowskiWarmuth2015-onlineEV, title = {PCA with Gaussian perturbations}, author = {Kot{\l}owski, Wojciech and Warmuth, Manfred K.}, year = 2015, journal = {ArXiv e-prints}, volume = {abs/1506.04855} } @inproceedings{kottenstette2010relationships, title = {Relationships between positive real, passive dissipative, \& positive systems}, author = {Kottenstette, Nicholas and Antsaklis, Panos J}, year = 2010, booktitle = {American Control Conference (ACC), 2010}, pages = {409--416}, organization = {IEEE} } @article{KoufogiannakisYoung2013, title = {{A Nearly Linear-Time PTAS for Explicit Fractional Packing and Covering Linear Programs}}, author = {Koufogiannakis, Christos and Young, Neal E.}, year = 2013, month = mar, journal = {Algorithmica}, pages = {494--506}, doi = {10.1007/s00453-013-9771-6}, issn = {0178-4617}, note = {Previously appeared in FOCS '07.}, abstract = {We give an approximation algorithm for packing and covering linear programs (linear programs with non-negative coefficients). Given a constraint matrix with n non-zeros, r rows, and c columns, the algorithm computes feasible primal and dual solutions whose costs are within a factor of 1+eps of the optimal cost in time O((r+c)log(n)/eps\^{}2 + n).} } @article{kouw2018introduction, title = {An introduction to domain adaptation and transfer learning}, author = {Kouw, Wouter M and Loog, Marco}, year = 2018, journal = {arXiv preprint arXiv:1812.11806} } @inproceedings{kouw2019learning, title = {Learning an mr acquisition-invariant representation using siamese neural networks}, author = {Kouw, Wouter M and Loog, Marco and Bartels, Lambertus W and Mendrik, Adri{\"e}nne M}, year = 2019, booktitle = {2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019)}, pages = {364--367}, organization = {IEEE} } @inproceedings{kovar2002motion, title = {Motion graphs}, author = {Lucas Kovar and Michael Gleicher and Fr\&\#233;d\&\#233;ric Pighin}, year = 2002, booktitle = { SIGGRAPH '02: Proceedings of the 29th annual conference on Computer graphics and interactive techniques }, location = {San Antonio, Texas}, publisher = {ACM Press}, address = {New York, NY, USA}, pages = {473--482}, doi = {http://doi.acm.org/10.1145/566570.566605}, isbn = {1-58113-521-1} } @inproceedings{kpotufe2013adaptivity, title = {Adaptivity to local smoothness and dimension in kernel regression}, author = {Kpotufe, Samory and Garg, Vikas}, year = 2013, booktitle = {Advances in Neural Information Processing Systems}, pages = {3075--3083} } @article{krishnamurthy2013sequential, title = {Sequential Algorithms for Matrix and Tensor Completion}, author = {Krishnamurthy, Akshay and Singh, Aarti}, year = 2013, journal = {arXiv preprint arXiv:1304.4672} } @article{krishnamurthy2014power, title = {On the power of adaptivity in matrix completion and approximation}, author = {Krishnamurthy, Akshay and Singh, Aarti}, year = 2014, journal = {arXiv preprint arXiv:1407.3619} } @article{krishnamurthy2016contextual, title = {Contextual-MDPs for {PAC}-Reinforcement Learning with Rich Observations}, author = {Krishnamurthy, Akshay and Agarwal, Alekh and Langford, John}, year = 2016, journal = {arXiv preprint arXiv:1602.02722} } @inproceedings{krishnamurthy2016pac, title = {PAC reinforcement learning with rich observations}, author = {Krishnamurthy, Akshay and Agarwal, Alekh and Langford, John}, year = 2016, booktitle = {Proceedings of the 30th International Conference on Neural Information Processing Systems}, pages = {1848--1856} } @techreport{Krizhevsky09learningmultiple, title = {Learning multiple layers of features from tiny images}, author = {Alex Krizhevsky}, year = 2009, institution = {} } @article{krizhevsky2009learning, title = {Learning multiple layers of features from tiny images}, author = {Krizhevsky, Alex and Hinton, Geoffrey}, year = 2009, publisher = {Citeseer} } @inproceedings{krizhevsky2012imagenet, title = {Imagenet classification with deep convolutional neural networks}, author = {Krizhevsky, Alex and Sutskever, Ilya and Hinton, Geoffrey E}, year = 2012, booktitle = {Advances in neural information processing systems}, pages = {1097--1105} } @article{kruskal77, title = {Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics}, author = {J. B. Kruskal}, year = 1977, journal = {Linear Algebra and Appl.}, volume = 18, number = 2, pages = {95--138} } @article{Kruskal:76, title = {{More factors than subjects, tests and treatments: an indeterminacy theorem for canonical decomposition and individual differences scaling}}, author = {Kruskal, J.B.}, year = 1976, journal = {Psychometrika}, volume = 41, number = 3, pages = {281--293} } @article{Kruskal:77, title = {{Three-way arrays: Rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics}}, author = {Kruskal, J.B.}, year = 1977, journal = {Linear algebra and its applications}, volume = 18, number = 2, pages = {95--138} } @inproceedings{KRV2006, title = {Graph partitioning using single commodity flows}, author = {Khandekar, Rohit and Rao, Satish and Vazirani, Umesh}, year = 2006, booktitle = {STOC '06} } @inproceedings{KSV05, title = {The spectral method for general mixture models}, author = {R. Kannan and H. Salmasian and S. Vempala}, year = 2005, booktitle = {COLT} } @inproceedings{Kumar12, title = {Fast Conical Hull Algorithms for Near-separable Non-negative Matrix Factorization}, author = {A. Kumar and V. Sindhwani and P. Kambadur}, year = 2012, note = {http://arxiv.org/abs/1210.1190v1} } @article{Kumar2003, title = {{Approximate minimum enclosing balls in high dimensions using core-sets}}, author = {Kumar, Piyush and Mitchell, Joseph S. B. and Yildirim, E. Alper}, year = 2003, month = jan, journal = {Journal of Experimental Algorithmics}, volume = 8, pages = {1--29}, doi = {10.1145/996546.996548}, issn = 10846654, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Kumar, Mitchell, Yildirim - 2003 - Approximate minimum enclosing balls in high dimensions using core-sets.pdf:pdf}, mendeley-groups = {Algorithms/Computational Geometry} } @inproceedings{kurucz2007methods, title = {Methods for large scale SVD with missing values}, author = {Kurucz, Mikl{\'o}s and Bencz{\'u}r, Andr{\'a}s A and Csalog{\'a}ny, K{\'a}roly}, year = 2007, booktitle = {Proceedings of KDD Cup and Workshop}, volume = 12, pages = {31--38}, organization = {Citeseer} } @article{kurutach2018model, title = {Model-ensemble trust-region policy optimization}, author = {Kurutach, Thanard and Clavera, Ignasi and Duan, Yan and Tamar, Aviv and Abbeel, Pieter}, year = 2018, journal = {arXiv preprint arXiv:1802.10592} } @article{kuruvilla2002vector, title = {Vector algebra in the analysis of genome-wide expression data}, author = {Kuruvilla, Finny G and Park, Peter J and Schreiber, Stuart L}, year = 2002, journal = {Genome biology}, publisher = {BioMed Central Ltd}, volume = 3, number = 3, pages = {research0011} } @book{KuY03, title = {Stochastic Approximation and Recursive Algorithms and Applications}, author = {Kushner, Harold J and Yin, George}, year = 2003, publisher = {Springer} } @article{KVV04, title = {On clusterings: Good, bad and spectral}, author = {Ravi Kannan and Santosh Vempala and Adrian Vetta}, year = 2004, journal = {Journal of the ACM}, volume = 51, number = 3, pages = {497--515}, ee = {http://doi.acm.org/10.1145/990308.990313}, bibsource = {DBLP, http://dblp.uni-trier.de} } @article{l1994stochastic, title = {Stochastic optimization by simulation: Convergence proofs for the {GI/G/1} queue in steady-state}, author = {L'Ecuyer, Pierre and Glynn, Peter W}, year = 1994, journal = {Management Science}, publisher = {INFORMS}, volume = 40, number = 11, pages = {1562--1578} } @article{l63, title = {A topological property of real analytic subsets}, author = {Lojasiewicz, S}, year = 1963, journal = {Coll. du CNRS, Les {\'e}quations aux d{\'e}riv{\'e}es partielles}, volume = 117, pages = {87--89} } @article{LacosteJulienSB2012, title = {A simpler approach to obtaining an $O(1/t)$ convergence rate for the projected stochastic subgradient method}, author = {Simon Lacoste{-}Julien and Mark W. Schmidt and Francis R. Bach}, year = 2012, journal = {ArXiv e-prints}, volume = {abs/1212.2002} } @article{lagoudakis2003least, title = {Least-squares policy iteration}, author = {Lagoudakis, Michail G and Parr, Ronald}, year = 2003, journal = {Journal of machine learning research}, volume = 4, number = {Dec}, pages = {1107--1149} } @misc{lakshminarayanan17linear, title = {A Linearly Relaxed Approximate Linear Program for {Markov} Decision Processes}, author = {Chandrashekar Lakshminarayanan and Shalabh Bhatnagar and Csaba Szepesv\'{a}ri}, year = 2017, journal = {IEEE Transactions on Automatic Control}, volume = 63, number = 4, pages = {1185--1191}, note = {CoRR abs/1704.02544} } @inproceedings{lakshminarayanan2017simple, title = {Simple and scalable predictive uncertainty estimation using deep ensembles}, author = {Lakshminarayanan, Balaji and Pritzel, Alexander and Blundell, Charles}, year = 2017, booktitle = {Advances in neural information processing systems}, pages = {6402--6413} } @article{Lan2011, title = {{An optimal method for stochastic composite optimization}}, author = {Lan, Guanghui}, year = 2011, month = jan, journal = {Mathematical Programming}, volume = 133, number = {1-2}, pages = {365--397}, doi = {10.1007/s10107-010-0434-y}, isbn = {0001408100}, issn = {0025-5610}, file = {:D$\backslash$:/Mendeley Desktop/Lan - 2011 - An optimal method for stochastic composite optimization.pdf:pdf}, keywords = {convex optimization,stochastic approximation}, mendeley-groups = {Optimization/Gradient Descent Theory/Composite} } @article{lan2021policy, title = {Policy mirror descent for reinforcement learning: Linear convergence, new sampling complexity, and generalized problem classes}, author = {Lan, Guanghui}, year = 2021, journal = {arXiv preprint arXiv:2102.00135} } @article{landrieu2017cut, title = {Cut pursuit: Fast algorithms to learn piecewise constant functions on general weighted graphs}, author = {Landrieu, Loic and Obozinski, Guillaume}, year = 2017, journal = {SIAM Journal on Imaging Sciences}, publisher = {SIAM}, volume = 10, number = 4, pages = {1724--1766} } @inproceedings{langford2008epoch, title = {The epoch-greedy algorithm for multi-armed bandits with side information}, author = {Langford, John and Zhang, Tong}, year = 2008, booktitle = {Advances in neural information processing systems}, pages = {817--824} } @article{LangRao2004, title = {{A flow-based method for improving the expansion or conductance of graph cuts}}, author = {Lang, Kevin and Rao, Satish}, year = 2004, journal = {Integer Programming and Combinatorial Optimization}, volume = 3064, pages = {325--337} } @article{LanZhou2015, title = {An optimal randomized incremental gradient method}, author = {Guanghui Lan and Yi Zhou}, year = 2015, month = oct, journal = {ArXiv e-prints}, volume = {abs/1507.02000} } @article{lasserre2001global, title = {Global optimization with polynomials and the problem of moments}, author = {Lasserre, Jean B}, year = 2001, journal = {SIAM Journal on Optimization}, publisher = {SIAM}, volume = 11, number = 3, pages = {796--817} } @article{latala1997estimation, title = {Estimation of moments of sums of independent real random variables}, author = {Latala, Rafal}, year = 1997, journal = {The Annals of Probability}, publisher = {JSTOR}, pages = {1502--1513} } @article{LatalaBound, title = {{Estimates of moments and tails of Gaussian chaoses}}, author = {R. Latala}, year = 2006, journal = {Ann. Prob.}, volume = 34, number = 6, pages = {2315--2331} } @inproceedings{lattimore2012pac, title = {PAC bounds for discounted MDPs}, author = {Lattimore, Tor and Hutter, Marcus}, year = 2012, booktitle = {International Conference on Algorithmic Learning Theory}, pages = {320--334}, organization = {Springer} } @inproceedings{lattimore2013sample, title = {The sample-complexity of general reinforcement learning}, author = {Lattimore, Tor and Hutter, Marcus and Sunehag, Peter and others}, year = 2013, booktitle = {International Conference on Machine Learning} } @inproceedings{lattimore2019learning, title = {Learning with Good Feature Representations in Bandits and in RL with a Generative Model}, author = {Lattimore, Tor and Szepesvari, Csaba}, year = 2020, booktitle = {International Conference on Machine Learning} } @book{lattimore2020bandit, title = {Bandit algorithms}, author = {Lattimore, Tor and Szepesv{\'a}ri, Csaba}, year = 2020, publisher = {Cambridge University Press} } @inproceedings{Lau04bipartiteroots, title = {Bipartite roots of graphs}, author = {Lap Chi Lau}, year = 2004, booktitle = {In Proceedings of the 15th Annual ACM-SIAM Symposium on Discrete Algorithms}, pages = {952--961} } @article{laurent2000, title = {Adaptive estimation of a quadratic functional by model selection}, author = {Laurent, B. and Massart, P.}, year = 2000, month = 10, journal = {Ann. Statist.}, publisher = {The Institute of Mathematical Statistics}, volume = 28, number = 5, pages = {1302--1338}, doi = {10.1214/aos/1015957395}, url = {http://dx.doi.org/10.1214/aos/1015957395}, fjournal = {The Annals of Statistics} } @article{lawrence1998searching, title = {Searching the world wide web}, author = {Steve Lawrence and C. Lee Giles}, year = 1998, journal = {Science}, volume = 280, number = 5360, pages = {98--100} } @inproceedings{lawrence2007hierarchical, title = {Hierarchical Gaussian process latent variable models}, author = {Lawrence,, Neil D. and Moore,, Andrew J.}, year = 2007, booktitle = { ICML '07: Proceedings of the 24th international conference on Machine learning }, location = {Corvalis, Oregon}, publisher = {ACM}, address = {New York, NY, USA}, pages = {481--488}, doi = {http://doi.acm.org/10.1145/1273496.1273557}, isbn = {978-1-59593-793-3} } @inproceedings{lazaric2010finite, title = {Finite-sample analysis of LSTD}, author = {Lazaric, Alessandro and Ghavamzadeh, Mohammad and Munos, R{\'e}mi}, year = 2010, booktitle = {ICML-27th International Conference on Machine Learning}, pages = {615--622} } @article{lazaric2012finite, title = {Finite-sample analysis of least-squares policy iteration}, author = {Lazaric, Alessandro and Ghavamzadeh, Mohammad and Munos, R{\'e}mi}, year = 2012, journal = {The Journal of Machine Learning Research}, publisher = {JMLR. org}, volume = 13, number = 1, pages = {3041--3074} } @article{LB93, title = {Multivariate normal mixtures: a fast consistent method}, author = {B. G. Lindsay and P. Basak}, year = 1993, journal = {Journal of the American Statistical Association}, volume = 88, number = 422, pages = {468--476} } @article{lbnsps17, title = {Deep Neural Networks as {G}aussian Processes}, author = {Lee, Jaehoon and Bahri, Yasaman and Novak, Roman and Schoenholz, Samuel S. and Pennington, Jeffrey and Sohl-Dickstein, Jascha}, year = 2017, journal = {arXiv:1711.00165}, url = {http://arxiv.org/abs/1711.00165} } @article{LDA, title = {Latent dirichlet allocation}, author = {Blei, David M. and Ng, Andrew Y. and Jordan, Michael I.}, year = 2003, month = mar, journal = {J. Mach. Learn. Res.}, publisher = {JMLR.org}, volume = 3, pages = {993--1022}, issn = {1532-4435}, url = {http://dl.acm.org/citation.cfm?id=944919.944937}, issue_date = {3/1/2003}, numpages = 30, acmid = 944937 } @inproceedings{LDAinference, title = {Efficient Methods for Topic Model Inference on Streaming Document Collections}, author = {Yao, Limin and Mimno, David and McCallum, Andrew}, year = 2009, booktitle = {Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining}, location = {Paris, France}, publisher = {ACM}, address = {New York, NY, USA}, series = {KDD '09}, pages = {937--946}, doi = {10.1145/1557019.1557121}, isbn = {978-1-60558-495-9}, url = {http://doi.acm.org/10.1145/1557019.1557121}, numpages = 10, acmid = 1557121, keywords = {inference, topic modeling} } @inproceedings{le2011ica, title = {{ICA with Reconstruction Cost for Efficient Overcomplete Feature Learning}}, author = {Q. V. Le and A. Karpenko and J. Ngiam and A. Y. Ng}, year = 2011, booktitle = {NIPS}, pages = {1017--1025} } @book{LeCam86, title = {Asymptotic Methods in Statistical Decision Theory}, author = {L. {Le Cam}}, year = 1986, publisher = {Springer} } @article{lecun1995convolutional, title = {Convolutional networks for images, speech, and time series}, author = {LeCun, Yann and Bengio, Yoshua}, year = 1995, journal = {The handbook of brain theory and neural networks}, volume = 3361, number = 10, pages = 1995 } @incollection{lecun1998efficient, title = {Efficient backprop}, author = {LeCun, Yann and Bottou, L{\'e}on and Orr, Genevieve B and M{\"u}ller, Klaus-Robert}, year = 1998, booktitle = {Neural networks: Tricks of the trade}, publisher = {Springer}, pages = {9--50} } @book{ledoux2013probability, title = {Probability in Banach Spaces: isoperimetry and processes}, author = {Ledoux, Michel and Talagrand, Michel}, year = 2013, publisher = {Springer Science \& Business Media}, volume = 23 } @inproceedings{lee1999hierarchical, title = { A hierarchical approach to interactive motion editing for human-like figures }, author = {Lee, Jehee and Shin, Sung Yong}, year = 1999, booktitle = { Proceedings of the 26th annual conference on Computer graphics and interactive techniques }, publisher = {ACM Press/Addison-Wesley Publishing Co.}, address = {New York, NY, USA}, series = {SIGGRAPH '99}, pages = {39--48}, doi = {http://dx.doi.org/10.1145/311535.311539}, isbn = {0-201-48560-5}, acmid = 311539, keywords = { hierarchical techniques, inverse kinematics, motion adaptation, motion editing, spacetime constraints }, numpages = 10 } @article{lee1999learning, title = {Learning the parts of objects by non-negative matrix factorization}, author = {Lee, Daniel and Seung, Sebastian}, year = 1999, journal = {Nature}, publisher = {Nature Publishing Group}, volume = 401, number = 6755, pages = {788--791} } @inproceedings{lee2000algorithms, title = {Algorithms for Non-negative Matrix Factorization}, author = {Lee, Daniel D. and Seung, H. Sebastian}, year = 2000, booktitle = {{NIPS}}, pages = {556--562}, url = {citeseer.ist.psu.edu/lee01algorithms.html}, biburl = {http://www.bibsonomy.org/bibtex/2a54d0f1fa298d6e6a7135fa56b80fb5e/zeno}, interhash = {cf8707cab8812be3c21d3e5c10fad477}, intrahash = {a54d0f1fa298d6e6a7135fa56b80fb5e}, keywords = {matrix-factorization nmf}, timestamp = {2009-12-17T17:15:39.000+0100} } @inproceedings{lee2002interactive, title = {Interactive control of avatars animated with human motion data}, author = { Lee, Jehee and Chai, Jinxiang and Reitsma, Paul S. A. and Hodgins, Jessica K. and Pollard, Nancy S. }, year = 2002, booktitle = { Proceedings of the 29th annual conference on Computer graphics and interactive techniques }, location = {San Antonio, Texas}, publisher = {ACM}, address = {New York, NY, USA}, series = {SIGGRAPH '02}, pages = {491--500}, doi = {http://doi.acm.org/10.1145/566570.566607}, isbn = {1-58113-521-1}, acmid = 566607, keywords = { avatars, human motion, interactive control, motion capture, virtual environments }, numpages = 10 } @article{lee2008existence, title = {{Existence of Asymptotic Solutions to Semi-linear Partial Difference Equations}}, author = {{Jason D. Lee} and Neuberger, John}, year = 2008, journal = {Joint Mathematics Meetings}, pages = {} } @inproceedings{lee2008trajectory, title = {Trajectory Outlier Detection: A Partition-and-Detect Framework}, author = {Jae-Gil Lee and Jiawei Han and Xiaolei Li}, year = 2008, month = apr, booktitle = {ICDE 2008: IEEE 24th International Conference on Data Engineering}, pages = {140--149}, doi = {10.1109/ICDE.2008.4497422}, keywords = { data mining;partition-and-detect framework;trajectory outlier detection;data mining;object detection; } } @article{lee2010multiscale, title = {{Multiscale Estimation of Intrinsic Dimensionality of Point Cloud Data and Multiscale Analysis of Dynamic Graphs}}, author = {{Jason D. Lee}}, year = 2010, journal = {Senior Thesis, Duke University} } @article{lee2010practical, title = {{Practical Large-Scale Optimization for Max-Norm Regularization}}, author = {{Jason D. Lee} and Recht, Ben and Srebro, Nathan and Tropp, Joel and Salakhutdinov, Ruslan}, year = 2010, journal = {Neural Information Processing Systems (NIPS)}, pages = {1297--1305} } @article{lee2011chebyshev, title = {Chebyshev center based column generation}, author = {Lee, Chungmok and Park, Sungsoo}, year = 2011, journal = {Discrete Applied Mathematics}, publisher = {Elsevier}, volume = 159, number = 18, pages = {2251--2265} } @article{lee2011multiscale, title = {{Multiscale Analysis of Time Series of Graphs}}, author = {{Jason D. Lee} and Maggioni, Mauro}, year = 2011, journal = {International Conference on Sampling Theory and Applications (SAMPTA)} } @article{lee2012convergence, title = {{Convergence Analysis of Inexact Proximal Newton-Type Methods}}, author = {{Jason D. Lee} and Sun, Yuekai and Saunders, Michael A}, year = 2012, journal = {NIPS Workshop on Optimization in Machine Learning}, pages = {} } @article{lee2012proximal, title = {{Proximal Newton-type Methods for Convex Optimization}}, author = {{Jason D. Lee} and Sun, Yuekai and Saunders, Michael}, year = 2012, journal = {Neural Information Processing Systems (NIPS)}, pages = {836--844} } @article{lee2013model, title = {{On Model Selection Consistency of Penalized M-Estimators: a Geometric Theory}}, author = {{Jason D. Lee} and Sun, Yuekai and Taylor, Jonathan E.}, year = 2013, journal = {Neural Information Processing Systems (NIPS)}, pages = {342--350} } @inproceedings{lee2013path, title = {Path finding methods for linear programming: solving linear programs in {O}(sqrt(rank)) iterations and faster algorithms for maximum flow}, author = {Yin Tat Lee and Aaron Sidford}, year = 2014, month = oct, booktitle = {2014 IEEE 55th Annual Symposium on Foundations of Computer Science}, pages = {424--433}, doi = {10.1109/FOCS.2014.52}, issn = {0272-5428} } @article{lee2013structure, title = {{Structure Learning of Mixed Graphical Models}}, author = {{Jason D. Lee} and Hastie, Trevor}, year = 2013, journal = {Artificial Intelligence and Statistics (AISTATS)}, pages = {388--396} } @article{lee2013using, title = {{Using Multiple Samples to Learn Mixture Models}}, author = {{Jason D. Lee} and Gilad-Bachrach, Ran and Caruana, Rich}, year = 2013, journal = {Neural Information Processing Systems (NIPS)}, pages = {324--332} } @article{lee2014exact, title = {{Exact Post Model Selection Inference for Marginal Screening}}, author = {{Jason D. Lee} and Taylor, Jonathan E.}, year = 2014, journal = {Neural Information Processing Systems (NIPS)}, pages = {1--9} } @article{lee2014learning, title = {{Learning the Structure of Mixed Graphical Models}}, author = {{Jason D. Lee} and Hastie, Trevor J}, year = 2014, journal = {Journal of Computational and Graphical Statistics}, publisher = {Taylor \& Francis}, number = {}, pages = {} } @article{lee2014proximal, title = {{Proximal Newton-Type Methods for Minimizing Composite Functions}}, author = {{Jason D. Lee} and Sun, Yuekai and Saunders, Michael}, year = 2014, journal = {SIAM Journal on Optimization} } @inproceedings{lee2015efficient, title = {Efficient inverse maintenance and faster algorithms for linear programming}, author = {Lee, Yin Tat and Sidford, Aaron}, year = 2015, booktitle = {Foundations of Computer Science (FOCS), 2015 IEEE 56th Annual Symposium on}, pages = {230--249}, organization = {IEEE} } @article{lee2015modelJournal, title = {{On Model Selection Consistency of Regularized M-Estimators}}, author = {{Jason D. Lee} and Sun, Yuekai and Taylor, Jonathan E.}, year = 2015, journal = {Electronic Journal of Statistics} } @article{lee2015significance, title = {{Evaluating the Statistical Significance of Biclusters}}, author = {{Jason D. Lee} and Sun, Yuekai and Taylor, Jonathan E.}, year = 2015, journal = {Neural Information Processing Systems (NIPS)}, pages = {1--9} } @article{lee2016exact, title = {{Exact Inference after Model Selection via the Lasso}}, author = {{Jason D. Lee} and Sun, Dennis L. and Sun, Yuekai and Taylor, Jonathan E.}, year = 2016, journal = {Annals of Statistics} } @article{lee2016gradient, title = {Gradient Descent Converges to Minimizers}, author = {{Jason D. Lee} and Simchowitz, Max and Jordan, Michael I and Recht, Benjamin}, year = 2016, journal = {Conference on Learning Theory (COLT)} } @inproceedings{lee2017ability, title = {On the ability of neural nets to express distributions}, author = {Lee, Holden and Ge, Rong and Ma, Tengyu and Risteski, Andrej and Arora, Sanjeev}, year = 2017, booktitle = {Conference on Learning Theory}, pages = {1271--1296}, organization = {PMLR} } @article{lee2017distributed, title = {Distributed Stochastic Variance Reduced Gradient Methods}, author = {{Jason D. Lee} and Ma, Tengyu and Lin, Qihang and Yang, Tianbao}, year = 2017, journal = {Journal of Machine Learning Research} } @article{lee2017one, title = {{Communication-Efficient Distributed Sparse Regression}}, author = {{Jason D. Lee} and Liu, Qiang and Sun, Yuekai and Taylor, Jonathan E.}, year = 2017, journal = {Journal of Machine Learning Research} } @article{lee2018first, title = {First-order Methods Almost Always Avoid Saddle Points}, author = {Lee, Jason D and Panageas, Ioannis and Piliouras, Georgios and Simchowitz, Max and Jordan, Michael I and Recht, Benjamin}, year = 2018, journal = {Accepted at Math Programming} } @article{lee2018stochastic, title = {Stochastic Subgradient Converges in Polynomial Time on Nonsmooth Functions}, author = {Lee, Jason D}, year = 2018, journal = {Unpublished} } @article{lee2020bias, title = {Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs}, author = {Lee, Chung-Wei and Luo, Haipeng and Wei, Chen-Yu and Zhang, Mengxiao}, year = 2020, journal = {arXiv preprint arXiv:2006.08040} } @article{lee2020generalized, title = {Generalized Leverage Score Sampling for Neural Networks}, author = {Lee, Jason D and Shen, Ruoqi and Song, Zhao and Wang, Mengdi and others}, year = 2020, journal = {Neural Information Processing Systems (NeurIPS)} } @article{lee2020predicting, title = {Predicting what you already know helps: Provable self-supervised learning}, author = {Lee, Jason D and Lei, Qi and Saunshi, Nikunj and Zhuo, Jiacheng}, year = 2020, journal = {arXiv preprint arXiv:2008.01064} } @article{leemulticlass, title = {{Multiclass Clustering using a Semidefinite Relaxation}}, author = {{Jason D. Lee}}, journal = {Tech Report} } @article{LeeS15, title = {Efficient Inverse Maintenance and Faster Algorithms for Linear Programming}, author = {Yin Tat Lee and Aaron Sidford}, year = 2015, journal = {CoRR}, volume = {abs/1503.01752}, url = {http://arxiv.org/abs/1503.01752}, bdsk-url-1 = {http://arxiv.org/abs/1503.01752}, bibsource = {dblp computer science bibliography, http://dblp.org}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/corr/LeeS15}, timestamp = {Wed, 07 Jun 2017 14:42:08 +0200} } @inproceedings{LeeSidford2013, title = {Efficient accelerated coordinate descent methods and faster algorithms for solving linear systems}, author = {Lee, Yin Tat and Sidford, Aaron}, year = 2013, booktitle = {FOCS}, pages = {147--156}, organization = {IEEE} } @inproceedings{LeeSun2015-bss, title = {Constructing Linear-Sized Spectral Sparsification in Almost-Linear Time}, author = {Lee, Yin Tat and Sun, He}, year = 2015, booktitle = {FOCS}, pages = {250--269}, organization = {IEEE} } @article{lei2019sgd, title = {{SGD} Learns One-Layer Networks in WGANs}, author = {Lei, Qi and Lee, Jason D and Dimakis, Alexandros G and Daskalakis, Constantinos}, year = 2020, journal = {International Conference on Machine Learning (ICML)} } @article{LeightonRao99, title = {Multicommodity max-flow min-cut theorems and their use in designing approximation algorithms}, author = {Frank Thomson Leighton and Satish Rao}, year = 1999, journal = {Journal of the ACM}, volume = 46, number = 6, pages = {787--832}, ee = {http://doi.acm.org/10.1145/331524.331526}, bibsource = {DBLP, http://dblp.uni-trier.de} } @article{LeL10, title = {Randomized methods for linear constraints: convergence rates and conditioning}, author = {Leventhal, Dennis and Lewis, Adrian S.}, year = 2010, journal = {Math. Oper. Res.}, publisher = {INFORMS}, volume = 35, number = 3, pages = {641--654}, doi = {10.1287/moor.1100.0456}, issn = {0364-765X}, url = {http://dx.doi.org/10.1287/moor.1100.0456}, fjournal = {Mathematics of Operations Research}, mrclass = {65F10 (15A39 65K05 90C25)}, mrnumber = 2724068, mrreviewer = {Raimundo J. B. de Sampaio} } @article{LeM08, title = {Alternating projections on manifolds}, author = {Lewis, Adrian S. and Malick, J\'er\^ome}, year = 2008, journal = {Math. Oper. Res.}, volume = 33, number = 1, pages = {216--234}, doi = {10.1287/moor.1070.0291}, issn = {0364-765X}, url = {http://dx.doi.org/10.1287/moor.1070.0291}, fjournal = {Mathematics of Operations Research}, mrclass = {90C30 (49J53 65K10)}, mrnumber = 2393548 } @article{lenet, title = {Gradient-based learning applied to document recognition}, author = {Y. Lecun and L. Bottou and Y. Bengio and P. Haffner}, year = 1998, month = nov, journal = {Proceedings of the IEEE}, volume = 86, number = 11, pages = {2278--2324}, doi = {10.1109/5.726791}, issn = {0018-9219}, keywords = {backpropagation;convolution;multilayer perceptrons;optical character recognition;2D shape variability;GTN;back-propagation;cheque reading;complex decision surface synthesis;convolutional neural network character recognizers;document recognition;document recognition systems;field extraction;gradient based learning technique;gradient-based learning;graph transformer networks;handwritten character recognition;handwritten digit recognition task;high-dimensional patterns;language modeling;multilayer neural networks;multimodule systems;performance measure minimization;segmentation recognition;Character recognition;Feature extraction;Hidden Markov models;Machine learning;Multi-layer neural network;Neural networks;Optical character recognition software;Optical computing;Pattern recognition;Principal component analysis} } @article{lennart1999system, title = {System identification: theory for the user}, author = {Lennart, Ljung}, year = 1999, journal = {PTR Prentice Hall, Upper Saddle River, NJ}, pages = {1--14} } @article{lepski1997optimal, title = {Optimal pointwise adaptive methods in nonparametric estimation}, author = {Lepski, Oleg V and Spokoiny, VG}, year = 1997, journal = {The Annals of Statistics}, publisher = {JSTOR}, pages = {2512--2546} } @article{lepskii1991problem, title = {On a problem of adaptive estimation in Gaussian white noise}, author = {Lepskii, OV}, year = 1991, journal = {Theory of Probability \& Its Applications}, publisher = {SIAM}, volume = 35, number = 3, pages = {454--466} } @article{lepskii1992asymptotically, title = {Asymptotically minimax adaptive estimation. I: Upper bounds. Optimally adaptive estimates}, author = {Lepskii, OV}, year = 1992, journal = {Theory of Probability \& Its Applications}, publisher = {SIAM}, volume = 36, number = 4, pages = {682--697} } @article{lepskii1993asymptotically, title = {Asymptotically minimax adaptive estimation. II. Schemes without optimal adaptation: Adaptive estimators}, author = {Lepskii, OV}, year = 1993, journal = {Theory of Probability \& Its Applications}, publisher = {SIAM}, volume = 37, number = 3, pages = {433--448} } @article{leskovec, title = {Latent Multi-group Membership Graph Model}, author = {Kim, Myunghwan and Leskovec, Jure}, year = 2012, journal = {CoRR}, volume = {abs/1205.4546} } @inproceedings{leskovec2007cost, title = {Cost-effective outbreak detection in networks}, author = { Jure Leskovec and Andreas Krause and Carlos Guestrin and Christos Faloutsos and Jeanne VanBriesen and Natalie Glance }, year = 2007, booktitle = { Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining }, publisher = {ACM}, address = {San Jose, California, USA}, pages = {420--429}, note = {http://doi.acm.org/10.1145/1281192.1281239}, owner = {leili}, timestamp = {2011.07.28} } @article{lessard2016analysis, title = {Analysis and design of optimization algorithms via integral quadratic constraints}, author = {Lessard, Laurent and Recht, Benjamin and Packard, Andrew}, year = 2016, journal = {SIAM Journal on Optimization}, publisher = {SIAM}, volume = 26, number = 1, pages = {57--95} } @article{LessardRP14, title = {Analysis and Design of Optimization Algorithms via Integral Quadratic Constraints}, author = {Laurent Lessard and Benjamin Recht and Andrew Packard}, year = 2014, journal = {CoRR}, volume = {abs/1408.3595} } @article{leurgans1993decomposition, title = {A decomposition for three-way arrays}, author = {Leurgans, SE and Ross, RT and Abel, RB}, year = 1993, journal = {SIAM Journal on Matrix Analysis and Applications}, publisher = {SIAM}, volume = 14, number = 4, pages = {1064--1083} } @inproceedings{levine2013guided, title = {Guided policy search}, author = {Levine, Sergey and Koltun, Vladlen}, year = 2013, booktitle = {Proceedings of The 30th International Conference on Machine Learning}, pages = {1--9}, date-added = {2016-04-04 17:32:28 +0000}, date-modified = {2016-04-04 17:32:28 +0000} } @article{levine2016end, title = {End-to-end training of deep visuomotor policies}, author = {Levine, Sergey and Finn, Chelsea and Darrell, Trevor and Abbeel, Pieter}, year = 2016, journal = {The Journal of Machine Learning Research}, publisher = {JMLR. org}, volume = 17, number = 1, pages = {1334--1373} } @inproceedings{levy2014linguistic, title = {Linguistic Regularities in Sparse and Explicit Word Representations}, author = {Levy, Omer and Goldberg, Yoav}, year = 2014, booktitle = {Proceedings of the Eighteenth Conference on Computational Natural Language Learning} } @inproceedings{levy2014neural, title = {Neural word embedding as implicit matrix factorization}, author = {Levy, Omer and Goldberg, Yoav}, year = 2014, booktitle = {Advances in Neural Information Processing Systems} } @article{levy2016power, title = {The Power of Normalization: Faster Evasion of Saddle Points}, author = {Levy, Kfir Y}, year = 2016, journal = {arXiv preprint arXiv:1611.04831} } @article{lewicki2000learning, title = {Learning overcomplete representations}, author = {M. S. Lewicki and T. J. Sejnowski}, year = 2000, journal = {Neural computation}, publisher = {MIT Press}, volume = 12, number = 2, pages = {337--365} } @article{lhphetsw15, title = {Continuous control with deep reinforcement learning}, author = {Lillicrap, Timothy P and Hunt, Jonathan J and Pritzel, Alexander and Heess, Nicolas and Erez, Tom and Tassa, Yuval and Silver, David and Wierstra, Daan}, year = 2015, journal = {arXiv preprint arXiv:1509.02971}, booktitle = {International Conference on Learning Representations} } @article{li1997relativeiii, title = {Relative perturbation theory. III. More bounds on eigenvalue variation}, author = {Li, Ren-Cang}, year = 1997, journal = {Linear algebra and its applications}, publisher = {Elsevier}, volume = 266, pages = {337--345} } @article{li1998relative, title = {Relative perturbation theory: I. Eigenvalue and singular value variations}, author = {Li, Ren-Cang}, year = 1998, journal = {SIAM Journal on Matrix Analysis and Applications}, publisher = {SIAM}, volume = 19, number = 4, pages = {956--982} } @article{li1998relativeII, title = {Relative perturbation theory: II. Eigenspace and singular subspace variations}, author = {Li, Ren-Cang}, year = 1998, journal = {SIAM Journal on Matrix Analysis and Applications}, publisher = {SIAM}, volume = 20, number = 2, pages = {471--492} } @inproceedings{li2002motion, title = { Motion texture: a two-level statistical model for character motion synthesis }, author = {Li, Yan and Wang, Tianshu and Shum, Heung-Yeung}, year = 2002, booktitle = { Proceedings of the 29th annual conference on Computer graphics and interactive techniques }, location = {San Antonio, Texas}, publisher = {ACM}, address = {New York, NY, USA}, series = {SIGGRAPH '02}, pages = {465--472}, doi = {http://doi.acm.org/10.1145/566570.566604}, isbn = {1-58113-521-1}, acmid = 566604, keywords = { linear dynamic systems, motion editing, motion synthesis, motion texture, texture synthesis }, numpages = 8 } @inproceedings{li2006towards, title = {Towards a unified theory of state abstraction for MDPs}, author = {Li, Lihong and Walsh, Thomas J and Littman, Michael L}, year = 2006, booktitle = {ISAIM} } @inproceedings{li2008laziness, title = {Laziness is a virtue: Motion stitching using effort minimization}, author = {Lei Li and James McCann and Christos Faloutsos and Nancy Pollard}, year = 2008, booktitle = {Short Papers Proceedings of EUROGRAPHICS}, owner = {leili}, timestamp = {2011.07.28} } @inproceedings{li2009dynammo, title = { DynaMMo: Mining and Summarization of Coevolving Sequences with Missing Values }, author = {Lei Li and James McCann and Nancy Pollard and Christos Faloutsos}, year = 2009, booktitle = { KDD '09: Proceeding of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining }, location = {Paris, France}, publisher = {ACM}, address = {New York, NY, USA}, isbn = {978-1-60558-193-4}, file = {:http\://www.cs.cmu.edu/~leili/pubs/li-kdd09.pdf:PDF}, owner = {leili}, timestamp = {2009.05.01} } @article{li2010parsimonious, title = {Parsimonious linear fingerprinting for time series}, author = {Li, Lei and Prakash, B. Aditya and Faloutsos, Christos}, year = 2010, month = sep, journal = {Proc. VLDB Endow.}, publisher = {VLDB Endowment}, volume = 3, pages = {385--396}, issn = {2150-8097}, acmid = 1920893, issue = {1-2}, issue_date = {September 2010}, numpages = 12 } @article{li2011knows, title = {Knows what it knows: a framework for self-aware learning}, author = {Li, Lihong and Littman, Michael L and Walsh, Thomas J and Strehl, Alexander L}, year = 2011, journal = {Machine learning}, publisher = {Springer}, volume = 82, number = 3, pages = {399--443} } @article{li2015convergence, title = {Convergence of the block Lanczos method for eigenvalue clusters}, author = {Li, Ren-Cang and Zhang, Lei-Hong}, year = 2015, journal = {Numerische Mathematik}, publisher = {Springer}, volume = 131, number = 1, pages = {83--113} } @article{li2016recovery, title = {Recovery guarantee of weighted low-rank approximation via alternating minimization}, author = {Li, Yuanzhi and Liang, Yingyu and Risteski, Andrej}, year = 2016, journal = {arXiv preprint arXiv:1602.02262} } @article{li2016symmetry, title = {Symmetry, Saddle Points, and Global Geometry of Nonconvex Matrix Factorization}, author = {Li, Xingguo and Wang, Zhaoran and Lu, Junwei and Arora, Raman and Haupt, Jarvis and Liu, Han and Zhao, Tuo}, year = 2016, journal = {arXiv preprint arXiv:1612.09296} } @article{li2017algorithmic, title = {Algorithmic regularization in over-parameterized matrix sensing and neural networks with quadratic activations}, author = {Li, Yuanzhi and Ma, Tengyu and Zhang, Hongyang}, year = 2017, journal = {arXiv preprint arXiv:1712.09203}, booktitle = {Conference On Learning Theory}, pages = {2--47}, organization = {PMLR} } @article{li2017convergence, title = {Convergence Analysis of Two-layer Neural Networks with ReLU Activation}, author = {Li, Yuanzhi and Yuan, Yang}, year = 2017, journal = {arXiv preprint arXiv:1705.09886} } @inproceedings{li2017provably, title = {Provably optimal algorithms for generalized linear contextual bandits}, author = {Li, Lihong and Lu, Yu and Zhou, Dengyong}, year = 2017, booktitle = {International Conference on Machine Learning}, pages = {2071--2080}, organization = {PMLR} } @inproceedings{li2017reinforcement, title = {Reinforcement learning with temporal logic rewards}, author = {Li, Xiao and Vasile, Cristian-Ioan and Belta, Calin}, year = 2017, booktitle = {2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)}, pages = {3834--3839}, organization = {IEEE} } @misc{li2017webvision, title = {WebVision Database: Visual Learning and Understanding from Web Data}, author = {Wen Li and Limin Wang and Wei Li and Eirikur Agustsson and Luc Van Gool}, year = 2017, eprint = {1708.02862}, archiveprefix = {arXiv}, primaryclass = {cs.CV} } @article{li2018estimation, title = {Estimation of {Markov} chain via rank-constrained likelihood}, author = {Li, Xudong and Wang, Mengdi and Zhang, Anru}, year = 2018, journal = {Proceedings of the 35th international conference on Machine learning} } @inproceedings{li2018learning, title = {Learning overparameterized neural networks via stochastic gradient descent on structured data}, author = {Li, Yuanzhi and Liang, Yingyu}, year = 2018, booktitle = {Advances in Neural Information Processing Systems}, pages = {8157--8166} } @article{li2019enhanced, title = {Enhanced convolutional neural tangent kernels}, author = {Li, Zhiyuan and Wang, Ruosong and Yu, Dingli and Du, Simon S and Hu, Wei and Salakhutdinov, Ruslan and Arora, Sanjeev}, year = 2019, journal = {arXiv preprint arXiv:1911.00809} } @article{li2019exponential, title = {An exponential learning rate schedule for deep learning}, author = {Li, Zhiyuan and Arora, Sanjeev}, year = 2019, journal = {arXiv preprint arXiv:1910.07454} } @article{li2019generalization, title = {On generalization error bounds of noisy gradient methods for non-convex learning}, author = {Li, Jian and Luo, Xuanyuan and Qiao, Mingda}, year = 2019, journal = {arXiv preprint arXiv:1902.00621} } @article{li2019incremental, title = {Incremental (Sub)-Gradient Descent for Weakly Convex Optimization}, author = {Li, Xiao and Zhu, Zhihui and So, Anthony Man-Cho and Lee, Jason D.}, year = 2019, journal = {Submitted to SIOPT} } @article{li2019towards, title = {Towards Explaining the Regularization Effect of Initial Large Learning Rate in Training Neural Networks}, author = {Li, Yuanzhi and Wei, Colin and Ma, Tengyu}, year = 2019, journal = {arXiv preprint arXiv:1907.04595} } @inproceedings{li2020breaking, title = {Breaking the sample size barrier in model-based reinforcement learning with a generative model}, author = {Li, Gen and Wei, Yuting and Chi, Yuejie and Gu, Yuantao and Chen, Yuxin}, year = 2020, booktitle = {Advances in Neural Information Processing Systems} } @article{li2020learning, title = {Learning Over-Parametrized Two-Layer ReLU Neural Networks beyond NTK}, author = {Li, Yuanzhi and Ma, Tengyu and Zhang, Hongyang R}, year = 2020, journal = {arXiv preprint arXiv:2007.04596} } @misc{li2021eluder, title = {Eluder Dimension and Generalized Rank}, author = {Gene Li and Pritish Kamath and Dylan J. Foster and Nathan Srebro}, year = 2021, eprint = {2104.06970}, archiveprefix = {arXiv}, primaryclass = {cs.LG} } @inproceedings{liang2009racnet, title = {RACNet: a high-fidelity data center sensing network}, author = { Liang, Chieh-Jan Mike and Liu, Jie and Luo, Liqian and Terzis, Andreas and Zhao, Feng }, year = 2009, booktitle = {Sensys}, location = {Berkeley, California}, pages = {15--28}, doi = {http://doi.acm.org/10.1145/1644038.1644041}, isbn = {978-1-60558-519-2}, bdsk-url-1 = {http://doi.acm.org/10.1145/1644038.1644041}, date-added = {2010-03-14 14:15:56 -0400}, date-modified = {2011-01-30 23:12:34 -0500}, keywords = {terzisbib} } @article{liang2016deep, title = {Why Deep Neural Networks for Function Approximation?}, author = {Liang, Shiyu and Srikant, R}, year = 2016 } @article{liang2018adding, title = {Adding One Neuron Can Eliminate All Bad Local Minima}, author = {Liang, Shiyu and Sun, Ruoyu and Lee, Jason D and Srikant, R}, year = 2018, journal = {Neural Information Processing Systems (NIPS)} } @inproceedings{liao2005logistic, title = {Logistic regression with an auxiliary data source}, author = {Liao, Xuejun and Xue, Ya and Carin, Lawrence}, year = 2005, booktitle = {Proceedings of the 22nd international conference on Machine learning}, pages = {505--512}, organization = {ACM} } @misc{LibSVMdata, title = {{LIBSVM Data: Classification, Regression and Multi-label}}, author = {Fan, Rong-En and Lin, Chih-Jen}, url = {http://www.csie.ntu.edu.tw/cjlin/libsvmtools/datasets}, note = {Accessed: 2015-06} } @article{Lieb1973convex, title = {Convex trace functions and the Wigner-Yanase-Dyson conjecture}, author = {Lieb, Elliott H.}, year = 1973, journal = {Advances in Mathematics}, publisher = {Elsevier}, volume = 11, number = 3, pages = {267--288} } @misc{liebert2007liebert, title = {Liebert Deluxe System/3 - Chilled Water - System Design Manual}, author = {Liebert}, year = 2007, howpublished = {Available at \url{http://shared.liebert.com/SharedDocuments/Manuals/sl_18110826.pdf}} } @misc{liebert2007lieberta, title = {Liebert Deluxe System/3 Precision Cooling System}, author = {Liebert}, year = 2007, howpublished = {Available at \url{http://www.liebert.com/product_pages/ProductDocumentation.aspx?id=13&hz=60}} } @misc{liebert2008technical, title = { Technical Note: Using EC Plug Fans to Improve Energy Efficiency of Chilled Water Cooling Systems in Large Data Centers }, author = {Liebert}, year = 2008, howpublished = {Available at \url{http://shared.liebert.com/SharedDocuments/White\%20Papers/PlugFan_Low060608.pdf}}, owner = {leili}, timestamp = {2011.07.28} } @article{liesen2008nonsymmetric, title = {On nonsymmetric saddle point matrices that allow conjugate gradient iterations}, author = {Liesen, J{\"o}rg and Parlett, Beresford N}, year = 2008, journal = {Numerische Mathematik}, publisher = {Springer}, volume = 108, number = 4, pages = {605--624} } @inproceedings{lihong06towardsa, title = {Towards a Unified Theory of State Abstraction for MDPs}, author = {Lihong Li and Thomas J. Walsh and Michael L. Littman}, year = 2006, booktitle = {In Proceedings of the Ninth International Symposium on Artificial Intelligence and Mathematics}, pages = {531--539} } @phdthesis{lihong2009disaggregation, title = {A Unifying Framework for Computational Reinforcement Learning Theory}, author = {Li, Lihong}, year = 2009, publisher = {Rutgers University}, address = {USA}, isbn = 9781109524970, note = {AAI3386797}, advisor = {Littman, Michael L.}, abstract = {Computational learning theory studies mathematical models that allow one to formally analyze and compare the performance of supervised-learning algorithms such as their sample complexity. While existing models such as PAC ( Probably Approximately Correct ) have played an influential role in understanding the nature of supervised learning, they have not been as successful in reinforcement learning (RL). Here, the fundamental barrier is the need for active exploration in sequential decision problems. An RL agent tries to maximize long-term utility by exploiting its knowledge about the problem, but this knowledge has to be acquired by the agent itself through exploring the problem that may reduce short-term utility. The need for active exploration is common in many problems in daily life, engineering, and sciences. For example, a Backgammon program strives to take good moves to maximize the probability of winning a game, but sometimes it may try novel and possibly harmful moves to discover how the opponent reacts in the hope of discovering a better game-playing strategy. It has been known since the early days of RL that a good tradeoff between exploration and exploitation is critical for the agent to learn fast ( i.e. , to reach near-optimal strategies with a small sample complexity ), but a general theoretical analysis of this tradeoff remained open until recently. In this dissertation, we introduce a novel computational learning model called KWIK ( Knows What It Knows ) that is designed particularly for its utility in analyzing learning problems like RL where active exploration can impact the training data the learner is exposed to. My thesis is that the KWIK learning model provides a flexible, modularized, and unifying way for creating and analyzing reinforcement-learning algorithms with provably efficient exploration. In particular, we show how the KWIK perspective can be used to unify the analysis of existing RL algorithms with polynomial sample complexity. It also facilitates the development of new algorithms with smaller sample complexity, which have demonstrated empirically faster learning speed in real-world problems. Furthermore, we provide an improved, matching sample complexity lower bound, which suggests the optimality (in a sense) of one of the KWIK-based algorithms known as delayed Q-learning .} } @inproceedings{LiLin2015, title = {{Accelerated Proximal Gradient Methods for Nonconvex Programming}}, author = {Li, Huan and Lin, Zhouchen}, year = 2015, booktitle = {Advances in Neural Information Processing Systems - NIPS '15}, pages = {379----387}, file = {:D$\backslash$:/Mendeley Desktop/Li, Lin - Unknown - Accelerated Proximal Gradient Methods for Nonconvex Programming.pdf:pdf}, mendeley-groups = {Optimization/Non-Convex} } @article{lim_tensoreig, title = {Singular values and eigenvalues of tensors: a variational approach}, author = {L.-H. Lim}, year = 2005, journal = {Proceedings of the IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP '05)}, volume = 1, pages = {129--132} } @inproceedings{lin2003symbolic, title = { A symbolic representation of time series, with implications for streaming algorithms }, author = {Lin, Jessica and Keogh, Eamonn and Lonardi, Stefano and Chiu, Bill}, year = 2003, booktitle = { DMKD '03: Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery }, location = {San Diego, California}, publisher = {ACM}, address = {New York, NY, USA}, pages = {2--11}, doi = {http://doi.acm.org/10.1145/882082.882086}, owner = {leili}, timestamp = {2010.02.01} } @article{lin2013network, title = {Network in network}, author = {Lin, Min and Chen, Qiang and Yan, Shuicheng}, year = 2013, journal = {arXiv preprint arXiv:1312.4400} } @inproceedings{lin2014adaptive, title = {An adaptive accelerated proximal gradient method and its homotopy continuation for sparse optimization}, author = {Lin, Qihang and Xiao, Lin}, year = 2014, booktitle = {International Conference on Machine Learning}, pages = {73--81} } @misc{Lin2016-email, author = {Lin, Hongzhou}, year = 2016, howpublished = {private communication} } @inproceedings{lin2020model, title = {Model-based Adversarial Meta-Reinforcement Learning}, author = {Lin, Zichuan and Thomas, Garrett and Yang, Guangwen and Ma, Tengyu}, year = 2020, booktitle = {Advances in Neural Information Processing Systems}, publisher = {Curran Associates, Inc.}, volume = 33, pages = {10161--10173}, url = {https://proceedings.neurips.cc/paper/2020/file/73634c1dcbe056c1f7dcf5969da406c8-Paper.pdf}, editor = {H. Larochelle and M. Ranzato and R. Hadsell and M. F. Balcan and H. Lin} } @inproceedings{LinCohen10, title = {Power Iteration Clustering}, author = {Frank Lin and William W. Cohen}, year = 2010, booktitle = {ICML '10}, pages = {655--662} } @article{Lindsay89, title = {Moment matrices: applications in mixtures}, author = {B. G. Lindsay}, year = 1989, journal = {Annals of Statistics}, volume = 17, number = 2, pages = {722--740} } @book{Lindsay95, title = {Mixture models: theory, geometry and applications}, author = {B. G. Lindsay}, year = 1995, publisher = {American Statistical Association} } @inproceedings{LinMH2015-Catalyst, title = {{A Universal Catalyst for First-Order Optimization}}, author = {Lin, Hongzhou and Mairal, Julien and Harchaoui, Zaid}, year = 2015, booktitle = {NIPS}, url = {http://arxiv.org/pdf/1506.02186v1.pdf}, archiveprefix = {arXiv}, arxivid = {1506.02186}, eprint = {1506.02186}, file = {:D$\backslash$:/Mendeley Desktop/Lin, Mairal, Harchaoui - 2015 - A Universal Catalyst for First-Order Optimization.pdf:pdf}, mendeley-groups = {Optimization/Gradient Descent Theory} } @article{LiSSA2016, title = {Second Order Stochastic Optimization for Machine Learning in Linear Time}, author = {Naman Agarwal and Brian Bullins and Elad Hazan}, year = 2016, journal = {arXiv preprint arXiv:1602.03943} } @article{little2009estimation, title = {{Estimation of Intrinsic Dimensionality of Samples from Noisy Low-Dimensional Manifolds in High Dimensions with Multiscale SVD}}, author = {Little, Anna V. and {Jason D. Lee} and Jung, Yoon-Mo and Maggioni, Mauro}, year = 2009, journal = {IEEE Workshop on Statistical Signal Processing (SSP)}, pages = {85--88} } @inproceedings{littman1994markov, title = {Markov Games As a Framework for Multi-agent Reinforcement Learning}, author = {Littman, Michael L.}, year = 1994, booktitle = {Proceedings of the 11th International Conference on International Conference on Machine Learning}, location = {New Brunswick, NJ, USA}, publisher = {Morgan Kaufmann Publishers Inc.}, address = {San Francisco, CA, USA}, series = {ICML'94}, pages = {157--163}, isbn = {1-55860-335-2}, url = {http://dl.acm.org/citation.cfm?id=3091574.3091594}, acmid = 3091594, numpages = 7 } @inproceedings{littman1995complexity, title = {On the complexity of solving {M}arkov decision problems}, author = {Littman, Michael L and Dean, Thomas L and Kaelbling, Leslie Pack}, year = 1995, booktitle = {Proceedings of the Eleventh conference on Uncertainty in artificial intelligence}, pages = {394--402}, organization = {Morgan Kaufmann Publishers Inc.}, date-added = {2017-05-19 01:00:50 +0000}, date-modified = {2017-05-19 01:00:50 +0000} } @article{littman1996algorithms, title = {Algorithms for sequential decision making}, author = {Littman, Michael Lederman}, year = 1996, publisher = {Brown University Providence, RI} } @inproceedings{littman2001predictive, title = {Predictive representations of state.}, author = {Littman, Michael L and Sutton, Richard S and Singh, Satinder P}, year = 2001, booktitle = {NIPS}, volume = 14, number = 1555, pages = 30 } @article{littman2017environment, title = {Environment-independent task specifications via GLTL}, author = {Littman, Michael L and Topcu, Ufuk and Fu, Jie and Isbell, Charles and Wen, Min and MacGlashan, James}, year = 2017, journal = {arXiv preprint arXiv:1704.04341} } @inproceedings{liu1995keyframe, title = {Keyframe Motion Optimization By Relaxing Speed and Timing}, author = {Zicheng Liu and Michael F. Cohen}, year = 1995, booktitle = {Computer Animation and Simulation '95}, publisher = {Springer-Verlag}, pages = {144--153}, editor = {Dimitri Terzopoulos and Daniel Thalmann} } @incollection{liu2001combined, title = {Combined parameter and state estimation in simulation-based filtering}, author = {Jane Liu and Mike West}, year = 2001, booktitle = {Sequential {M}onte {C}arlo Methods in Practice} } @article{liu2006estimation, title = {Estimation of missing markers in human motion capture}, author = {Guodong Liu and Leonard McMillan}, year = 2006, journal = {Vis. Comput.}, publisher = {Springer-Verlag New York, Inc.}, address = {Secaucus, NJ, USA}, volume = 22, number = 9, pages = {721--728}, doi = {http://dx.doi.org/10.1007/s00371-006-0080-9}, issn = {0178-2789}, owner = {leili}, timestamp = {2011.07.28} } @inproceedings{liu2008towards, title = {Towards Discovering Data Center Genome Using Sensor Net}, author = { Jie Liu and Bodhi Priyantha and Feng Zhao and Chieh-Jan Mike Liang and Qiang Wang and Sean James }, year = 2008, booktitle = {Proceedings of the 5th Workshop on Embedded Networked Sensors (HotEmNets)}, location = {Charlottesville, VA} } @inproceedings{liu2009bbm, title = {BBM: bayesian browsing model from petabyte-scale data}, author = {Liu, Chao and Guo, Fan and Faloutsos, Christos}, year = 2009, booktitle = { Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining }, location = {Paris, France}, publisher = {ACM}, address = {New York, NY, USA}, series = {KDD '09}, pages = {537--546}, doi = {http://doi.acm.org/10.1145/1557019.1557081}, isbn = {978-1-60558-495-9}, acmid = 1557081, keywords = {bayesian models, click log analysis, web search}, numpages = 10 } @inproceedings{liu2010distributed, title = {Distributed nonnegative matrix factorization for web-scale dyadic data analysis on mapreduce}, author = {Liu, Chao and Yang, Hung-chih and Fan, Jinliang and He, Li-Wei and Wang, Yi-Min}, year = 2010, booktitle = {Proceedings of the 19th international conference on World wide web}, pages = {681--690}, organization = {ACM} } @inproceedings{liu2012regularized, title = {Regularized off-policy TD-learning}, author = {Liu, Bo and Mahadevan, Sridhar and Liu, Ji}, year = 2012, booktitle = {Advances in Neural Information Processing Systems}, pages = {836--844} } @inproceedings{liu2014control, title = {Control in a safe set: Addressing safety in human-robot interactions}, author = {Liu, Changliu and Tomizuka, Masayoshi}, year = 2014, booktitle = {ASME 2014 Dynamic Systems and Control Conference}, organization = {American Society of Mechanical Engineers Digital Collection} } @article{liu2015classification, title = {Classification with noisy labels by importance reweighting}, author = {Liu, Tongliang and Tao, Dacheng}, year = 2015, journal = {IEEE Transactions on pattern analysis and machine intelligence}, publisher = {IEEE}, volume = 38, number = 3, pages = {447--461} } @inproceedings{liu2015finite, title = {Finite-Sample Analysis of Proximal Gradient TD Algorithms}, author = {Liu, Bo and Liu, Ji and Ghavamzadeh, Mohammad and Mahadevan, Sridhar and Petrik, Marek}, year = 2015, booktitle = {Proc. The 31st Conf. Uncertainty in Artificial Intelligence, Amsterdam, Netherlands} } @article{liu2016kernelized, title = {A Kernelized Stein Discrepancy for Goodness-of-fit Tests and Model Evaluation}, author = {Liu, Qiang and {Jason D. Lee} and Jordan, Michael I}, year = 2016, journal = {International Conference on Machine Learning (ICML)} } @article{liu2017black, title = {Black-box importance sampling}, author = {Liu, Qiang and Lee, Jason D}, year = 2017, journal = {Artificial Intelligence and Statistics (AISTATS)} } @article{liu2018inexact, title = {An inexact subsampled proximal Newton-type method for large-scale machine learning}, author = {Liu, Xuanqing and Hsieh, Cho-Jui and Lee, Jason D and Sun, Yuekai}, year = {}, journal = {Submitted to Journal of Machine Learning Research} } @article{liu2019bad, title = {Bad global minima exist and sgd can reach them}, author = {Liu, Shengchao and Papailiopoulos, Dimitris and Achlioptas, Dimitris}, year = 2019, journal = {arXiv preprint arXiv:1906.02613} } @inproceedings{liu2019towards, title = {Towards Understanding the Importance of Shortcut Connections in Residual Networks}, author = {Liu, Tianyi and Chen, Minshuo and Zhou, Mo and Du, Simon S and Zhou, Enlu and Zhao, Tuo}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, publisher = {Curran Associates, Inc.}, volume = 32, pages = {}, url = {https://proceedings.neurips.cc/paper/2019/file/7716d0fc31636914783865d34f6cdfd5-Paper.pdf}, editor = {H. Wallach and H. Larochelle and A. Beygelzimer and F. d\textquotesingle Alch\'{e}-Buc and E. Fox and R. Garnett} } @misc{liu2020metalearning, title = {Meta-learning Transferable Representations with a Single Target Domain}, author = {Hong Liu and Jeff Z. HaoChen and Colin Wei and Tengyu Ma}, year = 2020, eprint = {2011.01418}, archiveprefix = {arXiv}, primaryclass = {cs.LG} } @article{liu2020primer, title = {A Primer on Zeroth-Order Optimization in Signal Processing and Machine Learning: Principals, Recent Advances, and Applications}, author = {Liu, Sijia and Chen, Pin-Yu and Kailkhura, Bhavya and Zhang, Gaoyuan and Hero III, Alfred O and Varshney, Pramod K}, year = 2020, journal = {IEEE Signal Processing Magazine}, publisher = {IEEE}, volume = 37, number = 5, pages = {43--54} } @inproceedings{LiuWRBS2014asynchronous, title = {An Asynchronous Parallel Stochastic Coordinate Descent Algorithm}, author = {Liu, Ji and Wright, Steve and Re, Christopher and Bittorf, Victor and Sridhar, Srikrishna}, year = 2014, booktitle = {Proceedings of the 31st International Conference on Machine Learning (ICML-14)}, pages = {469--477} } @inproceedings{livni2013vanishing, title = {Vanishing Component Analysis.}, author = {Livni, Roi and Lehavi, David and Schein, Sagi and Nachlieli, Hila and Shalev-Shwartz, Shai and Globerson, Amir}, year = 2013, booktitle = {ICML (1)}, pages = {597--605} } @inproceedings{livni2014computational, title = {On the computational efficiency of training neural networks}, author = {Livni, Roi and Shalev-Shwartz, Shai and Shamir, Ohad}, year = 2014, booktitle = {Advances in Neural Information Processing Systems}, pages = {855--863} } @article{LiWLZ2016-online1SVD, title = {{Near-Optimal Stochastic Approximation for Online Principal Component Estimation}}, author = {Chris J. Li and Mengdi Wang and Han Liu and Tong Zhang}, year = 2016, month = mar, journal = {ArXiv e-prints}, volume = {abs/1603.05305} } @book{LjungBook, title = {System Identification. Theory for the user}, author = {Lennart Ljung}, year = 1998, publisher = {Prentice Hall}, address = {Upper Saddle River, NJ}, date-added = {2016-04-02 18:41:10 +0000}, date-modified = {2016-04-02 18:41:10 +0000}, edition = {2nd} } @article{LLDM09, title = {Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters}, author = {Jure Leskovec and Kevin J. Lang and Anirban Dasgupta and Michael W. Mahoney}, year = 2009, journal = {Internet Mathematics}, volume = 6, number = 1, pages = {29--123} } @inproceedings{LLM10WWW, title = {Empirical comparison of algorithms for network community detection}, author = {Leskovec, Jure and Lang, Kevin J. and Mahoney, Michael}, year = 2010, series = {WWW}, pages = {631--640} } @inproceedings{LLX2014-ProxSDCA-APCG, title = {{An Accelerated Proximal Coordinate Gradient Method and its Application to Regularized Empirical Risk Minimization}}, author = {Lin, Qihang and Lu, Zhaosong and Xiao, Lin}, year = 2014, booktitle = {NIPS}, pages = {3059--3067}, url = {http://arxiv.org/abs/1407.1296 http://papers.nips.cc/paper/5356-an-accelerated-proximal-coordinate-gradient-method.pdf}, annote = {A short version has appeared in NIPS 2014 with its first 3 sections.}, archiveprefix = {arXiv}, arxivid = {1407.1296}, eprint = {1407.1296} } @inproceedings{LM, title = {Pachinko Allocation: DAG-structured mixture models of topic correlations}, author = {W. Li and A. McCallum}, year = 2007, booktitle = {ICML}, pages = {633--640} } @article{lmz17, title = {Algorithmic Regularization in Over-parameterized Matrix Recovery}, author = {Yuanzhi Li and Tengyu Ma and Hongyang Zhang}, year = 2017, journal = {CoRR}, volume = {abs/1712.09203}, url = {http://arxiv.org/abs/1712.09203}, archiveprefix = {arXiv}, eprint = {1712.09203}, timestamp = {Mon, 13 Aug 2018 16:48:32 +0200}, biburl = {https://dblp.org/rec/bib/journals/corr/abs-1712-09203}, bibsource = {dblp computer science bibliography, https://dblp.org} } @article{loh2014support, title = {Support recovery without incoherence: A case for nonconvex regularization}, author = {Loh, Po-Ling and Wainwright, Martin J}, year = 2014, journal = {arXiv preprint arXiv:1412.5632} } @inproceedings{long2018conditional, title = {Conditional adversarial domain adaptation}, author = {Long, Mingsheng and Cao, Zhangjie and Wang, Jianmin and Jordan, Michael I}, year = 2018, booktitle = {Advances in Neural Information Processing Systems}, pages = {1640--1650} } @article{loshchilov2016sgdr, title = {Sgdr: Stochastic gradient descent with warm restarts}, author = {Loshchilov, Ilya and Hutter, Frank}, year = 2016, journal = {arXiv preprint arXiv:1608.03983} } @article{lovasz2006simulated, title = {Simulated annealing in convex bodies and an O*(n4) volume algorithm}, author = {Lov{\'a}sz, L{\'a}szl{\'o} and Vempala, Santosh}, year = 2006, journal = {Journal of Computer and System Sciences}, publisher = {Elsevier}, volume = 72, number = 2, pages = {392--417} } @inproceedings{LovaszSimonovits90, title = {The mixing rate of Markov chains, an isoperimetric inequality, and computing the volume}, author = {L{\'a}szl{\'o} Lov{\'a}sz and Mikl{\'o}s Simonovits}, year = 1990, series = {FOCS}, pages = {346--354} } @article{LovaszSimonovits93, title = {Random Walks in a Convex Body and an Improved Volume Algorithm}, author = {L{\'a}szl{\'o} Lov{\'a}sz and Mikl{\'o}s Simonovits}, year = 1993, journal = {Random Struct. Algorithms}, volume = 4, number = 4, pages = {359--412} } @inproceedings{LQBC12, title = {Spectral Learning for Non-Deterministic Dependency Parsing}, author = {F. M. Luque and A. Quattoni and B. Balle and X. Carreras}, year = 2012, booktitle = {Conference of the European Chapter of the Association for Computational Linguistics} } @inproceedings{LRS2013, title = {{A new approach to computing maximum flows using electrical flows}}, author = {Lee, Yin Tat and Rao, Satish and Srivastava, Nikhil}, year = 2013, booktitle = {Proceedings of the 45th annual ACM symposium on Symposium on theory of computing - STOC '13}, publisher = {ACM Press}, address = {New York, New York, USA}, pages = 755, doi = {10.1145/2488608.2488704}, isbn = 9781450320290, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Lee, Rao, Srivastava - 2013 - A new approach to computing maximum flows using electrical flows.pdf:pdf}, mendeley-groups = {Algorithms/Maxflow} } @inproceedings{LS, title = {Learning overcomplete representations}, author = {M. Lewicki and T. Sejnowski}, year = 2000, booktitle = {Neural Computation}, pages = {337--365} } @article{LS99, title = {Learning the parts of objects by non-negative matrix factorization}, author = {D. Lee and H. Seung}, year = 1999, journal = {Nature}, pages = {788--791} } @article{LSI, title = {Indexing by latent semantic analysis}, author = {S. Deerwester and S. Dumais and T. Landauer and G. Furnas and R. Harshman}, year = 1990, journal = {JASIS}, pages = {391--407} } @inproceedings{LSS01, title = {Predictive Representations of State}, author = {M. Littman and R. Sutton and S. Singh}, year = 2001, booktitle = {Advances in Neural Information Processing Systems 14}, pages = {1555--1561} } @article{lstm, title = {Long Short-Term Memory}, author = {Sepp Hochreiter and J{\"{u}}rgen Schmidhuber}, year = 1997, journal = {Neural Computation}, volume = 9, number = 8, pages = {1735--1780}, doi = {10.1162/neco.1997.9.8.1735}, url = {http://dx.doi.org/10.1162/neco.1997.9.8.1735}, timestamp = {Thu, 17 Nov 2011 16:24:23 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/neco/HochreiterS97}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{lu2014large, title = {Large scale canonical correlation analysis with iterative least squares}, author = {Lu, Yichao and Foster, Dean P}, year = 2014, booktitle = {NIPS}, pages = {91--99} } @inproceedings{LubyNisan1993, title = {{A parallel approximation algorithm for positive linear programming}}, author = {Luby, Michael and Nisan, Noam}, year = 1993, booktitle = {Proceedings of the twenty-fifth annual ACM symposium on Theory of computing - STOC '93}, publisher = {ACM Press}, address = {New York, New York, USA}, pages = {448--457}, doi = {10.1145/167088.167211}, isbn = {0897915917}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Luby, Nisan - 1993 - A parallel approximation algorithm for positive linear programming.pdf:pdf}, mendeley-groups = {Algorithms/Multiplicative Weight/LP} } @article{luedtke2008sample, title = {A sample approximation approach for optimization with probabilistic constraints}, author = {Luedtke, James and Ahmed, Shabbir}, year = 2008, journal = {SIAM Journal on Optimization}, publisher = {SIAM}, volume = 19, number = 2, pages = {674--699} } @article{luo2018algorithmic, title = {Algorithmic framework for model-based deep reinforcement learning with theoretical guarantees}, author = {Luo, Yuping and Xu, Huazhe and Li, Yuanzhi and Tian, Yuandong and Darrell, Trevor and Ma, Tengyu}, year = 2018, journal = {arXiv preprint arXiv:1807.03858} } @article{luo2019adaptive, title = {Adaptive gradient methods with dynamic bound of learning rate}, author = {Luo, Liangchen and Xiong, Yuanhao and Liu, Yan and Sun, Xu}, year = 2019, journal = {arXiv preprint arXiv:1902.09843} } @inproceedings{luo2019algorithmic, title = {Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees}, author = {Yuping Luo and Huazhe Xu and Yuanzhi Li and Yuandong Tian and Trevor Darrell and Tengyu Ma}, year = 2019, booktitle = {International Conference on Learning Representations}, url = {https://openreview.net/forum?id=BJe1E2R5KX} } @article{luo2019learning, title = {Learning self-correctable policies and value functions from demonstrations with negative sampling}, author = {Luo, Yuping and Xu, Huazhe and Ma, Tengyu}, year = 2019, journal = {arXiv preprint arXiv:1907.05634} } @inproceedings{luo2020learning, title = {Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling}, author = {Yuping Luo and Huazhe Xu and Tengyu Ma}, year = 2020, booktitle = {International Conference on Learning Representations} } @article{LuXiao2013, title = {On the complexity analysis of randomized block-coordinate descent methods}, author = {Lu, Zhaosong and Xiao, Lin}, year = 2013, journal = {Mathematical Programming}, publisher = {Springer}, pages = {1--28} } @inproceedings{LV06, title = {Fast Algorithms for Logconcave Functions: Sampling, Rounding, Integration and Optimization}, author = {Lovasz, Laszlo and Vempala, Santosh}, year = 2006, booktitle = {Proceedings of the 47th Annual IEEE Symposium on Foundations of Computer Science}, publisher = {IEEE Computer Society}, address = {Washington, DC, USA}, series = {FOCS '06}, pages = {57--68}, doi = {10.1109/FOCS.2006.28}, isbn = {0-7695-2720-5}, url = {http://dx.doi.org/10.1109/FOCS.2006.28}, numpages = 12, acmid = 1170488 } @inproceedings{ly17, title = {Convergence Analysis of Two-layer Neural Networks with {R}e{LU} Activation}, author = {Yuanzhi Li and Yang Yuan}, year = 2017, booktitle = {Advances in Neural Information Processing Systems (NIPS)}, publisher = {http://arxiv.org/abs/1705.09886} } @article{lygeros2015lecture, title = {Lecture notes on linear system theory}, author = {Lygeros, John and Ramponi, Federico}, year = 2015, howpublished = {\url{http://home.mit.bme.hu/~virosztek/docs/mt_literature/LectureNotes.pdf}} } @article{lykouris2019corruption, title = {Corruption robust exploration in episodic reinforcement learning}, author = {Lykouris, Thodoris and Simchowitz, Max and Slivkins, Aleksandrs and Sun, Wen}, year = 2019, journal = {arXiv preprint arXiv:1911.08689} } @article{lyu2019gradient, title = {Gradient descent maximizes the margin of homogeneous neural networks}, author = {Lyu, Kaifeng and Li, Jian}, year = 2019, journal = {arXiv preprint arXiv:1906.05890} } @inproceedings{M, title = {A Wavelet Tour of Signal Processing}, author = {S. Mallat}, year = 1998, booktitle = {Academic-Press} } @inproceedings{ma2015finding, title = {Finding Linear Structure in Large Datasets with Scalable Canonical Correlation Analysis}, author = {Ma, Zhuang and Lu, Yichao and Foster, Dean}, year = 2015, booktitle = {ICML}, pages = {169--178} } @article{ma2016poly, title = {Polynomial-time Tensor Decompositions with Sum-of-Squares}, author = {Tengyu Ma and Jonathan Shi and David Steurer}, year = 2016, journal = {IEEE Symposium on Foundations of Computer Science (FOCS)}, url = {http://arxiv.org/abs/1610.01980}, timestamp = {Wed, 02 Nov 2016 09:51:26 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/corr/MaSS16}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{ma2016polynomial, title = {Polynomial-time tensor decompositions with sum-of-squares}, author = {Ma, Tengyu and Shi, Jonathan and Steurer, David}, year = 2016, month = oct, journal = {ArXiv e-prints}, booktitle = {2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS)}, pages = {438--446}, organization = {IEEE}, archiveprefix = {arXiv}, eprint = {1610.01980}, primaryclass = {cs.DS}, keywords = {Computer Science - Data Structures and Algorithms, Computer Science - Learning}, adsurl = {http://adsabs.harvard.edu/abs/2016arXiv161001980M}, adsnote = {Provided by the SAO/NASA Astrophysics Data System} } @inproceedings{ma2018implicit, title = {Implicit regularization in nonconvex statistical estimation: Gradient descent converges linearly for phase retrieval and matrix completion}, author = {Ma, Cong and Wang, Kaizheng and Chi, Yuejie and Chen, Yuxin}, year = 2018, booktitle = {International Conference on Machine Learning}, pages = {3345--3354}, organization = {PMLR} } @article{ma2019implicit, title = {Implicit regularization in nonconvex statistical estimation: Gradient descent converges linearly for phase retrieval, matrix completion, and blind deconvolution}, author = {Ma, Cong and Wang, Kaizheng and Chi, Yuejie and Chen, Yuxin}, year = 2019, journal = {Foundations of Computational Mathematics}, publisher = {Springer}, pages = {1--182} } @article{ma2020local, title = {Why Do Local Methods Solve Nonconvex Problems?}, author = {Ma, Tengyu}, year = 2020, journal = {Beyond the Worst-Case Analysis of Algorithms}, publisher = {Cambridge University Press}, pages = 465 } @inproceedings{maas2009one, title = {One-Shot Learning with {B}ayesian Networks}, author = {Andrew L. Maas and Charles Kemp}, year = 2009, booktitle = {Proceedings of The 31st Annual Meeting of The Cognitive Science Society} } @inproceedings{mackey2011divide, title = {Divide-and-conquer matrix factorization}, author = {Mackey, Lester W and Jordan, Michael I and Talwalkar, Ameet}, year = 2011, booktitle = {Advances in Neural Information Processing Systems}, pages = {1134--1142} } @article{macua15distributed, title = {Distributed Policy Evaluation Under Multiple Behavior Strategies}, author = {Sergio Valcarcel Macua and Jianshu Chen and Santiago Zazo and Ali H. Sayed}, year = 2015, journal = {IEEE Transactions on Automatic Control}, volume = 60, number = 5, pages = {1260--1274} } @inproceedings{Madry2010, title = {{Faster approximation schemes for fractional multicommodity flow problems via dynamic graph algorithms}}, author = {Madry, Aleksander}, year = 2010, booktitle = {Proceedings of the 42nd ACM symposium on Theory of computing - STOC '10}, publisher = {ACM Press}, address = {New York, New York, USA}, pages = 121, doi = {10.1145/1806689.1806708}, isbn = 9781450300506, archiveprefix = {arXiv}, arxivid = {arXiv:1003.5907v2}, eprint = {arXiv:1003.5907v2}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Madry - 2010 - Faster approximation schemes for fractional multicommodity flow problems via dynamic graph algorithms.pdf:pdf}, mendeley-groups = {Algorithms/Multiplicative Weight/Flow} } @inproceedings{Madry2013, title = {{Navigating Central Path with Electrical Flows: From Flows to Matchings, and Back}}, author = {Madry, Aleksander}, year = 2013, month = oct, booktitle = {2013 IEEE 54th Annual Symposium on Foundations of Computer Science}, publisher = {IEEE}, pages = {253--262}, doi = {10.1109/FOCS.2013.35}, isbn = {978-0-7695-5135-7}, mendeley-groups = {Algorithms/Maxflow} } @inproceedings{maei10toward, title = {Toward Off-Policy Learning Control with Function Approximation}, author = {Hamid Reza Maei and Csaba Szepesvári and Shalabh Bhatnagar and Richard S. Sutton}, year = 2010, booktitle = {Proceedings of the 27th International Conference on Machine Learning (ICML)}, pages = {719--726} } @inproceedings{maei2010toward, title = {Toward off-policy learning control with function approximation.}, author = {Maei, Hamid Reza and Szepesv{\'a}ri, Csaba and Bhatnagar, Shalabh and Sutton, Richard S}, year = 2010, booktitle = {ICML}, pages = {719--726} } @misc{mahadevan14Proximal, title = {Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces}, author = {Sridhar Mahadevan and Bo Liu and Philip S. Thomas and William Dabney and Stephen Giguere and Nicholas Jacek and Ian Gemp and Ji Liu}, year = 2014, journal = {arXiv preprint arXiv:1405.6757}, note = {CoRR abs/1405.6757} } @inproceedings{mahadevan2005proto, title = {Proto-value functions: Developmental reinforcement learning}, author = {Mahadevan, Sridhar}, year = 2005, booktitle = {Proceedings of the 22nd international conference on Machine learning}, pages = {553--560}, organization = {ACM} } @article{mahadevan2009learning, title = {Learning representation and control in {Markov} decision processes: {New} frontiers}, author = {Mahadevan, Sridhar}, year = 2009, journal = {Foundations and Trends{\textregistered} in Machine Learning}, publisher = {Now Publishers, Inc.}, volume = 1, number = 4, pages = {403--565} } @article{mahadevan2012sparse, title = {Sparse q-learning with mirror descent}, author = {Mahadevan, Sridhar and Liu, Bo}, year = 2012, journal = {arXiv preprint arXiv:1210.4893} } @article{mahadevan2014proximal, title = {Proximal reinforcement learning: A new theory of sequential decision making in primal-dual spaces}, author = {Mahadevan, Sridhar and Liu, Bo and Thomas, Philip and Dabney, Will and Giguere, Steve and Jacek, Nicholas and Gemp, Ian and Liu, Ji}, year = 2014, journal = {arXiv preprint arXiv:1405.6757} } @inproceedings{mahdavi, title = {Linear convergence with condition number independent access of full gradients}, author = {Zhang, Lijun and Mahdavi, Mehrdad and Jin, Rong}, year = 2013, booktitle = {Advances in Neural Information Processing Systems}, pages = {980--988} } @inproceedings{MahdaviZhangJin2013-nonsc, title = {Mixed optimization for smooth functions}, author = {Mahdavi, Mehrdad and Zhang, Lijun and Jin, Rong}, year = 2013, booktitle = {Advances in Neural Information Processing Systems}, pages = {674--682} } @article{mahmud2018applications, title = {Applications of deep learning and reinforcement learning to biological data}, author = {Mahmud, Mufti and Kaiser, Mohammed Shamim and Hussain, Amir and Vassanelli, Stefano}, year = 2018, journal = {IEEE transactions on neural networks and learning systems}, publisher = {IEEE}, volume = 29, number = 6, pages = {2063--2079} } @article{mahoney2008tensor, title = {Tensor-{CUR} decompositions for tensor-based data}, author = {Mahoney, Michael W and Maggioni, Mauro and Drineas, Petros}, year = 2008, journal = {SIAM Journal on Matrix Analysis and Applications}, publisher = {SIAM}, volume = 30, number = 3, pages = {957--987} } @article{mahoney2009cur, title = {{CUR} matrix decompositions for improved data analysis}, author = {Mahoney, Michael W and Drineas, Petros}, year = 2009, journal = {Proceedings of the National Academy of Sciences}, publisher = {National Acad Sciences}, volume = 106, number = 3, pages = {697--702} } @article{mahoney2011randomized, title = {Randomized algorithms for matrices and data}, author = {Mahoney, Michael W}, year = 2011, journal = {Foundations and Trends{\textregistered} in Machine Learning}, publisher = {Now Publishers Inc.}, volume = 3, number = 2, pages = {123--224} } @incollection{mairal2008discriminative, title = {Discriminative sparse image models for class-specific edge detection and image interpretation}, author = {Mairal, Julien and Leordeanu, Marius and Bach, Francis and Hebert, Martial and Ponce, Jean}, year = 2008, booktitle = {Computer Vision--ECCV 2008}, publisher = {Springer}, pages = {43--56}, owner = {gewor_000}, timestamp = {2013.11.10} } @article{Mairal2015-MISO, title = {{Incremental Majorization-Minimization Optimization with Application to Large-Scale Machine Learning}}, author = {Mairal, Julien}, year = 2015, month = apr, journal = {SIAM Journal on Optimization}, volume = 25, number = 2, pages = {829--855}, doi = {10.1137/140957639}, issn = {1052-6234}, url = {http://epubs.siam.org/doi/10.1137/140957639}, note = {Preliminary version appeared in ICML 2013}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Mairal - 2015 - Incremental Majorization-Minimization Optimization with Application to Large-Scale Machine Learning.pdf:pdf}, keywords = {1,10,1137,140957639,90c06,90c25,90c26,ams subject classifications,convex optimization,doi,introduction,majorization-minimization,minimizing upper bounds of,nonconvex optimization,the,the principle of successively}, mendeley-groups = {Optimization/Variance Reduction} } @inproceedings{makarychev2014bilu, title = {Bilu-linial stable instances of max cut and minimum multiway cut}, author = {Makarychev, Konstantin and Makarychev, Yury and Vijayaraghavan, Aravindan}, year = 2014, booktitle = {Proceedings of the Twenty-Fifth Annual ACM-SIAM Symposium on Discrete Algorithms}, pages = {890--906}, organization = {Society for Industrial and Applied Mathematics} } @inproceedings{makarychev2014constant, title = {Constant factor approximation for balanced cut in the pie model}, author = {Makarychev, Konstantin and Makarychev, Yury and Vijayaraghavan, Aravindan}, year = 2014, booktitle = {Proceedings of the 46th Annual ACM Symposium on Theory of Computing}, pages = {41--49}, organization = {ACM} } @article{makkuva2019optimal, title = {Optimal transport mapping via input convex neural networks}, author = {Makkuva, Ashok Vardhan and Taghvaei, Amirhossein and Oh, Sewoong and Lee, Jason D}, year = 2020, journal = {International Conference on Machine Learning (ICML)} } @inproceedings{malek14linear, title = {Linear Programming for Large-Scale {Markov} Decision Problems}, author = {Yasin Abbasi-Yadkori and Peter L. Bartlett and Alan Malek}, year = 2014, booktitle = {Proceedings of the 31st International Conference on Machine Learning}, pages = {496--504} } @misc{MALLET, title = {MALLET: A Machine Learning for Language Toolkit}, author = {McCallum, Andrew Kachites}, year = 2002, url = {http://mallet.cs.umass.edu} } @article{malliavin1995gaussian, title = {Gaussian Sobolev Spaces and Stochastic Calculus of Variations}, author = {Malliavin, Paul}, year = 1995 } @book{mandic2001recurrent, title = {Recurrent neural networks for prediction: learning algorithms, architectures and stability}, author = {Mandic, Danilo P and Chambers, Jonathon}, year = 2001, publisher = {John Wiley \& Sons, Inc.} } @article{mangalam2019do, title = {Do deep neural networks learn shallow learnable examples first?}, author = {Mangalam, Karttikeya and Prabhu, Vinay}, year = 2019, month = jun } @article{mania2018simple, title = {Simple random search provides a competitive approach to reinforcement learning}, author = {Mania, Horia and Guy, Aurelia and Recht, Benjamin}, year = 2018, journal = {arXiv preprint arXiv:1803.07055} } @article{mannor2004sample, title = {The sample complexity of exploration in the multi-armed bandit problem}, author = {Mannor, Shie and Tsitsiklis, John N}, year = 2004, journal = {Journal of Machine Learning Research}, volume = 5, number = {Jun}, pages = {623--648} } @article{mannor2013algorithmic, title = {Algorithmic aspects of mean--variance optimization in Markov decision processes}, author = {Mannor, Shie and Tsitsiklis, John N}, year = 2013, journal = {European Journal of Operational Research}, publisher = {Elsevier}, volume = 231, number = 3, pages = {645--653} } @inproceedings{mansour1999complexity, title = {On the complexity of policy iteration}, author = {Mansour, Yishay and Singh, Satinder}, year = 1999, booktitle = {Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence}, pages = {401--408}, organization = {Morgan Kaufmann Publishers Inc.} } @article{mansour2009domain, title = {Domain adaptation: Learning bounds and algorithms}, author = {Mansour, Yishay and Mohri, Mehryar and Rostamizadeh, Afshin}, year = 2009, journal = {arXiv preprint arXiv:0902.3430} } @misc{MaoJieming2016-email, author = {Mao, Jieming}, year = 2016, howpublished = {private communication} } @inproceedings{MAR10, title = {Deep learning via Hessian-free optimization}, author = {Martens, James}, year = 2010, booktitle = {Proceedings of the 27th International Conference on Machine Learning (ICML-10)}, pages = {735--742} } @article{marcus1993building, title = {Building a large annotated corpus of English: The Penn Treebank}, author = {Marcus, Mitchell and Santorini, Beatrice and Marcinkiewicz, Mary Ann}, year = 1993 } @article{marskin2001markov, title = {Markov perfect equilibrium. {I}. {O}bservable actions}, author = {Maskin, Eric and Tirole, Jean}, year = 2001, journal = {J. Econom. Theory}, volume = 100, number = 2, pages = {191--219}, doi = {10.1006/jeth.2000.2785}, issn = {0022-0531}, url = {https://doi.org/10.1006/jeth.2000.2785}, fjournal = {Journal of Economic Theory}, mrclass = {91A20}, mrnumber = 1860033, mrreviewer = {Roy Gardner} } @conference{marthi2002decayed, title = {Decayed MCMC Filtering}, author = {Marthi, Bhaskara and Pasula, Hanna and Russell, Stuart J. and Peres, Yuval}, year = 2002, booktitle = {UAI}, pages = {319--326} } @article{martin2018implicit, title = {Implicit self-regularization in deep neural networks: Evidence from random matrix theory and implications for learning}, author = {Martin, Charles H and Mahoney, Michael W}, year = 2018, journal = {arXiv preprint arXiv:1810.01075} } @book{mathews2006complex, title = {Complex Analysis for Mathematics and Engineering}, author = {Mathews, John H. and Howell, Russell W.}, year = 2006, month = jan, day = {09}, publisher = {Jones \& Bartlett Pub}, isbn = 9780763737481, edition = 5, howpublished = {Hardcover}, keywords = {math\_phys, textbook}, lccn = 2005031562, owner = {leili}, timestamp = {2011.07.28} } @article{matthews2018gaussian, title = {Gaussian process behaviour in wide deep neural networks}, author = {Matthews, Alexander G de G and Rowland, Mark and Hron, Jiri and Turner, Richard E and Ghahramani, Zoubin}, year = 2018, journal = {arXiv preprint arXiv:1804.11271} } @inproceedings{maurer2009empirical, title = {Empirical {B}ernstein bounds and sample variance penalization}, author = {Maurer, Andreas and Pontil, Massimiliano}, year = 2009, booktitle = {Conference on Learning Theory} } @article{maurer2016benefit, title = {The benefit of multitask representation learning}, author = {Maurer, Andreas and Pontil, Massimiliano and Romera-Paredes, Bernardino}, year = 2016, journal = {The Journal of Machine Learning Research}, publisher = {JMLR. org} } @article{mazumder2010spectral, title = {Spectral regularization algorithms for learning large incomplete matrices}, author = {Mazumder, Rahul and Hastie, Trevor and Tibshirani, Robert}, year = 2010, journal = {Journal of machine learning research}, volume = 11, number = {Aug}, pages = {2287--2322} } @inproceedings{mcmahan2004online, title = {Online Geometric Optimization in the Bandit Setting Against an Adaptive Adversary}, author = {McMahan, H Brendan and Blum, Avrim}, year = 2004, booktitle = {COLT 2004}, volume = 17, pages = 109, organization = {Springer} } @article{McMahan2011, title = {{A Unified View of Regularized Dual Averaging and Mirror Descent with Implicit Updates}}, author = {McMahan, H. Brendan}, year = 2011, month = sep, journal = {arXiv preprint arXiv:1009.3240}, note = {Previously appeared in AISTATS 2011 as a conference paper entitled ``{Follow-the-regularized-leader and mirror descent: Equivalence theorems and l1 regularization}''}, abstract = {We study three families of online convex optimization algorithms: follow-the-proximally-regularized-leader (FTRL-Proximal), regularized dual averaging (RDA), and composite-objective mirror descent. We first prove equivalence theorems that show all of these algorithms are instantiations of a general FTRL update. This provides theoretical insight on previous experimental observations. In particular, even though the FOBOS composite mirror descent algorithm handles L1 regularization explicitly, it has been observed that RDA is even more effective at producing sparsity. Our results demonstrate that FOBOS uses subgradient approximations to the L1 penalty from previous rounds, leading to less sparsity than RDA, which handles the cumulative penalty in closed form. The FTRL-Proximal algorithm can be seen as a hybrid of these two, and outperforms both on a large, real-world dataset. Our second contribution is a unified analysis which produces regret bounds that match (up to logarithmic terms) or improve the best previously known bounds. This analysis also extends these algorithms in two important ways: we support a more general type of composite objective and we analyze implicit updates, which replace the subgradient approximation of the current loss function with an exact optimization.}, annote = {This is presumably the journal version of "Follow-the-regularized-leader and mirror descent: Equivalence theorems and l1 regularization"}, archiveprefix = {arXiv}, arxivid = {1009.3240}, eprint = {1009.3240}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/McMahan - 2011 - A Unified View of Regularized Dual Averaging and Mirror Descent with Implicit Updates.pdf:pdf}, keywords = {bounds,follow-the-leader algorithms,online convex optimization,online learning,regret,subgradient methods}, mendeley-groups = {Optimization/Gradient Descent Theory} } @inproceedings{McMahanStreeter2010, title = {{Adaptive Bound Optimization for Online Convex Optimization}}, author = {McMahan, H. Brendan and Streeter, Matthew}, year = 2010, month = feb, booktitle = {Proceedings of the 23rd Annual Conference on Learning Theory - COLT '10}, abstract = {We introduce a new online convex optimization algorithm that adaptively chooses its regularization function based on the loss functions observed so far. This is in contrast to previous algorithms that use a fixed regularization function such as L2-squared, and modify it only via a single time-dependent parameter. Our algorithm's regret bounds are worst-case optimal, and for certain realistic classes of loss functions they are much better than existing bounds. These bounds are problem-dependent, which means they can exploit the structure of the actual problem instance. Critically, however, our algorithm does not need to know this structure in advance. Rather, we prove competitive guarantees that show the algorithm provides a bound within a constant factor of the best possible bound (of a certain functional form) in hindsight.}, archiveprefix = {arXiv}, arxivid = {1002.4908}, eprint = {1002.4908}, mendeley-groups = {Optimization/Gradient Descent Theory} } @inproceedings{McSherry01, title = {Spectral Partitioning of Random Graphs}, author = {F. McSherry}, year = 2001, booktitle = {FOCS} } @inproceedings{mcwilliams2013correlated, title = {Correlated random features for fast semi-supervised learning}, author = {B. McWilliams and D. Balduzzi and J. Buhmann}, year = 2013, booktitle = {Advances in Neural Information Processing Systems}, pages = {440--448} } @inproceedings{Megretski08, title = {Convex Optimization in Robust Identification of Nonlinear Feedback}, author = {Alexandre Megretski}, year = 2008, booktitle = {Proceedings of the 47th Conference on Decision and Control}, date-added = {2016-04-02 18:56:48 +0000}, date-modified = {2016-04-02 18:58:18 +0000} } @inproceedings{mehta2006trajectory, title = {On Trajectory Representation for Scientific Features}, author = {Mehta, S. and Parthasarathy, S. and Machiraju, R.}, year = 2006, month = dec, booktitle = {ICDM '06. IEEE Sixth International Conference on Data Mining}, pages = {997--1001}, doi = {10.1109/ICDM.2006.120}, issn = {1550-4786}, keywords = { motion parameters;scientific features;shape parameters;trajectory representation algorithms;feature extraction;geometry;image motion analysis;image representation; } } @article{mei2016landscape, title = {The landscape of empirical risk for non-convex losses}, author = {Mei, Song and Bai, Yu and Montanari, Andrea}, year = 2016, journal = {arXiv preprint arXiv:1607.06534} } @misc{mei2017landscape, title = {The Landscape of Empirical Risk for Non-convex Losses}, author = {Song Mei and Yu Bai and Andrea Montanari}, year = 2017, eprint = {1607.06534}, archiveprefix = {arXiv}, primaryclass = {stat.ML} } @article{mei2017solving, title = {Solving SDPs for synchronization and MaxCut problems via the Grothendieck inequality}, author = {Mei, Song and Misiakiewicz, Theodor and Montanari, Andrea and Oliveira, Roberto I}, year = 2017, journal = {arXiv preprint arXiv:1703.08729} } @article{mei2018mean, title = {A Mean Field View of the Landscape of Two-Layers Neural Networks}, author = {Mei, Song and Montanari, Andrea and Nguyen, Phan-Minh}, year = 2018, journal = {Proceedings of the National Academy of Sciences}, pages = {E7665--E7671} } @article{meinshausen2009, title = {Lasso-type recovery of sparse representations for high-dimensional data}, author = {Meinshausen, Nicolai and Yu, Bin}, year = 2009, month = {02}, journal = {Ann. Statist.}, publisher = {The Institute of Mathematical Statistics}, volume = 37, number = 1, pages = {246--270}, doi = {10.1214/07-AOS582}, url = {http://dx.doi.org/10.1214/07-AOS582}, fjournal = {The Annals of Statistics} } @inproceedings{melo2007q, title = {Q-learning with linear function approximation}, author = {Melo, Francisco S and Ribeiro, M Isabel}, year = 2007, booktitle = {International Conference on Computational Learning Theory}, pages = {308--322}, organization = {Springer} } @inproceedings{melo2008analysis, title = {An analysis of reinforcement learning with function approximation}, author = {Melo, Francisco S and Meyn, Sean P and Ribeiro, M Isabel}, year = 2008, booktitle = {Proceedings of the 25th international conference on Machine learning}, pages = {664--671}, organization = {ACM} } @article{menard2020fast, title = {Fast active learning for pure exploration in reinforcement learning}, author = {M{\'e}nard, Pierre and Domingues, Omar Darwiche and Jonsson, Anders and Kaufmann, Emilie and Leurent, Edouard and Valko, Michal}, year = 2020, journal = {arXiv preprint arXiv:2007.13442} } @article{mengdi2017primal, title = {Primal-Dual {$\pi$} Learning: Sample Complexity and Sublinear Run Time for Ergodic Markov Decision Problems}, author = {Mengdi Wang}, year = 2017, journal = {CoRR}, volume = {abs/1710.06100}, archiveprefix = {arXiv}, bibsource = {dblp computer science bibliography, http://dblp.org}, biburl = {http://dblp.org/rec/bib/journals/corr/abs-1710-06100}, eprint = {1710.06100}, timestamp = {Wed, 01 Nov 2017 19:05:42 +0100} } @article{MerikoskiKumar2004, title = {Inequalities For Spreads Of Matrix Sums And Products}, author = {Jorma K. Merikoski and Ravinder Kumar}, year = 2004, journal = {Applied Mathematics E-Notes}, volume = 4, pages = {150--159} } @article{mertens1982stochastic, title = {Stochastic games have a value}, author = {Mertens, Jean-Francois and Neyman, Abraham}, year = 1982, journal = {Proceedings of the National Academy of Sciences}, publisher = {National Academy of Sciences}, volume = 79, number = 6, pages = 2145 } @article{METIS1998, title = {A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs}, author = {Karypis, George and Kumar, Vipin}, year = 1998, month = jan, journal = {SIAM Journal on Scientific Computing}, volume = 20, number = 1, pages = {359--392} } @article{metropolis1953equations, title = {Equations of State Calculations by Fast Computing Machines}, author = {N. Metropolis and A.W. Rosenbluth and M.N. Rosenbluth and A.H. Teller and E. Teller}, year = 1953, journal = {Journal of Chemical Physics}, volume = 21, number = 6, pages = {1087--1092} } @article{mh18, title = {When Recurrent Models Don't Need To Be Recurrent}, author = {Miller, John and Hardt, Moritz}, year = 2018, journal = {arXiv preprint arXiv:1805.10369} } @article{mianjy2018implicit, title = {On the implicit bias of dropout}, author = {Mianjy, Poorya and Arora, Raman and Vidal, Rene}, year = 2018, journal = {arXiv preprint arXiv:1806.09777} } @article{mianjy2019dropout, title = {On dropout and nuclear norm regularization}, author = {Mianjy, Poorya and Arora, Raman}, year = 2019, journal = {arXiv preprint arXiv:1905.11887} } @article{michaeli2015nonparametric, title = {Nonparametric Canonical Correlation Analysis}, author = {Michaeli, Tomer and Wang, Weiran and Livescu, Karen}, year = 2015, journal = {arXiv preprint}, volume = {abs/1511.04839} } @inproceedings{mikolov2011extensions, title = {Extensions of recurrent neural network language model}, author = {Mikolov, Tomas and Kombrink, Stefan and Burget, Lukas and Cernocky, JH and Khudanpur, Sanjeev}, year = 2011, booktitle = {Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on}, pages = {5528--5531}, organization = {IEEE} } @inproceedings{mikolov2013distributed, title = {Distributed representations of words and phrases and their compositionality}, author = {Mikolov, Tomas and Sutskever, Ilya and Chen, Kai and Corrado, Greg S. and Dean, Jeff}, year = 2013, booktitle = {Advances in Neural Information Processing Systems} } @article{mikolov2013efficient, title = {Efficient estimation of word representations in vector space}, author = {Mikolov, Tomas and Chen, Kai and Corrado, Greg and Dean, Jeffrey}, year = 2013, journal = {Proceedings of the International Conference on Learning Representations} } @inproceedings{mikolov2013linguistic, title = {Linguistic Regularities in Continuous Space Word Representations}, author = {Mikolov, Tomas and Yih, Wen-tau and Zweig, Geoffrey}, year = 2013, booktitle = {Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies} } @inproceedings{milch2005blog, title = {Blog: Probabilistic models with unknown objects}, author = {Brian Milch and Bhaskara Marthi and Stuart Russell and David Sontag and Daniel L. Ong and Andrey Kolobov}, year = 2005, booktitle = {In IJCAI}, pages = {1352--1359} } @inproceedings{milletari2016v, title = {V-net: Fully convolutional neural networks for volumetric medical image segmentation}, author = {Milletari, Fausto and Navab, Nassir and Ahmadi, Seyed-Ahmad}, year = 2016, booktitle = {3D Vision (3DV), 2016 Fourth International Conference on}, pages = {565--571}, organization = {IEEE} } @inproceedings{mimno2011optimizing, title = {Optimizing Semantic Coherence in Topic Models}, author = {David Mimno and Hanna Wallach and Edmund Talley and Miriam Leenders and Andrew McCallum}, year = 2011, booktitle = {EMNLP} } @book{minc1988nonnegative, title = {Nonnegative matrices}, author = {Minc, Henryk}, year = 1988, publisher = {Wiley} } @inproceedings{misra2019kinematic, title = {Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning}, author = {Misra, Dipendra and Henaff, Mikael and Krishnamurthy, Akshay and Langford, John}, year = 2020, booktitle = {International Conference on Machine Learning}, pages = {6961--6971}, organization = {PMLR} } @inproceedings{Mitliagkas2013-streamPCA, title = {Memory limited, streaming PCA}, author = {Mitliagkas, Ioannis and Caramanis, Constantine and Jain, Prateek}, year = 2013, booktitle = {NIPS}, pages = {2886--2894} } @inproceedings{mitliagkas2016asynchrony, title = {Asynchrony begets momentum, with an application to deep learning}, author = {Mitliagkas, Ioannis and Zhang, Ce and Hadjis, Stefan and R{\'e}, Christopher}, year = 2016, booktitle = {2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton)}, pages = {997--1004}, organization = {IEEE} } @article{mixed-SCORE, title = {Estimating network memberships by simplex vertex hunting}, author = {Jin, Jiashun and Ke, Zheng Tracy and Luo, Shengming}, year = 2017, journal = {arXiv:1708.07852} } @inproceedings{mkbck10, title = {Recurrent neural network based language model}, author = {Mikolov, Tom{\'a}{\v{s}} and Karafi{\'a}t, Martin and Burget, Luk{\'a}{\v{s}} and {\v{C}}ernock{\`y}, Jan and Khudanpur, Sanjeev}, year = 2010, booktitle = {Eleventh Annual Conference of the International Speech Communication Association} } @article{mm18, title = {On the Connection Between Learning Two-Layers Neural Networks and Tensor Decomposition}, author = {Mondelli, Marco and Montanari, Andrea}, year = 2018, journal = {arXiv preprint arXiv:1802.07301} } @article{mmn18, title = {A mean field view of the landscape of two-layer neural networks}, author = {Mei, Song and Montanari, Andrea and Nguyen, Phan-Minh}, year = 2018, journal = {Proceedings of the National Academy of Sciences}, publisher = {National Acad Sciences}, volume = 115, number = 33, pages = {E7665--E7671} } @inproceedings{mmstv18, title = {Towards deep learning models resistant to adversarial attacks}, author = {Madry, Aleksander and Makelov, Aleksandar and Schmidt, Ludwig and Tsipras, Dimitris and Vladu, Adrian}, year = 2018, booktitle = {ICLR}, publisher = {arXiv preprint arXiv:1706.06083} } @inproceedings{MMV12, title = {Approximation algorithms for semi-random partitioning problems}, author = {Konstantin Makarychev and Yury Makarychev and Aravindan Vijayaraghavan}, year = 2012, booktitle = {STOC '12}, pages = {367--384}, ee = {http://doi.acm.org/10.1145/2213977.2214013} } @inproceedings{mnih2007three, title = {Three new graphical models for statistical language modelling}, author = {Mnih, Andriy and Hinton, Geoffrey}, year = 2007, booktitle = {Proceedings of the 24th International Conference on Machine Learning} } @inproceedings{mnih2009scalable, title = {A scalable hierarchical distributed language model}, author = {Mnih, Andriy and Hinton, Geoffrey}, year = 2009, booktitle = {Advances in neural information processing systems} } @inproceedings{mnih2012fast, title = {A fast and simple algorithm for training neural probabilistic language models}, author = {Mnih, Andriy and Teh, Yee Whye}, year = 2012, booktitle = {Proceedings of the 29th International Conference on Machine Learning} } @inproceedings{mnih2013learning, title = {Learning word embeddings efficiently with noise-contrastive estimation}, author = {Mnih, Andriy and Kavukcuoglu, Koray}, year = 2013, booktitle = {Advances in Neural Information Processing Systems} } @article{mnih2013playing, title = {Playing atari with deep reinforcement learning}, author = {Mnih, Volodymyr and Kavukcuoglu, Koray and Silver, David and Graves, Alex and Antonoglou, Ioannis and Wierstra, Daan and Riedmiller, Martin}, year = 2013, journal = {arXiv preprint arXiv:1312.5602} } @article{mnih2015human, title = {Human-level control through deep reinforcement learning}, author = {Mnih, Volodymyr and Kavukcuoglu, Koray and Silver, David and Rusu, Andrei A and Veness, Joel and Bellemare, Marc G and Graves, Alex and Riedmiller, Martin and Fidjeland, Andreas K and Ostrovski, Georg and others}, year = 2015, journal = {nature}, publisher = {Nature Publishing Group}, volume = 518, number = 7540, pages = {529--533} } @inproceedings{mnih2016asynchronous, title = {Asynchronous methods for deep reinforcement learning}, author = {Mnih, Volodymyr and Badia, Adria Puigdomenech and Mirza, Mehdi and Graves, Alex and Lillicrap, Timothy and Harley, Tim and Silver, David and Kavukcuoglu, Koray}, year = 2016, booktitle = {International conference on machine learning}, pages = {1928--1937} } @article{mnist, title = {{MNIST} handwritten digit database}, author = {LeCun, Yann and Cortes, Corinna}, year = 2010, url = {http://yann.lecun.com/exdb/mnist/}, added-at = {2010-06-28T21:16:30.000+0200}, biburl = {http://www.bibsonomy.org/bibtex/2935bad99fa1f65e03c25b315aa3c1032/mhwombat}, groups = {public}, howpublished = {http://yann.lecun.com/exdb/mnist/}, interhash = {21b9d0558bd66279df9452562df6e6f3}, intrahash = {935bad99fa1f65e03c25b315aa3c1032}, keywords = {MSc _checked character_recognition mnist network neural}, lastchecked = {2016-01-14 14:24:11}, timestamp = {2016-01-14T15:24:40.000+0100}, username = {mhwombat} } @inproceedings{modi2019sample, title = {Sample complexity of reinforcement learning using linearly combined model ensembles}, author = {Modi, Aditya and Jiang, Nan and Tewari, Ambuj and Singh, Satinder}, year = 2020, booktitle = {Conference on Artificial Intelligence and Statistics}, pages = {2010--2020}, organization = {PMLR} } @book{Mohri2012, title = {Foundations of Machine Learning}, author = {Mohri, Mehryar and Rostamizadeh, Afshin and Talwalkar, Ameet}, year = 2012, publisher = {The MIT Press}, isbn = {026201825X, 9780262018258} } @inproceedings{mohri2012new, title = {New analysis and algorithm for learning with drifting distributions}, author = {Mohri, Mehryar and Medina, Andres Munoz}, year = 2012, booktitle = {Algorithmic Learning Theory}, pages = {124--138}, organization = {Springer} } @inproceedings{moitra2010settling, title = {Settling the polynomial learnability of mixtures of gaussians}, author = {Moitra, Ankur and Valiant, Gregory}, year = 2010, booktitle = {Foundations of Computer Science (FOCS), 2010 51st Annual IEEE Symposium on}, pages = {93--102}, organization = {IEEE} } @inproceedings{moitra2015robust, title = {How robust are reconstruction thresholds for community detection?}, author = {Moitra, Ankur and Perry, William and Wein, Alexander S}, year = 2016, journal = {FOCS} } @inproceedings{MoitraValiant:GaussianMixture, title = {Settling the polynomial learnability of mixtures of Gaussians}, author = {A. Moitra and G. Valiant}, year = 2010, booktitle = {FOCS} } @inproceedings{moldovan2012risk, title = {Risk aversion in Markov decision processes via near optimal Chernoff bounds}, author = {Moldovan, Teodor M and Abbeel, Pieter}, year = 2012, booktitle = {Advances in neural information processing systems}, pages = {3131--3139} } @article{monajemi2013deterministic, title = {{Deterministic Matrices Matching the Compressed Sensing Phase Transitions of Gaussian Random Matrices}}, author = {Monajemi, Hatef and Jafarpour, Sina and Gavish, Matan and { Stat 330/CME 362 Collaboration} and Donoho, David L}, year = 2013, journal = {Proceedings of the National Academy of Sciences}, publisher = {National Acad Sciences}, volume = 110, number = 4, pages = {1181--1186} } @inproceedings{monemizadeh20101, title = {1-pass relative-error lp-sampling with applications}, author = {Monemizadeh, Morteza and Woodruff, David P}, year = 2010, booktitle = {Proceedings of the twenty-first annual ACM-SIAM symposium on Discrete Algorithms}, pages = {1143--1160}, organization = {SIAM} } @inproceedings{moontae2014low, title = {Low-dimensional Embeddings for Interpretable Anchor-based Topic Inference}, author = {Moontae Lee and David Mimno}, year = 2014, booktitle = {EMNLP} } @incollection{moore1991variable, title = {Variable resolution dynamic programming: Efficiently learning action maps in multivariate real-valued state-spaces}, author = {Moore, Andrew W}, year = 1991, booktitle = {Machine Learning Proceedings 1991}, publisher = {Elsevier}, pages = {333--337} } @inproceedings{moore2005making, title = { Making scheduling "cool": temperature-aware workload placement in data centers }, author = { Moore, Justin and Chase, Jeff and Ranganathan, Parthasarathy and Sharma, Ratnesh }, year = 2005, booktitle = {Proceedings of the annual conference on USENIX Annual Technical Conference}, location = {Anaheim, CA}, publisher = {USENIX Association}, address = {Berkeley, CA, USA}, series = {ATEC '05}, pages = {5--5}, acmid = 1247365, numpages = 1 } @inproceedings{moore2006weatherman, title = { Weatherman: Automated, Online and Predictive Thermal Mapping and Management for Data Centers }, author = {Moore, J. and Chase, J.S. and Ranganathan, P.}, year = 2006, month = jun, booktitle = {ICAC '06. IEEE International Conference on Autonomic Computing}, pages = {155--164}, doi = {10.1109/ICAC.2006.1662394} } @inproceedings{morimura2010nonparametric, title = {Nonparametric return distribution approximation for reinforcement learning}, author = {Morimura, Tetsuro and Sugiyama, Masashi and Kashima, Hisashi and Hachiya, Hirotaka and Tanaka, Toshiyuki}, year = 2010, booktitle = {Proceedings of the 27th International Conference on International Conference on Machine Learning}, pages = {799--806} } @article{moroshko2020implicit, title = {Implicit Bias in Deep Linear Classification: Initialization Scale vs Training Accuracy}, author = {Moroshko, Edward and Gunasekar, Suriya and Woodworth, Blake and Lee, Jason D and Srebro, Nathan and Soudry, Daniel}, year = 2020, journal = {Neural Information Processing Systems (NeurIPS)} } @inproceedings{MorrisPeres03, title = {Evolving sets and mixing}, author = {Morris, Ben and Peres, Yuval}, year = 2003, location = {San Diego, CA, USA}, publisher = {ACM}, series = {STOC '03}, pages = {279--286}, numpages = 8 } @inproceedings{MOS, title = {Learning juntas}, author = {Mossel, Elchanan and O'Donnell, Ryan and Servedio, Rocco P.}, year = 2003, booktitle = {Proceedings of the thirty-fifth annual ACM symposium on Theory of computing}, location = {San Diego, CA, USA}, publisher = {ACM}, address = {New York, NY, USA}, series = {STOC '03}, pages = {206--212}, isbn = {1-58113-674-9}, numpages = 7, keywords = {fourier, juntas, learning, relevant variables, uniform distribution} } @misc{moss2005guidelines, title = { Guidelines for Assessing Power and Cooling Requirements in the Data Center }, author = {David Moss}, year = 2005, howpublished = {Available at \url{http://www.dell.com/downloads/global/power/ps3q05-20050115-Moss.pdf}} } @inproceedings{mossel2005learning, title = {Learning nonsingular phylogenies and hidden Markov models}, author = {Mossel, Elchanan and Roch, S{\'e}bastien}, year = 2005, booktitle = {Proceedings of the thirty-seventh annual ACM symposium on Theory of computing}, pages = {366--375}, organization = {ACM} } @article{Motwani1994, title = {Computing Roots of Graphs is Hard}, author = {Rajeev Motwani and Madhu Sudan}, year = 1994, journal = {DISCRETE APPLIED MATHEMATICS}, volume = 54, pages = {54--81} } @article{mou2017generalization, title = {Generalization bounds of sgld for non-convex learning: Two theoretical viewpoints}, author = {Mou, Wenlong and Wang, Liwei and Zhai, Xiyu and Zheng, Kai}, year = 2017, month = {06--09 Jul}, journal = {arXiv preprint arXiv:1707.05947}, booktitle = {Proceedings of the 31st Conference On Learning Theory}, publisher = {PMLR}, address = {}, series = {Proceedings of Machine Learning Research}, volume = 75, pages = {605--638}, url = {http://proceedings.mlr.press/v75/mou18a.html}, editor = {Bubeck, S\'ebastien and Perchet, Vianney and Rigollet, Philippe}, pdf = {http://proceedings.mlr.press/v75/mou18a/mou18a.pdf}, abstract = {We study the generalization errors of \emph{non-convex} regularized ERM procedures using Stochastic Gradient Langevin Dynamics (SGLD). Two theories are proposed with non-asymptotic discrete-time analysis, using stability and PAC-Bayesian theory respectively. The stability-based theory obtains a bound of $O\left(\frac{1}{n}L\sqrt{\beta T_N}\right)$, where $L$ is Lipschitz parameter, $\beta$ is inverse temperature, and $T_N$ is the sum of step sizes. For PAC-Bayesian theory, though the bound has a slower $O(1/\sqrt{n})$ rate, the contribution of each step decays exponentially through time, and the uniform Lipschitz constant is also replaced by actual norms of gradients along the optimization trajectory. Our bounds have reasonable dependence on aggregated step sizes, and do not explicitly depend on dimensions, norms or other capacity measures of the parameter. The bounds characterize how the noises in the algorithm itself controls the statistical learning behavior in non-convex problems, without uniform convergence in the hypothesis space, which sheds light on the effect of training algorithms on the generalization error for deep neural networks.} } @article{mou2020sample, title = {On the Sample Complexity of Reinforcement Learning with Policy Space Generalization}, author = {Mou, Wenlong and Wen, Zheng and Chen, Xi}, year = 2020, journal = {arXiv preprint arXiv:2008.07353} } @incollection{moulines2011non, title = {Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Machine Learning}, author = {Moulines, Eric and Bach, Francis R.}, year = 2011, booktitle = {Advances in Neural Information Processing Systems 24}, pages = {451--459}, url = {http://papers.nips.cc/paper/4316-non-asymptotic-analysis-of-stochastic-approximation-algorithms-for-machine-learning.pdf} } @inproceedings{mouratidis2006continuous, title = {Continuous nearest neighbor monitoring in road networks}, author = { Mouratidis, Kyriakos and Yiu, Man Lung and Papadias, Dimitris and Mamoulis, Nikos }, year = 2006, booktitle = { Proceedings of the 32nd international conference on Very large data bases }, location = {Seoul, Korea}, publisher = {VLDB Endowment}, series = {VLDB '06}, pages = {43--54}, acmid = 1164133, numpages = 12 } @article{MOV2012, title = {A local spectral method for graphs: with applications to improving graph partitions and exploring data graphs locally}, author = {Mahoney, Michael W. and Orecchia, Lorenzo and Vishnoi, Nisheeth K.}, year = 2012, journal = {Journal of Machine Learning Research}, volume = 13, pages = {2339--2365} } @article{MR06, title = {Learning Nonsingular Phylogenies and Hidden {M}arkov Models}, author = {Elchanan Mossel and S\'{e}bastian Roch}, year = 2006, journal = {Annals of Applied Probability}, volume = 16, number = 2, pages = {583--614} } @article{mr18, title = {The Computational Complexity of Training {R}e{LU}(s)}, author = {Manurangsi, Pasin and Reichman, Daniel}, year = 2018, journal = {arXiv preprint arXiv:1810.04207} } @book{MR2442439, title = {Stochastic approximation: a dynamical systems viewpoint}, author = {Borkar, Vivek S.}, year = 2008, publisher = {Cambridge University Press, Cambridge}, pages = {x+164}, isbn = {978-0-521-51592-4}, mrclass = {60-01 (39-01 62L20 93E03 93E10)}, mrnumber = 2442439, mrreviewer = {Oleg N. Granichin} } @article{MRsurvey, title = {Lattice-Based Cryptography}, author = {Micciancio, Daniele and Regev, Oded}, year = 2009, journal = {Post Quantum Cryptography}, publisher = {Springer Publishing Company, Heidelberg}, pages = {147--191} } @article{ms08, title = {Finite-time bounds for fitted value iteration}, author = {Munos, R{\'e}mi and Szepesv{\'a}ri, Csaba}, year = 2008, journal = {Journal of Machine Learning Research}, volume = 9, number = {May}, pages = {815--857} } @inproceedings{MS13, title = {A Polynomial Time Algorithm for Lossy Population Recovery}, author = {Ankur Moitra and Michael E. Saks}, year = 2013, booktitle = {54th Annual {IEEE} Symposium on Foundations of Computer Science, {FOCS} 2013, 26-29 October, 2013, Berkeley, CA, {USA}}, pages = {110--116}, doi = {10.1109/FOCS.2013.20}, url = {http://dx.doi.org/10.1109/FOCS.2013.20}, crossref = {DBLP:conf/focs/2013}, timestamp = {Mon, 24 Aug 2015 19:09:00 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/focs/MoitraS13}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{mt18, title = {Robust Spectral Filtering and Anomaly Detection}, author = {Marecek, Jakub and Tchrakian, Tigran}, year = 2018, journal = {arXiv preprint arXiv:1808.01181} } @article{mu2013square, title = {Square Deal: Lower Bounds and Improved Relaxations for Tensor Recovery}, author = {Mu, Cun and Huang, Bo and Wright, John and Goldfarb, Donald}, year = 2013, journal = {arXiv preprint arXiv:1307.5870} } @phdthesis{mugtussids2000flight, title = {Flight Data Processing Techniques to Identify Unusual Events}, author = {Mugtussids, Iossif B.}, year = 2000, school = {Virginia Tech} } @article{mukherjee2009spatio, title = { Spatio-temporal thermal-aware job scheduling to minimize energy consumption in virtualized heterogeneous data centers }, author = { Mukherjee, Tridib and Banerjee, Ayan and Varsamopoulos, Georgios and Gupta, Sandeep K. S. and Rungta, Sanjay }, year = 2009, journal = {Computer Networks}, volume = 53, number = 17, pages = {2888--2904} } @inproceedings{munos2005error, title = {Error bounds for approximate value iteration}, author = {Munos, R{\'e}mi}, year = 2005, booktitle = {Proceedings of the National Conference on Artificial Intelligence}, volume = 20, number = 2, pages = 1006, organization = {Menlo Park, CA; Cambridge, MA; London; AAAI Press; MIT Press; 1999} } @article{murty1987some, title = {Some NP-complete problems in quadratic and nonlinear programming}, author = {Murty, Katta G and Kabadi, Santosh N}, year = 1987, journal = {Mathematical programming}, publisher = {Springer}, volume = 39, number = 2, pages = {117--129} } @article{murty2012m, title = {$O(m)$ Bound on Number of Iterations in Sphere Methods for LP}, author = {Murty, Katta G.}, year = 2012, journal = {Algorithmic Operations Research}, volume = 7, number = 1, pages = {30--40} } @inproceedings{musco2015randomized, title = {Randomized block krylov methods for stronger and faster approximate singular value decomposition}, author = {Musco, Cameron and Musco, Christopher}, year = 2015, booktitle = {Advances in Neural Information Processing Systems}, pages = {1396--1404} } @article{MV, title = {Settling the Polynomial Learnability of Mixtures of Gaussians}, author = {Ankur Moitra and Gregory Valiant}, year = 2010, booktitle = {51th Annual {IEEE} Symposium on Foundations of Computer Science, {FOCS} 2010, October 23-26, 2010, Las Vegas, Nevada, {USA}}, pages = {93--102}, doi = {10.1109/FOCS.2010.15}, url = {http://dx.doi.org/10.1109/FOCS.2010.15}, crossref = {DBLP:conf/focs/2010}, timestamp = {Tue, 16 Dec 2014 09:57:23 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/focs/MoitraV10}, bibsource = {dblp computer science bibliography, http://dblp.org}, file = {:D\:\\Documents\\ResearchD\\Thesis\\Citations\\1004.4223v1.pdf:PDF}, owner = {rongge} } @inproceedings{MZ, title = {Matching pursuits with time-frequency dictionaries}, author = {S. Mallat and Z. Zhang}, year = 1993, booktitle = {IEEE Trans. on Signal Processing}, pages = {3397--3415} } @article{nacson2018convergence, title = {Convergence of Gradient Descent on Separable Data}, author = {Nacson, Mor Shpigel and Lee, Jason D. and Gunasekar, Suriya and Srebro, Nathan and Soudry, Daniel}, year = 2019, journal = {Artificial Intelligence and Statistics (AISTATS)}, booktitle = {The 22nd International Conference on Artificial Intelligence and Statistics}, pages = {3420--3428}, organization = {PMLR} } @article{nacson2019lexicographic, title = {Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Training}, author = {Nacson, Mor Sphigel and Gunasekar, Suriya and Lee, Jason D and Srebro, Nathan and Soudry Daniel}, year = 2019, journal = {International Conference on Machine Learning (ICML)} } @article{nagarajan2019deterministic, title = {Deterministic PAC-Bayesian generalization bounds for deep networks via generalizing noise-resilience}, author = {Nagarajan, Vaishnavh and Kolter, J Zico}, year = 2019, journal = {arXiv preprint arXiv:1905.13344} } @article{nagumo1942lage, title = {{\"U}ber die lage der integralkurven gew{\"o}hnlicher differentialgleichungen}, author = {Nagumo, Mitio}, year = 1942, journal = {Proceedings of the Physico-Mathematical Society of Japan. 3rd Series}, publisher = {THE PHYSICAL SOCIETY OF JAPAN, The Mathematical Society of Japan}, volume = 24, pages = {551--559} } @article{nakkiran2019sgd, title = {Sgd on neural networks learns functions of increasing complexity}, author = {Nakkiran, Preetum and Kaplun, Gal and Kalimeris, Dimitris and Yang, Tristan and Edelman, Benjamin L and Zhang, Fred and Barak, Boaz}, year = 2019, journal = {arXiv preprint arXiv:1905.11604} } @article{nakkiran2020optimal, title = {Optimal regularization can mitigate double descent}, author = {Nakkiran, Preetum and Venkat, Prayaag and Kakade, Sham and Ma, Tengyu}, year = 2020, journal = {arXiv preprint arXiv:2003.01897} } @inproceedings{nakkiran2021optimal, title = {Optimal Regularization can Mitigate Double Descent}, author = {Preetum Nakkiran and Prayaag Venkat and Sham M. Kakade and Tengyu Ma}, year = 2021, booktitle = {International Conference on Learning Representations}, url = {https://openreview.net/forum?id=7R7fAoUygoa} } @article{Naor2012, title = {SPARSE QUADRATIC FORMS AND THEIR GEOMETRIC APPLICATIONS [after {B}atson, {S}pielman and {S}rivastava]}, author = {Naor, Assaf}, year = 2012, journal = {Ast{\'e}risque}, publisher = {Soci{\'e}t{\'e} math{\'e}matique de France} } @article{nash1951non, title = {Non-cooperative games}, author = {Nash, John}, year = 1951, journal = {Annals of mathematics}, publisher = {JSTOR}, pages = {286--295} } @article{neal2003slice, title = {Slice Sampling}, author = {Radford M. Neal}, year = 2003, journal = {Annals of Statistics}, volume = 31, number = 3, pages = {705--767} } @article{neal2011mcmc, title = {MCMC using Hamiltonian dynamics}, author = {Neal, Radford M and others}, year = 2011, journal = {Handbook of markov chain monte carlo}, volume = 2, number = 11, pages = 2 } @article{NecoaraClipici2013, title = {Efficient parallel coordinate descent algorithm for convex optimization problems with separable constraints: application to distributed MPC}, author = {Necoara, Ion and Clipici, Dragos}, year = 2013, journal = {Journal of Process Control}, publisher = {Elsevier}, volume = 23, number = 3, pages = {243--253} } @inproceedings{Ned10, title = {Random projection algorithms for convex set intersection problems}, author = {Nedi{\'c}, Angelia}, year = 2010, booktitle = {49th IEEE Conference on Decision and Control (CDC)}, pages = {7655--7660} } @article{Ned11, title = {Random algorithms for convex minimization problems}, author = {Nedi\'c, Angelia}, year = 2011, journal = {Math. Program.}, volume = 129, number = {2, Ser. B}, pages = {225--253}, doi = {10.1007/s10107-011-0468-9}, issn = {0025-5610}, fjournal = {Mathematical Programming}, mrclass = {90C25 (90C15 90C34)}, mrnumber = 2837881, mrreviewer = {Teemu Pennanen} } @incollection{nedic2001convergence, title = {Convergence rate of incremental subgradient algorithms}, author = {Nedi\'c, Angelia and Bertsekas, Dimitri}, year = 2001, booktitle = {Stochastic optimization: algorithms and applications}, series = {Appl. Optim.}, volume = 54, pages = {223--264}, doi = {10.1007/978-1-4757-6594-6_11}, url = {http://dx.doi.org/10.1007/978-1-4757-6594-6_11}, mrclass = {90C25 (90C52)}, mrnumber = 1835501, mrreviewer = {A. M. Galperin} } @article{nedic2003least, title = {Least squares policy evaluation algorithms with linear function approximation}, author = {Nedi{\'c}, A and Bertsekas, Dimitri P}, year = 2003, journal = {Discrete Event Dynamic Systems}, publisher = {Springer}, volume = 13, number = {1-2}, pages = {79--110} } @article{neelakantan2015adding, title = {Adding gradient noise improves learning for very deep networks}, author = {Neelakantan, Arvind and Vilnis, Luke and Le, Quoc V and Sutskever, Ilya and Kaiser, Lukasz and Kurach, Karol and Martens, James}, year = 2015, journal = {arXiv preprint arXiv:1511.06807} } @article{negahban2012restricted, title = {Restricted strong convexity and weighted matrix completion: Optimal bounds with noise}, author = {Negahban, Sahand and Wainwright, Martin J}, year = 2012, journal = {Journal of Machine Learning Research}, volume = 13, number = {May}, pages = {1665--1697} } @inproceedings{negrea2019information, title = {Information-Theoretic Generalization Bounds for SGLD via Data-Dependent Estimates}, author = {Negrea, Jeffrey and Haghifam, Mahdi and Dziugaite, Gintare Karolina and Khisti, Ashish and Roy, Daniel M}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {11013--11023} } @techreport{nemirovski1997, title = {On self-concordant convex-concave functions}, author = {Nemirovskii, Arkadii}, year = 1997, month = jun, number = {\# 3/97}, institution = {Optimization Laboratory Faculty of Industrial Engineering and Management, The Technion - Israel Institute of Technology} } @article{Nemirovski2004, title = {{Prox-Method with Rate of Convergence $O(1/t)$ for Variational Inequalities with Lipschitz Continuous Monotone Operators and Smooth Convex-Concave Saddle Point Problems}}, author = {Nemirovski, Arkadi}, year = 2004, month = jan, journal = {SIAM Journal on Optimization}, volume = 15, number = 1, pages = {229--251}, doi = {10.1137/S1052623403425629}, issn = {1052-6234}, annote = {Nemirovski's Mirror-Prox Method}, mendeley-groups = {Optimization/Gradient Descent Theory} } @incollection{nemirovski2005efficient, title = {An efficient stochastic approximation algorithm for stochastic saddle point problems}, author = {Nemirovski, Arkadi and Rubinstein, Reuven Y}, year = 2005, booktitle = {Modeling Uncertainty}, publisher = {Springer}, pages = {156--184} } @article{nemirovski2006convex, title = {Convex approximations of chance constrained programs}, author = {Nemirovski, Arkadi and Shapiro, Alexander}, year = 2006, journal = {SIAM Journal on Optimization}, publisher = {SIAM}, volume = 17, number = 4, pages = {969--996} } @article{nemirovski2009robust, title = {Robust stochastic approximation approach to stochastic programming}, author = {Nemirovski, Arkadi and Juditsky, Anatoli and Lan, Guanghui and Shapiro, Alexander}, year = 2009, journal = {SIAM Journal on optimization}, publisher = {SIAM}, volume = 19, number = 4, pages = {1574--1609}, doi = {10.1137/070704277}, issn = {1052-6234}, url = {http://dx.doi.org/10.1137/070704277}, fjournal = {SIAM Journal on Optimization}, mrclass = {90C15 (90C51)}, mrnumber = 2486041, mrreviewer = {Teemu Pennanen} } @book{Nemirovski2013, title = {{Lectures on Modern Convex Optimization}}, author = {{Ben-Tal}, Aharon and Nemirovski, Arkadi}, year = 2013, month = jan, publisher = {Society for Industrial and Applied Mathematics}, doi = {10.1137/1.9780898718829}, isbn = {978-0-89871-491-3}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Ben-Tal, Nemirovski - 2013 - Lectures on Modern Convex Optimization.pdf:pdf}, mendeley-groups = {Optimization/Gradient Descent Theory,Books/Optimization} } @article{NemirovskiBook, title = {Interior point polynomial time methods in convex programming}, author = {Nemirovskii, AS}, year = 2004, journal = {Lecture Notes} } @book{Nemirovsky1978, title = {Problem complexity and method efficiency in optimization.}, author = {Nemirovsky, Arkadi and Yudin, David}, year = 1978, publisher = {Nauka Publishers, Moscow (in Russian)}, note = {John Wiley, New York (in English) 1983} } @book{Nesterov, title = {Introductory lectures on convex optimization : a basic course}, author = {Nesterov, Yurii}, year = 2004, publisher = {Kluwer Academic Publ.}, address = {Boston, Dordrecht, London}, series = {Applied optimization}, volume = 87, isbn = {1-4020-7553-7}, url = {http://opac.inria.fr/record=b1104789} } @inproceedings{Nesterov1983, title = {A method of solving a convex programming problem with convergence rate {$O(1/k^2)$}}, author = {Nesterov, Yurii}, year = 1983, booktitle = {Doklady AN SSSR (translated as Soviet Mathematics Doklady)}, volume = 269, number = 2, pages = {543--547} } @book{nesterov1994interior, title = {Interior-point polynomial algorithms in convex programming}, author = {Nesterov, Yurii and Nemirovskii, Arkadii and Ye, Yinyu}, year = 1994, publisher = {SIAM}, volume = 13 } @incollection{nesterov2000squared, title = {Squared functional systems and optimization problems}, author = {Nesterov, Yurii}, year = 2000, booktitle = {High performance optimization}, publisher = {Springer}, pages = {405--440} } @book{Nesterov2004, title = {Introductory Lectures on Convex Programming Volume: A Basic course}, author = {Nesterov, Yurii}, year = 2004, publisher = {Kluwer Academic Publishers}, volume = {I}, isbn = 1402075537 } @article{Nesterov2005, title = {{Smooth minimization of non-smooth functions}}, author = {Nesterov, Yurii}, year = 2005, month = dec, journal = {Mathematical Programming}, volume = 103, number = 1, pages = {127--152}, doi = {10.1007/s10107-004-0552-5}, isbn = 1010700405, issn = {0025-5610}, abstract = {In this paper we propose a new approach for constructing efficient schemes for non- smooth convex optimization. It is based on a special smoothing technique, which can be applied to the functions with explicit max-structure. Our approach can be considered as an alternative to black-box minimization. From the viewpoint of efficiency estimates, we manage to improve the traditional bounds on the number of iterations of the gra- dient schemes from O 1 unchanged. 2 to O1, keeping basically the complexity of each iteration}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Nesterov - 2005 - Smooth minimization of non-smooth functions.pdf:pdf}, keywords = {complexity theory,convex optimization,non smooth optimization,optimal methods,optimization,structural optimization}, mendeley-groups = {Optimization/Gradient Descent Theory}, mendeley-tags = {optimization} } @article{Nesterov2005excessive, title = {{Excessive Gap Technique in Nonsmooth Convex Minimization}}, author = {Nesterov, Yurii}, year = 2005, month = jan, journal = {SIAM Journal on Optimization}, volume = 16, number = 1, pages = {235--249}, doi = {10.1137/S1052623403422285}, issn = {1052-6234}, annote = {YinTat mentioned that this paper may have combined the primal/dual descent steps of Nesterov into (either one or two, I forgot) Prox steps.}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Nesterov - 2005 - Excessive Gap Technique in Nonsmooth Convex Minimization.pdf:pdf}, keywords = {black-box oracle,complexity theory,convex optimization,non-smooth optimization,optimal methods,structural}, mendeley-groups = {Optimization/Gradient Descent Theory} } @article{nesterov2006cubic, title = {Cubic regularization of Newton method and its global performance}, author = {Nesterov, Yurii and Polyak, Boris T}, year = 2006, journal = {Mathematical Programming}, publisher = {Springer}, volume = 108, number = 1, pages = {177--205} } @article{nesterov2008cubic, title = {Accelerating the cubic regularization of Newton's method on convex problems}, author = {Nesterov, Yurii}, year = 2008, journal = {Mathematical Programming}, publisher = {Springer}, volume = 112, number = 1, pages = {159--181} } @article{nesterov2008rounding, title = {Rounding of convex sets and efficient gradient methods for linear programming problems}, author = {Nesterov, Yu}, year = 2008, journal = {Optimisation Methods and Software}, publisher = {Taylor \& Francis}, volume = 23, number = 1, pages = {109--128} } @article{Nesterov2009, title = {{Primal-dual subgradient methods for convex problems}}, author = {Nesterov, Yurii}, year = 2007, month = jun, journal = {Mathematical Programming}, volume = 120, number = 1, pages = {221--259}, doi = {10.1007/s10107-007-0149-x}, issn = {0025-5610}, abstract = {In this paper we present a new approach for constructing subgradient schemesfordifferent types ofnonsmoothproblems withconvexstructure.Ourmethods are primal-dual since they are always able to generate a feasible approximation to the optimum of an appropriately formulated dual problem. Besides other advantages, this useful feature provides the methods with a reliable stopping criterion. The proposed schemes differ from the classical approaches (divergent series methods, mirror descent methods) by presence of two control sequences. The first sequence is responsible for aggregating the support functions in the dual space, and the second one establishes a dynamically updated scale between the primal and dual spaces. This additional flexi- bility allows to guarantee a boundedness of the sequence of primal test points even in the case of unbounded feasible set (however, we always assume the uniform bounded- ness of subgradients).We present the variants of subgradient schemes for nonsmooth convex minimization, minimax problems, saddle point problems, variational inequali- ties, and stochastic optimization. In all situations our methods are proved to be optimal from the view point of worst-case black-box lower complexity bounds.}, annote = {A good citation to his dual averaging.}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Nesterov - 2007 - Primal-dual subgradient methods for convex problems.pdf:pdf}, keywords = {Black-box methods,Convex optimization,Lower complexity bounds,Minimax problems,Non-smooth optimization,Saddle points,Stochastic optimization,Subgradient methods,Variational inequalities}, mendeley-groups = {Optimization/Gradient Descent Theory} } @article{Nesterov2012, title = {{Efficiency of Coordinate Descent Methods on Huge-Scale Optimization Problems}}, author = {Nesterov, Yurii}, year = 2012, month = jan, journal = {SIAM Journal on Optimization}, volume = 22, number = 2, pages = {341--362}, doi = {10.1137/100802001}, issn = {1052-6234}, url = {http://130.104.5.100/cps/ucl/doc/core/documents/coredp2010{\_}2web.pdf http://epubs.siam.org/doi/abs/10.1137/100802001}, abstract = {In this paper we propose new methods for solving huge-scale optimization problems. For problems of this size, even the simplest full-dimensional vector operations are very expensive. Hence, we propose to apply an optimization technique based on random partial update of decision variables. For these methods, we prove the global estimates for the rate of convergence. Surprisingly, for certain classes of objective functions, our results are better than the standard worst-case bounds for deterministic algorithms. We present constrained and unconstrained versions of the method and its accelerated variant. Our numerical test confirms a high efficiency of this technique on problems of very big size. Read More: http://epubs.siam.org/doi/abs/10.1137/100802001}, file = {:D$\backslash$:/Mendeley Desktop/Nesterov - 2012 - Efficiency of Coordinate Descent Methods on Huge-Scale Optimization Problems.pdf:pdf}, keywords = {Google problem,convex optimization,coordinate relaxation,fast gradient schemes,worst-case efficiency estimates}, mendeley-groups = {Optimization/Coordinate Descent} } @article{Nesterov2013, title = {{Gradient methods for minimizing composite functions}}, author = {Nesterov, Yurii}, year = 2013, journal = {Mathematical Programming}, volume = 140, number = 1, pages = {125--161}, doi = {10.1007/s10107-012-0629-5}, issn = {0025-5610}, file = {:D$\backslash$:/Mendeley Desktop/Nesterov - 2013 - Gradient methods for minimizing composite functions.pdf:pdf}, mendeley-groups = {Optimization/Gradient Descent Theory,Optimization/Gradient Descent Theory/Composite} } @article{Nesterov2014, title = {{Universal gradient methods for convex optimization problems}}, author = {Nesterov, Yurii}, year = 2014, month = may, journal = {Mathematical Programming}, doi = {10.1007/s10107-014-0790-0}, issn = {0025-5610}, mendeley-groups = {Optimization/Gradient Descent Theory} } @book{NesterovBook, title = {Introductory lectures on convex optimization}, author = {Nesterov, Yurii}, year = 2004, publisher = {Springer Science \& Business Media}, volume = 87 } @techreport{NesterovStich2016, title = {Efficiency of accelerated coordinate descent method on structured optimization problems}, author = {Nesterov, Yurii and Stich, Sebastian}, year = 2016, institution = {CORE Discussion Papers} } @article{netrapalli2013phase, title = {Phase Retrieval using Alternating Minimization}, author = {Netrapalli, Praneeth and Jain, Prateek and Sanghavi, Sujay}, year = 2013, journal = {arXiv preprint arXiv:1306.0160} } @article{neu2020unifying, title = {A Unifying View of Optimism in Episodic Reinforcement Learning}, author = {Neu, Gergely and Pike-Burke, Ciara}, year = 2020, journal = {arXiv preprint arXiv:2007.01891} } @inproceedings{newman2006statistical, title = {Statistical entity-topic models}, author = {Newman, David and Chemudugunta, Chaitanya and Smyth, Padhraic}, year = 2006, booktitle = { Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining }, location = {Philadelphia, PA, USA}, publisher = {ACM}, address = {New York, NY, USA}, series = {KDD '06}, pages = {680--686}, doi = {http://doi.acm.org/10.1145/1150402.1150487}, isbn = {1-59593-339-5}, acmid = 1150487, keywords = {entity recognition, text modeling, topic modeling}, numpages = 7 } @inproceedings{newsamp, title = {Convergence rates of sub-sampled Newton methods}, author = {Erdogdu, Murat A and Montanari, Andrea}, year = 2015, booktitle = {Advances in Neural Information Processing Systems}, pages = {3034--3042} } @inproceedings{neyshabur2015path, title = {Path-sgd: Path-normalized optimization in deep neural networks}, author = {Neyshabur, Behnam and Salakhutdinov, Russ R and Srebro, Nati}, year = 2015, booktitle = {Advances in Neural Information Processing Systems}, pages = {2422--2430} } @inproceedings{neyshabur2017exploring, title = {Exploring generalization in deep learning}, author = {Neyshabur, Behnam and Bhojanapalli, Srinadh and McAllester, David and Srebro, Nati}, year = 2017, booktitle = {Advances in Neural Information Processing Systems}, pages = {5947--5956} } @article{NFINDR, title = {N-FindR method versus independent component analysis for lithological identification in hyperspectral imagery}, author = {C. Gomez and H. Le Borgne and P. Allemand and C. Delacourt and P. Ledru}, year = 2007, month = jan, journal = {Int. J. Remote Sens.}, publisher = {Taylor \& Francis, Inc.}, address = {Bristol, PA, USA}, volume = 28, number = 23, issue_date = {November 2007} } @inproceedings{Ng2004-L1LR, title = {Feature selection, {L1 vs. L2} regularization, and rotational invariance}, author = {Ng, Andrew Y.}, year = 2004, booktitle = {Proceedings of the 21st International Conference on Machine Learning}, series = {ICML 2004}, pages = 78, organization = {ACM} } @article{Nguyen-Regev, title = {Learning a Parallelepiped: Cryptanalysis of {GGH} and {NTRU} Signatures}, author = {P. Q. Nguyen and O. Regev}, year = 2009, journal = {Journal of Cryptology}, volume = 22, number = 2, pages = {139--160} } @inproceedings{nguyen2014anchors, title = {Anchors Regularized: Adding Robustness and Extensibility to Scalable Topic-Modeling Algorithms}, author = {Thang Nguyen and Yuening Hu and Jordan Boyd-Graber}, year = 2014, booktitle = {ACL} } @inproceedings{nguyen2017loss, title = {The Loss Surface of Deep and Wide Neural Networks}, author = {Nguyen, Quynh and Hein, Matthias}, year = 2017, journal = {arXiv preprint arXiv:1704.08045}, booktitle = {International Conference on Machine Learning}, pages = {2603--2612} } @article{nguyen2017loss2, title = {The loss surface and expressivity of deep convolutional neural networks}, author = {Nguyen, Quynh and Hein, Matthias}, year = 2017, journal = {arXiv preprint arXiv:1710.10928} } @inproceedings{nh18, title = {Optimization landscape and expressivity of deep cnns}, author = {Nguyen, Quynh and Hein, Matthias}, year = 2018, booktitle = {International Conference on Machine Learning (ICML)}, pages = {3727--3736} } @article{ni2020survey, title = {A Survey on Theories and Applications for Self-Driving Cars Based on Deep Learning Methods}, author = {Ni, Jianjun and Chen, Yinan and Chen, Yan and Zhu, Jinxiu and Ali, Deena and Cao, Weidong}, year = 2020, journal = {Applied Sciences}, publisher = {Multidisciplinary Digital Publishing Institute}, volume = 10, number = 8, pages = 2749 } @inproceedings{niculescu2007inductive, title = {Inductive Transfer for Bayesian Network Structure Learning.}, author = {Niculescu-Mizil, Alexandru and Caruana, Rich}, year = 2007, booktitle = {AISTATS}, pages = {339--346} } @inproceedings{Nie2013online, title = {Online pca with optimal regrets}, author = {Nie, Jiazhong and Kot{\l}owski, Wojciech and Warmuth, Manfred K}, year = 2013, booktitle = {International Conference on Algorithmic Learning Theory}, pages = {98--112}, organization = {Springer} } @article{nie2015hierarchy, title = {The hierarchy of local minimums in polynomial optimization}, author = {Nie, Jiawang}, year = 2015, journal = {Mathematical Programming}, publisher = {Springer}, volume = 151, number = 2, pages = {555--583} } @inproceedings{NIPS12-WLSWC, title = {Learning with Partially Absorbing Random Walks}, author = {Wu, Xiao-Ming and Li, Zhenguo and So, Anthony Man-Cho and Wright, John and Chang, Shih-Fu}, year = 2012, booktitle = {NIPS} } @incollection{NIPS2017_7203, title = {The Expressive Power of Neural Networks: A View from the Width}, author = {Lu, Zhou and Pu, Hongming and Wang, Feicheng and Hu, Zhiqiang and Wang, Liwei}, year = 2017, booktitle = {Advances in Neural Information Processing Systems 30}, publisher = {Curran Associates, Inc.}, pages = {6231--6239} } @book{nisan2007algorithmic, title = {Algorithmic game theory}, author = {Nisan, Noam and Roughgarden, Tim and Tardos, Eva and Vazirani, Vijay V}, year = 2007, publisher = {Cambridge University Press Cambridge}, volume = 1 } @inproceedings{NJS, title = {Phase Retrieval using Alternating Minimization}, author = {Praneeth Netrapalli and Prateek Jain and Sujay Sanghavi}, year = 2013, booktitle = {Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013, Lake Tahoe, Nevada, United States.}, pages = {2796--2804}, url = {http://papers.nips.cc/paper/5041-phase-retrieval-using-alternating-minimization}, crossref = {DBLP:conf/nips/2013}, timestamp = {Fri, 31 Jan 2014 12:11:40 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/nips/Netrapalli0S13}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{NNopq2014, title = {Optimized Product Quantization}, author = {Tiezheng Ge and Kaiming He and Qifa Ke and Jian Sun}, year = 2014, journal = {{IEEE} Trans. Pattern Anal. Mach. Intell.}, volume = 36, number = 4, pages = {744--755} } @article{NNpq2011, title = {Product Quantization for Nearest Neighbor Search}, author = {Herv{\'{e}} J{\'{e}}gou and Matthijs Douze and Cordelia Schmid}, year = 2011, journal = {{IEEE} Trans. Pattern Anal. Mach. Intell.}, volume = 33, number = 1, pages = {117--128} } @inproceedings{NNSAJ, title = {Non-convex Robust {PCA}}, author = {Praneeth Netrapalli and Niranjan U. N and Sujay Sanghavi and Animashree Anandkumar and Prateek Jain}, year = 2014, booktitle = {Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, December 8-13 2014, Montreal, Quebec, Canada}, pages = {1107--1115}, url = {http://papers.nips.cc/paper/5430-non-convex-robust-pca}, crossref = {DBLP:conf/nips/2014}, timestamp = {Wed, 10 Dec 2014 21:34:12 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/nips/NetrapalliNSA014}, bibsource = {dblp computer science bibliography, http://dblp.org} } @book{Nocedal2006NO, title = {Numerical Optimization}, author = {J. Nocedal and S. J. Wright}, year = 2006, publisher = {Springer}, address = {New York}, edition = {2nd} } @book{nocedalbook, title = {Numerical optimization}, author = {Nocedal, Jorge and Wright, Stephen}, year = 2006, publisher = {Springer Science \& Business Media} } @article{noren2019safe, title = {Safe Adaptation in Confined Environments Using Energy Functions}, author = {Noren, Charles and Liu, Changliu}, year = 2019, journal = {arXiv preprint arXiv:1912.09095} } @article{nouiehed2018convergence, title = {Convergence to Second-Order Stationarity for Constrained Non-Convex Optimization}, author = {Nouiehed, Maher and Lee, Jason D and Razaviyayn, Meisam}, year = 2018, journal = {Submitted to SIAM Journal on Optimization} } @article{nouiehed2019solving, title = {Solving a class of non-convex min-max games using iterative first order methods}, author = {Nouiehed, Maher and Sanjabi, Maziar and Huang, Tianjian and Lee, Jason D and Razaviyayn, Meisam}, year = 2019, journal = {Neural Information Processing Systems (NeurIPS)} } @inproceedings{nowozin2016f, title = {f-gan: Training generative neural samplers using variational divergence minimization}, author = {Nowozin, Sebastian and Cseke, Botond and Tomioka, Ryota}, year = 2016, booktitle = {Advances in neural information processing systems}, pages = {271--279} } @inproceedings{NSLIFK-gauss-southwell, title = {Coordinate Descent Converges Faster with the Gauss-Southwell Rule Than Random Selection}, author = {Nutini, Julie and Schmidt, Mark and Laradji, Issam and Friedlander, Michael and Koepke, Hoyt}, year = 2015, booktitle = {Proceedings of the 32nd International Conference on Machine Learning (ICML-15)}, pages = {1632--1641} } @inproceedings{nuske2014modeling, title = {Modeling and Calibrating Visual Yield Estimates in Vineyards}, author = {Nuske, Stephen and Gupta, Kamal and Narasimhan, Srinivasa and Singh, Sanjiv}, year = 2014, booktitle = {Field and Service Robotics}, pages = {343--356}, organization = {Springer} } @book{NuY83, title = {Problem complexity and method efficiency in optimization}, author = {Nemirovskii, Arkadi and Yudin, David Borisovich}, year = 1983, publisher = {Wiley} } @inproceedings{NWS-Kaczmarz-algorithm, title = {Stochastic Gradient Descent, Weighted Sampling, and the Randomized Kaczmarz algorithm}, author = {Needell, Deanna and Ward, Rachel and Srebro, Nati}, year = 2014, booktitle = {Advances in Neural Information Processing Systems 27}, pages = {1017--1025} } @misc{NYCyellowcabJan2016, title = {{NYC} {Taxi} and {Limousine} {Commission} ({TLC}) trip record data}, note = {Accessed June 11, 2018}, howpublished = {\url{http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml}} } @misc{NYT, title = {UCI Machine Learning Repository}, author = {A. Frank and A. Asuncion}, year = 2010, note = {http://archive.ics.uci.edu/ml. Irvine, CA: University of California, School of Information and Computer Science} } @article{o18, title = {Learning Compact Neural Networks with Regularization}, author = {Oymak, Samet}, year = 2018, journal = {arXiv preprint arXiv:1802.01223} } @book{o2014analysis, title = {Analysis of boolean functions}, author = {O'Donnell, Ryan}, year = 2014, publisher = {Cambridge University Press} } @article{ODonoghue2012, title = {{Adaptive Restart for Accelerated Gradient Schemes}}, author = {{O'Donoghue}, Brendan and Cand\`{e}s, Emmanuel}, year = 2013, month = jul, journal = {Foundations of Computational Mathematics}, doi = {10.1007/s10208-013-9150-3}, issn = {1615-3375}, file = {:C$\backslash$:/Users/Zeyuan Zhu/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Unknown - Unknown - No Title.pdf:pdf}, mendeley-groups = {Optimization/Gradient Descent Theory} } @article{OF, title = {Sparse coding with an overcomplete basis set: A strategy employed by V1?}, author = {Olshausen, Bruno A and Field, David J}, year = 1997, journal = {Vision research}, publisher = {Elsevier}, volume = 37, number = 23, pages = {3311--3325}, owner = {gewor_000}, timestamp = {2013.11.10} } @article{ogras2006online, title = {Online summarization of dynamic time series data}, author = {Ogras, Y. and Ferhatosmanoglu, Hakan}, year = 2006, month = jan, journal = {The VLDB Journal}, publisher = {Springer-Verlag New York, Inc.}, address = {Secaucus, NJ, USA}, volume = 15, pages = {84--98}, doi = {http://dx.doi.org/10.1007/s00778-004-0149-x}, issn = {1066-8888}, acmid = 1146470, issue = 1, keywords = { Data streams, Dimensionality reduction, Time-series data, Transformation-based summarization }, numpages = 15 } @inproceedings{oh2017value, title = {Value prediction network}, author = {Oh, Junhyuk and Singh, Satinder and Lee, Honglak}, year = 2017, booktitle = {Proceedings of the 31st International Conference on Neural Information Processing Systems}, pages = {6120--6130} } @article{oh2018self, title = {Self-Imitation Learning}, author = {Oh, Junhyuk and Guo, Yijie and Singh, Satinder and Lee, Honglak}, year = 2018, journal = {arXiv preprint arXiv:1806.05635} } @article{OhadDeep, title = {A Provably Efficient Algorithm for Training Deep Networks}, author = {Roi Livni and Shai Shalev-Shwartz and Ohad Shamir}, year = 2013, journal = {ArXiv}, volume = {1304.7045}, owner = {rongge}, timestamp = {2013.09.26} } @article{ohnishi2019barrier, title = {Barrier-certified adaptive reinforcement learning with applications to brushbot navigation}, author = {Ohnishi, Motoya and Wang, Li and Notomista, Gennaro and Egerstedt, Magnus}, year = 2019, journal = {IEEE Transactions on robotics}, publisher = {IEEE}, volume = 35, number = 5, pages = {1186--1205} } @article{oja82, title = {{Simplified neuron model as a principal component analyzer}}, author = {Oja, Erkki}, year = 1982, month = nov, day = 1, journal = {Journal of Mathematical Biology}, booktitle = {Journal of Mathematical Biology}, publisher = {Springer-Verlag}, volume = 15, number = 3, pages = {267--273}, doi = {10.1007/bf00275687}, issn = {0303-6812}, url = {http://dx.doi.org/10.1007/bf00275687}, abstract = {{A simple linear neuron model with constrained Hebbian-type synaptic modification is analyzed and a new class of unconstrained learning rules is derived. It is shown that the model neuron tends to extract the principal component from a stationary input vector sequence.}}, citeulike-article-id = 1222082, citeulike-linkout-0 = {http://dx.doi.org/10.1007/bf00275687}, citeulike-linkout-1 = {http://www.springerlink.com/content/u9u6120r003825u1}, citeulike-linkout-2 = {http://link.springer.com/article/10.1007/BF00275687}, keywords = {computational-neuroscience, prototype-learning}, posted-at = {2009-12-02 21:57:21}, priority = 2 } @article{oja92, title = {Principal components, minor components, and linear neural networks}, author = {Oja, Erkki}, year = 1992, journal = {Neural Networks}, publisher = {Elsevier}, volume = 5, number = 6, pages = {927--935} } @inproceedings{ok2018exploration, title = {Exploration in structured reinforcement learning}, author = {Ok, Jungseul and Proutiere, Alexandre and Tranos, Damianos}, year = 2018, booktitle = {Advances in Neural Information Processing Systems}, pages = {8874--8882} } @article{oliveira10concentration, title = {{Concentration of the adjacency matrix and of the Laplacian in random graphs with independent edges}}, author = {{Imbuzeiro Oliveira}, R.}, year = 2009, month = nov, journal = {ArXiv e-prints}, archiveprefix = {arXiv}, eprint = {0911.0600}, primaryclass = {math.CO}, keywords = {Mathematics - Combinatorics, Mathematics - Probability, 05C80, 60B20}, adsurl = {http://adsabs.harvard.edu/abs/2009arXiv0911.0600I}, adsnote = {Provided by the SAO/NASA Astrophysics Data System} } @article{olshausen2004sparse, title = {Sparse coding of sensory inputs}, author = {Olshausen, Bruno A and Field, David J}, year = 2004, journal = {Current opinion in neurobiology}, publisher = {Elsevier}, volume = 14, number = 4, pages = {481--487} } @article{oo18, title = {Non-asymptotic Identification of {LTI} Systems from a Single Trajectory}, author = {Oymak, Samet and Ozay, Necmiye}, year = 2018, journal = {arXiv preprint arXiv:1806.05722} } @article{oord2018representation, title = {Representation learning with contrastive predictive coding}, author = {Oord, Aaron van den and Li, Yazhe and Vinyals, Oriol}, year = 2018, journal = {arXiv:1807.03748} } @inproceedings{ops17, title = {The Statistical Recurrent Unit}, author = {Oliva, Junier B and P{\'o}czos, Barnab{\'a}s and Schneider, Jeff}, year = 2017, booktitle = {International Conference on Machine Learning (ICML)}, pages = {2671--2680} } @article{Orabona2012prisma, title = {Prisma: Proximal iterative smoothing algorithm}, author = {Orabona, Francesco and Argyriou, Andreas and Srebro, Nathan}, year = 2012, journal = {arXiv preprint arXiv:1206.2372} } @phdthesis{Orecchia11, title = {Fast Approximation Algorithms for Graph Partitioning using Spectral and Semidefinite-Programming Techniques}, author = {Orecchia, Lorenzo}, year = 2011, month = may, number = {UCB/EECS-2011-56}, school = {EECS Department, University of California, Berkeley} } @article{ortner2007logarithmic, title = {Logarithmic online regret bounds for undiscounted reinforcement learning}, author = {Ortner, P and Auer, R}, year = 2007, journal = {Advances in Neural Information Processing Systems}, volume = 19, pages = 49 } @inproceedings{ortner2014selecting, title = {Selecting near-optimal approximate state representations in reinforcement learning}, author = {Ortner, Ronald and Maillard, Odalric-Ambrym and Ryabko, Daniil}, year = 2014, booktitle = {International Conference on Algorithmic Learning Theory}, organization = {Springer} } @inproceedings{osband2013more, title = {(More) efficient reinforcement learning via posterior sampling}, author = {Osband, Ian and Russo, Daniel and Van Roy, Benjamin}, year = 2013, booktitle = {Advances in Neural Information Processing Systems}, pages = {3003--3011} } @article{osband2014generalization, title = {Generalization and Exploration via Randomized Value Functions}, author = {Osband, Ian and Van Roy, Benjamin and Wen, Zheng}, year = 2014, journal = {arXiv preprint arXiv:1402.0635} } @inproceedings{osband2014model, title = {Model-based reinforcement learning and the eluder dimension}, author = {Osband, Ian and Roy, Benjamin Van}, year = 2014, booktitle = {Proceedings of the 27th International Conference on Neural Information Processing Systems-Volume 1}, pages = {1466--1474} } @inproceedings{osband2014near, title = {Near-optimal reinforcement learning in factored mdps}, author = {Osband, Ian and Van Roy, Benjamin}, year = 2014, booktitle = {Advances in Neural Information Processing Systems}, pages = {604--612} } @article{osband2016on, title = {On Lower Bounds for Regret in Reinforcement Learning}, author = {Ian Osband and Benjamin Van Roy}, year = 2016, journal = {ArXiv}, volume = {abs/1608.02732} } @article{osband2017deep, title = {Deep exploration via randomized value functions}, author = {Osband, Ian and Russo, Daniel and Wen, Zheng and Van Roy, Benjamin}, year = 2017, journal = {arXiv preprint arXiv:1703.07608} } @inproceedings{osband2017posterior, title = {Why is posterior sampling better than optimism for reinforcement learning?}, author = {Osband, Ian and Van Roy, Benjamin}, year = 2017, booktitle = {Proceedings of the 34th International Conference on Machine Learning-Volume 70}, organization = {JMLR. org} } @article{ostrovski2017count, title = {Count-based exploration with neural density models}, author = {Ostrovski, Georg and Bellemare, Marc G and Oord, Aaron van den and Munos, R{\'e}mi}, year = 2017, journal = {arXiv preprint arXiv:1703.01310} } @inproceedings{OSV12, title = {Approximating the exponential, the lanczos method and an $\tilde{O}(m)$-time spectral algorithm for balanced separator}, author = {Orecchia, Lorenzo and Sachdeva, Sushant and Vishnoi, Nisheeth K.}, year = 2012, month = nov, booktitle = {STOC '12}, publisher = {ACM Press} } @inproceedings{OSVV2008, title = {On partitioning graphs via single commodity flows}, author = {Orecchia, Lorenzo and Schulman, Leonard J. and Vazirani, Umesh V. and Vishnoi, Nisheeth K.}, year = 2008, booktitle = {STOC 08}, address = {New York, New York, USA} } @inproceedings{ouyang2017learning, title = {Learning unknown markov decision processes: A thompson sampling approach}, author = {Ouyang, Yi and Gagrani, Mukul and Nayyar, Ashutosh and Jain, Rahul}, year = 2017, booktitle = {Advances in Neural Information Processing Systems}, pages = {1333--1342} } @inproceedings{ovadia2019can, title = {Can you trust your model's uncertainty? Evaluating predictive uncertainty under dataset shift}, author = {Ovadia, Yaniv and Fertig, Emily and Ren, Jie and Nado, Zachary and Sculley, David and Nowozin, Sebastian and Dillon, Joshua and Lakshminarayanan, Balaji and Snoek, Jasper}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {13991--14002} } @article{OvercompleteLVMs2014, title = {{Sample Complexity Analysis for Learning Overcomplete Latent Variable Models through Tensor Methods}}, author = {Anima Anandkumar and Rong Ge and Majid Janzamin}, year = 2014, month = aug, journal = {arXiv preprint arXiv:1408.0553} } @article{oymak2020towards, title = {Towards moderate overparameterization: global convergence guarantees for training shallow neural networks}, author = {Oymak, Samet and Soltanolkotabi, Mahdi}, year = 2020, journal = {IEEE Journal on Selected Areas in Information Theory}, publisher = {IEEE} } @article{p63, title = {Gradient methods for minimizing functionals}, author = {Polyak, Boris Teodorovich}, year = 1963, journal = {Zhurnal Vychislitel'noi Matematiki i Matematicheskoi Fiziki}, publisher = {Russian Academy of Sciences, Branch of Mathematical Sciences}, volume = 3, number = 4, pages = {643--653} } @article{p99, title = {Approximation theory of the MLP model in neural networks}, author = {Pinkus, Allan}, year = 1999, journal = {Acta Numerica}, publisher = {Cambridge University Press}, volume = 8, pages = {143--195} } @inproceedings{PAC, title = {A theory of the learnable}, author = {Valiant, L. G.}, year = 1984, booktitle = {Proceedings of the sixteenth annual ACM symposium on Theory of computing}, publisher = {ACM}, address = {New York, NY, USA}, series = {STOC '84}, pages = {436--445}, isbn = {0-89791-133-4}, numpages = 10 } @article{pacchiano2020optimism, title = {On Optimism in Model-Based Reinforcement Learning}, author = {Pacchiano, Aldo and Ball, Philip and Parker-Holder, Jack and Choromanski, Krzysztof and Roberts, Stephen}, year = 2020, journal = {arXiv preprint arXiv:2006.11911} } @phdthesis{pachinko, title = {Pachinko allocation: dag-structured mixture models of topic correlations}, author = {Li, Wei}, year = 2007, publisher = {University of Massachusetts Amherst}, isbn = {978-0-549-33023-3}, note = {AAI3289214}, advisor = {Mccallum, Andrew} } @book{pachter2005algebraic, title = {Algebraic statistics for computational biology}, author = {Pachter, L. and Sturmfels, B.}, year = 2005, publisher = {Cambridge University Press}, volume = 13 } @article{PageRank-BrinPage98, title = {The Anatomy of a Large-Scale Hypertextual Web Search Engine}, author = {Sergey Brin and Lawrence Page}, year = 1998, journal = {Computer Networks}, volume = 30, number = {1-7}, pages = {107--117}, ee = {http://dx.doi.org/10.1016/S0169-7552(98)00110-X}, bibsource = {DBLP, http://dblp.uni-trier.de} } @article{pagnoncelli2009sample, title = {Sample average approximation method for chance constrained programming: theory and applications}, author = {Pagnoncelli, BK and Ahmed, Shabbir and Shapiro, A}, year = 2009, journal = {Journal of optimization theory and applications}, publisher = {Springer}, volume = 142, number = 2, pages = {399--416} } @book{palis2012geometric, title = {Geometric {T}heory of {D}ynamical {S}ystems: {A}n {I}ntroduction}, author = {Palis, J Jr and De Melo, Welington}, year = 2012, publisher = {Springer Science \& Business Media} } @article{pan2010survey, title = {A survey on transfer learning}, author = {Pan, Sinno Jialin and Yang, Qiang}, year = 2010, journal = {Knowledge and Data Engineering, IEEE Transactions on}, publisher = {IEEE}, volume = 22, number = 10, pages = {1345--1359} } @article{panageas2016gradient, title = {Gradient descent only converges to minimizers: Non-isolated critical points and invariant regions}, author = {Panageas, Ioannis and Piliouras, Georgios}, year = 2016, journal = {arXiv preprint arXiv:1605.00405} } @article{papadimitriou1987complexity, title = {The complexity of {M}arkov decision processes}, author = {Papadimitriou, Christos H and Tsitsiklis, John N}, year = 1987, journal = {Mathematics of operations research}, publisher = {INFORMS}, volume = 12, number = 3, pages = {441--450} } @inproceedings{papadimitriou1998latent, title = {Latent semantic indexing: A probabilistic analysis}, author = {Papadimitriou, Christos H and Tamaki, Hisao and Raghavan, Prabhakar and Vempala, Santosh}, year = 1998, booktitle = {Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems}, pages = {159--168}, organization = {ACM} } @inproceedings{papadimitriou2003adaptive, title = {Adaptive, hands-off stream mining}, author = {Papadimitriou, Spiros and Brockwell, Anthony and Faloutsos, Christos}, year = 2003, booktitle = { Proceedings of the 29th international conference on Very large data bases - Volume 29 }, location = {Berlin, Germany}, publisher = {VLDB Endowment}, series = {VLDB '2003}, pages = {560--571}, isbn = {0-12-722442-4}, acmid = 1315500, numpages = 12 } @article{papadimitriou2005streaming, title = {Streaming pattern discovery in multiple time-series}, author = {Papadimitriou, Spiros and Sun, Jimeng and Faloutsos, Christos}, year = 2005, booktitle = { Proceedings of the 31st international conference on Very large data bases }, location = {Trondheim, Norway}, publisher = {VLDB Endowment}, series = {VLDB '05}, pages = {697--708}, isbn = {1-59593-154-6}, acmid = 1083674, numpages = 12 } @inproceedings{papadimitriou2006optimal, title = {Optimal multi-scale patterns in time series streams}, author = {Papadimitriou, Spiros and Yu, Philip}, year = 2006, booktitle = { Proceedings of the 2006 ACM SIGMOD international conference on Management of data }, location = {Chicago, IL, USA}, publisher = {ACM}, address = {New York, NY, USA}, series = {SIGMOD '06}, pages = {647--658}, doi = {http://doi.acm.org/10.1145/1142473.1142545}, isbn = {1-59593-434-0}, acmid = 1142545, keywords = { SVD, empirical orthogonal functions, local patterns, multi-scale, singular spectrum, stream }, numpages = 12 } @article{Papadimitriou98latentsemantic, title = {Latent Semantic Indexing: A Probabilistic Analysis}, author = {C. H. Papadimitriou and P. Raghavan and H. Tamaki and S. Vempala}, year = 2000, journal = {J. Comput. Syst. Sci.}, volume = 61, number = 2 } @inproceedings{park2006capturing, title = {Capturing and animating skin deformation in human motion}, author = {Park, Sang Il and Hodgins, Jessica K.}, year = 2006, booktitle = {ACM SIGGRAPH 2006 Papers}, location = {Boston, Massachusetts}, publisher = {ACM}, address = {New York, NY, USA}, series = {SIGGRAPH '06}, pages = {881--889}, doi = {http://doi.acm.org/10.1145/1179352.1141970}, isbn = {1-59593-364-6}, acmid = 1141970, keywords = {human animation, motion capture, skin deformation}, numpages = 9 } @inproceedings{park2017non, title = {Non-square matrix sensing without spurious local minima via the {B}urer-{M}onteiro approach}, author = {Park, Dohyung and Kyrillidis, Anastasios and Carmanis, Constantine and Sanghavi, Sujay}, year = 2017, booktitle = {Artificial Intelligence and Statistics}, pages = {65--74} } @inproceedings{parr2007analyzing, title = {Analyzing feature generation for value-function approximation}, author = {Parr, Ronald and Painter-Wakefield, Christopher and Li, Lihong and Littman, Michael}, year = 2007, booktitle = {Proceedings of the 24th international conference on Machine learning}, pages = {737--744}, organization = {ACM} } @inproceedings{parr2008analysis, title = {An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning}, author = {Parr, Ronald and Li, Lihong and Taylor, Gavin and Painter-Wakefield, Christopher and Littman, Michael L}, year = 2008, booktitle = {Proceedings of the 25th international conference on Machine learning}, pages = {752--759}, organization = {ACM} } @article{parse, title = {Head-Driven Statistical Models for Natural Language Parsing}, author = {Collins, Michael}, year = 2003, month = dec, journal = {Comput. Linguist.}, publisher = {MIT Press}, address = {Cambridge, MA, USA}, volume = 29, number = 4, pages = {589--637}, doi = {10.1162/089120103322753356}, issn = {0891-2017}, url = {http://dx.doi.org/10.1162/089120103322753356}, issue_date = {December 2003}, numpages = 49, acmid = 1105706 } @article{paschou2007intra, title = {Intra-and interpopulation genotype reconstruction from tagging SNPs}, author = {Paschou, Peristera and Mahoney, Michael W and Javed, Asif and Kidd, Judith R and Pakstis, Andrew J and Gu, Sheng and Kidd, Kenneth K and Drineas, Petros}, year = 2007, journal = {Genome Research}, publisher = {Cold Spring Harbor Lab}, volume = 17, number = 1, pages = {96--107} } @inproceedings{patel2003energy, title = {Energy Aware Grid: Global Workload Placement based on Energy Efficiency}, author = {Chandrakant Patel and Ratnesh Sharma and Cullen Bash and Sven Graupner}, year = 2003, booktitle = {ASME International Mechanical Engineering Congress and R\&D Expo} } @inproceedings{patel2003smart, title = {Smart Cooling of Data Centers}, author = {Patel, C.D. and Bash, C.E. and Sharma, R. and Friedrich, R.}, year = 2003, booktitle = {ASME Interpack} } @inproceedings{pathak2017curiosity, title = {Curiosity-driven exploration by self-supervised prediction}, author = {Pathak, Deepak and Agrawal, Pulkit and Efros, Alexei A and Darrell, Trevor}, year = 2017, booktitle = {International Conference on Machine Learning} } @inproceedings{patnaik2009sustainable, title = { Sustainable operation and management of data center chillers using temporal data mining }, author = { Patnaik, Debprakash and Marwah, Manish and Sharma, Ratnesh and Ramakrishnan, Naren }, year = 2009, booktitle = { Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining }, location = {Paris, France}, publisher = {ACM}, address = {New York, NY, USA}, series = {KDD '09}, pages = {1305--1314}, doi = {http://doi.acm.org/10.1145/1557019.1557159}, isbn = {978-1-60558-495-9}, acmid = 1557159, keywords = {chillers, clustering, data centers, frequent episodes, motifs, sustainability}, numpages = 10 } @inproceedings{paul2015column, title = {Column Selection via Adaptive Sampling}, author = {Paul, Saurabh and Magdon-Ismail, Malik and Drineas, Petros}, year = 2015, booktitle = {Advances in Neural Information Processing Systems}, pages = {406--414} } @inproceedings{pb17, title = {Geometry of Neural Network Loss Surfaces via Random Matrix Theory}, author = {Pennington, Jeffrey and Bahri, Yasaman}, year = 2017, booktitle = {International Conference on Machine Learning (ICML)}, address = {International Convention Centre, Sydney, Australia} } @article{pearl1986fusion, title = {Fusion, Propagation, and Structuring in Belief Networks}, author = {Judea Pearl}, year = 1986, journal = {Artif. Intell.}, volume = 29, number = 3, pages = {241--288}, bibsource = {DBLP, http://dblp.uni-trier.de}, ee = {http://dx.doi.org/10.1016/0004-3702(86)90072-X} } @article{pearl1987evidential, title = {Evidential reasoning using stochastic simulation of causal models}, author = {Pearl, J.}, year = 1987, journal = {Artificial Intelligence}, volume = 32, pages = {247--257} } @article{pearson1894, title = {Contributions to the Mathematical Theory of Evolution}, author = {Karl Pearson}, year = 1894, journal = {Philosophical Transactions of the Royal Society of London. A}, volume = 185, pages = {71--110} } @article{Pearson94, title = {Contributions to the mathematical theory of evolution}, author = {K. Pearson}, year = 1894, journal = {Philosophical Transactions of the Royal Society, London, A.}, pages = 71 } @article{pegasos, title = {Pegasos: Primal estimated sub-gradient solver for svm}, author = {Shalev-Shwartz, Shai and Singer, Yoram and Srebro, Nathan and Cotter, Andrew}, year = 2011, journal = {Mathematical programming}, publisher = {Springer}, volume = 127, number = 1, pages = {3--30} } @article{pemantle1990nonconvergence, title = {Nonconvergence to unstable points in urn models and stochastic approximations}, author = {Pemantle, Robin}, year = 1990, journal = {The Annals of Probability}, publisher = {JSTOR}, pages = {698--712} } @inproceedings{PengRowSampling, title = {Iterative Row Sampling}, author = {Mu Li and Gary L. Miller and Richard Peng}, year = 2013, booktitle = {54th Annual {IEEE} Symposium on Foundations of Computer Science, {FOCS} 2013, 26-29 October, 2013, Berkeley, CA, {USA}}, pages = {127--136}, doi = {10.1109/FOCS.2013.22}, url = {http://dx.doi.org/10.1109/FOCS.2013.22}, crossref = {DBLP:conf/focs/2013}, timestamp = {Tue, 16 Dec 2014 09:57:25 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/focs/LiMP13}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{PengS14, title = {An efficient parallel solver for {SDD} linear systems}, author = {Richard Peng and Daniel A. Spielman}, year = 2014, booktitle = {Symposium on Theory of Computing, {STOC} 2014, New York, NY, USA, May 31 - June 03, 2014}, pages = {333--342}, doi = {10.1145/2591796.2591832}, url = {http://doi.acm.org/10.1145/2591796.2591832}, timestamp = {Mon, 03 Nov 2014 22:25:46 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/stoc/PengS14}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{PengTangwongsan2012, title = {{Faster and simpler width-independent parallel algorithms for positive semidefinite programming}}, author = {Peng, Richard and Tangwongsan, Kanat}, year = 2012, month = jan, booktitle = {Proceedinbgs of the 24th ACM symposium on Parallelism in algorithms and architectures - SPAA '12}, publisher = {ACM Press}, address = {New York, New York, USA}, pages = 101, doi = {10.1145/2312005.2312026}, isbn = 9781450312134, archiveprefix = {arXiv}, arxivid = {arXiv:1201.5135v1}, eprint = {arXiv:1201.5135v1}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Peng, Tangwongsan - 2012 - Faster and simpler width-independent parallel algorithms for positive semidefinite programming.pdf:pdf}, mendeley-groups = {Algorithms/Multiplicative Weight/SDP} } @article{pennington2014glove, title = {Glove: Global vectors for word representation}, author = {Pennington, Jeffrey and Socher, Richard and Manning, Christopher D.}, year = 2014, journal = {Proceedings of the Empiricial Methods in Natural Language Processing} } @inproceedings{pennington2017resurrecting, title = {Resurrecting the sigmoid in deep learning through dynamical isometry: theory and practice}, author = {Pennington, Jeffrey and Schoenholz, Samuel and Ganguli, Surya}, year = 2017, month = nov, journal = {arXiv:1711.04735}, booktitle = {Advances in neural information processing systems}, pages = {4785--4795}, url = {http://arxiv.org/abs/1711.04735} } @article{pennington2018emergence, title = {The emergence of spectral universality in deep networks}, author = {Pennington, Jeffrey and Schoenholz, Samuel S and Ganguli, Surya}, year = 2018, journal = {arXiv preprint arXiv:1802.09979} } @inproceedings{perkins2003convergent, title = {A convergent form of approximate policy iteration}, author = {Perkins, Theodore J and Precup, Doina}, year = 2003, booktitle = {Advances in neural information processing systems}, pages = {1627--1634} } @techreport{PertDJ, title = {Perturbation of joint diagonalizers. Ref\# 94D027}, author = {J.-F. Cardoso}, year = 1994, institution = {T\'{e}l\'{e}com {P}aris} } @inproceedings{peters2006policy, title = {Policy gradient methods for robotics}, author = {Peters, Jan and Schaal, Stefan}, year = 2006, booktitle = {2006 IEEE/RSJ International Conference on Intelligent Robots and Systems}, pages = {2219--2225}, organization = {IEEE} } @inproceedings{petrik2007analysis, title = {An Analysis of Laplacian Methods for Value Function Approximation in {MDP}s.}, author = {Petrik, Marek}, year = 2007, booktitle = {IJCAI}, pages = {2574--2579} } @article{pintore2006spatially, title = {Spatially adaptive smoothing splines}, author = {Pintore, Alexandre and Speckman, Paul and Holmes, Chris C}, year = 2006, journal = {Biometrika}, publisher = {Oxford University Press}, volume = 93, number = 1, pages = {113--125} } @article{PlotkinST1995, title = {{Fast Approximation Algorithms for Fractional Packing and Covering Problems}}, author = {Plotkin, Serge A. and Shmoys, David B. and Tardos, \'{E}va}, year = 1995, month = may, journal = {Mathematics of Operations Research}, volume = 20, number = 2, pages = {257--301}, doi = {10.1287/moor.20.2.257}, issn = {0364-765X}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Plotkin, Shmoys, Tardos - 1995 - Fast Approximation Algorithms for Fractional Packing and Covering Problems.pdf:pdf}, mendeley-groups = {Algorithms/Multiplicative Weight/LP} } @inproceedings{plrsg16, title = {Exponential expressivity in deep neural networks through transient chaos}, author = {Poole, Ben and Lahiri, Subhaneil and Raghu, Maithreyi and Sohl-Dickstein, Jascha and Ganguli, Surya}, year = 2016, booktitle = {Advances In Neural Information Processing Systems}, pages = {3360--3368}, note = {00047} } @inproceedings{pmlr-v48-hardt16, title = {Train faster, generalize better: Stability of stochastic gradient descent}, author = {Moritz Hardt and Ben Recht and Yoram Singer}, year = 2016, month = {20--22 Jun}, booktitle = {Proceedings of The 33rd International Conference on Machine Learning}, publisher = {PMLR}, address = {New York, New York, USA}, series = {Proceedings of Machine Learning Research}, volume = 48, pages = {1225--1234}, url = {http://proceedings.mlr.press/v48/hardt16.html}, editor = {Maria Florina Balcan and Kilian Q. Weinberger}, pdf = {http://proceedings.mlr.press/v48/hardt16.pdf}, abstract = {We show that parametric models trained by a stochastic gradient method (SGM) with few iterations have vanishing generalization error. We prove our results by arguing that SGM is algorithmically stable in the sense of Bousquet and Elisseeff. Our analysis only employs elementary tools from convex and continuous optimization. We derive stability bounds for both convex and non-convex optimization under standard Lipschitz and smoothness assumptions. Applying our results to the convex case, we provide new insights for why multiple epochs of stochastic gradient methods generalize well in practice. In the non-convex case, we give a new interpretation of common practices in neural networks, and formally show that popular techniques for training large deep models are indeed stability-promoting. Our findings conceptually underscore the importance of reducing training time beyond its obvious benefit.} } @article{poggio2017theory, title = {Theory of deep learning III: explaining the non-overfitting puzzle}, author = {Poggio, Tomaso and Kawaguchi, Kenji and Liao, Qianli and Miranda, Brando and Rosasco, Lorenzo and Boix, Xavier and Hidary, Jack and Mhaskar, Hrushikesh}, year = 2017, journal = {arXiv preprint arXiv:1801.00173} } @article{polson2008practical, title = {Practical filtering with sequential parameter learning}, author = {Polson, Nicholas G. and Stroud, Jonathan R. and M\"uller, Peter}, year = 2008, journal = {Journal of the Royal Statistical Society: Series B (Statistical Methodology)}, publisher = {Blackwell Publishing Ltd}, volume = 70, number = 2, pages = {413--428}, keywords = {Filtering, Markov chain Monte Carlo methods, Particle filtering, Sequential parameter learning, Spatiotemporal models, State space models} } @article{polyak1964some, title = {Some methods of speeding up the convergence of iteration methods}, author = {Polyak, Boris T}, year = 1964, journal = {USSR Computational Mathematics and Mathematical Physics}, publisher = {Elsevier}, volume = 4, number = 5, pages = {1--17} } @article{post2015simplex, title = {The simplex method is strongly polynomial for deterministic {M}arkov decision processes}, author = {Post, Ian and Ye, Yinyu}, year = 2015, journal = {Mathematics of Operations Research}, publisher = {INFORMS}, volume = 40, number = 4, pages = {859--868} } @inproceedings{prashanth2013actor, title = {Actor-critic algorithms for risk-sensitive MDPs}, author = {Prashanth, LA and Ghavamzadeh, Mohammad}, year = 2013, booktitle = {Advances in neural information processing systems}, pages = {252--260} } @inproceedings{prashanth2014fast, title = {Fast LSTD using stochastic approximation: Finite time analysis and application to traffic control}, author = {Prashanth, LA and Korda, Nathaniel and Munos, R{\'e}mi}, year = 2014, booktitle = {Joint European Conference on Machine Learning and Knowledge Discovery in Databases}, pages = {66--81}, organization = {Springer} } @inproceedings{prashanth2014policy, title = {Policy gradients for CVaR-constrained MDPs}, author = {Prashanth, LA}, year = 2014, booktitle = {International Conference on Algorithmic Learning Theory}, pages = {155--169}, organization = {Springer} } @inproceedings{prolat2015approximateDP, title = {Approximate Dynamic Programming for Two-Player Zero-Sum Markov Games}, author = {Julien P{\'e}rolat and Bruno Scherrer and Bilal Piot and Olivier Pietquin}, year = 2015, booktitle = {Proceedings of the 32th International Conference on International Conference on Machine Learning}, series = {ICML'15} } @inproceedings{prsz18, title = {Convergence Results for Neural Networks via Electrodynamics}, author = {Panigrahy, Rina and Rahimi, Ali and Sachdeva, Sushant and Zhang, Qiuyi}, year = 2018, journal = {arXiv preprint arXiv:1702.00458}, booktitle = {ITCS} } @article{PRTV, title = {Latent semantic indexing: a probabilistic analysis}, author = {C. Papadimitriou and P. Raghavan and H. Tamaki and S. Vempala}, year = 2000, journal = {JCSS}, pages = {217--235}, note = {Preliminary version in {\em PODS} 1998} } @inproceedings{PSX11, title = {A Spectral Algorithm for Latent Tree Graphical Models}, author = {A. Parikh and L. Song and E. P. Xing}, year = 2011, booktitle = {ICML} } @book{puterman2014markov, title = {{M}arkov decision processes: discrete stochastic dynamic programming}, author = {Puterman, Martin L}, year = 2014, publisher = {John Wiley \& Sons} } @inproceedings{pw17, title = {Nonlinear random matrix theory for deep learning}, author = {Pennington, Jeffrey and Worah, Pratik}, year = 2017, booktitle = {Advances in Neural Information Processing Systems (NIPS)} } @inproceedings{pytorch, title = {PyTorch: An Imperative Style, High-Performance Deep Learning Library}, author = {Paszke, Adam and Gross, Sam and Massa, Francisco and Lerer, Adam and Bradbury, James and Chanan, Gregory and Killeen, Trevor and Lin, Zeming and Gimelshein, Natalia and Antiga, Luca and Desmaison, Alban and Kopf, Andreas and Yang, Edward and DeVito, Zachary and Raison, Martin and Tejani, Alykhan and Chilamkurthy, Sasank and Steiner, Benoit and Fang, Lu and Bai, Junjie and Chintala, Soumith}, year = 2019, booktitle = {Advances in Neural Information Processing Systems 32}, publisher = {Curran Associates, Inc.}, pages = {8024--8035}, url = {http://papers.neurips.cc/paper/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf}, editor = {H. Wallach and H. Larochelle and A. Beygelzimer and F. d Alch\'{e}-Buc and E. Fox and R. Garnett} } @article{qi2005eigenvalues, title = {Eigenvalues of a real supersymmetric tensor}, author = {Qi, L.}, year = 2005, journal = {Journal of Symbolic Computation}, publisher = {Elsevier}, volume = 40, number = 6, pages = {1302--1324} } @article{QRZ-arbitrary-sampling, title = {Randomized Dual Coordinate Ascent with Arbitrary Sampling}, author = {Zheng Qu and Peter Richt{\'{a}}rik and Tong Zhang}, year = 2014, journal = {CoRR}, volume = {abs/1411.5873} } @article{quah2006maximum, title = {Maximum reward reinforcement learning: A non-cumulative reward criterion}, author = {Quah, Kian Hong and Quek, Chai}, year = 2006, journal = {Expert Systems with Applications}, publisher = {Elsevier}, volume = 31, number = 2, pages = {351--359} } @inproceedings{radlinski2008learning, title = {Learning diverse rankings with multi-armed bandits}, author = {Radlinski, Filip and Kleinberg, Robert and Joachims, Thorsten}, year = 2008, booktitle = {Proceedings of the 25th international conference on Machine learning}, pages = {784--791}, organization = {ACM} } @inproceedings{rafiei1997similarity, title = {Similarity-based queries for time series data}, author = {Rafiei, Davood and Mendelzon, Alberto}, year = 1997, booktitle = { Proceedings of the 1997 ACM SIGMOD international conference on Management of data }, location = {Tucson, Arizona, United States}, publisher = {ACM}, address = {New York, NY, USA}, series = {SIGMOD '97}, pages = {13--25}, doi = {http://doi.acm.org/10.1145/253260.253264}, isbn = {0-89791-911-4}, acmid = 253264, numpages = 13 } @inproceedings{raghavendra2010graph, title = {Graph expansion and the unique games conjecture}, author = {Raghavendra, Prasad and Steurer, David}, year = 2010, booktitle = {Proceedings of the forty-second ACM symposium on Theory of computing}, pages = {755--764} } @article{raghu2016expressive, title = {On the expressive power of deep neural networks}, author = {Raghu, Maithra and Poole, Ben and Kleinberg, Jon and Ganguli, Surya and Sohl-Dickstein, Jascha}, year = 2016, journal = {arXiv preprint arXiv:1606.05336} } @inproceedings{raghu2019transfusion, title = {Transfusion: Understanding transfer learning for medical imaging}, author = {Raghu, Maithra and Zhang, Chiyuan and Kleinberg, Jon and Bengio, Samy}, year = 2019, booktitle = {Advances in neural information processing systems}, pages = {3347--3357} } @inproceedings{raginsky2017non, title = {Non-convex learning via Stochastic Gradient Langevin Dynamics: a nonasymptotic analysis}, author = {Raginsky, Maxim and Rakhlin, Alexander and Telgarsky, Matus}, year = 2017, booktitle = {Conference on Learning Theory}, pages = {1674--1703} } @inproceedings{Rahimi05, title = {Learning Appearance Manifolds from Video}, author = {Ali Rahimi and Ben Recht and Trevor Darrell}, year = 2005, booktitle = {Proc.~IEEE CVPR}, date-added = {2016-04-04 17:32:01 +0000}, date-modified = {2016-04-04 17:32:01 +0000} } @inproceedings{raina2006constructing, title = {Constructing informative priors using transfer learning}, author = {Raina, Rajat and Ng, Andrew Y and Koller, Daphne}, year = 2006, booktitle = {Proceedings of the 23rd international conference on Machine learning}, pages = {713--720}, organization = {ACM} } @inproceedings{rajeswaran2020game, title = {A game theoretic framework for model based reinforcement learning}, author = {Rajeswaran, Aravind and Mordatch, Igor and Kumar, Vikash}, year = 2020, booktitle = {International Conference on Machine Learning}, pages = {7953--7963}, organization = {PMLR} } @article{rakhlin2009lecture, title = {Lecture notes on online learning}, author = {Rakhlin, Alexander}, year = 2009, journal = {Draft}, note = {Available at \url{http://www-stat.wharton.upenn.edu/~rakhlin/courses/stat991/papers/lecture_notes.pdf}} } @inproceedings{rakhlin2011making, title = {Making gradient descent optimal for strongly convex stochastic optimization}, author = {Rakhlin, Alexander and Shamir, Ohad and Sridharan, Karthik}, year = 2012, booktitle = {ICML} } @inproceedings{rakhlin2011online, title = {Online learning: stochastic, constrained, and smoothed adversaries}, author = {Rakhlin, Alexander and Sridharan, Karthik and Tewari, Ambuj}, year = 2011, booktitle = {Proceedings of the 24th International Conference on Neural Information Processing Systems}, pages = {1764--1772} } @inproceedings{rakhlin2013optimization, title = {Optimization, learning, and games with predictable sequences}, author = {Rakhlin, Sasha and Sridharan, Karthik}, year = 2013, booktitle = {Advances in Neural Information Processing Systems}, pages = {3066--3074} } @article{rakhlin2015online, title = {Online Learning via Sequential Complexities}, author = {Rakhlin, Alexander and Sridharan, Karthik and Tewari, Ambuj}, year = 2015, journal = {Journal of Machine Learning Research}, volume = 16, number = 6, pages = {155--186} } @article{rakhlin2015sequential, title = {Sequential complexities and uniform martingale laws of large numbers}, author = {Rakhlin, Alexander and Sridharan, Karthik and Tewari, Ambuj}, year = 2015, journal = {Probability Theory and Related Fields}, publisher = {Springer}, volume = 161, number = {1-2}, pages = {111--153} } @article{ram2009incremental, title = {Incremental stochastic subgradient algorithms for convex optimization}, author = {Ram, S. Sundhar and Nedi\'c, Angelia and Veeravalli, Venugopal V.}, year = 2009, journal = {SIAM J. Optim.}, volume = 20, number = 2, pages = {691--717}, doi = {10.1137/080726380}, issn = {1052-6234}, url = {http://dx.doi.org/10.1137/080726380}, fjournal = {SIAM Journal on Optimization}, mrclass = {90C25 (68W15)}, mrnumber = 2515792, mrreviewer = {Teemu Pennanen} } @article{ramdas2012optimal, title = {Optimal rates for first-order stochastic convex optimization under Tsybakov noise condition}, author = {Ramdas, Aaditya and Singh, Aarti}, year = 2012, journal = {arXiv preprint arXiv:1207.3012} } @inproceedings{ramos2008c, title = {C-Oracle: Predictive Thermal Management for Data Centers}, author = {Luiz Ramos and Ricardo Bianchini}, year = 2008, month = feb, booktitle = { HPCA 2008. IEEE 14th International Symposium on High Performance Computer Architecture }, pages = {111--122}, doi = {10.1109/HPCA.2008.4658632}, issn = {1530-0897}, keywords = { C-Oracle;Internet services;data centers;dynamic voltage/frequency scaling;load redistribution;multitier services;power-dense server clusters;predictive thermal management policy;software infrastructure;computer centres;thermal management (packaging); } } @book{RandomizedAlgorithms, title = {Randomized algorithms}, author = {Motwani, Rajeev and Raghavan, Prabhakar}, year = 1995, publisher = {Cambridge University Press}, address = {New York, NY, USA} } @article{RandomMatrices:Tropp, title = {User-Friendly Tail Bounds for Sums of Random Matrices}, author = {Joel A. Tropp}, year = 2012, journal = {Foundations of Computational Mathematics}, volume = 12, number = 4, pages = {389--434} } @inproceedings{ranger2007evaluating, title = {Evaluating MapReduce for Multi-core and Multiprocessor Systems}, author = { Ranger, Colby and Raghuraman, Ramanan and Penmetsa, Arun and Bradski, Gary and Kozyrakis, Christos }, year = 2007, booktitle = { Proceedings of the 2007 IEEE 13th International Symposium on High Performance Computer Architecture }, publisher = {IEEE Computer Society}, address = {Washington, DC, USA}, pages = {13--24}, doi = {10.1109/HPCA.2007.346181}, isbn = {1-4244-0804-0}, acmid = 1318097, numpages = 12 } @article{raskutti2012minimax, title = {Minimax-optimal rates for sparse additive models over kernel classes via convex programming}, author = {Raskutti, Garvesh and Wainwright, Martin J and Yu, Bin}, year = 2012, journal = {Journal of Machine Learning Research}, volume = 13, number = {Feb}, pages = {389--427} } @article{raskutti2015information, title = {The information geometry of mirror descent}, author = {Raskutti, Garvesh and Mukherjee, Sayan}, year = 2015, journal = {IEEE Transactions on Information Theory}, publisher = {IEEE}, volume = 61, number = 3, pages = {1451--1457} } @article{rasmussen1996delve, title = {Delve data for evaluating learning in valid experiments}, author = {Rasmussen, Carl Edward and Neal, Radford M and Hinton, Georey and van Camp, Drew and Revow, Michael and Ghahramani, Zoubin and Kustra, Rafal and Tibshirani, Rob}, year = 1996, journal = {URL http://www. cs. toronto. edu/~ delve} } @article{rauch1965maximum, title = {Maximum likelihood estimates of linear dynamic systems}, author = {Rauch, H. E. and Tung, F. and Striebel, C. T.}, year = 1965, month = aug, journal = {AIAA Journal}, volume = 3, number = 8, pages = {1445--1450}, doi = {10.2514/3.3166}, issn = {0001-1452}, citeulike-article-id = 9533564, citeulike-linkout-0 = {http://dx.doi.org/10.2514/3.3166}, keywords = {smoother}, posted-at = {2011-07-11 21:56:40}, priority = 2 } @inproceedings{raz2016fast, title = {Fast learning requires good memory: A time-space lower bound for parity learning}, author = {Raz, Ran}, year = 2016, booktitle = {Foundations of Computer Science (FOCS), 2016 IEEE 57th Annual Symposium on}, pages = {266--275}, organization = {IEEE} } @inproceedings{raz2017time, title = {A Time-Space Lower Bound for a Large Class of Learning Problems}, author = {Raz, Ran}, year = 2017, booktitle = {Foundations of Computer Science (FOCS), 2017 IEEE 58th Annual Symposium on}, organization = {IEEE} } @article{razin2020implicit, title = {Implicit Regularization in Deep Learning May Not Be Explainable by Norms}, author = {Razin, Noam and Cohen, Nadav}, year = 2020, journal = {arXiv preprint arXiv:2005.06398} } @inproceedings{RBL, title = {Sparse Feature Learning for Deep Belief Networks}, author = {Marc'Aurelio Ranzato and Y{-}Lan Boureau and Yann LeCun}, year = 2007, booktitle = {Advances in Neural Information Processing Systems 20, Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 3-6, 2007}, pages = {1185--1192}, url = {http://papers.nips.cc/paper/3363-sparse-feature-learning-for-deep-belief-networks}, crossref = {DBLP:conf/nips/2007}, timestamp = {Thu, 11 Dec 2014 17:34:08 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/nips/RanzatoBL07}, bibsource = {dblp computer science bibliography, http://dblp.org} } @misc{Recht09, title = {A simpler approach to matrix completion}, author = {B.~Recht}, year = 2009, note = {arXiv:0910.0651v2} } @article{Recht11, title = {A Simpler Approach to Matrix Completion}, author = {Recht, Benjamin}, year = 2011, month = dec, journal = {J. Mach. Learn. Res.}, publisher = {JMLR.org}, volume = 12, pages = {3413--3430}, issn = {1532-4435}, url = {http://dl.acm.org/citation.cfm?id=1953048.2185803}, issue_date = {2/1/2011}, numpages = 18, acmid = 2185803 } @inproceedings{recht2011hogwild, title = {Hogwild: A lock-free approach to parallelizing stochastic gradient descent}, author = {Recht, Benjamin and Re, Christopher and Wright, Stephen and Niu, Feng}, year = 2011, booktitle = {Advances in Neural Information Processing Systems}, pages = {693--701} } @article{recht2019imagenet, title = {Do imagenet classifiers generalize to imagenet?}, author = {Recht, Benjamin and Roelofs, Rebecca and Schmidt, Ludwig and Shankar, Vaishaal}, year = 2019, journal = {arXiv preprint arXiv:1902.10811} } @article{Reddi2016-nonconvexSAGA, title = {Fast Incremental Method for Nonconvex Optimization}, author = {Sashank J. Reddi and Suvrit Sra and Barnabas Poczos and Alex Smola}, year = 2016, month = mar, journal = {ArXiv e-prints}, volume = {abs/1603.06159} } @article{Reddi2016-nonconvexSVRG, title = {Stochastic Variance Reduction for Nonconvex Optimization}, author = {Sashank J. Reddi and Ahmed Hefny and Suvrit Sra and Barnabas Poczos and Alex Smola}, year = 2016, month = mar, journal = {ArXiv e-prints}, volume = {abs/1603.06160} } @inproceedings{reddi2016stochastic, title = {Stochastic Variance Reduction for Nonconvex Optimization}, author = {Reddi, Sashank J and Hefny, Ahmed and Sra, Suvrit and P{\'o}cz{\'o}s, Barnab{\'a}s and Smola, Alex}, year = 2016, journal = {ICML} } @article{reeves2009managing, title = { Managing massive time series streams with multi-scale compressed trickles }, author = {Reeves, Galen and Liu, Jie and Nath, Suman and Zhao, Feng}, year = 2009, month = aug, journal = {Proc. VLDB Endow.}, publisher = {VLDB Endowment}, volume = 2, pages = {97--108}, issn = {2150-8097}, acmid = 1687639, issue = 1, issue_date = {August 2009}, numpages = 12 } @article{Regev, title = {On lattices, learning with errors, random linear codes, and cryptography}, author = {Regev, Oded}, year = 2009, month = sep, journal = {J. ACM}, publisher = {ACM}, address = {New York, NY, USA}, volume = 56, number = 6, pages = {34:1--34:40}, doi = {10.1145/1568318.1568324}, issn = {0004-5411}, url = {http://doi.acm.org/10.1145/1568318.1568324}, issue_date = {September 2009}, articleno = 34, numpages = 40, acmid = 1568324, keywords = {Lattice, average-case hardness, cryptography, public key encryption, quantum computation} } @article{rehsommer, title = {A network that uses few active neurones to code visual input predicts the diverse shapes of cortical receptive fields}, author = {Martin Rehn and Friedrich T. Sommer}, year = 2007, journal = {Journal of Computational Neuroscience}, volume = 22, number = 2, pages = {135--146}, doi = {10.1007/s10827-006-0003-9}, url = {http://dx.doi.org/10.1007/s10827-006-0003-9}, timestamp = {Thu, 16 Oct 2014 21:34:03 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/jcns/RehnS07}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{reinhardt2007multi, title = { A Multi-Level Parallel Implementation of a Program for Finding Frequent Patterns in a Large Sparse Graph }, author = {Steve Reinhardt and George Karypis}, year = 2007, booktitle = { IPDPS 2007. IEEE International Parallel and Distributed Processing Symposium }, pages = {1--8}, ee = {http://dx.doi.org/10.1109/IPDPS.2007.370404} } @inproceedings{ren2002state, title = {State aggregation in {Markov} decision processes}, author = {Ren, Zhiyuan and Krogh, Bruce H}, year = 2002, booktitle = {Proceedings of the 41st IEEE Conference on Decision and Control}, volume = 4, pages = {3819--3824}, organization = {IEEE} } @inproceedings{ren2018learning, title = {Learning to Reweight Examples for Robust Deep Learning}, author = {Ren, Mengye and Zeng, Wenyuan and Yang, Bin and Urtasun, Raquel}, year = 2018, booktitle = {International Conference on Machine Learning}, pages = {4334--4343} } @article{ren2021nearly, title = {Nearly Horizon-Free Offline Reinforcement Learning}, author = {Ren, Tongzheng and Li, Jialian and Dai, Bo and Du, Simon S and Sanghavi, Sujay}, year = 2021, journal = {arXiv preprint arXiv:2103.14077} } @inproceedings{rendle2010factorizing, title = {Factorizing personalized {M}arkov chains for next-basket recommendation}, author = {Rendle, Steffen and Freudenthaler, Christoph and Schmidt-Thieme, Lars}, year = 2010, booktitle = {International Conference on World Wide Web} } @inproceedings{rennie2005fast, title = {Fast maximum margin matrix factorization for collaborative prediction}, author = {Rennie, Jasson DM and Srebro, Nathan}, year = 2005, booktitle = {Proceedings of the 22nd international conference on Machine learning}, pages = {713--719}, organization = {ACM} } @misc{reuters2011factbox, title = {Factbox: A look at the \$65 billion video games industry}, author = {Reuters}, year = 2011, month = jun, url = {http://uk.reuters.com/article/2011/06/06/us-videogames-factbox-idUKTRE75552I20110606}, owner = {leili}, timestamp = {2011.07.28} } @inproceedings{RGapprox, title = {New Tools for Graph Coloring}, author = {Arora, Sanjeev and Ge, Rong}, year = 2011, booktitle = {Proceedings of Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques - 14th International Workshop, APPROX 2011, and 15th International Workshop, RANDOM 2011,}, pages = {1--12} } @article{RGcacm, title = {Computational Complexity and Information Asymmetry in Financial Products}, author = {Arora, Sanjeev and Barak, Boaz and Brunnermeier, Markus and Ge, Rong}, year = 2010, journal = {Communications of the ACM}, booktitle = {The First Symposium on Innovations in Computer Science, ICS 2010}, publisher = {Tsinghua University Press}, address = {Beijing}, volume = 54, number = 5, pages = {101--107}, doi = {10.1145/1941487.1941511}, isbn = {978-7-89474-827-0} } @inproceedings{RGdeep, title = {Provable Bounds for Learning Some Deep Representations}, author = {Sanjeev Arora and Aditya Bhaskara and Rong Ge and Tengyu Ma}, year = 2014, booktitle = {Proceedings of the 31th International Conference on Machine Learning, {ICML} 2014, Beijing, China, 21-26 June 2014}, pages = {584--592}, url = {http://jmlr.org/proceedings/papers/v32/arora14.html}, crossref = {DBLP:conf/icml/2014}, timestamp = {Sun, 26 Oct 2014 02:38:30 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/icml/AroraBGM14}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{RGec, title = {Finding overlapping communities in social networks: toward a rigorous approach}, author = {Sanjeev Arora and Rong Ge and Sushant Sachdeva and Grant Schoenebeck}, year = 2012, booktitle = {ACM Conference on Electronic Commerce, EC '12, Valencia, Spain, June 4-8, 2012}, pages = {37--54} } @inproceedings{RGicalp, title = {New Algorithms for Learning in Presence of Errors}, author = {Sanjeev Arora and Rong Ge}, year = 2011, booktitle = {Automata, Languages and Programming - 38th International Colloquium, ICALP 2011, Zurich, Switzerland, July 4-8, 2011, Proceedings, Part I}, pages = {403--415}, ee = {http://dx.doi.org/10.1007/978-3-642-22006-7_34} } @inproceedings{RGisaac, title = {New Results on Simple Stochastic Games}, author = {Dai, Decheng and Ge, Rong}, year = 2009, booktitle = {ISAAC '09: Proceedings of the 20th International Symposium on Algorithms and Computation}, location = {Honolulu, Hawaii}, publisher = {Springer-Verlag}, address = {Berlin, Heidelberg}, pages = {1014--1023}, doi = {http://dx.doi.org/10.1007/978-3-642-10631-6_102}, isbn = {978-3-642-10630-9} } @inproceedings{RGnips, title = {Provable ICA with Unknown Gaussian Noise, and Implications for Gaussian Mixtures and Autoencoders}, author = {Sanjeev Arora and Rong Ge and Ankur Moitra and Sushant Sachdeva}, year = 2012, booktitle = {Advances in Neural Information Processing Systems (NIPS)} } @article{RGovercomplete1, title = {Guaranteed Non-Orthogonal Tensor Decomposition via Alternating Rank-1 Updates}, author = {Animashree Anandkumar and Rong Ge and Majid Janzamin}, year = 2014, journal = {CoRR}, volume = {abs/1402.5180}, url = {http://arxiv.org/abs/1402.5180}, timestamp = {Sun, 26 Oct 2014 02:41:41 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/corr/AnandkumarGJ14}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{RGovercomplete2, title = {Provable Learning of Overcomplete Latent Variable Models: Semi-supervised and Unsupervised Settings}, author = {Animashree Anandkumar and Rong Ge and Majid Janzamin}, year = 2014, journal = {CoRR}, volume = {abs/1408.0553}, url = {http://arxiv.org/abs/1408.0553}, timestamp = {Sun, 26 Oct 2014 02:41:44 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/corr/AnandkumarGJ14a}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{rgSparse, title = {Towards a better approximation for sparsest cut?}, author = {Arora, Sanjeev and Ge, Rong and Sinop, Ali Kemal}, year = 2013, booktitle = {Foundations of Computer Science (FOCS), 2013 IEEE 54th Annual Symposium on}, pages = {270--279}, organization = {IEEE} } @inproceedings{RGstoc, title = {Computing a Nonnegative Matrix Factorization \--- Provably.}, author = {Sanjeev Arora and Rong Ge and Ravindran Kannan and Ankur Moitra}, year = 2012, booktitle = {Proceedings of the 44th Symposium on Theory of Computing Conference, STOC 2012, New York, NY, USA, May 19 - 22}, pages = {145--162} } @inproceedings{rgStochastic, title = {Competing with the Emprical Risk Minimizer in a Single Pass}, author = {Roy Frostig and Rong Ge and Sham M. Kakade and Aaron Sidford}, year = 2015, booktitle = {Proceedings of The 28th Conference on Learning Theory}, pp = {728–763} } @misc{richards2018lyapunov, title = {The Lyapunov Neural Network: Adaptive Stability Certification for Safe Learning of Dynamical Systems}, author = {Spencer M. Richards and Felix Berkenkamp and Andreas Krause}, year = 2018, eprint = {1808.00924}, archiveprefix = {arXiv}, primaryclass = {cs.SY} } @article{RichtarikTakac2012parallel, title = {Parallel coordinate descent methods for big data optimization}, author = {Richt{\'a}rik, Peter and Tak{\'a}{\v{c}}, Martin}, year = 2012, journal = {Mathematical Programming}, publisher = {Springer}, pages = {1--52} } @article{RichtarikTakac2013distributed, title = {Distributed coordinate descent method for learning with big data}, author = {Richt{\'a}rik, Peter and Tak{\'a}{\v{c}}, Martin}, year = 2013, journal = {arXiv preprint arXiv:1310.2059} } @article{RichtarikTakac2014, title = {Iteration complexity of randomized block-coordinate descent methods for minimizing a composite function}, author = {Richt{\'a}rik, Peter and Tak{\'a}{\v{c}}, Martin}, year = 2014, journal = {Mathematical Programming}, publisher = {Springer}, volume = 144, number = {1-2}, pages = {1--38} } @article{richter2016gta5, title = {Playing for Data: Ground Truth from Computer Games}, author = {Richter, Stephan R. and Vineet, Vibhav and Roth, Stefan and Koltun, Vladlen}, year = 2016, journal = {Lecture Notes in Computer Science}, publisher = {Springer International Publishing}, pages = {102–118}, doi = {10.1007/978-3-319-46475-6_7}, isbn = 9783319464756, issn = {1611-3349}, url = {http://dx.doi.org/10.1007/978-3-319-46475-6_7} } @misc{ridge, title = {An Analysis of Random Design Linear Regression}, author = {D. Hsu and S. M. Kakade and T. Zhang}, year = 2011, note = {arXiv:1106.2363}, eprint = {arXiv:1106.2363} } @article{RK03, title = {Monotonic convergence of fixed-point algorithms for {ICA}}, author = {P. A. Regalia and E. Kofidis}, year = 2003, journal = {IEEE Transactions on Neural Networks}, volume = 14, pages = {943--949} } @article{RM51, title = {A stochastic approximation method}, author = {Robbins, Herbert and Monro, Sutton}, year = 1951, journal = {The annals of mathematical statistics}, publisher = {JSTOR}, pages = {400--407} } @article{robbins1952some, title = {Some Aspects of the Sequential Design of Experiments}, author = {Robbins, Herbert}, year = 1952, journal = {Bulletin of the American Mathematical Society}, volume = 58, pages = {527--535} } @book{robert2005monte, title = {{M}onte {C}arlo Statistical Methods}, author = {Robert, Christian P. and Casella, George}, year = 2005, publisher = {Springer-Verlag New York, Inc.}, address = {Secaucus, NJ, USA} } @article{roberts1996exponential, title = {Exponential convergence of Langevin distributions and their discrete approximations}, author = {Roberts, Gareth O and Tweedie, Richard L}, year = 1996, journal = {Bernoulli}, publisher = {Bernoulli Society for Mathematical Statistics and Probability}, volume = 2, number = 4, pages = {341--363} } @incollection{roberts2014navigating, title = {Navigating the Local Modes of Big Data: The Case of Topic Models}, author = {Margaret E. Roberts and Brandon M. Stewart and Dustin Tingley}, year = 2014, booktitle = {Data Science for Politics, Policy and Government} } @article{Roch:CompBio, title = {A Short Proof that Phylogenetic Tree Reconstruction by Maximum Likelihood Is Hard}, author = {Roch, S.}, year = 2006, journal = {IEEE/ACM Trans. Comput. Biol. Bioinformatics}, volume = 3, number = 1 } @book{Rockafellar1996convex, title = {Convex Analysis (Princeton Landmarks in Mathematics and Physics)}, author = {Rockafellar, R. Tyrrell}, year = 1996, publisher = {Princeton University Press} } @book{rockafellar2015convex, title = {Convex analysis}, author = {Rockafellar, Ralph Tyrell}, year = 2015, publisher = {Princeton university press} } @book{rockefellerconvex, title = {Convex Analysis}, author = {Rockefeller, RT}, year = 1970, publisher = {Princeton University Press, Princeton, NJ} } @article{roderick2020provably, title = {Provably Safe PAC-MDP Exploration Using Analogies}, author = {Roderick, Melrose and Nagarajan, Vaishnavh and Kolter, J Zico}, year = 2020, journal = {arXiv preprint arXiv:2007.03574} } @article{rogers1991aggregation, title = {Aggregation and disaggregation techniques and methodology in optimization}, author = {Rogers, David F and Plante, Robert D and Wong, Richard T and Evans, James R}, year = 1991, journal = {Operations Research}, publisher = {INFORMS}, volume = 39, number = 4, pages = {553--582} } @article{rohde2006, title = {An improved model of semantic similarity based on lexical co-occurence}, author = {Douglas L. T. Rohde and Laura M. Gonnerman and David C. Plaut}, year = 2006, journal = {Communication of the Association for Computing Machinery} } @inproceedings{romano2019conformalized, title = {Conformalized quantile regression}, author = {Romano, Yaniv and Patterson, Evan and Candes, Emmanuel}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {3543--3553} } @incollection{RoS71, title = {A Convergence Theorem for Non Negative Almost Supermartingales and Some Applications}, author = {Robbins, Herbert and Siegmund, David}, year = 1985, booktitle = {Herbert Robbins Selected Papers}, pages = {111--135}, doi = {10.1007/978-1-4612-5110-1_10}, isbn = {978-1-4612-5110-1}, url = {http://dx.doi.org/10.1007/978-1-4612-5110-1_10}, //address = {New York, NY}, //publisher = {Springer New York} } @techreport{rosales1998improved, title = { Improved Tracking of Multiple Humans with Trajectory Predcition and Occlusion Modeling }, author = {Romer Rosales and Stan Sclaroff}, year = 1998, month = {2,}, number = {1998-007} } @inproceedings{rose1996efficient, title = {Efficient generation of motion transitions using spacetime constraints}, author = { Charles Rose and Brian Guenter and Bobby Bodenheimer and Michael F. Cohen }, year = 1996, booktitle = { SIGGRAPH '96: Proceedings of the 23rd annual conference on Computer graphics and interactive techniques }, publisher = {ACM Press}, address = {New York, NY, USA}, pages = {147--154}, doi = {http://doi.acm.org/10.1145/237170.237229}, isbn = {0-89791-746-4} } @inproceedings{rosenberg2019online, title = {Online Convex Optimization in Adversarial Markov Decision Processes}, author = {Rosenberg, Aviv and Mansour, Yishay}, year = 2019, booktitle = {International Conference on Machine Learning}, pages = {5478--5486} } @article{rothvoss2017matching, title = {The matching polytope has exponential extension complexity}, author = {Rothvo{\ss}, Thomas}, year = 2017, journal = {Journal of the ACM} } @article{rotskoff2018neural, title = {Neural networks as Interacting Particle Systems: Asymptotic convexity of the Loss Landscape and Universal Scaling of the Approximation Error}, author = {Rotskoff, Grant M and Vanden-Eijnden, Eric}, year = 2018, journal = {arXiv preprint arXiv:1805.00915} } @book{roughgarden2020beyond, title = {Beyond the Worst-Case Analysis of Algorithms}, author = {Roughgarden, Tim}, year = 2020, publisher = {Cambridge University Press} } @article{roy2019Comments, title = {Comments on the {Du-Kakade-Wang-Yang} Lower Bounds}, author = {Benjamin Van Roy and Shi-Hai Dong}, year = 2019, journal = {ArXiv}, volume = {abs/1911.07910} } @inproceedings{RSS11, title = {Making Gradient Descent Optimal for Strongly Convex Stochastic Optimization}, author = {Alexander Rakhlin and Ohad Shamir and Karthik Sridharan}, year = 2012, booktitle = {Proceedings of the 29th International Conference on Machine Learning}, series = {ICML '12}, pages = {449--456}, isbn = {978-1-4503-1285-1}, //address = {New York, NY, USA}, //location = {Edinburgh, Scotland, GB}, //month = {July}, //publisher = {Omnipress} } @article{rubin1987calculation, title = {The Calculation of Posterior Distributions by Data Augmentation: Comment: A Noniterative Sampling/Importance Resampling Alternative to the Data Augmentation Algorithm for Creating a Few Imputations When Fractions of Missing Information Are Modest: The {SIR} Algorithm}, author = {Rubin, Donald B.}, year = 1987, journal = {Journal of the American Statistical Association}, publisher = {American Statistical Association}, volume = 82, number = 398, pages = {pp. 543--546}, issn = {01621459}, copyright = {Copyright © 1987 American Statistical Association}, jstor_articletype = {research-article}, jstor_formatteddate = {Jun., 1987}, language = {English} } @article{Rudelson2009smallest, title = {Smallest singular value of a random rectangular matrix}, author = {Rudelson, Mark and Vershynin, Roman}, year = 2009, journal = {Communications on Pure and Applied Mathematics}, publisher = {Wiley Online Library}, volume = 62, number = 12, pages = {1707--1739} } @inproceedings{rudelson2010non, title = {Non-asymptotic theory of random matrices: extreme singular values}, author = {Rudelson, Mark and Vershynin, Roman}, year = 2010, booktitle = {Proceedings of the International Congress of Mathematicians 2010 (ICM 2010) (In 4 Volumes) Vol. I: Plenary Lectures and Ceremonies Vols. II--IV: Invited Lectures}, pages = {1576--1602}, organization = {World Scientific} } @article{RudelsonVershynin2009, title = {The smallest singular value of a random rectangular matrix}, author = {M. Rudelson and R. Vershynin}, year = 2009, journal = {Communications on Pure and Applied Mathematics}, volume = 62, number = 12, pages = {1707--1739} } @article{rumelhart1988learning, title = {Learning representations by back-propagating errors}, author = {Rumelhart, David E and Hinton, Geoffrey E and Williams, Ronald J}, year = 1988, journal = {Cognitive modeling}, volume = 5, number = 3, pages = 1 } @inproceedings{russo2013eluder, title = {Eluder dimension and the sample complexity of optimistic exploration}, author = {Russo, Daniel and Roy, Benjamin Van}, year = 2013, booktitle = {Proceedings of the 26th International Conference on Neural Information Processing Systems-Volume 2}, pages = {2256--2264} } @article{russo2014learning, title = {Learning to optimize via posterior sampling}, author = {Russo, Daniel and Van Roy, Benjamin}, year = 2014, journal = {Mathematics of Operations Research}, publisher = {INFORMS}, volume = 39, number = 4, pages = {1221--1243} } @inproceedings{russo2019worst, title = {Worst-case regret bounds for exploration via randomized value functions}, author = {Russo, Daniel}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {14433--14443} } @article{ruszczynski2003stochastic, title = {Stochastic programming models}, author = {Ruszczy{\'n}ski, Andrzej and Shapiro, Alexander}, year = 2003, journal = {Handbooks in operations research and management science}, publisher = {Elsevier}, volume = 10, pages = {1--64} } @article{RW84, title = {Mixture densities, maximum likelihood and the {EM} algorithm}, author = {R. A. Redner and H. F. Walker}, year = 1984, journal = {SIAM Review}, volume = 26, number = 2, pages = {195--239} } @inproceedings{ryzhov2010approximate, title = {Approximate dynamic programming with correlated Bayesian beliefs}, author = {Ryzhov, Ilya O and Powell, Warren B}, year = 2010, booktitle = {Communication, Control, and Computing (Allerton), 2010 48th Annual Allerton Conference on}, pages = {1360--1367}, organization = {IEEE} } @article{S07, title = {Graph clustering}, author = {S. E. Schaeffer}, year = 2007, journal = {Computer Science Review,}, volume = 1, number = 1, pages = {27--64} } @article{s17, title = {Learning {R}e{LU}s via Gradient Descent}, author = {Mahdi Soltanolkotabi}, year = 2017, journal = {CoRR}, volume = {abs/1705.04591}, url = {http://arxiv.org/abs/1705.04591}, archiveprefix = {arXiv}, eprint = {1705.04591}, timestamp = {Mon, 13 Aug 2018 16:48:16 +0200}, biburl = {https://dblp.org/rec/bib/journals/corr/Soltanolkotabi17a}, bibsource = {dblp computer science bibliography, https://dblp.org} } @article{s18, title = {Distribution-specific hardness of learning neural networks}, author = {Shamir, Ohad}, year = 2018, journal = {Journal of Machine Learning Research (JMLR)}, volume = 19, number = 32 } @article{S61, title = {On the definition of a family of automata}, author = {M. P. Sch\"utzenberger}, year = 1961, journal = {Inf. Control}, volume = 4, pages = {245--270} } @inproceedings{sa15, title = {Provable methods for training neural networks with sparse connectivity}, author = {Sedghi, Hanie and Anandkumar, Anima}, year = 2015, journal = {arXiv preprint arXiv:1412.2693}, booktitle = {ICLR}, publisher = {arXiv preprint arXiv:1412.2693} } @article{sa96, title = {A model of multiplicative neural responses in parietal cortex}, author = {Salinas, Emilio and Abbott, Laurence F.}, year = 1996, journal = {Proceedings of the National Academy of Sciences}, volume = 93, number = 21, pages = {11956--11961} } @article{saad1995line, title = {On-line learning in soft committee machines}, author = {Saad, David and Solla, Sara A}, year = 1995, journal = {Physical Review E}, publisher = {APS}, volume = 52, number = 4, pages = 4225 } @inproceedings{safonova2003optimizing, title = {Optimizing Human Motion for the Control of a Humanoid Robot}, author = {Alla Safonova and Nancy Pollard and Jessica K Hodgins}, year = 2003, month = mar, booktitle = {AMAM2003}, owner = {leili}, timestamp = {2011.07.28} } @article{safonova2007construction, title = {Construction and optimal search of interpolated motion graphs}, author = {Safonova, Alla and Hodgins, Jessica K.}, year = 2007, booktitle = {ACM SIGGRAPH 2007 papers}, location = {San Diego, California}, publisher = {ACM}, address = {New York, NY, USA}, series = {SIGGRAPH '07}, doi = {http://doi.acm.org/10.1145/1275808.1276510}, acmid = 1276510, articleno = 106, keywords = { human animation, motion capture, motion graph, motion interpolation, motion planning } } @article{safran2015quality, title = {On the Quality of the Initial Basin in Overspecified Neural Networks}, author = {Safran, Itay and Shamir, Ohad}, year = 2015, journal = {arXiv preprint arXiv:1511.04210}, booktitle = {International Conference on Machine Learning}, pages = {774--782} } @inproceedings{safran2017depth, title = {Depth-Width Tradeoffs in Approximating Natural Functions with Neural Networks}, author = {Safran, Itay and Shamir, Ohad}, year = 2017, booktitle = {International Conference on Machine Learning}, pages = {2979--2987} } @article{safran2017spurious, title = {Spurious local minima are common in two-layer relu neural networks}, author = {Safran, Itay and Shamir, Ohad}, year = 2017, journal = {arXiv preprint arXiv:1712.08968} } @inproceedings{Saha2011, title = {{New Approximation Algorithms for Minimum Enclosing Convex Shapes}}, author = {Saha, Ankan and Vishwanathan, S. V. N. and Zhang, Xinhua}, year = 2011, month = sep, booktitle = {Proceedings of the Twenty-Second Annual ACM-SIAM Symposium on Discrete Algorithms - SODA '11}, pages = {1146--1160}, abstract = {Given \$n\$ points in a \$d\$ dimensional Euclidean space, the Minimum Enclosing Ball (MEB) problem is to find the ball with the smallest radius which contains all \$n\$ points. We give a \$O(nd\backslash Qcal/\backslash sqrt\{\backslash epsilon\})\$ approximation algorithm for producing an enclosing ball whose radius is at most \$\backslash epsilon\$ away from the optimum (where \$\backslash Qcal\$ is an upper bound on the norm of the points). This improves existing results using $\backslash$emph\{coresets\}, which yield a \$O(nd/\backslash epsilon)\$ greedy algorithm. Finding the Minimum Enclosing Convex Polytope (MECP) is a related problem wherein a convex polytope of a fixed shape is given and the aim is to find the smallest magnification of the polytope which encloses the given points. For this problem we present a \$O(mnd\backslash Qcal/\backslash epsilon)\$ approximation algorithm, where \$m\$ is the number of faces of the polytope. Our algorithms borrow heavily from convex duality and recently developed techniques in non-smooth optimization, and are in contrast with existing methods which rely on geometric arguments. In particular, we specialize the excessive gap framework of $\backslash$citet\{Nesterov05a\} to obtain our results.}, archiveprefix = {arXiv}, arxivid = {0909.1062}, eprint = {0909.1062}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Saha, Vishwanathan, Zhang - 2011 - New Approximation Algorithms for Minimum Enclosing Convex Shapes.pdf:pdf}, mendeley-groups = {Algorithms/Computational Geometry} } @inproceedings{sakurai2005braid, title = {BRAID: stream mining through group lag correlations}, author = {Sakurai, Yasushi and Papadimitriou, Spiros and Faloutsos, Christos}, year = 2005, booktitle = { Proceedings of the 2005 ACM SIGMOD international conference on Management of data }, location = {Baltimore, Maryland}, publisher = {ACM}, address = {New York, NY, USA}, series = {SIGMOD '05}, pages = {599--610}, doi = {http://doi.acm.org/10.1145/1066157.1066226}, isbn = {1-59593-060-4}, acmid = 1066226, numpages = 12 } @inproceedings{sakurai2005ftw, title = {FTW: fast similarity search under the time warping distance}, author = {Sakurai, Yasushi and Yoshikawa, Masatoshi and Faloutsos, Christos}, year = 2005, booktitle = { Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems }, location = {Baltimore, Maryland}, publisher = {ACM}, address = {New York, NY, USA}, series = {PODS '05}, pages = {326--337}, doi = {http://doi.acm.org/10.1145/1065167.1065210}, isbn = {1-59593-062-0}, acmid = 1065210, numpages = 12 } @inproceedings{sakurai2007stream, title = {Stream Monitoring under the Time Warping Distance}, author = {Yasushi Sakurai and Christos Faloutsos and Masashi Yamamuro}, year = 2007, month = apr, booktitle = {ICDE 2007. IEEE 23rd International Conference on Data Engineering}, address = {Istanbul, Turkey}, pages = {1046--1055}, doi = {10.1109/ICDE.2007.368963} } @inproceedings{salakhutdinov2010collaborative, title = {Collaborative filtering in a non-uniform world: Learning with the weighted trace norm}, author = {Salakhutdinov, Ruslan and Srebro, Nathan}, year = 2010, booktitle = {Proc. of NIPS} } @inproceedings{salimans2016weight, title = {Weight normalization: A simple reparameterization to accelerate training of deep neural networks}, author = {Salimans, Tim and Kingma, Diederik P}, year = 2016, booktitle = {Advances in Neural Information Processing Systems}, pages = {901--909} } @article{salman2020adversarially, title = {Do Adversarially Robust ImageNet Models Transfer Better?}, author = {Salman, Hadi and Ilyas, Andrew and Engstrom, Logan and Kapoor, Ashish and Madry, Aleksander}, year = 2020, journal = {arXiv preprint arXiv:2007.08489} } @article{samuel1959some, title = {Some studies in machine learning using the game of checkers}, author = {Samuel, Arthur L}, year = 1959, journal = {IBM Journal of research and development}, publisher = {IBM}, volume = 3, number = 3, pages = {210--229} } @article{sanjabi2018solving, title = {Solving Approximate Wasserstein {GANs} to Stationarity}, author = {Sanjabi, Maziar and Ba, Jimmy and Razaviyayn, Meisam and Lee, Jason D}, year = 2018, journal = {Neural Information Processing Systems (NIPS)} } @article{santurkar2018does, title = {How Does Batch Normalization Help Optimization?}, author = {Santurkar, Shibani and Tsipras, Dimitris and Ilyas, Andrew and Madry, Aleksander}, year = 2018, journal = {Advances in neural information processing systems}, number = 31 } @inproceedings{SaReOlukotun2015-online1PCA, title = {Global Convergence of Stochastic Gradient Descent for Some Non-convex Matrix Problems}, author = {Sa, Christopher De and Re, Christopher and Olukotun, Kunle}, year = 2015, booktitle = {ICML}, pages = {2332--2341} } @article{savinov2018semi, title = {Semi-parametric topological memory for navigation}, author = {Savinov, Nikolay and Dosovitskiy, Alexey and Koltun, Vladlen}, year = 2018, journal = {arXiv preprint arXiv:1803.00653} } @inproceedings{SBG10, title = {Reduced-Rank Hidden {M}arkov Models}, author = {S. M. Siddiqi and B. Boots and G. J. Gordon}, year = 2010, booktitle = {AISTATS} } @article{sbsbcv17, title = {Recent Advances in Recurrent Neural Networks}, author = {Salehinejad, Hojjat and Baarbe, Julianne and Sankar, Sharan and Barfett, Joseph and Colak, Errol and Valaee, Shahrokh}, year = 2017, journal = {arXiv preprint arXiv:1801.01078} } @article{schaeffer1941, title = {Inequalities of A. Markoff and S. Bernstein for polynomials and related functions}, author = {Schaeffer, A. C.}, year = 1941, month = {08}, journal = {Bull. Amer. Math. Soc.}, publisher = {American Mathematical Society}, volume = 47, number = 8, pages = {565--579}, url = {http://projecteuclid.org/euclid.bams/1183503783}, fjournal = {Bulletin of the American Mathematical Society} } @inproceedings{scherrer2013improved, title = {Improved and generalized upper bounds on the complexity of policy iteration}, author = {Scherrer, Bruno}, year = 2013, booktitle = {Advances in Neural Information Processing Systems}, pages = {386--394}, date-added = {2017-05-19 05:08:36 +0000}, date-modified = {2017-05-19 05:08:36 +0000} } @inproceedings{scherrer2014approximate, title = {Approximate policy iteration schemes: a comparison}, author = {Scherrer, Bruno}, year = 2014, booktitle = {International Conference on Machine Learning}, pages = {1314--1322} } @inproceedings{scherrer2014local, title = {Local policy search in a convex space and conservative policy iteration as boosted policy search}, author = {Scherrer, Bruno and Geist, Matthieu}, year = 2014, booktitle = {Joint European Conference on Machine Learning and Knowledge Discovery in Databases}, pages = {35--50}, organization = {Springer} } @book{Schilders2008model, title = {Model order reduction: theory, research aspects and applications}, author = {Schilders, Wilhelmus H.A. and Van der Vorst, Henk A. and Rommes, Joost}, year = 2008, publisher = {Springer}, volume = 13 } @article{Schmidt2013-SAG, title = {{Minimizing finite sums with the stochastic average gradient}}, author = {Schmidt, Mark and {Le Roux}, Nicolas and Bach, Francis}, year = 2013, journal = {arXiv preprint arXiv:1309.2388}, pages = {1--45}, url = {http://arxiv.org/abs/1309.2388}, note = {Preliminary version appeared in NIPS 2012}, abstract = {We propose the stochastic average gradient (SAG) method for optimizing the sum of a finite number of smooth convex functions. Like stochastic gradient (SG) methods, the SAG method's iteration cost is independent of the number of terms in the sum. However, by incorporating a memory of previous gradient values the SAG method achieves a faster convergence rate than black-box SG methods. The convergence rate is improved from O(1/k\^{}\{1/2\}) to O(1/k) in general, and when the sum is strongly-convex the convergence rate is improved from the sub-linear O(1/k) to a linear convergence rate of the form O(p\^{}k) for p < 1. Further, in many cases the convergence rate of the new method is also faster than black-box deterministic gradient methods, in terms of the number of gradient evaluations. Numerical experiments indicate that the new algorithm often dramatically outperforms existing SG and deterministic gradient methods, and that the performance may be further improved through the use of non-uniform sampling strategies.}, archiveprefix = {arXiv}, arxivid = {1309.2388}, eprint = {1309.2388}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/6bb9f6695c64ca57938706579bcdff9c8712f8e9.pdf:pdf}, mendeley-groups = {Optimization/Variance Reduction} } @article{schoenholz2016deep, title = {Deep information propagation}, author = {Schoenholz, Samuel S and Gilmer, Justin and Ganguli, Surya and Sohl-Dickstein, Jascha}, year = 2016, journal = {arXiv preprint arXiv:1611.01232}, booktitle = {ICLR}, url = {https://openreview.net/pdf?id=H1W1UN9gg} } @inproceedings{schulman15trust, title = {Trust Region Policy Optimization}, author = {John Schulman and Sergey Levine and Pieter Abbeel and Michael I. Jordan and Philipp Moritz}, year = 2015, booktitle = {Proceedings of the Thirty-Second International Conference on Machine Learning (ICML-15)}, pages = {1889--1897} } @inproceedings{schulman2015trust, title = {Trust region policy optimization}, author = {Schulman, John and Levine, Sergey and Abbeel, Pieter and Jordan, Michael and Moritz, Philipp}, year = 2015, booktitle = {International conference on machine learning}, pages = {1889--1897} } @article{schulman2017proximal, title = {Proximal policy optimization algorithms}, author = {Schulman, John and Wolski, Filip and Dhariwal, Prafulla and Radford, Alec and Klimov, Oleg}, year = 2017, journal = {arXiv preprint arXiv:1707.06347} } @article{schweitzer85generalized, title = {Generalized Polynomial Approximations in {Markovian} Decision Processes}, author = {Paul J. Schweitzer and Abraham Seidmann}, year = 1985, journal = {Journal of Mathematical Analysis and Applications}, volume = 110, number = 2, pages = {568--582} } @article{SCORE, title = {Fast community detection by {SCORE}}, author = {Jin, Jiashun}, year = 2015, journal = {Annals of Statistics}, publisher = {Institute of Mathematical Statistics}, volume = 43, number = 1, pages = {57--89} } @article{SDCA, title = {Stochastic dual coordinate ascent methods for regularized loss}, author = {Shalev-Shwartz, Shai and Zhang, Tong}, year = 2013, journal = {The Journal of Machine Learning Research}, publisher = {JMLR. org}, volume = 14, number = 1, pages = {567--599} } @misc{sensirion2010datasheet, title = { Datasheet SHT1x (SHT10, SHT11, SHT15) - Humidity and Temperature Sensor }, author = {Sensirion}, year = 2010, howpublished = {Available at \url{http://www.sensirion.com/en/pdf/product_information/Datasheet-humidity-sensor-SHT1x.pdf}} } @inproceedings{sgs15, title = {Training very deep networks}, author = {Srivastava, Rupesh K and Greff, Klaus and Schmidhuber, J{\"u}rgen}, year = 2015, booktitle = {Advances in neural information processing systems (NIPS)}, pages = {2377--2385} } @article{sha70, title = {Conditioning of quasi-Newton methods for function minimization}, author = {Shanno, David F}, year = 1970, journal = {Mathematics of computation}, volume = 24, number = 111, pages = {647--656} } @inproceedings{shachter1989simulation, title = {Simulation Approaches to General Probabilistic Inference on Belief Networks.}, author = {Shachter, Ross D. and Peot, Mark A.}, year = 1989, booktitle = {UAI}, publisher = {North-Holland}, pages = {221--234}, isbn = {0-444-88738-5}, url = {http://dblp.uni-trier.de/db/conf/uai/uai1989.html#ShachterP89}, editor = {Henrion, Max and Shachter, Ross D. and Kanal, Laveen N. and Lemmer, John F.}, added-at = {2011-10-24T15:49:08.000+0200}, biburl = {http://www.bibsonomy.org/bibtex/2b89fc81eadd940a390c16768fabcb335/djain}, description = {dblp}, ee = {http://rome.exp.sis.pitt.edu/UAI/Abstract.asp?articleID=787&proceedingID=5}, interhash = {f917a3b27de3d53cc4b432f0d9cba0e3}, intrahash = {b89fc81eadd940a390c16768fabcb335}, keywords = {Bayesian directedModels inference sampling}, timestamp = {2011-10-24T15:49:08.000+0200} } @inproceedings{Shah12, title = {Linear System Identification via Atomic Norm Regularization.}, author = {Parikshit Shah and Badri Narayan Bhaskar and Gongguo Tang and Benjamin Recht}, year = 2012, booktitle = {Proceedings of the 51st Conference on Decision and Control}, date-added = {2016-04-02 18:40:54 +0000}, date-modified = {2016-04-02 18:40:54 +0000} } @techreport{Shalev-Shwartz2007a, title = {{Logarithmic regret algorithms for strongly convex repeated games}}, author = {{Shalev-Shwartz}, Shai and Singer, Yoram}, year = 2007, booktitle = {The Hebrew University, Technical \ldots}, pages = {1--16}, annote = {Contains the detailed proof for the PEGASOS paper}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Shalev-Shwartz, Singer - 2007 - Logarithmic regret algorithms for strongly convex repeated games.pdf:pdf}, mendeley-groups = {Optimization/Stochastic Online Regularized Optimization}, institution = {The Hebrew University} } @phdthesis{Shalev-Shwartz2007b, title = {{Online learning: Theory, algorithms, and applications}}, author = {{Shalev-Shwartz}, Shai}, year = 2007, number = {July}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Shalev-Shwartz - 2007 - Online learning Theory, algorithms, and applications.pdf:pdf}, mendeley-groups = {Optimization/Stochastic Online Optimization,Optimization/General Theory}, school = {Hebrew University} } @article{Shalev-Shwartz2011, title = {{Online Learning and Online Convex Optimization}}, author = {{Shalev-Shwartz}, Shai}, year = 2012, journal = {Foundations and Trends in Machine Learning}, volume = 4, number = 2, pages = {107--194}, doi = {10.1561/2200000018}, issn = {1935-8237}, mendeley-groups = {Optimization/Stochastic Online Regularized Optimization} } @article{Shalev-Shwartz2011a, title = {{Stochastic methods for l1-regularized loss minimization}}, author = {{Shalev-Shwartz}, Shai and Tewari, Ambuj}, year = 2011, journal = {Journal of Machine Learning Research}, volume = 12, pages = {1865−-1892}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Unknown - Unknown - No Title(3).pdf:pdf}, mendeley-groups = {Optimization/Stochastic Online Optimization} } @article{Shalev-Shwartz2013-SDCA, title = {{Stochastic dual coordinate ascent methods for regularized loss minimization}}, author = {{Shalev-Shwartz}, Shai and Zhang, Tong}, year = 2013, journal = {Journal of Machine Learning Research}, volume = 14, number = {Feb}, pages = {567--599}, url = {http://arxiv.org/abs/1209.1873}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Shalev-Shwartz, Zhang - 2013 - Stochastic dual coordinate ascent methods for regularized loss minimization.pdf:pdf}, keywords = {computational complexity,ized loss minimization,logistic regression,optimization,regular-,ridge regression,stochastic dual coordinate ascent,support vector machines}, mendeley-groups = {Optimization/Stochastic Online Optimization} } @inproceedings{Shalev-Shwartz2013a, title = {{Accelerated Mini-Batch Stochastic Dual Coordinate Ascent}}, author = {{Shalev-Shwartz}, Shai and Zhang, Tong}, year = 2013, month = may, booktitle = {NIPS}, pages = {1--17}, abstract = {Stochastic dual coordinate ascent (SDCA) is an effective technique for solving regularized loss minimization problems in machine learning. This paper considers an extension of SDCA under the mini-batch setting that is often used in practice. Our main contribution is to introduce an accelerated mini-batch version of SDCA and prove a fast convergence rate for this method. We discuss an implementation of our method over a parallel computing system, and compare the results to both the vanilla stochastic dual coordinate ascent and to the accelerated deterministic gradient descent method of $\backslash$cite\{nesterov2007gradient\}.}, archiveprefix = {arXiv}, arxivid = {1305.2581}, eprint = {1305.2581}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Shalev-Shwartz, Zhang - 2013 - Accelerated Mini-Batch Stochastic Dual Coordinate Ascent.pdf:pdf}, mendeley-groups = {Optimization/Stochastic Online Regularized Optimization} } @inproceedings{Shalev-Shwartz2015-SDCAwithoutDual, title = {{SDCA without Duality, Regularization, and Individual Convexity}}, author = {{Shalev-Shwartz}, Shai}, year = 2016, booktitle = {ICML} } @article{Shalev-ShwartzZhang2014-ProxSDCA, title = {{Proximal Stochastic Dual Coordinate Ascent}}, author = {{Shalev-Shwartz}, Shai and Zhang, Tong}, year = 2012, journal = {arXiv preprint arXiv:1211.2717}, pages = {1--18}, url = {http://arxiv.org/pdf/1211.2717v1.pdf}, archiveprefix = {arXiv}, arxivid = {1211.2717}, eprint = {1211.2717}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Shalev-shwartz, Zhang - 2012 - Proximal Stochastic Dual Coordinate Ascent.pdf:pdf}, mendeley-groups = {Optimization/Stochastic Online Optimization} } @inproceedings{shalev2017failures, title = {Failures of Gradient-Based Deep Learning}, author = {Shalev-Shwartz, Shai and Shamir, Ohad and Shammah, Shaked}, year = 2017, booktitle = {International Conference on Machine Learning}, pages = {3067--3075} } @article{shalev2017weight, title = {Weight Sharing is Crucial to Succesful Optimization}, author = {Shalev-Shwartz, Shai and Shamir, Ohad and Shammah, Shaked}, year = 2017, journal = {arXiv preprint arXiv:1706.00687} } @article{ShalevShwartzSS2011-zeroone, title = {Learning kernel-based halfspaces with the 0-1 loss}, author = {Shalev-Shwartz, Shai and Shamir, Ohad and Sridharan, Karthik}, year = 2011, journal = {SIAM Journal on Computing}, publisher = {SIAM}, volume = 40, number = 6, pages = {1623--1646} } @article{shallue2018measuring, title = {Measuring the effects of data parallelism on neural network training}, author = {Shallue, Christopher J and Lee, Jaehoon and Antognini, Joseph and Sohl-Dickstein, Jascha and Frostig, Roy and Dahl, George E}, year = 2018, journal = {arXiv preprint arXiv:1811.03600} } @inproceedings{Shamir2015-1SVD, title = {{A Stochastic PCA and SVD Algorithm with an Exponential Convergence Rate}}, author = {Shamir, Ohad}, year = 2015, booktitle = {ICML}, pages = {144----153} } @inproceedings{Shamir2015-kSVD, title = {Fast Stochastic Algorithms for SVD and PCA: Convergence Properties and Convexity}, author = {Ohad Shamir}, year = 2016, booktitle = {ICML} } @inproceedings{Shamir2016-onlinePCA, title = {Convergence of stochastic gradient descent for PCA}, author = {Shamir, Ohad}, year = 2016, booktitle = {ICML} } @article{shamir2018exponential, title = {Exponential Convergence Time of Gradient Descent for One-Dimensional Deep Linear Neural Networks}, author = {Shamir, Ohad}, year = 2018, journal = {arXiv preprint arXiv:1809.08587,} } @article{shamirPCA, title = {Fast stochastic algorithms for svd and pca: Convergence properties and convexity}, author = {Shamir, Ohad}, year = 2015, journal = {arXiv preprint arXiv:1507.08788} } @inproceedings{ShamirZhang2013, title = {{Stochastic Gradient Descent for Non-smooth Optimization: Convergence Results and Optimal Averaging Schemes}}, author = {Shamir, Ohad and Zhang, Tong}, year = 2013, booktitle = {Proceedings of the 30th International Conference on Machine Learning - ICML '13}, location = {Atlanta, GA, USA}, series = {ICML'13}, volume = 28, pages = {I-71--I-79}, url = {http://dl.acm.org/citation.cfm?id=3042817.3042827}, abstract = {Stochastic Gradient Descent (SGD) is one of the simplest and most popular stochastic optimization methods. While it has already been theoretically studied for decades, the classical analysis usually required non-trivial smoothness assumptions, which do not apply to many modern applications of SGD with non-smooth objective functions such as support vector machines. In this paper, we investigate the performance of SGD without such smoothness assumptions, as well as a running average scheme to convert the SGD iterates to a solution with optimal optimization accuracy. In this framework, we prove that after T rounds, the suboptimality of the last SGD iterate scales as O(log(T)/$\backslash$sqrt\{T\}) for non-smooth convex objective functions, and O(log(T)/T) in the non-smooth strongly convex case. To the best of our knowledge, these are the first bounds of this kind, and almost match the minimax-optimal rates obtainable by appropriate averaging schemes. We also propose a new and simple averaging scheme, which not only attains optimal rates, but can also be easily computed on-the-fly (in contrast, the suffix averaging scheme proposed in Rakhlin et al. (2011) is not as simple to implement). Finally, we provide some experimental illustrations.}, annote = {This paper answers the open question of Shamir in COLT'12 about how to get a non-smooth algorithm whose last round is great, rather than avearging of the history. This paper also works for strongly-convex non-smooth functions.}, archiveprefix = {arXiv}, arxivid = {1212.1824}, eprint = {1212.1824}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Shamir, Zhang - 2013 - Stochastic Gradient Descent for Non-smooth Optimization Convergence Results and Optimal Averaging Schemes.pdf:pdf}, mendeley-groups = {Optimization/Gradient Descent Theory}, //publisher = {JMLR.org}, acmid = 3042827 } @article{shani2019adaptive, title = {Adaptive trust region policy optimization: Global convergence and faster rates for regularized {MDP}s}, author = {Shani, Lior and Efroni, Yonathan and Mannor, Shie}, year = 2019, journal = {arXiv preprint arXiv:1909.02769} } @article{shapley1953stochastic, title = {Stochastic games}, author = {Shapley, Lloyd S}, year = 1953, journal = {Proceedings of the national academy of sciences}, publisher = {National Acad Sciences}, volume = 39, number = 10, pages = {1095--1100} } @article{shariff2020efficient, title = {Efficient planning in large MDPs with weak linear function approximation}, author = {Shariff, Roshan and Szepesv{\'a}ri, Csaba}, year = 2020, journal = {arXiv preprint arXiv:2007.06184} } @book{shawe2004kernel, title = {Kernel methods for pattern analysis}, author = {Shawe-Taylor, John and Cristianini, Nello}, year = 2004, publisher = {Cambridge university press} } @article{shen2014risk, title = {Risk-sensitive reinforcement learning}, author = {Shen, Yun and Tobia, Michael J and Sommer, Tobias and Obermayer, Klaus}, year = 2014, journal = {Neural computation}, publisher = {MIT Press}, volume = 26, number = 7, pages = {1298--1328} } @inproceedings{Sherman09, title = {Breaking the Multicommodity Flow Barrier for $O(\sqrt{\log n})$-Approximations to Sparsest Cut}, author = {Sherman, Jonah}, year = 2009, booktitle = {Proceedings of the 50th Annual IEEE Symposium on Foundations of Computer Science}, series = {FOCS '09}, pages = {363--372}, numpages = 10 } @inproceedings{Sherman2013, title = {{Nearly Maximum Flows in Nearly Linear Time}}, author = {Sherman, Jonah}, year = 2013, month = oct, booktitle = {2013 IEEE 54th Annual Symposium on Foundations of Computer Science}, publisher = {IEEE}, pages = {263--269}, doi = {10.1109/FOCS.2013.36}, isbn = {978-0-7695-5135-7}, mendeley-groups = {Algorithms/Maxflow} } @article{shi2018understanding, title = {Understanding the acceleration phenomenon via high-resolution differential equations}, author = {Shi, Bin and Du, Simon S and Jordan, Michael I and Su, Weijie J}, year = 2018, journal = {arXiv preprint arXiv:1810.08907} } @inproceedings{shi2019acceleration, title = {Acceleration via Symplectic Discretization of High-Resolution Differential Equations}, author = {Shi, Bin and Du, Simon S and Su, Weijie and Jordan, Michael I}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, publisher = {Curran Associates, Inc.}, volume = 32, pages = {}, url = {https://proceedings.neurips.cc/paper/2019/file/a9986cb066812f440bc2bb6e3c13696c-Paper.pdf}, editor = {H. Wallach and H. Larochelle and A. Beygelzimer and F. d\textquotesingle Alch\'{e}-Buc and E. Fox and R. Garnett} } @inproceedings{shieh2008isax, title = {iSAX: indexing and mining terabyte sized time series}, author = {Shieh, Jin and Keogh, Eamonn}, year = 2008, booktitle = { Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining }, location = {Las Vegas, Nevada, USA}, publisher = {ACM}, address = {New York, NY, USA}, series = {KDD '08}, pages = {623--631}, doi = {http://doi.acm.org/10.1145/1401890.1401966}, isbn = {978-1-60558-193-4}, acmid = 1401966, keywords = {data mining, indexing, representations, time series}, numpages = 9 } @article{shimodaira2000improving, title = {Improving predictive inference under covariate shift by weighting the log-likelihood function}, author = {Shimodaira, Hidetoshi}, year = 2000, journal = {Journal of statistical planning and inference}, publisher = {Elsevier}, volume = 90, number = 2, pages = {227--244} } @article{shin2001computer, title = {Computer puppetry: An importance-based approach}, author = {Hyun Joon Shin and Jehee Lee and Sung Yong Shin and Michael Gleicher}, year = 2001, journal = {ACM Trans. Graph.}, publisher = {ACM}, address = {New York, NY, USA}, volume = 20, number = 2, pages = {67--94}, doi = {http://doi.acm.org/10.1145/502122.502123}, issn = {0730-0301} } @article{SHOPM, title = {On the Best rank-1 and Rank-$({R}_1, {R}_2, ..., {R}_N)$ Approximation and Applications of Higher-Order Tensors}, author = {L. De Lathauwer and B. De Moor and J. Vandewalle}, year = 2000, journal = {SIAM J. Matrix Anal. Appl.}, volume = 21, number = 4, pages = {1324--1342} } @article{shumway1982approach, title = { An approach to time series smoothing and forecasting using the EM algorithm }, author = {Shumway, R. H. and Stoffer, D. S.}, year = 1982, journal = {Journal of Time Series Analysis}, volume = 3, pages = {253--264}, citeulike-article-id = 2322861, keywords = {algorithm, bibtex-import, em, filtermaximum, kalman, likelihood}, local-url = {file://localhost/Users/paulfrogerais/travail/lecture/articles/algo\%20EM/em\_Shumway.pdf}, posted-at = {2008-02-02 12:11:13}, priority = 2 } @inproceedings{sidford2018near, title = {Near-optimal time and sample complexities for solving Markov decision processes with a generative model}, author = {Sidford, Aaron and Wang, Mengdi and Wu, Xian and Yang, Lin F and Ye, Yinyu}, year = 2018, booktitle = {Proceedings of the 32nd International Conference on Neural Information Processing Systems}, pages = {5192--5202} } @inproceedings{sidford2018variance, title = {Variance reduced value iteration and faster algorithms for solving markov decision processes}, author = {Sidford, Aaron and Wang, Mengdi and Wu, Xian and Ye, Yinyu}, year = 2018, booktitle = {Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms}, pages = {770--787}, organization = {SIAM} } @article{sidi2003zero, title = {A zero-cost preconditioning for a class of indefinite linear systems}, author = {Sidi, AVRAM}, year = 2003, journal = {WSEAS Trans. Math}, volume = 2, pages = {142--150} } @techreport{Silva2011, title = {{Sparse Sums of Positive Semidefinite Matrices}}, author = {{\noopsort{Carli Silva}}de {Carli Silva}, Marcel K. and Harvey, Nicholas J. A. and Sato, Cristiane M.}, year = 2011, month = jul, abstract = {Recently there has been much interest in "sparsifying" sums of rank one matrices: modifying the coefficients such that only a few are nonzero, while approximately preserving the matrix that results from the sum. Results of this sort have found applications in many different areas, including sparsifying graphs. In this paper we consider the more general problem of sparsifying sums of positive semidefinite matrices that have arbitrary rank. We give several algorithms for solving this problem. The first algorithm is based on the method of Batson, Spielman and Srivastava (2009). The second algorithm is based on the matrix multiplicative weights update method of Arora and Kale (2007). We also highlight an interesting connection between these two algorithms. Our algorithms have numerous applications. We show how they can be used to construct graph sparsifiers with auxiliary constraints, sparsifiers of hypergraphs, and sparse solutions to semidefinite programs.}, archiveprefix = {arXiv}, arxivid = {1107.0088}, eprint = {1107.0088}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Silva, Harvey, Sato - 2011 - Sparse Sums of Positive Semidefinite Matrices(2).pdf:pdf}, mendeley-groups = {Algorithms/Sparsification} } @inproceedings{silver2014deterministic, title = {Deterministic policy gradient algorithms}, author = {Silver, David and Lever, Guy and Heess, Nicolas and Degris, Thomas and Wierstra, Daan and Riedmiller, Martin}, year = 2014, booktitle = {International conference on machine learning}, pages = {387--395}, organization = {PMLR} } @article{silver2017mastering, title = {Mastering the game of {Go} with deep neural networks and tree search}, author = {David Silver and Aja Huang and Christopher J. Maddison and Arthur Guez and Laurent Sifre and George van den Driessche and Julian Schrittwieser and Ioannis Antonoglou and Veda Panneershelvam and Marc Lanctot and Sander Dieleman and Dominik Grewe and John Nham and Nal Kalchbrenner and Ilya Sutskever and Timothy Lillicrap and Madeleine Leach and Koray Kavukcuoglu and Thore Graepel and Demis Hassabis}, year = 2016, journal = {Nature}, volume = 529, pages = {484--503} } @inproceedings{silver2017predictron, title = {The predictron: End-to-end learning and planning}, author = {Silver, David and Hasselt, Hado and Hessel, Matteo and Schaul, Tom and Guez, Arthur and Harley, Tim and Dulac-Arnold, Gabriel and Reichert, David and Rabinowitz, Neil and Barreto, Andre and Degris, Thomas}, year = 2017, booktitle = {International Conference on Machine Learning}, pages = {3191--3199}, organization = {PMLR} } @inproceedings{silvio-security13, title = {The evolution of sybil defense via social networks}, author = {L. Alvisi and A. Clement and A. Epasto and S. Lattanzi and A. Panconesi}, year = 2013, booktitle = {IEEE Symposium on Security and Privacy} } @article{SIMAX-080148-Tensor-Eigenvalues, title = {Shifted Power Method for Computing Tensor Eigenpairs}, author = {T. G. Kolda and J. R. Mayo}, year = 2011, month = oct, journal = {SIAM Journal on Matrix Analysis and Applications}, volume = 32, number = 4, pages = {1095--1124} } @article{simchi2020bypassing, title = {Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits under Realizability}, author = {Simchi-Levi, David and Xu, Yunzong}, year = 2020, journal = {arXiv preprint arXiv:2003.12699} } @article{simchowitz2017gap, title = {On the Gap Between Strict-Saddles and True Convexity: An Omega (log d) Lower Bound for Eigenvector Approximation}, author = {Simchowitz, Max and Alaoui, Ahmed El and Recht, Benjamin}, year = 2017, journal = {arXiv preprint arXiv:1704.04548} } @inproceedings{simchowitz2019non, title = {Non-asymptotic gap-dependent regret bounds for tabular {MDPs}}, author = {Simchowitz, Max and Jamieson, Kevin G}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {1153--1162} } @article{simonyan2014very, title = {Very deep convolutional networks for large-scale image recognition}, author = {Simonyan, Karen and Zisserman, Andrew}, year = 2014, journal = {arXiv preprint arXiv:1409.1556} } @article{SimulatedAnnealing1953, title = {Equation of state calculations by fast computing machines}, author = {Metropolis, Nicholas and Rosenbluth, Arianna W. and Rosenbluth, Marshall N. and Teller, Augusta H. and Teller, Edward}, year = 1953, journal = {The journal of chemical physics}, publisher = {AIP Publishing}, volume = 21, number = 6, pages = {1087--1092} } @article{SinclairJerrum89, title = {Approximate Counting, Uniform Generation and Rapidly Mixing Markov Chains}, author = {Alistair Sinclair and Mark Jerrum}, year = 1989, journal = {Information and Computation}, volume = 82, number = 1, pages = {93--133}, ee = {http://dx.doi.org/10.1016/0890-5401(89)90067-9}, bibsource = {DBLP, http://dblp.uni-trier.de} } @article{singer2011angular, title = {Angular synchronization by eigenvectors and semidefinite programming}, author = {Singer, Amit}, year = 2011, journal = {Applied and computational harmonic analysis}, publisher = {Elsevier}, volume = 30, number = 1, pages = {20--36} } @inproceedings{singh1992reinforcement, title = {Reinforcement learning with a hierarchy of abstract models}, author = {Singh, Satinder P}, year = 1992, booktitle = {Proceedings of the National Conference on Artificial Intelligence}, number = 10, pages = 202, organization = {Citeseer} } @article{singh1992transfer, title = {Transfer of learning by composing solutions of elemental sequential tasks}, author = {Singh, Satinder Pal}, year = 1992, journal = {Machine Learning}, publisher = {Springer}, volume = 8, number = {3-4}, pages = {323--339} } @article{singh1994upper, title = {An upper bound on the loss from approximate optimal-value functions}, author = {Singh, Satinder P and Yee, Richard C}, year = 1994, journal = {Machine Learning}, publisher = {Springer}, volume = 16, number = 3, pages = {227--233} } @inproceedings{singh1995reinforcement, title = {Reinforcement learning with soft state aggregation}, author = {Singh, Satinder P and Jaakkola, Tommi and Jordan, Michael I}, year = 1995, booktitle = {Advances in neural information processing systems}, pages = {361--368} } @article{singh1996reinforcement, title = {Reinforcement learning with replacing eligibility traces}, author = {Singh, Satinder P and Sutton, Richard S}, year = 1996, journal = {Machine learning}, publisher = {Springer}, volume = 22, number = {1-3}, pages = {123--158} } @inproceedings{singh2004predictive, title = {Predictive state representations: a new theory for modeling dynamical systems}, author = {Singh, Satinder and James, Michael R and Rudary, Matthew R}, year = 2004, booktitle = {Conference on Uncertainty in Artificial Intelligence} } @inproceedings{singh2016efficient, title = {Efficient Nonparametric Smoothness Estimation}, author = {Singh, Shashank and Du, Simon S and Poczos, Barnabas}, year = 2016, booktitle = {Advances in Neural Information Processing Systems}, publisher = {Curran Associates, Inc.}, volume = 29, pages = {}, url = {https://proceedings.neurips.cc/paper/2016/file/acc3e0404646c57502b480dc052c4fe1-Paper.pdf}, editor = {D. Lee and M. Sugiyama and U. Luxburg and I. Guyon and R. Garnett} } @article{sion1958general, title = {On general minimax theorems.}, author = {Sion, Maurice}, year = 1958, journal = {Pacific Journal of Mathematics}, publisher = {Pacific Journal of Mathematics}, volume = 8, number = 1, pages = {171--176} } @article{sirignano2018mean, title = {Mean Field Analysis of Neural Networks}, author = {Sirignano, Justin and Spiliopoulos, Konstantinos}, year = 2018, journal = {arXiv preprint arXiv:1805.01053} } @article{sittler1964optimal, title = {An Optimal Data Association Problem in Surveillance Theory}, author = {Sittler, Robert W.}, year = 1964, month = apr, journal = {Military Electronics, IEEE Transactions on}, volume = 8, number = 2, pages = {125--139}, doi = {10.1109/TME.1964.4323129}, issn = {0536-1559} } @article{sjl18, title = {Theoretical insights into the optimization landscape of over-parameterized shallow neural networks}, author = {Soltanolkotabi, Mahdi and Javanmard, Adel and Lee, Jason D}, year = 2018, journal = {IEEE Transactions on Information Theory}, publisher = {IEEE}, volume = 65, number = 2, pages = {742--769} } @article{SleatorTarjan1983, title = {A data structure for dynamic trees}, author = {Sleator, Daniel D. and Tarjan, Robert Endre}, year = 1983, journal = {Journal of computer and system sciences}, volume = 26, number = 3 } @article{slivkins2019introduction, title = {Introduction to multi-armed bandits}, author = {Slivkins, Aleksandrs}, year = 2019, journal = {arXiv preprint arXiv:1904.07272} } @article{SM00, title = {Normalized cuts and image segmentation}, author = {J. Shi and J. Malik}, year = 2000, journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence}, volume = 22, number = 8, pages = {888--905} } @inproceedings{smh11, title = {Generating Text with Recurrent Neural Networks}, author = {Sutskever, Ilya and Martens, James and Hinton, Geoffrey}, year = 2011, booktitle = {International Conference on Machine Learning (ICML)}, pages = {1017--1024} } @article{smith2017bayesian, title = {A bayesian perspective on generalization and stochastic gradient descent}, author = {Smith, Samuel L and Le, Quoc V}, year = 2017, journal = {arXiv preprint arXiv:1710.06451} } @inproceedings{smith2017cyclical, title = {Cyclical learning rates for training neural networks}, author = {Smith, Leslie N}, year = 2017, booktitle = {2017 IEEE Winter Conference on Applications of Computer Vision (WACV)}, pages = {464--472}, organization = {IEEE} } @article{smith2017don, title = {Don't decay the learning rate, increase the batch size}, author = {Smith, Samuel L and Kindermans, Pieter-Jan and Ying, Chris and Le, Quoc V}, year = 2017, journal = {arXiv preprint arXiv:1711.00489} } @inproceedings{smith2019super, title = {Super-convergence: Very fast training of neural networks using large learning rates}, author = {Smith, Leslie N and Topin, Nicholay}, year = 2019, booktitle = {Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications}, volume = 11006, pages = 1100612, organization = {International Society for Optics and Photonics} } @inproceedings{smtjr18, title = {Learning Without Mixing: Towards A Sharp Analysis of Linear System Identification}, author = {Simchowitz, Max and Mania, Horia and Tu, Stephen and Jordan, Michael I and Recht, Benjamin}, year = 2018, booktitle = {Conference on Learning Theory (COLT)}, publisher = {arXiv preprint arXiv:1802.08334} } @misc{Snap-data, title = {{SNAP Datasets}: {Stanford} Large Network Dataset Collection}, author = {Jure Leskovec and Andrej Krevl}, year = 2014, month = jun, howpublished = {\url{http://snap.stanford.edu/data}} } @article{soderstrom1982some, title = {Some properties of the output error method}, author = {S{\"o}derstr{\"o}m, Torsten and Stoica, Petre}, year = 1982, journal = {Automatica}, publisher = {Elsevier}, volume = 18, number = 1, pages = {93--99} } @inproceedings{soltanolkotabi2017learning, title = {Learning {ReLUs} via gradient descent}, author = {Soltanolkotabi, Mahdi}, year = 2017, booktitle = {Advances in Neural Information Processing Systems}, pages = {2007--2017} } @article{SongEtal:NonparametricTensorDecomp, title = {Nonparametric Estimation of Multi-View Latent Variable Models}, author = {L. Song and A. Anandkumar and B. Dai and B. Xie}, year = 2013, month = nov, journal = {Available on arXiv:1311.3287} } @article{soudry2017exponentially, title = {Exponentially vanishing sub-optimal local minima in multilayer neural networks}, author = {Soudry, Daniel and Hoffer, Elad}, year = 2017, journal = {arXiv preprint arXiv:1702.05777} } @article{soudry2018implicit, title = {The implicit bias of gradient descent on separable data}, author = {Soudry, Daniel and Hoffer, Elad and Nacson, Mor Shpigel and Gunasekar, Suriya and Srebro, Nathan}, year = 2018, journal = {The Journal of Machine Learning Research}, publisher = {JMLR. org}, volume = 19, number = 1, pages = {2822--2878} } @article{sp97, title = {Bidirectional recurrent neural networks}, author = {Schuster, Mike and Paliwal, Kuldip K}, year = 1997, journal = {IEEE Transactions on Signal Processing}, publisher = {IEEE}, volume = 45, number = 11, pages = {2673--2681} } @incollection{speech, title = {Readings in speech recognition}, author = {Rabiner, Lawrence R.}, year = 1990, publisher = {Morgan Kaufmann Publishers Inc.}, address = {San Francisco, CA, USA}, pages = {267--296}, isbn = {1-55860-124-4}, url = {http://dl.acm.org/citation.cfm?id=108235.108253}, chapter = {A tutorial on hidden Markov models and selected applications in speech recognition}, editor = {Waibel, Alex and Lee, Kai-Fu}, numpages = 30, acmid = 108253 } @article{SphericalGaussian2012, title = {{Learning Mixtures of Spherical Gaussians: Moment Methods and Spectral Decompositions}}, author = {D. Hsu and S. M. Kakade}, year = 2012, journal = {arXiv preprint arXiv:1206.5766} } @misc{Spielman-lecture, title = {Spectral Graph Theory: Lecture 7}, author = {Daniel Spielman}, year = 2012, note = {\url{http://www.cs.cmu.edu/~15859n/RelatedWork/Spielman-SpectralClass/lect07-12-3.pdf}}, howpublished = {Lecture notes} } @article{spielman2004smoothed, title = {Smoothed analysis of algorithms: Why the simplex algorithm usually takes polynomial time}, author = {Spielman, Daniel A and Teng, Shang-Hua}, year = 2004, journal = {Journal of the ACM (JACM)}, publisher = {ACM}, volume = 51, number = 3, pages = {385--463} } @article{SpielmanSrivastava2011, title = {{Graph Sparsification by Effective Resistances}}, author = {Spielman, Daniel A. and Srivastava, Nikhil}, year = 2011, month = jan, journal = {SIAM Journal on Computing}, volume = 40, number = 6, pages = {1913--1926}, doi = {10.1137/080734029}, issn = {0097-5397}, archiveprefix = {arXiv}, arxivid = {arXiv:0803.0929v4}, eprint = {arXiv:0803.0929v4}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Spielman - 2009 - Graph Sparsification by Effective Resistances ∗.pdf:pdf}, mendeley-groups = {Algorithms/Sparsification} } @inproceedings{SR, title = {Complexity of inference in Latent Dirichlet Allocation}, author = {D. Sontag and D. Roy}, year = 2011, booktitle = {NIPS}, pages = {1008--1016} } @book{sra2006nonnegative, title = {Nonnegative matrix approximation: Algorithms and applications}, author = {Sra, Suvrit and Dhillon, Inderjit S}, year = 2006, publisher = {Computer Science Department, University of Texas at Austin} } @article{srd18, title = {Mathematical Models of Physiological Responses to Exercise}, author = {Sojoudi, Somayeh and Recht, Benjamin and Doyle, John C}, year = 2018, journal = {https://people.eecs.berkeley.edu/~sojoudi/SRD_2018.pdf} } @inproceedings{srebro2003weighted, title = {Weighted low-rank approximations}, author = {Srebro, Nathan and Jaakkola, Tommi}, year = 2013, booktitle = {ICML} } @inproceedings{srebro2004maximum, title = {Maximum-margin matrix factorization}, author = {Srebro, Nathan and Rennie, and Jaakkola, Tommi S}, year = 2004, booktitle = {Advances in neural information processing systems}, pages = {1329--1336} } @inproceedings{srebro2005rank, title = {Rank, trace-norm and max-norm}, author = {Srebro, Nathan and Shraibman, Adi}, year = 2005, booktitle = {International Conference on Computational Learning Theory}, pages = {545--560}, organization = {Springer} } @article{srinivas2009gaussian, title = {Gaussian process optimization in the bandit setting: No regret and experimental design}, author = {Srinivas, Niranjan and Krause, Andreas and Kakade, Sham M and Seeger, Matthias}, year = 2009, journal = {arXiv preprint arXiv:0912.3995} } @article{srinivas2012, title = {Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting}, author = {Srinivas, Niranjan and Krause, Andreas and Kakade, Sham M. and Seeger, Matthias W.}, year = 2012, month = may, journal = {IEEE Transactions on Information Theory}, publisher = {Institute of Electrical and Electronics Engineers (IEEE)}, volume = 58, number = 5, pages = {3250–3265}, doi = {10.1109/tit.2011.2182033}, issn = {1557-9654}, url = {http://dx.doi.org/10.1109/TIT.2011.2182033} } @inproceedings{srinivas2020curl, title = {Curl: Contrastive unsupervised representations for reinforcement learning}, author = {Srinivas, Aravind and Laskin, Michael and Abbeel, Pieter}, year = 2020, booktitle = {International Conference on Machine Learning} } @article{srinivasan2020learning, title = {Learning to be Safe: Deep RL with a Safety Critic}, author = {Srinivasan, Krishnan and Eysenbach, Benjamin and Ha, Sehoon and Tan, Jie and Finn, Chelsea}, year = 2020, journal = {arXiv preprint arXiv:2010.14603} } @article{srivastava2014dropout, title = {Dropout: a simple way to prevent neural networks from overfitting}, author = {Srivastava, Nitish and Hinton, Geoffrey and Krizhevsky, Alex and Sutskever, Ilya and Salakhutdinov, Ruslan}, year = 2014, journal = {The journal of machine learning research}, publisher = {JMLR. org}, volume = 15, number = 1, pages = {1929--1958} } @article{srivastava2015highway, title = {Highway networks}, author = {Srivastava, Rupesh Kumar and Greff, Klaus and Schmidhuber, J{\"u}rgen}, year = 2015, journal = {arXiv preprint arXiv:1505.00387} } @inproceedings{SS08, title = {{SVM} optimization: inverse dependence on training set size}, author = {Shai {Shalev-Shwartz} and Nathan Srebro}, year = 2008, booktitle = {ICML} } @inproceedings{ss18, title = {Spurious Local Minima are Common in Two-Layer {R}e{LU} Neural Networks}, author = {Itay Safran and Ohad Shamir}, year = 2018, booktitle = {International Conference on Machine Learning (ICML)}, publisher = {http://arxiv.org/abs/1712.08968} } @book{SS90, title = {Matrix Perturbation Theory}, author = {G. W. Stewart and Ji-Guang Sun}, year = 1990, publisher = {Academic Press} } @article{ss91, title = {Turing computability with neural nets}, author = {Siegelmann, Hava T and Sontag, Eduardo D}, year = 1991, journal = {Applied Mathematics Letters}, publisher = {Elsevier}, volume = 4, number = 6, pages = {77--80} } @inproceedings{ssb14, title = {Long short-term memory recurrent neural network architectures for large scale acoustic modeling}, author = {Sak, Ha{\c{s}}im and Senior, Andrew and Beaufays, Fran{\c{c}}oise}, year = 2014, booktitle = {Fifteenth annual conference of the international speech communication association} } @inproceedings{ssn12, title = {LSTM neural networks for language modeling}, author = {Sundermeyer, Martin and Schl{\"u}ter, Ralf and Ney, Hermann}, year = 2012, booktitle = {Thirteenth annual conference of the international speech communication association} } @inproceedings{st04, title = {{Nearly-linear time algorithms for graph partitioning, graph sparsification, and solving linear systems}}, author = {Spielman, Daniel A. and Teng, Shang-Hua}, year = 2004, booktitle = {Proceedings of the thirty-sixth annual ACM symposium on Theory of computing - STOC '04}, publisher = {ACM Press}, address = {New York, New York, USA}, pages = 81, doi = {10.1145/1007352.1007372}, isbn = 1581138520, mendeley-groups = {Algorithms/Sparsification} } @article{ST08a, title = {{A Local Clustering Algorithm for Massive Graphs and Its Application to Nearly Linear Time Graph Partitioning}}, author = {Spielman, Daniel A. and Teng, Shang-Hua}, year = 2013, month = jan, journal = {SIAM Journal on Computing}, volume = 42, number = 1, pages = {1--26}, doi = {10.1137/080744888}, issn = {0097-5397}, abstract = {We study the design of local algorithms for massive graphs. A local algorithm is one that finds a solution containing or near a given vertex without looking at the whole graph. We present a local clustering algorithm. Our algorithm finds a good cluster--a subset of vertices whose internal connections are significantly richer than its external connections--near a given vertex. The running time of our algorithm, when it finds a non-empty local cluster, is nearly linear in the size of the cluster it outputs. Our clustering algorithm could be a useful primitive for handling massive graphs, such as social networks and web-graphs. As an application of this clustering algorithm, we present a partitioning algorithm that finds an approximate sparsest cut with nearly optimal balance. Our algorithm takes time nearly linear in the number edges of the graph. Using the partitioning algorithm of this paper, we have designed a nearly-linear time algorithm for constructing spectral sparsifiers of graphs, which we in turn use in a nearly-linear time algorithm for solving linear systems in symmetric, diagonally-dominant matrices. The linear system solver also leads to a nearly linear-time algorithm for approximating the second-smallest eigenvalue and corresponding eigenvector of the Laplacian matrix of a graph. These other results are presented in two companion papers.}, archiveprefix = {arXiv}, arxivid = {0809.3232}, eprint = {0809.3232}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Spielman, Teng - 2013 - A Local Clustering Algorithm for Massive Graphs and Its Application to Nearly Linear Time Graph Partitioning.pdf:pdf}, mendeley-groups = {Algorithms/Sparsest Cut/Local Clustering,Algorithms/Sparsification} } @article{ST08b, title = {{Spectral Sparsification of Graphs}}, author = {Spielman, Daniel A. and Teng, Shang-Hua}, year = 2011, month = jan, journal = {SIAM Journal on Computing}, volume = 40, number = 4, pages = {981--1025}, doi = {10.1137/08074489X}, issn = {0097-5397}, abstract = {We introduce a new notion of graph sparsificaiton based on spectral similarity of graph Laplacians: spectral sparsification requires that the Laplacian quadratic form of the sparsifier approximate that of the original. This is equivalent to saying that the Laplacian of the sparsifier is a good preconditioner for the Laplacian of the original. We prove that every graph has a spectral sparsifier of nearly linear size. Moreover, we present an algorithm that produces spectral sparsifiers in time \$\backslash softO\{m\}\$, where \$m\$ is the number of edges in the original graph. This construction is a key component of a nearly-linear time algorithm for solving linear equations in diagonally-dominant matrcies. Our sparsification algorithm makes use of a nearly-linear time algorithm for graph partitioning that satisfies a strong guarantee: if the partition it outputs is very unbalanced, then the larger part is contained in a subgraph of high conductance.}, archiveprefix = {arXiv}, arxivid = {0808.4134}, eprint = {0808.4134}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Spielman, Teng - 2008 - Spectral Sparsification of Graphs.pdf:pdf}, mendeley-groups = {Algorithms/Sparsification} } @article{ST08c, title = {{Nearly Linear Time Algorithms for Preconditioning and Solving Symmetric, Diagonally Dominant Linear Systems}}, author = {Spielman, Daniel A. and Teng, Shang-Hua}, year = 2014, month = jul, journal = {SIAM Journal on Matrix Analysis and Applications}, volume = 35, number = 3, pages = {835--885}, doi = {10.1137/090771430}, issn = {0895-4798}, abstract = {We present a randomized algorithm that, on input a symmetric, weakly diagonally dominant n-by-n matrix A with m nonzero entries and an n-vector b, produces a y such that \$\backslash norm\{y - \backslash pinv\{A\} b\}\_\{A\} \backslash leq \backslash epsilon \backslash norm\{\backslash pinv\{A\} b\}\_\{A\}\$ in expected time \$O (m \backslash log\^{}\{c\}n \backslash log (1/\backslash epsilon)),\$ for some constant c. By applying this algorithm inside the inverse power method, we compute approximate Fiedler vectors in a similar amount of time. The algorithm applies subgraph preconditioners in a recursive fashion. These preconditioners improve upon the subgraph preconditioners first introduced by Vaidya (1990). For any symmetric, weakly diagonally-dominant matrix A with non-positive off-diagonal entries and \$k \backslash geq 1\$, we construct in time \$O (m \backslash log\^{}\{c\} n)\$ a preconditioner B of A with at most \$2 (n - 1) + O ((m/k) \backslash log\^{}\{39\} n)\$ nonzero off-diagonal entries such that the finite generalized condition number \$\backslash kappa\_\{f\} (A,B)\$ is at most k, for some other constant c. In the special case when the nonzero structure of the matrix is planar the corresponding linear system solver runs in expected time \$ O (n \backslash log\^{}\{2\} n + n \backslash log n \backslash \backslash log \backslash log n \backslash \backslash log (1/\backslash epsilon))\$. We hope that our introduction of algorithms of low asymptotic complexity will lead to the development of algorithms that are also fast in practice.}, archiveprefix = {arXiv}, arxivid = {cs/0607105}, eprint = {0607105}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Spielman, Teng - 2006 - Nearly-Linear Time Algorithms for Preconditioning and Solving Symmetric, Diagonally Dominant Linear Systems.pdf:pdf}, mendeley-groups = {Algorithms/Sparsification}, primaryclass = {cs} } @article{stack4816, title = {What are the sharpest known tail bounds for ${\cal X}_k^2$ distributed variables?}, author = {Robin Girard}, year = 2010, journal = {StackExchange}, publisher = {\url{https://stats.stackexchange.com/questions/4816/what-are-the-sharpest-known-tail-bounds-for-chi-k2-distributed-variables}} } @article{star_model, title = {Smooth Transition Autoregressive Models -- A Survey Of Recent Developments}, author = {Dick van Dijk and Timo Ter�svirta and Philip Hans Franses}, year = 2002, journal = {Econometric Reviews}, volume = 21, pages = {1--47} } @article{stat-linear-regression, title = {A Statistical Perspective on Algorithmic Leveraging}, author = {Ma, Ping and Mahoney, Michael and Yu, Bin}, year = 2013, journal = {arXiv:1306.5362} } @inproceedings{steinwart2009optimal, title = {Optimal Rates for Regularized Least Squares Regression.}, author = {Steinwart, Ingo and Hush, Don R and Scovel, Clint and others}, year = 2009, booktitle = {COLT} } @inproceedings{Steurer2010, title = {{Fast SDP algorithms for constraint satisfaction problems}}, author = {Steurer, David}, year = 2010, booktitle = {Proceedings of the Twenty-First Annual ACM-SIAM Symposium on Discrete Algorithms - SODA '10}, pages = {684----697}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Steurer - 2010 - Fast SDP algorithms for constraint satisfaction problems.pdf:pdf}, mendeley-groups = {Optimization/Multiplicative Weight/SDP} } @inproceedings{stevens2012exploring, title = {Exploring Topic Coherence over Many Models and Many Topics}, author = {Keith Stevens and Philip Kegelmeyer and David Andrzejewski and David Buttler}, year = 2012, booktitle = {EMNLP} } @article{stewart1990matrix, title = {Matrix perturbation theory}, author = {Stewart, Gilbert W}, year = 1990, publisher = {Citeseer} } @techreport{stewart1998perturbation, title = {Perturbation theory for the singular value decomposition}, author = {Stewart, Gilbert W}, year = 1998 } @incollection{steyvers2006probabilistic, title = {Probabilistic Topic Models}, author = {M. Steyvers and T. Griffiths}, year = 2006, booktitle = {Latent Semantic Analysis: A Road to Meaning.}, publisher = {Laurence Erlbaum}, url = {http://cocosci.berkeley.edu/tom/papers/SteyversGriffiths.pdf}, editor = {Landauer, T. and Mcnamara, D. and Dennis, S. and Kintsch, W.} } @article{steyvers2007probabilistic, title = {Probabilistic topic models}, author = {Steyvers, Mark and Griffiths, Tom}, year = 2007, journal = {Handbook of latent semantic analysis}, volume = 427, number = 7, pages = {424--440} } @inproceedings{stooke2020responsive, title = {Responsive safety in reinforcement learning by pid lagrangian methods}, author = {Stooke, Adam and Achiam, Joshua and Abbeel, Pieter}, year = 2020, booktitle = {International Conference on Machine Learning}, pages = {9133--9143}, organization = {PMLR} } @article{storvik2002particle, title = {Particle Filters for state-space models with the presence of unknown static paramaters}, author = {Storvik, Geir}, year = 2002, journal = {IEEE Transactions on Signal Processing}, volume = 50, number = 2, pages = {281--289} } @article{strehl09reinforcement, title = {Reinforcement Learning in Finite {MDP}s: {PAC} Analysis}, author = {Alexander L. Strehl and Lihong Li and Michael L. Littman}, year = 2009, journal = {Journal of Machine Learning Research}, volume = 10, number = {Nov}, pages = {2413--2444} } @inproceedings{strehl2004empirical, title = {An empirical evaluation of interval estimation for markov decision processes}, author = {Strehl, Alexander L and Littman, Michael L}, year = 2004, booktitle = {Tools with Artificial Intelligence, 2004. ICTAI 2004. 16th IEEE International Conference on}, pages = {128--135}, organization = {IEEE} } @inproceedings{strehl2005theoretical, title = {A theoretical analysis of model-based interval estimation}, author = {Strehl, Alexander L and Littman, Michael L}, year = 2005, booktitle = {Proceedings of the 22nd international conference on Machine learning}, pages = {856--863}, organization = {ACM} } @inproceedings{strehl2006pac, title = {PAC model-free reinforcement learning}, author = {Strehl, Alexander L and Li, Lihong and Wiewiora, Eric and Langford, John and Littman, Michael L}, year = 2006, booktitle = {Proceedings of the 23rd international conference on Machine learning}, pages = {881--888} } @book{strehl2007probably, title = {Probably approximately correct (PAC) exploration in reinforcement learning}, author = {Strehl, Alexander L}, year = 2007, publisher = {ProQuest} } @article{strehl2008analysis, title = {An analysis of model-based interval estimation for Markov decision processes}, author = {Strehl, Alexander L and Littman, Michael L}, year = 2008, journal = {Journal of Computer and System Sciences}, publisher = {Academic Press}, volume = 74, number = 8, pages = {1309--1331} } @article{strehl2009reinforcement, title = {Reinforcement Learning in Finite MDPs: PAC Analysis.}, author = {Strehl, Alexander L and Li, Lihong and Littman, Michael L}, year = 2009, journal = {Journal of Machine Learning Research}, volume = 10, number = 11 } @article{StrohmerVershynin2009, title = {A randomized Kaczmarz algorithm with exponential convergence}, author = {Strohmer, Thomas and Vershynin, Roman}, year = 2009, journal = {Journal of Fourier Analysis and Applications}, publisher = {Springer}, volume = 15, number = 2, pages = {262--278} } @article{stroop1935studies, title = {Studies of interference in serial verbal reactions.}, author = {Stroop, J Ridley}, year = 1935, journal = {Journal of experimental psychology}, publisher = {Psychological Review Company}, volume = 18, number = 6, pages = 643 } @misc{sturmfels2011binary, title = {Binary cumulant varieties}, author = {Sturmfels, B. and Zwiernik, P.}, year = 2013, journal = {Ann. Comb.}, number = 17, pages = {229--250} } @inproceedings{su2014differential, title = {A Differential Equation for Modeling Nesterov’s Accelerated Gradient Method: Theory and Insights}, author = {Su, Weijie and Boyd, Stephen and Candes, Emmanuel}, year = 2014, booktitle = {Advances in Neural Information Processing Systems}, pages = {2510--2518} } @article{su2020sanity, title = {Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot}, author = {Su, Jingtong and Chen, Yihang and Cai, Tianle and Wu, Tianhao and Gao, Ruiqi and Wang, Liwei and Lee, Jason D}, year = 2020, journal = {Neural Information Processing Systems (NeurIPS)} } @book{SubIDMoor, title = {Subspace Identification of Linear Systems}, author = {P. V. Overschee and B. De Moor}, year = 1996, publisher = {Kluwer Academic Publishers} } @inproceedings{sudderth2008shared, title = { Shared Segmentation of Natural Scenes Using Dependent {P}itman--{Y}or Processes }, author = { Erik B. Sudderth and Michael I. Jordan }, year = 2008, booktitle = {NIPS}, pages = {1585--1592}, bibsource = {DBLP, http://dblp.uni-trier.de}, ee = {http://books.nips.cc/papers/files/nips21/NIPS2008_1027.pdf} } @article{sugiyama2007covariate, title = {Covariate shift adaptation by importance weighted cross validation}, author = {Sugiyama, Masashi and Krauledat, Matthias and M{\~A}{\v{z}}ller, Klaus-Robert}, year = 2007, journal = {Journal of Machine Learning Research}, volume = 8, number = {May}, pages = {985--1005} } @inproceedings{sugiyama2008direct, title = {Direct importance estimation with model selection and its application to covariate shift adaptation}, author = {Sugiyama, Masashi and Nakajima, Shinichi and Kashima, Hisashi and Buenau, Paul V and Kawanabe, Motoaki}, year = 2008, booktitle = {Advances in neural information processing systems}, pages = {1433--1440} } @inproceedings{Summa2015, title = {On Largest Volume Simplices and Sub-determinants}, author = {Summa, Marco Di and Eisenbrand, Friedrich and Faenza, Yuri and Moldenhauer, Carsten}, year = 2015, booktitle = {Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms}, location = {San Diego, California}, publisher = {SIAM}, series = {SODA '15}, pages = {315--323}, url = {http://dl.acm.org/citation.cfm?id=2722129.2722152}, numpages = 9, acmid = 2722152 } @inproceedings{sun2006beyond, title = {Beyond streams and graphs: dynamic tensor analysis}, author = {Sun, Jimeng and Tao, Dacheng and Faloutsos, Christos}, year = 2006, booktitle = { Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining }, location = {Philadelphia, PA, USA}, publisher = {ACM}, address = {New York, NY, USA}, series = {KDD '06}, pages = {374--383}, doi = {http://doi.acm.org/10.1145/1150402.1150445}, isbn = {1-59593-339-5}, acmid = 1150445, numpages = 10 } @inproceedings{sun2006distributed, title = {Distributed Pattern Discovery in Multiple Streams}, author = {Jimeng Sun and Spiros Papadimitriou and Christos Faloutsos}, year = 2006, journal = {PAKDD}, address = {Singapore}, pages = {713--718}, owner = {leili}, timestamp = {2011.07.28} } @inproceedings{sun2006window, title = { Window-based Tensor Analysis on High-dimensional and Multi-aspect Streams }, author = {Sun, Jimeng and Papadimitriou, S. and Yu, P. S.}, year = 2006, booktitle = {ICDM '06. Sixth International Conference on Data Mining}, pages = {1076--1080}, doi = {10.1109/ICDM.2006.169}, issn = {1550-4786}, abstract = { Data stream values are often associated with multiple aspects. For example, each value from environmental sensors may have an associated type (e.g., temperature, humidity, etc) as well as location. Aside from timestamp, type and location are the two additional aspects. How to model such streams? How to simultaneously find patterns within and across the multiple aspects? How to do it incrementally in a streaming fashion? In this paper, all these problems are addressed through a general data model, tensor streams, and an effective algorithmic framework, window-based tensor analysis (WTA). Two variations of WTA, independent- window tensor analysis (IW) and moving-window tensor analysis (MW), are presented and evaluated extensively on real datasets. Finally, we illustrate one important application, multi-aspect correlation analysis (MACA), which uses WTA and we demonstrate its effectiveness on an environmental monitoring application. }, keywords = { data mining, environmental science computing, environmental monitoring application, high-dimensional streams, multi-aspect correlation analysis, multi-aspect streams, window-based tensor analysis }, owner = {leili}, timestamp = {2010.02.03} } @inproceedings{sun2007less, title = {Less is more: Compact matrix decomposition for large sparse graphs}, author = {Jimeng Sun and Yinglian Xie and Hui Zhang and Christos Faloutsos}, year = 2007, booktitle = {In Proceeding SIAM International Conference on Data Mining} } @article{sun2008incremental, title = {Incremental tensor analysis: Theory and applications}, author = { Sun, Jimeng and Tao, Dacheng and Papadimitriou, Spiros and Yu, Philip S. and Faloutsos, Christos }, year = 2008, journal = {ACM Trans. Knowl. Discov. Data}, publisher = {ACM}, address = {New York, NY, USA}, volume = 2, number = 3, pages = {1--37}, doi = {http://doi.acm.org/10.1145/1409620.1409621}, issn = {1556-4681}, abstract = { How do we find patterns in author-keyword associations, evolving over time Or in data cubes (tensors), with product-branchcustomer sales information And more generally, how to summarize high-order data cubes (tensors) How to incrementally update these patterns over time Matrix decompositions, like principal component analysis (PCA) and variants, are invaluable tools for mining, dimensionality reduction, feature selection, rule identification in numerous settings like streaming data, text, graphs, social networks, and many more settings. However, they have only two orders (i.e., matrices, like author and keyword in the previous example). We propose to envision such higher-order data as tensors, and tap the vast literature on the topic. However, these methods do not necessarily scale up, let alone operate on semi-infinite streams. Thus, we introduce a general framework, incremental tensor analysis (ITA), which efficiently computes a compact summary for high-order and high-dimensional data, and also reveals the hidden correlations. Three variants of ITA are presented: (1) dynamic tensor analysis (DTA); (2) streaming tensor analysis (STA); and (3) window-based tensor analysis (WTA). In paricular, we explore several fundamental design trade-offs such as space efficiency, computational cost, approximation accuracy, time dependency, and model complexity. We implement all our methods and apply them in several real settings, such as network anomaly detection, multiway latent semantic indexing on citation networks, and correlation study on sensor measurements. Our empirical studies show that the proposed methods are fast and accurate and that they find interesting patterns and outliers on the real datasets. }, owner = {leili}, timestamp = {2010.02.05} } @inproceedings{sun2011two, title = {A two-stage weighting framework for multi-source domain adaptation}, author = {Sun, Qian and Chattopadhyay, Rita and Panchanathan, Sethuraman and Ye, Jieping}, year = 2011, booktitle = {Advances in neural information processing systems}, pages = {505--513} } @article{sun2015complete1, title = {Complete Dictionary Recovery over the Sphere {I}: Overview and the Geometric Picture}, author = {Sun, Ju and Qu, Qing and Wright, John}, year = 2015, journal = {arXiv:1511.03607}, publisher = {IEEE}, volume = 63, number = 2, pages = {853--884}, date-modified = {2016-02-15 19:36:19 +0000} } @article{sun2015complete2, title = {Complete Dictionary Recovery over the Sphere {II}: Recovery by {R}iemannian Trust-region Method}, author = {Sun, Ju and Qu, Qing and Wright, John}, year = 2015, journal = {arXiv:1511.04777}, date-modified = {2016-02-15 19:36:24 +0000} } @inproceedings{sun2015guaranteed, title = {Guaranteed matrix completion via nonconvex factorization}, author = {Sun, Ruoyu and Luo, Zhi-Quan}, year = 2015, journal = {IEEE Transactions on Information Theory}, booktitle = {Foundations of Computer Science (FOCS), 2015 IEEE 56th Annual Symposium on}, publisher = {IEEE}, volume = 62, number = 11, pages = {270--289}, organization = {IEEE} } @article{sun2015nonconvex, title = {When Are Nonconvex Problems Not Scary?}, author = {Sun, Ju and Qu, Qing and Wright, John}, year = 2015, journal = {arXiv preprint arXiv:1510.06096} } @inproceedings{sun2016geometric, title = {A geometric analysis of phase retrieval}, author = {Sun, Ju and Qu, Qing and Wright, John}, year = 2016, journal = {Forthcoming}, booktitle = {Information Theory (ISIT), 2016 IEEE International Symposium on}, pages = {2379--2383}, organization = {IEEE} } @inproceedings{sun2018model, title = {Model-based {RL} in contextual decision processes: {PAC} bounds and exponential improvements over model-free approaches}, author = {Sun, Wen and Jiang, Nan and Krishnamurthy, Akshay and Agarwal, Alekh and Langford, John}, year = 2019, booktitle = {Conference on Learning Theory}, pages = {2898--2933}, organization = {PMLR} } @article{survey, title = {Introduction to probabilistic topic models}, author = {D. Blei}, year = 2012, journal = {Communications of the ACM}, pages = {77--84} } @inproceedings{sutskever2013importance, title = {On the importance of initialization and momentum in deep learning}, author = {Sutskever, Ilya and Martens, James and Dahl, George and Hinton, Geoffrey}, year = 2013, booktitle = {International conference on machine learning}, pages = {1139--1147} } @incollection{sutton1990integrated, title = {Integrated architectures for learning, planning, and reacting based on approximating dynamic programming}, author = {Sutton, Richard S}, year = 1990, booktitle = {Machine learning proceedings 1990}, publisher = {Elsevier}, pages = {216--224} } @article{sutton1991dyna, title = {Dyna, an integrated architecture for learning, planning, and reacting}, author = {Sutton, Richard S}, year = 1991, journal = {ACM Sigart Bulletin}, publisher = {ACM New York, NY, USA}, volume = 2, number = 4, pages = {160--163} } @book{sutton1998reinforcement, title = {Reinforcement learning: An introduction}, author = {Sutton, Richard S and Barto, Andrew G}, year = 1998, publisher = {MIT press} } @inproceedings{sutton1999policy, title = {Policy gradient methods for reinforcement learning with function approximation.}, author = {Sutton, Richard S and McAllester, David A and Singh, Satinder P and Mansour, Yishay}, year = 1999, booktitle = {NIPs}, volume = 99, pages = {1057--1063}, organization = {Citeseer} } @inproceedings{sutton2009convergent, title = {A Convergent $ O (n) $ Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation}, author = {Sutton, Richard S and Maei, Hamid R and Szepesv{\'a}ri, Csaba}, year = 2009, booktitle = {Advances in neural information processing systems}, pages = {1609--1616} } @article{sutton88learning, title = {Learning to Predict by the Methods of Temporal Differences}, author = {Richard S. Sutton}, year = 1988, journal = {Machine Learning}, volume = 3, number = 1, pages = {9--44} } @book{Sutton:1998:IRL:551283, title = {Introduction to Reinforcement Learning}, author = {Sutton, Richard S. and Barto, Andrew G.}, year = 1998, publisher = {MIT Press}, address = {Cambridge, MA, USA}, isbn = {0262193981}, edition = {1st}, date-added = {2018-02-14 09:43:11 +0000}, date-modified = {2018-02-14 09:43:11 +0000} } @inproceedings{svl14, title = {Sequence to sequence learning with neural networks}, author = {Sutskever, Ilya and Vinyals, Oriol and Le, Quoc V}, year = 2014, booktitle = {Advances in neural information processing systems (NIPS)}, pages = {3104--3112} } @inproceedings{SVM, title = {A training algorithm for optimal margin classifiers}, author = {Boser, Bernhard E. and Guyon, Isabelle M. and Vapnik, Vladimir N.}, year = 1992, booktitle = {Proceedings of the fifth annual workshop on Computational learning theory}, location = {Pittsburgh, Pennsylvania, United States}, publisher = {ACM}, address = {New York, NY, USA}, series = {COLT '92}, pages = {144--152}, doi = {10.1145/130385.130401}, isbn = {0-89791-497-X}, url = {http://doi.acm.org/10.1145/130385.130401}, numpages = 9, acmid = 130401 } @inproceedings{svwx17, title = {On the complexity of learning neural networks}, author = {Song, Le and Vempala, Santosh and Wilmes, John and Xie, Bo}, year = 2017, booktitle = {Advances in Neural Information Processing Systems (NIPS)}, pages = {5514--5522} } @misc{swalin2010evaluating, title = { Evaluating Microsoft Hyper-V Live Migration Performance Using IBM System x3650 M3 and IBM System Storage DS3400 }, author = {Kent R. Swalin}, year = 2010, howpublished = {Available at \url{ftp://public.dhe.ibm.com/common/ssi/ecm/en/xsw03091usen/XSW03091USEN.PD}} } @inproceedings{SWW, title = {Exact recovery of sparsely-used dictionaries}, author = {D. Spielman and H. Wang and J. Wright}, year = 2012, booktitle = {Journal of Machine Learning Research} } @inproceedings{SYG07, title = {A stochastic quasi-Newton method for online convex optimization}, author = {Schraudolph, Nicol N and Yu, Jin and G{\"u}nter, Simon}, year = 2007, booktitle = {International Conference on Artificial Intelligence and Statistics}, pages = {436--443} } @article{sylvester1857question, title = {A question in the geometry of situation}, author = {Sylvester, James Joseph}, year = 1857, journal = {Quarterly Journal of Pure and Applied Mathematics}, volume = 1 } @article{Szarek1991-EigenDistribution, title = {Condition numbers of random matrices}, author = {Szarek, Stanislaw J}, year = 1991, journal = {Journal of Complexity}, publisher = {Elsevier}, volume = 7, number = 2, pages = {131--149} } @article{szegedy2015rethinking, title = {Rethinking the inception architecture for computer vision}, author = {Szegedy, Christian and Vanhoucke, Vincent and Ioffe, Sergey and Shlens, Jonathon and Wojna, Zbigniew}, year = 2015, journal = {arXiv preprint arXiv:1512.00567}, booktitle = {Proceedings of the IEEE conference on computer vision and pattern recognition}, pages = {2818--2826} } @article{szegedy2016inception, title = {Inception-v4, inception-resnet and the impact of residual connections on learning}, author = {Szegedy, Christian and Ioffe, Sergey and Vanhoucke, Vincent}, year = 2016, journal = {arXiv preprint arXiv:1602.07261}, booktitle = {AAAI}, pages = {4278--4284} } @inproceedings{szita2010model, title = {Model-based reinforcement learning with nearly tight exploration complexity bounds}, author = {Szita, Istv{\'a}n and Szepesv{\'a}ri, Csaba}, year = 2010, booktitle = {ICML} } @inproceedings{T, title = {Greed is good: Algorithmic results for sparse approximation}, author = {J. Tropp}, year = 2004, booktitle = {IEEE Transactions on Information Theory}, pages = {2231--2242} } @article{t10, title = {254A, Notes 3 : The operator norm of a random matrix}, author = {Terence Tao}, year = 2010, journal = {https://terrytao.wordpress.com/2010/01/09/254a-notes-3-the-operator-norm-of-a-random-matrix/} } @inproceedings{t16, title = {Benefits of depth in neural networks}, author = {Telgarsky, Matus}, year = 2016, booktitle = {Conference on Learning Theory (COLT)}, publisher = {arXiv preprint arXiv:1602.04485} } @inproceedings{t17, title = {An Analytical Formula of Population Gradient for two-layered {R}e{LU} network and its Applications in Convergence and Critical Point Analysis}, author = {Yuandong Tian}, year = 2017, booktitle = {International Conference on Machine Learning (ICML)}, publisher = {http://arxiv.org/abs/1703.00560} } @article{tagorti2014rate, title = {Rate of Convergence and Error Bounds for LSTD ($\lambda$)}, author = {Tagorti, Manel and Scherrer, Bruno and others}, year = 2014, journal = {arXiv preprint arXiv:1405.3229} } @inproceedings{tagorti2015rate, title = {On the Rate of Convergence and Error Bounds for {LSTD}($\lambda$)}, author = {Tagorti, Manel and Scherrer, Bruno}, year = 2015, booktitle = {International Conference on Machine Learning}, pages = {1521--1529} } @article{tak2005physically, title = {A physically-based motion retargeting filter}, author = {Tak, Seyoon and Ko, Hyeong-Seok}, year = 2005, month = jan, journal = {ACM Trans. Graph.}, publisher = {ACM}, address = {New York, NY, USA}, volume = 24, pages = {98--117}, doi = {http://doi.acm.org/10.1145/1037957.1037963}, issn = {0730-0301}, acmid = 1037963, issue = 1, keywords = {Motion retargeting, animation w/constraints, physically based animation}, numpages = 20 } @inproceedings{TakacBRS2013, title = {Mini-Batch Primal and Dual Methods for SVMs}, author = {Takac, Martin and Bijral, Avleen and Richtarik, Peter and Srebro, Nati}, year = 2013, booktitle = {Proceedings of The 30th International Conference on Machine Learning}, pages = {1022--1030} } @article{talebi2018variance, title = {Variance-aware regret bounds for undiscounted reinforcement learning in mdps}, author = {Talebi, Mohammad Sadegh and Maillard, Odalric-Ambrym}, year = 2018, journal = {arXiv preprint arXiv:1803.01626} } @article{taleghan2015pac, title = {{PAC} Optimal MDP Planning with Application to Invasive Species Management}, author = {Majid Alkaee Taleghan and Thomas G. Dietterich and Mark Crowley and Kim Hall and H. Jo Albers}, year = 2015, journal = {Journal of Machine Learning Research}, volume = 16, pages = {3877--3903} } @book{talluri2006theory, title = {The theory and practice of revenue management}, author = {Talluri, Kalyan T and Van Ryzin, Garrett J}, year = 2006, publisher = {Springer Science \& Business Media}, volume = 68 } @inproceedings{tamar2012policy, title = {Policy gradients with variance related risk criteria}, author = {Tamar, Aviv and Di Castro, Dotan and Mannor, Shie}, year = 2012, booktitle = {Proceedings of the 29th International Coference on International Conference on Machine Learning}, pages = {1651--1658} } @inproceedings{tamar2015optimizing, title = {Optimizing the CVaR via sampling}, author = {Tamar, Aviv and Glassner, Yonatan and Mannor, Shie}, year = 2015, booktitle = {Twenty-Ninth AAAI Conference on Artificial Intelligence} } @article{tang2008energy, title = { Energy-efficient thermal-aware task scheduling for homogeneous high-performance computing data centers: A cyber-physical approach }, author = {Q. Tang and S. K. S. Gupta and G. Varsamopoulos}, year = 2008, journal = {IEEE Transactions on Parallel and Distributed Systems}, volume = 19, number = 11, pages = {1458--1472} } @inproceedings{tang2017exploration, title = {\#{E}xploration: A study of count-based exploration for deep reinforcement learning}, author = {Tang, Haoran and Houthooft, Rein and Foote, Davis and Stooke, Adam and Chen, Xi and Duan, Yan and Schulman, John and DeTurck, Filip and Abbeel, Pieter}, year = 2017, booktitle = {Advances in Neural Information Processing Systems} } @inproceedings{tao2004prediction, title = {Prediction and indexing of moving objects with unknown motion patterns}, author = { Tao, Yufei and Faloutsos, Christos and Papadias, Dimitris and Liu, Bin }, year = 2004, booktitle = { SIGMOD '04: Proceedings of the 2004 ACM SIGMOD international conference on Management of data }, publisher = {ACM Press}, address = {New York, NY, USA}, pages = {611--622}, doi = {http://dx.doi.org/10.1145/1007568.1007637}, isbn = 1581138598, citeulike-article-id = 1053264, keywords = {location-prediction, sota}, owner = {leili}, posted-at = {2007-01-19 16:27:54}, priority = 2, timestamp = {2011.07.28} } @article{tao2010random, title = {Random matrices: The distribution of the smallest singular values}, author = {Tao, Terence and Vu, Van}, year = 2010, journal = {Geometric And Functional Analysis}, publisher = {Springer}, volume = 20, number = 1, pages = {260--297} } @article{tarbouriech2021stochastic, title = {Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret}, author = {Tarbouriech, Jean and Zhou, Runlong and Du, Simon S and Pirotta, Matteo and Valko, Michal and Lazaric, Alessandro}, year = 2021, journal = {arXiv preprint arXiv:2104.11186} } @inproceedings{taylor12value, title = {Value Function Approximation in Noisy Environments Using Locally Smoothed Regularized Approximate Linear Programs}, author = {Gavin Taylor and Ronald Parr}, year = 2012, booktitle = {Proceedings of the 28th Conference on Uncertainty in Artificial Intelligence}, pages = {835--842} } @incollection{taylor2007modeling, title = {Modeling Human Motion Using Binary Latent Variables}, author = {Graham W. Taylor and Geoffrey E. Hinton and Sam T. Roweis}, year = 2007, booktitle = {Advances in Neural Information Processing Systems 19}, publisher = {MIT Press}, address = {Cambridge, MA}, pages = {1345--1352}, editor = {B. Sch\"{o}lkopf and J. Platt and T. Hoffman}, owner = {leili}, timestamp = {2011.07.28} } @article{teh2016consistency, title = {Consistency and fluctuations for stochastic gradient Langevin dynamics}, author = {Teh, Yee Whye and Thiery, Alexandre H and Vollmer, Sebastian J}, year = 2016, journal = {The Journal of Machine Learning Research}, publisher = {JMLR. org}, volume = 17, number = 1, pages = {193--225} } @article{tensor_rank_increase, title = {Subtracting a best rank-1 approximation may increase tensor rank}, author = {A. Stegeman and P. Comon}, year = 2010, journal = {Linear Algebra and Its Applications}, volume = 433, pages = {1276--1300} } @misc{tensorflow2015whitepaper, title = {{TensorFlow}: Large-Scale Machine Learning on Heterogeneous Systems}, author = {Mart\'{\i}n~Abadi and Ashish~Agarwal and Paul~Barham and Eugene~Brevdo and Zhifeng~Chen and Craig~Citro and Greg~S.~Corrado and Andy~Davis and Jeffrey~Dean and Matthieu~Devin and Sanjay~Ghemawat and Ian~Goodfellow and Andrew~Harp and Geoffrey~Irving and Michael~Isard and Yangqing Jia and Rafal~Jozefowicz and Lukasz~Kaiser and Manjunath~Kudlur and Josh~Levenberg and Dandelion~Man\'{e} and Rajat~Monga and Sherry~Moore and Derek~Murray and Chris~Olah and Mike~Schuster and Jonathon~Shlens and Benoit~Steiner and Ilya~Sutskever and Kunal~Talwar and Paul~Tucker and Vincent~Vanhoucke and Vijay~Vasudevan and Fernanda~Vi\'{e}gas and Oriol~Vinyals and Pete~Warden and Martin~Wattenberg and Martin~Wicke and Yuan~Yu and Xiaoqiang~Zheng}, year = 2015, url = {https://www.tensorflow.org/}, note = {Software available from tensorflow.org} } @misc{tensorhard, title = {Most tensor problems are {NP} hard}, author = {Christopher Hillar and Lek-Heng Lim}, year = 2012, note = {arXiv:0911.1393v3}, eprint = {arXiv:0911.1393v3} } @article{TensorPCA2014, title = {A statistical model for tensor PCA}, author = {Andrea Montanari and Emile Richard}, year = 2014, month = nov, journal = {arXiv preprint arXiv:1411.1076} } @article{TenSparsification, title = {{ Tensor sparsification via a bound on the spectral norm of random tensors}}, author = {N. H. Nguyen and P. Drineas and T. D. Tran}, year = 2010, month = may, journal = {arXiv preprint arXiv:1005.4732} } @article{tessler2018reward, title = {Reward constrained policy optimization}, author = {Tessler, Chen and Mankowitz, Daniel J and Mannor, Shie}, year = 2018, journal = {arXiv preprint arXiv:1805.11074} } @inproceedings{tewari2008optimistic, title = {Optimistic linear programming gives logarithmic regret for irreducible {MDPs}}, author = {Tewari, Ambuj and Bartlett, Peter L}, year = 2008, booktitle = {Advances in Neural Information Processing Systems}, pages = {1505--1512} } @inproceedings{TGMS, title = {Improved sparse approximation over quasi-incoherent dictionaries}, author = {J. Tropp and A. Gilbert and S. Muthukrishnan and M. Strauss}, year = 2003, booktitle = {IEEE International Conf. on Image Processing} } @article{thananjeyan2020recovery, title = {Recovery rl: Safe reinforcement learning with learned recovery zones}, author = {Thananjeyan, Brijen and Balakrishna, Ashwin and Nair, Suraj and Luo, Michael and Srinivasan, Krishnan and Hwang, Minho and Gonzalez, Joseph E and Ibarz, Julian and Finn, Chelsea and Goldberg, Ken}, year = 2020, journal = {arXiv preprint arXiv:2010.15920} } @article{theocharous2017posterior, title = {Posterior sampling for large scale reinforcement learning}, author = {Theocharous, Georgios and Wen, Zheng and Abbasi-Yadkori, Yasin and Vlassis, Nikos}, year = 2017, journal = {arXiv preprint arXiv:1711.07979} } @article{thiebaux2006decision, title = {Decision-theoretic planning with non-Markovian rewards}, author = {Thi{\'e}baux, Sylvie and Gretton, Charles and Slaney, John and Price, David and Kabanza, Froduald}, year = 2006, journal = {Journal of Artificial Intelligence Research}, volume = 25, pages = {17--74} } @article{thon2015links, title = {Links between multiplicity automata, observable operator models and predictive state representations: a unified learning framework}, author = {Thon, Michael and Jaeger, Herbert}, year = 2015, journal = {The Journal of Machine Learning Research} } @conference{thurau10cikm, title = {Yes We Can – Simplex Volume Maximization for Descriptive Web{--}Scale Matrix Factorization}, author = {C. Thurau and K. Kersting and C. Bauckhage}, year = 2010, booktitle = {CIKM{--}10} } @inproceedings{thurau2012deterministic, title = {Deterministic {CUR} for Improved Large-Scale Data Analysis: An Empirical Study.}, author = {Thurau, Christian and Kersting, Kristian and Bauckhage, Christian}, year = 2012, booktitle = {SDM}, pages = {684--695}, organization = {SIAM} } @article{tian2017analytical, title = {An Analytical Formula of Population Gradient for two-layered ReLU network and its Applications in Convergence and Critical Point Analysis}, author = {Tian, Yuandong}, year = 2017, journal = {arXiv preprint arXiv:1703.00560} } @article{Tibshirani1996-Lasso, title = {Regression shrinkage and selection via the lasso}, author = {Tibshirani, Robert}, year = 1996, journal = {Journal of the Royal Statistical Society. Series B (Methodological)}, publisher = {JSTOR}, volume = 58, number = 1, pages = {267--288} } @article{tibshirani2014adaptive, title = {Adaptive piecewise polynomial estimation via trend filtering}, author = {Tibshirani, Ryan J}, year = 2014, journal = {The Annals of Statistics}, publisher = {Institute of Mathematical Statistics}, volume = 42, number = 1, pages = {285--323}, date-added = {2020-06-01 22:32:54 -0400}, date-modified = {2020-06-01 22:32:54 -0400} } @article{tieleman2012lecture, title = {Lecture 6.5-rmsprop, coursera: Neural networks for machine learning}, author = {Tieleman, Tijmen and Hinton, Geoffrey}, year = 2012, journal = {University of Toronto, Technical Report} } @article{tierney1994markov, title = {{M}arkov Chains for Exploring Posterior Distributions}, author = {Tierney, Luke}, year = 1994, journal = {The Annals of Statistics}, publisher = {Institute of Mathematical Statistics}, volume = 22, number = 4, pages = {1701--1728}, doi = {10.2307/2242477}, issn = {00905364}, url = {http://dx.doi.org/10.2307/2242477}, abstract = {Several Markov chain methods are available for sampling from a posterior distribution. Two important examples are the Gibbs sampler and the Metropolis algorithm. In addition, several strategies are available for constructing hybrid algorithms. This paper outlines some of the basic methods and strategies and discusses some related theoretical and practical issues. On the theoretical side, results from the theory of general state space Markov chains can be used to obtain convergence rates, laws of large numbers and central limit theorems for estimates obtained from Markov chain methods. These theoretical results can be used to guide the construction of more efficient algorithms. For the practical use of Markov chain methods, standard simulation methodology provides several variance reduction techniques and also give guidance on the choice of sample size and allocation.}, citeulike-article-id = 432149, citeulike-linkout-0 = {http://dx.doi.org/10.2307/2242477}, citeulike-linkout-1 = {http://www.jstor.org/stable/2242477}, keywords = {markov-chains, probability, statistics}, posted-at = {2008-11-30 02:31:25}, priority = 2 } @article{Tippet2000, title = {Conditioning of the Stable, Discrete-Time Lyapunov Operator}, author = {Michael K. Tippett and Stephen E. Cohn and Ricardo Todling and Dan Marchesin}, year = 2000, journal = {{SIAM} J. Matrix Analysis Applications}, volume = 22, number = 1, pages = {56--65}, doi = {10.1137/S0895479899354822}, url = {http://dx.doi.org/10.1137/S0895479899354822}, bibsource = {dblp computer science bibliography, http://dblp.org}, biburl = {http://dblp.uni-trier.de/rec/bib/journals/siammax/TippettCTM00}, timestamp = {Tue, 21 Jul 2015 18:50:35 +0200}, bdsk-url-1 = {http://dx.doi.org/10.1137/S0895479899354822} } @article{tipping1999probabilistic, title = {Probabilistic Principal Component Analysis}, author = {Michael E. Tipping and Chris M. Bishop}, year = 1999, journal = {Journal of the Royal Statistical Society, Series B}, volume = 61, pages = {611--622}, owner = {leili}, timestamp = {2011.07.28} } @book{TMS, title = {Tensor Methods in Statistics}, author = {P. McCullagh}, year = 1987, publisher = {Chapman and Hall} } @inproceedings{todorov2012mujoco, title = {Mujoco: A physics engine for model-based control}, author = {Todorov, Emanuel and Erez, Tom and Tassa, Yuval}, year = 2012, booktitle = {2012 IEEE/RSJ International Conference on Intelligent Robots and Systems}, pages = {5026--5033}, organization = {IEEE} } @inproceedings{tomioka2011statistical, title = {Statistical performance of convex tensor decomposition}, author = {Tomioka, Ryota and Suzuki, Taiji and Hayashi, Kohei and Kashima, Hisashi}, year = 2011, booktitle = {Advances in Neural Information Processing Systems (NIPS)}, pages = 137 } @book{tong1990non, title = {Non-linear Time Series: {A} Dynamical System Approach}, author = {Howell Tong}, year = 1990, publisher = {Clarendon Press}, address = {Oxford}, isbn = 9780198523000, lccn = 89029697, owner = {leili}, timestamp = {2011.07.28} } @article{tong2002concentration, title = {Covering Number Bounds of Certain Regularized Linear Function Classes}, author = {Zhang, Tong}, year = 2002, month = mar, journal = {J. Mach. Learn. Res.}, publisher = {JMLR.org}, volume = 2, pages = {527–550}, doi = {10.1162/153244302760200713}, issn = {1532-4435}, url = {https://doi.org/10.1162/153244302760200713}, issue_date = {3/1/2002}, abstract = {Recently, sample complexity bounds have been derived for problems involving linear functions such as neural networks and support vector machines. In many of these theoretical studies, the concept of covering numbers played an important role. It is thus useful to study covering numbers for linear function classes. In this paper, we investigate two closely related methods to derive upper bounds on these covering numbers. The first method, already employed in some earlier studies, relies on the so-called Maurey's lemma; the second method uses techniques from the mistake bound framework in online learning. We compare results from these two methods, as well as their consequences in some learning formulations.}, numpages = 24, keywords = {mistake bounds, learning sample complexity, sparse approximation, covering numbers} } @article{Topic-SCORE, title = {A new {SVD} approach to optimal topic estimation}, author = {Ke, Zheng Tracy and Wang, Minzhe}, year = 2017, journal = {arXiv:1704.07016} } @inproceedings{toro2018teaching, title = {Teaching multiple tasks to an RL agent using LTL}, author = {Toro Icarte, Rodrigo and Klassen, Toryn Q and Valenzano, Richard and McIlraith, Sheila A}, year = 2018, booktitle = {Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems}, pages = {452--461}, organization = {International Foundation for Autonomous Agents and Multiagent Systems} } @techreport{torrefrade2008guide, title = { Guide to the Carnegie Mellon University Multimodal Activity (CMU-MMAC) Database }, author = { Fernando De la Torre Frade and Jessica K. Hodgins and Adam W. Bargteil and Xavier Martin Artal and Justin C. Macey and Alexandre Collado I Castells and Josep Beltran }, year = 2008, month = apr, address = {Pittsburgh, PA}, number = {CMU-RI-TR-08-22}, institution = {Robotics Institute} } @article{tosh2020contrastive, title = {Contrastive estimation reveals topic posterior information to linear models}, author = {Tosh, Christopher and Krishnamurthy, Akshay and Hsu, Daniel}, year = 2020, journal = {arXiv:2003.02234} } @inproceedings{tr18, title = {Least-squares temporal difference learning for the linear quadratic regulator}, author = {Tu, Stephen and Recht, Benjamin}, year = 2018, booktitle = {International Conference on Machine Learning (ICML)}, publisher = {arXiv preprint arXiv:1712.08642} } @inproceedings{traina2000fast, title = {Fast feature selection using the fractal dimension,}, author = {Caetano Traina and Agma Traina and Leejay Wu and Christos Faloutsos}, year = 2000, month = oct, booktitle = {XV Brazilian Symposium on Databases (SBBD)}, address = {Paraiba, Brazil}, owner = {leili}, timestamp = {2011.07.28} } @article{TranDinh2015adaptive, title = {Adaptive Smoothing Algorithms for Nonsmooth Composite Convex Minimization}, author = {Tran-Dinh, Quoc}, year = 2015, journal = {arXiv preprint arXiv:1509.00106} } @book{trefethen1997numerical, title = {Numerical linear algebra}, author = {Trefethen, Lloyd N and Bau III, David}, year = 1997, publisher = {Siam}, volume = 50 } @book{Trefethen2013, title = {{Approximation Theory and Approximation Practice}}, author = {Trefethen, Lloyd N.}, year = 2013, publisher = {SIAM}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Trefethen - 2013 - Approximation Theory and Approximation Practice.pdf:pdf}, mendeley-groups = {Books/Book-Theory,Notes-Tools/Chebyshev,Books/Book-Optimization} } @article{Trevisan1998, title = {{Parallel Approximation Algorithms by Positive Linear Programming}}, author = {Trevisan, Luca}, year = 1998, month = may, journal = {Algorithmica}, volume = 21, number = 1, pages = {72--88}, doi = {10.1007/PL00009209}, issn = {0178-4617}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Trevisan - 1998 - Parallel Approximation Algorithms by Positive Linear Programming.pdf:pdf}, mendeley-groups = {Algorithms/Multiplicative Weight/LP} } @article{Tropp, title = {User-friendly tail bounds for sums of random matrices}, author = {Tropp, Joel A}, year = 2012, journal = {Foundations of computational mathematics}, publisher = {Springer}, volume = 12, number = 4, pages = {389--434}, doi = {10.1007/s10208-011-9099-z}, issn = {1615-3383}, url = {http://dx.doi.org/10.1007/s10208-011-9099-z}, abstract = {This paper presents new probability inequalities for sums of independent, random, self-adjoint matrices. These results place simple and easily verifiable hypotheses on the summands, and they deliver strong conclusions about the large-deviation behavior of the maximum eigenvalue of the sum. Tail bounds for the norm of a sum of random rectangular matrices follow as an immediate corollary. The proof techniques also yield some information about matrix-valued martingales.} } @article{Tropp-book2015, title = {{An Introduction to Matrix Concentration Inequalities}}, author = {Tropp, Joel A.}, year = 2015, month = jan, journal = {ArXiv e-prints}, publisher = {Now Publishers, Inc.}, volume = {abs/1501.01571}, number = {1-2}, pages = {1--230}, archiveprefix = {arXiv}, eprint = {1501.01571}, primaryclass = {math.PR}, keywords = {Mathematics - Probability, Computer Science - Data Structures and Algorithms, Computer Science - Information Theory, Computer Science - Numerical Analysis, Statistics - Machine Learning, Primary: 60B20.~Secondary: 60F10, 60G50, 60G42}, adsurl = {http://adsabs.harvard.edu/abs/2015arXiv150101571T}, adsnote = {Provided by the SAO/NASA Astrophysics Data System} } @misc{tropp2015introduction, title = {An Introduction to Matrix Concentration Inequalities}, author = {Joel A. Tropp}, year = 2015, eprint = {1501.01571}, archiveprefix = {arXiv}, primaryclass = {math.PR} } @article{tropp:svd, title = {Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions}, author = {N. Halko and P.-G. Martinsson and J. A. Tropp}, year = 2011, journal = {SIAM Review}, volume = 53, number = 2 } @techreport{Tse90, title = {Successive projection under a quasi-cyclic order}, author = {Tseng, Paul}, year = 1990, publisher = {Lab. for Information and Decision Systems, MIT}, series = {LIDS-P-1938}, institution = {DTIC Document}, //address = {Cambridge, MA, USA} } @article{tseng1990solving, title = {Solving H-horizon, stationary Markov decision problems in time proportional to log (H)}, author = {Tseng, Paul}, year = 1990, journal = {Operations Research Letters}, publisher = {Elsevier}, volume = 9, number = 5, pages = {287--297}, date-added = {2017-05-19 05:05:08 +0000}, date-modified = {2017-05-19 05:05:08 +0000} } @article{tsitsiklis1996feature, title = {Feature-based methods for large scale dynamic programming}, author = {Tsitsiklis, John N and Van Roy, Benjamin}, year = 1996, journal = {Machine Learning}, publisher = {Springer}, volume = 22, number = {1-3}, pages = {59--94} } @inproceedings{tsitsiklis1997analysis, title = {Analysis of temporal-diffference learning with function approximation}, author = {Tsitsiklis, John N and Van Roy, Benjamin}, year = 1997, booktitle = {Advances in neural information processing systems}, pages = {1075--1081} } @book{TSM85, title = {Statistical analysis of finite mixture distributions}, author = {D. M. Titterington and A. F. M. Smith and U. E. Makov}, year = 1985, publisher = {Wiley} } @article{tsn18, title = {Tensor Decomposition for Compressing Recurrent Neural Network}, author = {Andros Tjandra and Sakriani Sakti and Satoshi Nakamura}, year = 2018, journal = {CoRR}, volume = {abs/1802.10410}, url = {http://arxiv.org/abs/1802.10410} } @article{tsuchida2017invariance, title = {Invariance of Weight Distributions in Rectified MLPs}, author = {Tsuchida, Russell and Roosta-Khorasani, Farbod and Gallagher, Marcus}, year = 2017, journal = {arXiv preprint arXiv:1711.09090} } @book{tsybakov2008introduction, title = {Introduction to nonparametric estimation}, author = {Tsybakov, Alexandre B}, year = 2008, publisher = {Springer Science \& Business Media} } @article{tu2015low, title = {Low-rank solutions of linear matrix equations via {P}rocrustes flow}, author = {Tu, Stephen and Boczar, Ross and Soltanolkotabi, Mahdi and Recht, Benjamin}, year = 2015, journal = {arXiv preprint arXiv:1507.03566}, booktitle = {Proceedings of the 33rd International Conference on International Conference on Machine Learning-Volume 48}, pages = {964--973}, date-modified = {2016-02-15 19:26:56 +0000}, organization = {JMLR. org} } @article{tu2018gap, title = {The gap between model-based and model-free methods on the linear quadratic regulator: An asymptotic viewpoint}, author = {Tu, Stephen and Recht, Benjamin}, year = 2018, journal = {arXiv preprint arXiv:1812.03565} } @article{turchetta2020safe, title = {Safe reinforcement learning via curriculum induction}, author = {Turchetta, Matteo and Kolobov, Andrey and Shah, Shital and Krause, Andreas and Agarwal, Alekh}, year = 2020, journal = {arXiv preprint arXiv:2006.12136} } @article{turney2010frequency, title = {From frequency to meaning: Vector space models of semantics}, author = {Turney, Peter D. and Pantel, Patrick}, year = 2010, journal = {Journal of Artificial Intelligence Research} } @misc{uci, title = {{UCI} Machine Learning Repository}, author = {M. Lichman}, year = 2013, url = {http://archive.ics.uci.edu/ml}, institution = {University of California, Irvine, School of Information and Computer Sciences} } @inproceedings{unmixing, title = {Identifiability and unmixing of latent parse trees}, author = {D. Hsu and S. Kakade and P. Liang}, year = 2012, booktitle = {Advances in Neural Information Processing Systems 25} } @article{uno1989formation, title = { Formation and control of optimal trajectory in human multijoint arm movement }, author = {Uno, Y. and Kawato, M. and Suzuki, R.}, year = 1989, month = jun, journal = {Biological Cybernetics}, volume = 61, number = 2, pages = {89--101}, doi = {10.1007/BF00204593}, abstract = { In this paper, we study trajectory planning and control in voluntary, human arm movements. When a hand is moved to a target, the central nervous system must select one specific trajectory among an infinite number of possible trajectories that lead to the target position. First, we discuss what criterion is adopted for trajectory determination. Several researchers measured the hand trajectories of skilled movements and found common invariant features. For example, when moving the hand between a pair of targets, subjects tended to generate roughly straight hand paths with bell-shaped speed profiles. On the basis of these observations and dynamic optimization theory, we propose a mathematical model which accounts for formation of hand trajectories. This model is formulated by defining an objective function, a measure of performance for any possible movement: square of the rate of change of torque integrated over the entire movement. That is, the objective function CT is defined as follows: \$\$C\_T = \frac{1}{2}{}^t\int\limits\_0^f {\sum\limits\_{i = 1}^n {\left( {\frac{{{\text{d}}z\_i }}{{{\text{d}}t}}} \right)^2 {\text{d}}t,} } \$\$ where ziis the torque generated by the i-th actuator (muslce) out of n actuators, and tfis the movement time. Since this objective function critically depends on the complex nonlinear dynamics of the musculoskeletal system, it is very difficult to determine the unique trajectory which yields the best performance. We overcome this difficult by developing an iterative scheme, with which the optimal trajectory and the associated motor command are simultaneously computed. To evaluate our model, human hand trajectories were experimentally measured under various behavioral situations. These results supported the idea that the human hand trajectory is planned and controlled in accordance with the minimum torquechange criterion. }, citeulike-article-id = 2270940, keywords = {movement}, myurl = {http://dx.doi.org/10.1007/BF00204593}, priority = 2 } @book{uryasev2013stochastic, title = {Stochastic optimization: algorithms and applications}, author = {Uryasev, Stanislav and Pardalos, Panos M}, year = 2013, publisher = {Springer Science \& Business Media}, volume = 54 } @article{uschmajew2012local, title = {Local convergence of the alternating least squares algorithm for canonical tensor approximation}, author = {Uschmajew, Andr{\'e}}, year = 2012, journal = {SIAM Journal on Matrix Analysis and Applications}, publisher = {SIAM}, volume = 33, number = 2, pages = {639--652} } @article{valcarcel2015distributed, title = {Distributed policy evaluation under multiple behavior strategies}, author = {Valcarcel Macua, Sergio and Chen, Jianshu and Zazo, Santiago and Sayed, Ali H}, year = 2015, journal = {Automatic Control, IEEE Transactions on}, publisher = {IEEE}, volume = 60, number = 5, pages = {1260--1274} } @inproceedings{valko2013finite, title = {Finite-time analysis of kernelised contextual bandits}, author = {Valko, Michal and Korda, Nathan and Munos, R{\'e}mi and Flaounas, Ilias and Cristianini, Nello}, year = 2013, booktitle = {Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence}, pages = {654--663} } @article{van1985computing, title = {Computing the CS and the generalized singular value decompositions}, author = {Van Loan, Charles}, year = 1985, journal = {Numerische Mathematik}, publisher = {Springer}, volume = 46, number = 4, pages = {479--491} } @article{van2006performance, title = {Performance loss bounds for approximate value iteration with state aggregation}, author = {Van Roy, Benjamin}, year = 2006, journal = {Math. Oper. Res.}, volume = 31, number = 2, pages = {234--244}, issn = {0364-765X}, url = {https://doi.org/10.1287/moor.1060.0188}, fjournal = {Mathematics of Operations Research}, mrclass = {90C39 (60A10 90C40)}, mrnumber = 2233994 } @article{van2014transfer, title = {Transfer learning improves supervised image segmentation across imaging protocols}, author = {Van Opbroek, Annegreet and Ikram, M Arfan and Vernooij, Meike W and De Bruijne, Marleen}, year = 2014, journal = {IEEE transactions on medical imaging}, publisher = {IEEE}, volume = 34, number = 5, pages = {1018--1030} } @inproceedings{van2018inaturalist, title = {The inaturalist species classification and detection dataset}, author = {Van Horn, Grant and Mac Aodha, Oisin and Song, Yang and Cui, Yin and Sun, Chen and Shepard, Alex and Adam, Hartwig and Perona, Pietro and Belongie, Serge}, year = 2018, booktitle = {Proceedings of the IEEE conference on computer vision and pattern recognition}, pages = {8769--8778} } @article{van2019comments, title = {Comments on the {D}u-{K}akade-{W}ang-{Y}ang Lower Bounds}, author = {Van Roy, Benjamin and Dong, Shi}, year = 2019, journal = {arXiv:1911.07910} } @misc{vanbriesenchlorine, title = {Chlorine levels data}, author = {Jeanne M. VanBriesen}, url = {http://www.cs.cmu.edu/afs/cs/project/spirit-1/www/} } @article{vapnik1971uniform, title = {On the uniform convergence of relative frequencies of events to their probabilities}, author = {Vapnik, Vladimir N and Chervonenkis, A Ya}, year = 1971, journal = {Theory of Probability \& Its Applications}, booktitle = {Measures of complexity}, publisher = {SIAM}, volume = 16, number = 2, pages = {264--280}, doi = {10.1137/1116025}, owner = {rongge}, timestamp = {2013.10.04}, keywords = {machine learning, statistics, stl}, posted-at = {2009-10-11 23:35:59}, priority = 2 } @inproceedings{vasilescu2003multilinear, title = {Multilinear subspace analysis of image ensembles}, author = {M. A. O. Vasilescu and D. Terzopoulos}, year = 2003, booktitle = {Computer Vision and Pattern Recognition, 2003. Proceedings. 2003 IEEE Computer Society Conference on}, volume = 2, pages = {II--93}, organization = {IEEE} } @inproceedings{vaskevicius2019implicit, title = {Implicit Regularization for Optimal Sparse Recovery}, author = {Vaskevicius, Tomas and Kanade, Varun and Rebeschini, Patrick}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {2968--2979} } @inproceedings{vassiliadis1993input, title = { The Input-State Space Approach to the Prediction of Auroral Geomagnetic Activity from Solar Wind Variables }, author = {Dimitris Vassiliadis}, year = 1993, month = sep, booktitle = { Int. Workshop on Applications of Artificial Intelligence in Solar Terrestrial Physics }, address = {Lund, Sweden}, keywords = {time series}, owner = {leili}, timestamp = {2011.07.28} } @inproceedings{vaswani2019fast, title = {Fast and Faster Convergence of SGD for Over-Parameterized Models and an Accelerated Perceptron}, author = {Vaswani, Sharan and Bach, Francis and Schmidt, Mark}, year = 2019, booktitle = {The 22nd International Conference on Artificial Intelligence and Statistics}, pages = {1195--1204} } @article{Vav, title = {On the complexity of nonnegative matrix factorization}, author = {S. Vavasis}, year = 2009, journal = {SIAM Journal on Optimization}, pages = {1364--1377} } @article{Vavasis, title = {On the Complexity of Nonnegative Matrix Factorization}, author = {Vavasis, Stephen A.}, year = 2009, month = oct, journal = {SIAM J. on Optimization}, publisher = {Society for Industrial and Applied Mathematics}, address = {Philadelphia, PA, USA}, volume = 20, number = 3, pages = {1364--1377}, doi = {10.1137/070709967}, issn = {1052-6234}, url = {http://dx.doi.org/10.1137/070709967}, issue_date = {August 2009}, numpages = 14, acmid = 1898406, keywords = {NP-hard, complexity, data mining, feature detection, nonnegative matrix factorization, nonnegative rank} } @inproceedings{vavilapalli2013apache, title = {Apache hadoop yarn: Yet another resource negotiator}, author = {Vavilapalli, Vinod Kumar and Murthy, Arun C and Douglas, Chris and Agarwal, Sharad and Konar, Mahadev and Evans, Robert and Graves, Thomas and Lowe, Jason and Shah, Hitesh and Seth, Siddharth and others}, year = 2013, booktitle = {Proceedings of the 4th annual Symposium on Cloud Computing}, pages = 5, organization = {ACM} } @article{VCA, title = {Vertex Component Analysis: A Fast Algorithm to Unmix Hyperspectral Data}, author = {J.M. P. Nascimento and J. M. B. Dias}, year = 2004, journal = {IEEE TRANS. GEOSCI. REM. SENS}, volume = 43, pages = {898--910} } @article{veatch1996scheduling, title = {Scheduling a make-to-stock queue: Index policies and hedging points}, author = {Veatch, Michael H and Wein, Lawrence M}, year = 1996, journal = {Operations Research}, publisher = {INFORMS}, volume = 44, number = 4, pages = {634--647} } @article{veatch2013approximate, title = {Approximate linear programming for average cost MDPs}, author = {Veatch, Michael H}, year = 2013, journal = {Mathematics of Operations Research}, publisher = {INFORMS}, volume = 38, number = 3, pages = {535--544} } @article{vempala2011structure, title = {Structure from local optima: Learning subspace juntas via higher order PCA}, author = {Vempala, Santosh S and Xiao, Ying}, year = 2011, journal = {arXiv preprint arXiv:1108.3329} } @inproceedings{VempalaWang:GaussianMixture, title = {A spectral algorithm for learning mixtures of distributions}, author = {S. Vempala and G. Wang}, year = 2002, booktitle = {FOCS} } @article{VempalaXiao, title = {Structure from Local Optima: Learning Subspace Juntas via Higher Order PCA}, author = {Santosh Vempala and Ying Xiao}, year = 2011, journal = {CoRR}, volume = {abs/1108.3329}, ee = {http://arxiv.org/abs/1108.3329}, bibsource = {DBLP, http://dblp.uni-trier.de} } @article{venturi2018neural, title = {Neural Networks with Finite Intrinsic Dimension have no Spurious Valleys}, author = {Venturi, Luca and Bandeira, Afonso and Bruna, Joan}, year = 2018, journal = {arXiv preprint arXiv:1802.06384} } @article{vermaak2005monte, title = {{M}onte {C}arlo filtering for multi-target tracking and data association}, author = {Jaco Vermaak and Simon J. Godsill and Patrick Perez}, year = 2005, journal = {IEEE Transactions on Aerospace and Electronic Systems}, volume = 41, pages = {309--332} } @incollection{Vershynin12, title = {Introduction to the non-asymptotic analysis of random matrices}, author = {R. Vershynin}, year = 2012, booktitle = {Compressed Sensing, Theory and Applications}, publisher = {Cambridge University Press}, pages = {210--268}, editor = {Y. Eldar and G. Kutyniok}, chapter = 5 } @article{vershynin2010introduction, title = {Introduction to the non-asymptotic analysis of random matrices}, author = {Vershynin, Roman}, year = 2010, journal = {arXiv preprint arXiv:1011.3027} } @book{vershynin2018high, title = {High-dimensional probability: An introduction with applications in data science}, author = {Vershynin, Roman}, year = 2018, publisher = {Cambridge university press}, volume = 47 } @article{verstynen2014organization, title = {The organization and dynamics of corticostriatal pathways link the medial orbitofrontal cortex to future behavioral responses}, author = {Verstynen, Timothy D}, year = 2014, journal = {Journal of neurophysiology}, publisher = {Am Physiological Soc}, volume = 112, number = 10, pages = {2457--2469} } @article{Vidyasagar08, title = {A learning theory approach to system identification and stochastic adaptive control}, author = {M. Vidyasagar and Rajeeva L.~Karandikar}, year = 2008, journal = {Journal of Process Control}, volume = 18, number = 3, pages = {421--430}, date-added = {2016-04-02 18:43:31 +0000}, date-modified = {2016-04-02 18:44:14 +0000} } @conference{vijayarangan2017high, title = {High-throughput Robotic Phenotyping of Energy Sorghum Crops}, author = {Srinivasan Vijayarangan and Paloma Sodhi and Prathamesh Kini and James Bourne and Simon Du and Hanqi Sun and Barnabas Poczos and Dimitrios (Dimi) Apostolopoulos and David Wettergreen}, year = 2017, month = sep, booktitle = {Proceedings of 11th International Conference on Field and Service Robotics (FSR '17)}, pages = {99--113}, keywords = {Plant Phenotyping, Computer Vision, Multi-view Reconstruction, Field Robot Design, Machine Learning} } @inproceedings{vijayarangan2018high, title = {High-throughput robotic phenotyping of energy sorghum crops}, author = {Vijayarangan, Srinivasan and Sodhi, Paloma and Kini, Prathamesh and Bourne, James and Du, Simon and Sun, Hanqi and Poczos, Barnabas and Apostolopoulos, Dimitrios and Wettergreen, David}, year = 2018, booktitle = {Field and Service Robotics}, pages = {99--113}, organization = {Springer} } @article{vincent2010stacked, title = {Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion}, author = {Vincent, Pascal and Larochelle, Hugo and Lajoie, Isabelle and Bengio, Yoshua and Manzagol, Pierre-Antoine}, year = 2010, journal = {Journal of Machine Learning Research}, volume = 11, number = {Dec}, pages = {3371--3408} } @article{vNe50, title = {Functional operators}, author = {Neumann, John von}, year = 1950, journal = {Ann. of Math.}, number = 22, fjournal = {Annals of Mathematics} } @inproceedings{voloshin2021minimax, title = {Minimax Model Learning}, author = {Voloshin, Cameron and Jiang, Nan and Yue, Yisong}, year = 2021, booktitle = {International Conference on Artificial Intelligence and Statistics}, pages = {1612--1620}, organization = {PMLR} } @incollection{vovk2013kernel, title = {Kernel ridge regression}, author = {Vovk, Vladimir}, year = 2013, booktitle = {Empirical Inference}, publisher = {Springer}, pages = {105--116} } @inproceedings{VP12, title = {Krylov Subspace Descent for Deep Learning}, author = {Oriol Vinyals and Daniel Povey}, year = 2012, journal = {Journal of Machine Learning Research - Workshop and Conference Proceedings}, booktitle = {Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics (AISTATS-12)}, volume = 22, pages = {1261--1268}, url = {http://jmlr.csail.mit.edu/proceedings/papers/v22/vinyals12/vinyals12.pdf}, editor = {Neil D. Lawrence and Mark A. Girolami} } @article{vsima2002training, title = {Training a single sigmoidal neuron is hard}, author = {{\v{S}}{\'\i}ma, Ji{\v{r}}{\'\i}}, year = 2002, journal = {Neural Computation}, publisher = {MIT Press}, volume = 14, number = 11, pages = {2709--2728} } @article{vw18, title = {Polynomial Convergence of Gradient Descent for Training One-Hidden-Layer Neural Networks}, author = {Vempala, Santosh and Wilmes, John}, year = 2018, journal = {arXiv preprint arXiv:1805.02677} } @inproceedings{W, title = {Perturbation bounds in connection with singular value decompositions}, author = {P. Wedin}, year = 1972, booktitle = {BIT}, pages = {99--111} } @article{w90, title = {Backpropagation through time: what it does and how to do it}, author = {Werbos, Paul J}, year = 1990, journal = {Proceedings of the IEEE}, publisher = {IEEE}, volume = 78, number = 10, pages = {1550--1560} } @article{WaB12, title = {Incremental constraint projection methods for variational inequalities}, author = {Wang, Mengdi and Bertsekas, Dimitri P.}, year = 2015, journal = {Math. Program.}, volume = 150, number = {2, Ser. A}, pages = {321--363}, doi = {10.1007/s10107-014-0769-x}, issn = {0025-5610}, url = {http://dx.doi.org/10.1007/s10107-014-0769-x}, fjournal = {Mathematical Programming}, mrclass = {65K15 (62L20 68W27 90C33)}, mrnumber = 3323620 } @article{WaB13, title = {Stochastic first-order methods with random constraint projection}, author = {Wang, Mengdi and Bertsekas, Dimitri P.}, year = 2016, journal = {SIAM J. Optim.}, volume = 26, number = 1, pages = {681--717}, doi = {10.1137/130931278}, issn = {1052-6234}, url = {http://dx.doi.org/10.1137/130931278}, fjournal = {SIAM Journal on Optimization}, mrclass = {90C15 (90C25)}, mrnumber = 3472017, mrreviewer = {Kurt Marti} } @book{wahba1990spline, title = {Spline models for observational data}, author = {Wahba, Grace}, year = 1990, publisher = {Siam}, volume = 59 } @book{wainwright2019high, title = {High-dimensional statistics: A non-asymptotic viewpoint}, author = {Wainwright, Martin J}, year = 2019, publisher = {Cambridge University Press}, volume = 48 } @inproceedings{wall2003singular, title = {Singular Value Decomposition and Principal Component Analysis}, author = {Wall, Michael E. and Rechtsteiner, Andreas and Rocha, Luis M.}, year = 2003, month = mar, booktitle = {A Practical Approach to Microarray Data Analysis}, publisher = {Kluwel}, address = {Norwell, MA}, pages = {91--109}, editor = {Berrar, D. P. and Dubitzky, W. and Granzow, M.}, abstract = { This chapter describes gene expression analysis by Singular Value Decomposition (SVD), emphasizing initial characterization of the data. We describe SVD methods for visualization of gene expression data, representation of the data using a smaller number of variables, and detection of patterns in noisy gene expression data. In addition, we describe the precise relation between SVD analysis and Principal Component Analysis (PCA) when PCA is calculated using the covariance matrix, enabling our descriptions to apply equally well to either method. Our aim is to provide definitions, interpretations, examples, and references that will serve as resources for understanding and extending the application of SVD and PCA to gene expression analysis. }, chapter = 5, citeulike-article-id = 352522, eprint = {physics/0208101}, keywords = { algebra, analysis, components, dimension, dimensionality, linear, linearalgebra, pca, principal, svd }, owner = {leili}, posted-at = {2007-09-26 05:31:41}, priority = 2, timestamp = {2011.07.28} } @inproceedings{wallach2009evaluation, title = {Evaluation Methods for Topic Models}, author = {Hanna Wallach and Iain Murray and Ruslan Salakhutdinov and David Mimno}, year = 2009, booktitle = {ICML} } @inproceedings{wang07stable, title = {Stable Dual Dynamic Programming}, author = {Tao Wang and Daniel J. Lizotte and Michael H. Bowling and Dale Schuurmans}, year = 2007, booktitle = {Advances in Neural Information Processing Systems 20 (NIPS-07)}, pages = {1569--1576} } @proceedings{wang16_ijcai, title = {Nonparametric Risk and Stability Analysis for Multi-Task Learning Problems}, author = {Xuezhi Wang, Junier Oliva, Jeff Schneider, Barnabas Poczos}, year = 2016, booktitle = {IJCAI} } @inproceedings{wang2003evaluation, title = { An evaluation of a cost metric for selecting transitions between motion segments }, author = {Wang, Jing and Bodenheimer, Bobby}, year = 2003, booktitle = { Proceedings of the 2003 ACM SIGGRAPH/Eurographics symposium on Computer animation }, location = {San Diego, California}, publisher = {Eurographics Association}, address = {Aire-la-Ville, Switzerland, Switzerland}, series = {SCA '03}, pages = {232--238}, isbn = {1-58113-659-5}, acmid = 846309, numpages = 7 } @inproceedings{wang2004computing, title = {Computing the duration of motion transitions: an empirical approach}, author = {Wang, Jing and Bodenheimer, Bobby}, year = 2004, booktitle = { Proceedings of the 2004 ACM SIGGRAPH/Eurographics symposium on Computer animation }, location = {Grenoble, France}, publisher = {Eurographics Association}, address = {Aire-la-Ville, Switzerland, Switzerland}, series = {SCA '04}, pages = {335--344}, doi = {http://dx.doi.org/10.1145/1028523.1028568}, isbn = {3-905673-14-2}, acmid = 1028568, numpages = 10 } @inproceedings{wang2007dual, title = {Dual representations for dynamic programming and reinforcement learning}, author = {Wang, Tao and Bowling, Michael and Schuurmans, Dale}, year = 2007, booktitle = {Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on}, pages = {44--51}, organization = {IEEE} } @article{wang2008gaussian, title = {Gaussian Process Dynamical Models for Human Motion}, author = {Wang, J. M. and Fleet, D. J. and Hertzmann, A.}, year = 2008, journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence}, booktitle = {Pattern Analysis and Machine Intelligence, IEEE Transactions on}, volume = 30, number = 2, pages = {283--298}, doi = {10.1109/TPAMI.2007.1167}, abstract = { We introduce Gaussian process dynamical models (GPDMs) for nonlinear time series analysis, with applications to learning models of human pose and motion from high-dimensional motion capture data. A GPDM is a latent variable model. It comprises a low-dimensional latent space with associated dynamics, as well as a map from the latent space to an observation space. We marginalize out the model parameters in closed form by using Gaussian process priors for both the dynamical and the observation mappings. This results in a nonparametric model for dynamical systems that accounts for uncertainty in the model. We demonstrate the approach and compare four learning algorithms on human motion capture data, in which each pose is 50-dimensional. Despite the use of small data sets, the GPDM learns an effective representation of the nonlinear dynamics in these spaces. }, citeulike-article-id = 3504557, keywords = {discriminative, gaussian, motion, process}, owner = {leili}, posted-at = {2008-11-11 22:28:16}, priority = 2, timestamp = {2011.07.28} } @article{wang2008sample, title = {Sample average approximation of expected value constrained stochastic programs}, author = {Wang, Wei and Ahmed, Shabbir}, year = 2008, journal = {Operations Research Letters}, publisher = {Elsevier}, volume = 36, number = 5, pages = {515--519} } @inproceedings{wang2012scalable, title = {A Scalable {CUR} Matrix Decomposition Algorithm: Lower Time Complexity and Tighter Bound}, author = {Wang, Shusen and Zhang, Zhihua}, year = 2012, booktitle = {Advances in Neural Information Processing Systems}, pages = {647--655} } @article{wang2013bregman, title = {Bregman Alternating Direction Method of Multipliers}, author = {Wang, Huahua and Banerjee, Arindam}, year = 2013, journal = {arXiv preprint arXiv:1306.3203} } @article{wang2013improving, title = {Improving {CUR} matrix decomposition and the {Nystr{\"o}m} approximation via adaptive sampling}, author = {Wang, Shusen and Zhang, Zhihua}, year = 2013, journal = {Journal of Machine Learning Research}, publisher = {JMLR. org}, volume = 14, number = 1, pages = {2729--2769} } @article{wang2013nonnegative, title = {Nonnegative matrix factorization: A comprehensive review}, author = {Wang, Yu-Xiong and Zhang, Yu-Jin}, year = 2013, journal = {Knowledge and Data Engineering, IEEE Transactions on}, publisher = {IEEE}, volume = 25, number = 6, pages = {1336--1353} } @article{wang2013smoothing, title = {Smoothing splines with varying smoothing parameter}, author = {Wang, Xiao and Du, Pang and Shen, Jinglai}, year = 2013, journal = {Biometrika}, publisher = {Oxford University Press}, volume = 100, number = 4, pages = {955--970} } @article{wang2014efficient, title = {Efficient Algorithms and Error Analysis for the Modified {Nystr{\"o}m} Method}, author = {Wang, Shusen and Zhang, Zhihua}, year = 2014, journal = {arXiv preprint arXiv:1404.0138} } @inproceedings{wang2014flexible, title = {Flexible transfer learning under support and model shift}, author = {Wang, Xuezhi and Schneider, Jeff}, year = 2014, booktitle = {Advances in Neural Information Processing Systems}, pages = {1898--1906} } @article{wang2015adjusting, title = {Adjusting Leverage Scores by Row Weighting: A Practical Approach to Coherent Matrix Completion}, author = {Wang, Shusen and Zhang, Tong and Zhang, Zhihua}, year = 2014, journal = {arXiv:1412.7938} } @article{wang2015dualitygap, title = {Vanishing Price of Anarchy in Large Coordinative Nonconvex Optimization}, author = {Mengdi Wang}, year = 2015, journal = {Submitted; Optimization Online 2015/07/5021} } @article{wang2015large, title = {Large-Scale Approximate Kernel Canonical Correlation Analysis}, author = {Wang, Weiran and Livescu, Karen}, year = 2015, journal = {arXiv preprint}, volume = {abs/1511.04773} } @article{wang2015provably, title = {Provably Correct Active Sampling Algorithms for Matrix Column Subset Selection with Missing Data}, author = {Wang, Yining and Singh, Aarti}, year = 2015, journal = {arXiv preprint arXiv:1505.04343} } @book{wang2017cooperative, title = {Cooperative control of multi-agent systems: Theory and applications}, author = {Wang, Yue and Garcia, Eloy and Zhang, Fumin and Casbeer, David}, year = 2017, publisher = {John Wiley \& Sons} } @article{wang2017randomized, title = {Randomized Linear Programming Solves the Discounted {M}arkov Decision Problem In Nearly-Linear Running Time}, author = {Wang, Mengdi}, year = 2017, journal = {arXiv preprint arXiv:1704.01869}, date-added = {2017-05-19 05:10:31 +0000}, date-modified = {2017-05-19 05:10:31 +0000} } @article{wang2017sketching, title = {Sketching Meets Random Projection in the Dual: A Provable Recovery Algorithm for Big and High-dimensional Data}, author = {Wang, Jialei and Lee, Jason D and Mahdavi, Mehrdad and Kolar, Mladen and Srebro, Nathan}, year = 2017, journal = {Electronic Journal of Statistics} } @inproceedings{wang2018stochastic, title = {Stochastic Zeroth-order Optimization in High Dimensions}, author = {Yining Wang and Simon Du and Sivaraman Balakrishnan and Aarti Singh}, year = 2018, month = {09--11 Apr}, booktitle = {Proceedings of the Twenty-First International Conference on Artificial Intelligence and Statistics}, publisher = {PMLR}, series = {Proceedings of Machine Learning Research}, volume = 84, pages = {1356--1365}, url = {http://proceedings.mlr.press/v84/wang18e.html}, editor = {Amos Storkey and Fernando Perez-Cruz}, pdf = {http://proceedings.mlr.press/v84/wang18e/wang18e.pdf}, abstract = {We consider the problem of optimizing a high-dimensional convex function using stochastic zeroth-order queries. Under sparsity assumptions on the gradients or function values, we present two algorithms: a successive component/feature selection algorithm and a noisy mirror descent algorithm using Lasso gradient estimates, and show that both algorithms have convergence rates that depend only logarithmically on the ambient dimension of the problem. Empirical results confirm our theoretical findings and show that the algorithms we design outperform classical zeroth-order optimization methods in the high-dimensional setting.} } @inproceedings{wang2019neural, title = {Neural Policy Gradient Methods: Global Optimality and Rates of Convergence}, author = {Wang, Lingxiao and Cai, Qi and Yang, Zhuoran and Wang, Zhaoran}, year = 2019, booktitle = {International Conference on Learning Representations} } @article{wang2019optimism, title = {Optimism in Reinforcement Learning with Generalized Linear Function Approximation}, author = {Wang, Yining and Wang, Ruosong and Du, Simon S and Krishnamurthy, Akshay}, year = 2019, journal = {arXiv preprint arXiv:1912.04136} } @article{wang2020beyond, title = {Beyond Lazy Training for Over-parameterized Tensor Decomposition}, author = {Wang, Xiang and Wu, Chenwei and Lee, Jason D and Ma, Tengyu and Ge, Rong}, year = 2020, journal = {Neural Information Processing Systems (NeurIPS)} } @inproceedings{wang2020dual, title = {DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs}, author = {Wang, Yunbo and Liu, Bo and Wu, Jiajun and Zhu, Yuke and Du, Simon S. and Fei-Fei, Li and Tenenbaum, Joshua B.}, year = 2020, booktitle = {Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, {IJCAI-20}}, publisher = {International Joint Conferences on Artificial Intelligence Organization}, pages = {4190--4198}, editor = {Christian Bessiere} } @article{wang2020long, title = {Is Long Horizon Reinforcement Learning More Difficult Than Short Horizon Reinforcement Learning?}, author = {Wang, Ruosong and Du, Simon S and Yang, Lin F and Kakade, Sham M}, year = 2020, journal = {arXiv preprint arXiv:2005.00527} } @article{wang2020nearly, title = {Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection}, author = {Wang, Yining and Chen, Yi and Fang, Ethan X and Wang, Zhaoran and Li, Runze}, year = 2020, journal = {arXiv preprint arXiv:2009.02003} } @article{wang2020planning, title = {Planning with General Objective Functions: Going Beyond Total Rewards}, author = {Wang, Ruosong and Zhong, Peilin and Du, Simon S and Salakhutdinov, Russ R and Yang, Lin}, year = 2020, journal = {Advances in Neural Information Processing Systems}, volume = 33 } @article{wang2020provably, title = {Provably Efficient Reinforcement Learning with General Value Function Approximation}, author = {Wang, Ruosong and Salakhutdinov, Ruslan and Yang, Lin F}, year = 2020, journal = {Advances in Neural Information Processing Systems} } @article{wang2020reinforcement, title = {Reinforcement learning with general value function approximation: Provably efficient approach via bounded eluder dimension}, author = {Wang, Ruosong and Salakhutdinov, Russ R and Yang, Lin}, year = 2020, journal = {Advances in Neural Information Processing Systems}, volume = 33 } @inproceedings{wang2020reward, title = {On Reward-Free Reinforcement Learning with Linear Function Approximation}, author = {Wang, Ruosong and Du, Simon S and Yang, Lin and Salakhutdinov, Russ R}, year = 2020, booktitle = {Advances in Neural Information Processing Systems}, volume = 33, pages = {17816--17826} } @article{wang2020statistical, title = {What are the Statistical Limits of Offline {RL} with Linear Function Approximation?}, author = {Wang, Ruosong and Foster, Dean P and Kakade, Sham M}, year = 2020, journal = {arXiv preprint arXiv:2010.11895} } @article{wang2021exponential, title = {An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap}, author = {Wang, Yuanhao and Wang, Ruosong and Kakade, Sham M}, year = 2021, journal = {arXiv preprint arXiv:2103.12690} } @article{wang2021near, title = {Near-Linear Time Local Polynomial Nonparametric Estimation with Box Kernels}, author = {Wang, Yining and Wu, Yi and Du, Simon S}, year = 2021, journal = {INFORMS Journal on Computing}, publisher = {INFORMS} } @inproceedings{wang2021optimism, title = {Optimism in Reinforcement Learning with Generalized Linear Function Approximation}, author = {Yining Wang and Ruosong Wang and Simon Shaolei Du and Akshay Krishnamurthy}, year = 2021, booktitle = {International Conference on Learning Representations}, url = {https://openreview.net/forum?id=CBmJwzneppz} } @article{wanggeneralization, title = {Generalization Bounds for Transfer Learning under Model Shift}, author = {Wang, Xuezhi and Schneider, Jeff} } @article{WangMMR2015, title = {Faster Parallel Solver for Positive Linear Programs via Dynamically-Bucketed Selective Coordinate Descent}, author = {Di Wang and Michael W. Mahoney and Nishanth Mohan and Satish Rao}, year = 2015, month = nov, journal = {ArXiv e-prints}, volume = {abs/1511.06468} } @article{WangWGS2016-CCA, title = {{Efficient Globally Convergent Stochastic Optimization for Canonical Correlation Analysis}}, author = {Weiran Wang and Jialei Wang and Dan Garber and Nathan Srebro}, year = 2016, month = apr, journal = {ArXiv e-prints}, volume = {abs/1604.01870} } @article{warga1986higher, title = {Higher order conditions with and without Lagrange multipliers}, author = {Warga, Jack}, year = 1986, journal = {SIAM journal on control and optimization}, publisher = {SIAM}, volume = 24, number = 4, pages = {715--730} } @book{wasserman2006all, title = {All of nonparametric statistics}, author = {Wasserman, Larry}, year = 2006, publisher = {Springer Science \& Business Media} } @article{watkins1992q, title = {Q-learning}, author = {Watkins, Christopher JCH and Dayan, Peter}, year = 1992, journal = {Machine learning}, publisher = {Springer}, volume = 8, number = {3-4}, pages = {279--292} } @misc{web19latent, title = {Reinforcement Learning with Latent State Decoding Library}, year = 2019, note = {{https://github.com/Microsoft/StateDecoding}} } @misc{web20ai, title = {AI Goes To High School}, howpublished = {\url{forbes.com/sites/insights-intelai/2019/05/22/ai-goes-to-high-school/#40ad5d971d0c}} } @misc{web20cmu, title = {{CMU} {ML} {Blog}}, year = {{2018}}, note = {{https://blog.ml.cmu.edu/}} } @misc{web20csta, title = {Computer Science Teacher Association}, howpublished = {\url{https://www.csteachers.org/}} } @misc{web20iisme, title = {IISME Community Website}, howpublished = {\url{http://community.iisme.org/ }} } @misc{web20offconvex, title = {{Off the Convex Path}}, year = {{2015}}, note = {{http://www.offconvex.org/}} } @misc{web20oneworld, title = {{One World Seminar Series on the Mathematics of Machine Learning}}, year = 2020, note = {{https://sites.google.com/view/oneworldml/home}} } @misc{web21dltbook, title = {Deep Learning Theory}, howpublished = {\url{https://www.cs.princeton.edu/courses/archive/fall19/cos597B/lecnotes/bookdraft.pdf}} } @misc{web21wrlt, title = {Workshop on Reinforcement Learning Theory at {ICML} 2021}, howpublished = {\url{https://lyang36.github.io/icml2021\_rltheory/}} } @misc{webai4k12, title = {AI4K12 GitHub Homepage}, note = {Accessed: 2020-07-20}, howpublished = {\url{ https://github.com/touretzkyds/ai4k12/wiki }} } @misc{webaicurriculum, title = {AI Curriculum Is Coming for K-12 At Last. What Will It Include?}, note = {Accessed: 2020-07-20}, howpublished = {\url{https://www.edsurge.com/news/2019-01-15-ai-curriculum-is-coming-for-k-12-at-last-what-will-it-include}} } @misc{webaielectricity, title = {Artificial intelligence: the new electricity}, note = {Accessed: 2020-07-20}, howpublished = {\url{https://www.wipo.int/wipo_magazine/en/2019/03/article_0001.html}} } @misc{webapstudents, title = {AP Students in College: An Analysis of Five-Year Academic Careers. Research Report No. 2007-4}, note = {Accessed: 2020-07-20}, howpublished = {\url{ https://eric.ed.gov/?id=ED561034 }} } @misc{webcomputer, title = {Computer and Information Technology Occupations}, note = {Accessed: 2020-07-20}, howpublished = {\url{https://www.bls.gov/ooh/computer-and-information-technology/home.htm}} } @misc{webidc4u, title = {IDC4U: Data Science High School Course}, note = {Accessed: 2020-07-20}, howpublished = {\url{ https://cognitiveclass.ai/partner-courses/data-science-high-school-vhs }} } @misc{webready, title = {Read AI: Live Online AI Classes at Outschool!}, note = {Accessed: 2020-07-20}, howpublished = {\url{https://www.readyai.org/}} } @misc{webskillshift, title = {Skill shift: Automation and the future of the workforce}, note = {Accessed: 2020-07-20}, howpublished = {\url{ https://www.mckinsey.com/featured-insights/future-of-work/skill-shift-automation-and-the-future-of-the-workforce# }} } @article{Wedin1972, title = {Perturbation bounds in connection with singular value decomposition}, author = {Wedin, Per-{\AA}ke}, year = 1972, month = mar, day = {01}, journal = {BIT Numerical Mathematics}, publisher = {Springer}, volume = 12, number = 1, pages = {99--111}, doi = {10.1007/BF01932678}, issn = {1572-9125}, url = {https://doi.org/10.1007/BF01932678}, abstract = {LetA be anm {\texttimes}n-matrix which is slightly perturbed. In this paper we will derive an estimate of how much the invariant subspaces ofA H A andAA H will then be affected. These bounds have the sin Ï theorem for Hermitian linear operators in Davis and Kahan [1] as a special case. They are applicable to computational solution of overdetermined systems of linear equations and especially cover the rank deficient case when the matrix is replaced by one of lower rank.} } @inproceedings{wei2007dynamic, title = {Dynamic mixture models for multiple time series}, author = {Wei, Xing and Sun, Jimeng and Wang, Xuerui}, year = 2007, booktitle = { Proceedings of the 20th international joint conference on Artifical intelligence }, location = {Hyderabad, India}, publisher = {Morgan Kaufmann Publishers Inc.}, address = {San Francisco, CA, USA}, pages = {2909--2914}, acmid = 1625744, numpages = 6 } @article{wei2008dynamics, title = {Dynamics of learning near singularities in layered networks}, author = {Wei, Haikun and Zhang, Jun and Cousseau, Florent and Ozeki, Tomoko and Amari, Shun-ichi}, year = 2008, journal = {Neural computation}, publisher = {MIT Press}, volume = 20, number = 3, pages = {813--843} } @inproceedings{wei2017online, title = {Online Reinforcement Learning in Stochastic Games}, author = {Wei, Chen-Yu and Hong, Yi-Te and Lu, Chi-Jen}, year = 2017, booktitle = {Advances in Neural Information Processing Systems}, pages = {4994--5004} } @article{wei2019data, title = {Data-dependent Sample Complexity of Deep Neural Networks via Lipschitz Augmentation}, author = {Wei, Colin and Ma, Tengyu}, year = 2019, journal = {arXiv preprint arXiv:1905.03684} } @article{wei2019noise, title = {How noise affects the Hessian spectrum in overparameterized neural networks}, author = {Wei, Mingwei and Schwab, David J}, year = 2019, journal = {arXiv preprint arXiv:1910.00195} } @article{wei2020implicit, title = {The Implicit and Explicit Regularization Effects of Dropout}, author = {Wei, Colin and Kakade, Sham and Ma, Tengyu}, year = 2020, journal = {arXiv preprint arXiv:2002.12915} } @misc{wei2020theoretical, title = {Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data}, author = {Colin Wei and Kendrick Shen and Yining Chen and Tengyu Ma}, year = 2020, booktitle = {International Conference on Learning Representations}, url = {https://openreview.net/forum?id=rC8sJ4i6kaH}, eprint = {2010.03622}, archiveprefix = {arXiv}, primaryclass = {cs.LG} } @article{weissman2003inequalities, title = {Inequalities for the L1 deviation of the empirical distribution}, author = {Weissman, Tsachy and Ordentlich, Erik and Seroussi, Gadiel and Verdu, Sergio and Weinberger, Marcelo J}, year = 2003, journal = {Hewlett-Packard Labs, Tech. Rep} } @article{weisstein2003gershgorin, title = {Gershgorin circle theorem}, author = {Weisstein, Eric W}, year = 2003, publisher = {Wolfram Research, Inc.} } @misc{weisz2020exponential, title = {Exponential Lower Bounds for Planning in MDPs With Linearly-Realizable Optimal Action-Value Functions}, author = {Gellert Weisz and Philip Amortila and Csaba Szepesvári}, year = 2020, eprint = {2010.01374}, archiveprefix = {arXiv}, primaryclass = {cs.LG} } @misc{welch1995introduction, title = {An introduction to the {K}alman filter}, author = {Welch, Greg and Bishop, Gary}, year = 1995 } @misc{welch2001introduction, title = {An Introduction to the Kalman Filter, SIGGRAPH 2001 Courses}, author = {Gregory Welch and Gary Bishop}, year = 2001 } @techreport{WelinderEtal2010, title = {{Caltech-UCSD Birds 200}}, author = {P. Welinder and S. Branson and T. Mita and C. Wah and F. Schroff and S. Belongie and P. Perona}, year = 2010, number = {CNS-TR-2010-001}, institution = {California Institute of Technology} } @inproceedings{welling2011bayesian, title = {Bayesian learning via stochastic gradient Langevin dynamics}, author = {Welling, Max and Teh, Yee W}, year = 2011, booktitle = {Proceedings of the 28th international conference on machine learning (ICML-11)}, pages = {681--688} } @book{welzl1991smallest, title = {Smallest enclosing disks (balls and ellipsoids)}, author = {Welzl, Emo}, year = 1991, publisher = {Springer} } @inproceedings{wen2013efficient, title = {Efficient exploration and value function generalization in deterministic systems}, author = {Wen, Zheng and Van Roy, Benjamin}, year = 2013, booktitle = {Advances in Neural Information Processing Systems} } @article{wen2017efficient, title = {Efficient reinforcement learning in deterministic systems with value function generalization}, author = {Wen, Zheng and Van Roy, Benjamin}, year = 2017, journal = {Mathematics of Operations Research}, publisher = {INFORMS}, volume = 42, number = 3, pages = {762--782} } @article{wen2019interplay, title = {Interplay between optimization and generalization of stochastic gradient descent with covariance noise}, author = {Wen, Yeming and Luk, Kevin and Gazeau, Maxime and Zhang, Guodong and Chan, Harris and Ba, Jimmy}, year = 2019, journal = {arXiv preprint arXiv:1902.08234} } @article{Wendel1948note, title = {Note on the gamma function}, author = {Wendel, J. G.}, year = 1948, journal = {The American Mathematical Monthly}, volume = 55, number = 9, pages = {563--564} } @article{werbos1974beyond, title = {Beyond regression: New tools for prediction and analysis in the behavioral sciences}, author = {Werbos, Paul}, year = 1974 } @inproceedings{Weyer99, title = {Finite Sample Properties of System Identification Methods}, author = {Erik Weyer and M. C. Campi}, year = 1999, booktitle = {Proceedings of the 38th Conference on Decision and Control}, date-added = {2016-04-02 18:44:53 +0000}, date-modified = {2016-04-02 18:45:38 +0000} } @article{Weyl1912, title = {Das asymptotische Verteilungsgesetz der Eigenwerte linearer partieller Differentialgleichungen (mit einer Anwendung auf die Theorie der Hohlraumstrahlung)}, author = {Weyl, H.}, year = 1912, journal = {Mathematische Annalen}, volume = 71, pages = {441--479}, url = {http://eudml.org/doc/158545} } @inproceedings{whitehead2014complexity, title = {Complexity and cooperation in {Q}-learning}, author = {Whitehead, Steven D}, year = 2014, booktitle = {Proceedings of the Eighth International Workshop on Machine Learning}, pages = {363--367} } @article{whitney1934analytic, title = {Analytic extensions of differentiable functions defined in closed sets}, author = {Whitney, Hassler}, year = 1934, journal = {Transactions of the American Mathematical Society}, publisher = {JSTOR}, volume = 36, number = 1, pages = {63--89} } @article{whittle1988restless, title = {Restless bandits: Activity allocation in a changing world}, author = {Whittle, Peter}, year = 1988, journal = {Journal of applied probability}, publisher = {JSTOR}, pages = {287--298} } @article{wibisono2016variational, title = {A variational perspective on accelerated methods in optimization}, author = {Wibisono, Andre and Wilson, Ashia C and Jordan, Michael I}, year = 2016, journal = {proceedings of the National Academy of Sciences}, publisher = {National Acad Sciences}, volume = 113, number = 47, pages = {E7351--E7358} } @article{wieland2007constructive, title = {Constructive safety using control barrier functions}, author = {Wieland, Peter and Allg{\"o}wer, Frank}, year = 2007, journal = {IFAC Proceedings Volumes}, publisher = {Elsevier}, volume = 40, number = 12, pages = {462--467} } @article{wigner1955characteristic, title = {Characteristic vectors of bordered matrices with infinite dimensions}, author = {Wigner, Eugene P}, year = 1955, journal = {The Annals of Mathematics}, publisher = {JSTOR}, volume = 62, number = 3, pages = {548--564} } @misc{wiki-hermite, title = {Hermite polynomials --- Wikipedia{,} The Free Encyclopedia}, author = {Wikipedia}, year = 2017, url = {https://en.wikipedia.org/w/index.php?title=Hermite_polynomials&oldid=796842411}, note = {[Online; accessed 1-September-2017 ]} } @misc{wiki:fixedpoint, title = {Schauder fixed point theorem --- Wikipedia{,} The Free Encyclopedia}, author = {Wikipedia}, year = 2016, url = {https://en.wikipedia.org/w/index.php?title=Schauder_fixed_point_theorem&oldid=722238234}, note = {[Online; accessed 26-May-2016]} } @misc{wiki:incompleteGamma, title = {Incomplete gamma function --- Wikipedia{,} The Free Encyclopedia}, author = {Wikipedia}, year = 2016, url = {\url{https://en.wikipedia.org/w/index.php?title=Incomplete_gamma_function&oldid=730854137}}, note = {[Online; accessed 13-September-2016]} } @misc{wiki:JL, title = {Johnson–Lindenstrauss lemma --- Wikipedia{,} The Free Encyclopedia}, author = {Wikipedia}, year = 2016, url = {https://en.wikipedia.org/w/index.php?title=Johnson%E2%80%93Lindenstrauss_lemma&oldid=743553642} } @misc{wiki:power_series, title = {Formal power series --- Wikipedia{,} The Free Encyclopedia}, author = {Wikipedia}, year = 2017, url = {https://en.wikipedia.org/w/index.php?title=Formal_power_series&oldid=797671381}, note = {[Online; accessed 20-September-2017 ]} } @article{williams1992simple, title = {Simple statistical gradient-following algorithms for connectionist reinforcement learning}, author = {Williams, Ronald J}, year = 1992, journal = {Machine learning}, publisher = {Springer}, volume = 8, number = {3-4}, pages = {229--256} } @inproceedings{williams2001using, title = {Using the {Nystr{\"o}m} method to speed up kernel machines}, author = {Williams, Christopher and Seeger, Matthias}, year = 2001, booktitle = {Advances in Neural Information Processing Systems}, number = {}, pages = {682--688} } @article{wilson2016lyapunov, title = {A {L}yapunov analysis of momentum methods in optimization}, author = {Wilson, Ashia C and Recht, Benjamin and Jordan, Michael I}, year = 2016, journal = {arXiv preprint arXiv:1611.02635} } @inproceedings{wilson2017marginal, title = {The marginal value of adaptive gradient methods in machine learning}, author = {Wilson, Ashia C and Roelofs, Rebecca and Stern, Mitchell and Srebro, Nati and Recht, Benjamin}, year = 2017, booktitle = {Advances in Neural Information Processing Systems}, pages = {4148--4158} } @article{witten2009penalized, title = {A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis}, author = {Witten, Daniela M and Tibshirani, Robert and Hastie, Trevor}, year = 2009, journal = {Biostatistics}, publisher = {Biometrika Trust}, pages = {kxp008} } @article{WJ, title = {Graphical models, exponential families, and variational inference}, author = {M. Wainwright and M. Jordan}, year = 2008, journal = {Foundations and Trends in Machine Learning}, pages = {1--305} } @misc{wolfram:incompleteGamma, title = {Incomplete Gamma Function --- From MathWorld--A Wolfram Web Resource}, author = {Weisstein, Eric W.}, year = 2016, url = {\url{http://mathworld.wolfram.com/IncompleteGammaFunction.html}} } @article{wong1980efficient, title = {An efficient method for weighted sampling without replacement}, author = {Wong, Chak-Kuen and Easton, Malcolm C.}, year = 1980, journal = {SIAM Journal on Computing}, publisher = {SIAM}, volume = 9, number = 1, pages = {111--113} } @article{woodruff2014sketching, title = {Sketching as a tool for numerical linear algebra}, author = {Woodruff, David P}, year = 2014, journal = {Foundations and Trends{\textregistered} in Theoretical Computer Science}, publisher = {Now Publishers, Inc.}, volume = 10, number = {1--2}, pages = {1--157} } @inproceedings{woodworth2016tight, title = {Tight complexity bounds for optimizing composite objectives}, author = {Woodworth, Blake E and Srebro, Nati}, year = 2016, booktitle = {Advances in Neural Information Processing Systems}, pages = {3639--3647} } @article{woodworth2019kernel, title = {Kernel and Deep Regimes in Overparametrized Models}, author = {Woodworth, Blake and Gunasekar, Suriya and Lee, Jason and Soudry, Daniel and Srebro, Nathan}, year = 2020, journal = {Conference on Learning Theory (COLT)} } @article{woodworth2020kernel, title = {Kernel and rich regimes in overparametrized models}, author = {Woodworth, Blake and Gunasekar, Suriya and Lee, Jason D and Moroshko, Edward and Savarese, Pedro and Golan, Itay and Soudry, Daniel and Srebro, Nathan}, year = 2020, journal = {arXiv preprint arXiv:2002.09277} } @inproceedings{WoodworthSrebro2016, title = {{Tight Complexity Bounds for Optimizing Composite Objectives}}, author = {Blake Woodworth and Nati Srebro}, year = 2016, booktitle = {NIPS} } @inproceedings{word2vec, title = {Distributed Representations of Words and Phrases and their Compositionality}, author = {Tomas Mikolov and Ilya Sutskever and Kai Chen and Gregory S. Corrado and Jeffrey Dean}, booktitle = {Advances in Neural Information Processing Systems (NIPS), 2015}, url = {http://papers.nips.cc/paper/5021-distributed-representations-of-words-and-phrases-and-their-compositionality}, crossref = {DBLP:conf/nips/2013}, timestamp = {Thu, 07 May 2015 20:02:01 +0200}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/nips/MikolovSCCD13}, bibsource = {dblp computer science bibliography, http://dblp.org} } @inproceedings{wu2016collaborative, title = {Collaborative denoising auto-encoders for top-n recommender systems}, author = {Wu, Yao and DuBois, Christopher and Zheng, Alice X and Ester, Martin}, year = 2016, booktitle = {Proceedings of the Ninth ACM International Conference on Web Search and Data Mining}, pages = {153--162}, organization = {ACM} } @inproceedings{wu2017framework, title = {Framework for control and deep reinforcement learning in traffic}, author = {Wu, Cathy and Parvate, Kanaad and Kheterpal, Nishant and Dickstein, Leah and Mehta, Ankur and Vinitsky, Eugene and Bayen, Alexandre M}, year = 2017, booktitle = {2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC)}, pages = {1--8}, organization = {IEEE} } @article{wu2018building, title = {Building Generalizable Agents with a Realistic and Rich 3D Environment}, author = {Wu, Yi and Wu, Yuxin and Gkioxari, Georgia and Tian, Yuandong}, year = 2018, journal = {arXiv preprint arXiv:1801.02209} } @inproceedings{wu2018discrete, title = {Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms}, author = {Wu, Yi and Srivastava, Siddharth and Hay, Nicholas and Du, Simon and Russell, Stuart}, year = 2018, month = {10--15 Jul}, booktitle = {Proceedings of the 35th International Conference on Machine Learning}, publisher = {PMLR}, series = {Proceedings of Machine Learning Research}, volume = 80, pages = {5343--5352}, url = {http://proceedings.mlr.press/v80/wu18f.html}, editor = {Dy, Jennifer and Krause, Andreas}, pdf = {http://proceedings.mlr.press/v80/wu18f/wu18f.pdf}, abstract = {Despite the recent successes of probabilistic programming languages (PPLs) in AI applications, PPLs offer only limited support for random variables whose distributions combine discrete and continuous elements. We develop the notion of measure-theoretic Bayesian networks (MTBNs) and use it to provide more general semantics for PPLs with arbitrarily many random variables defined over arbitrary measure spaces. We develop two new general sampling algorithms that are provably correct under the MTBN framework: the lexicographic likelihood weighting (LLW) for general MTBNs and the lexicographic particle filter (LPF), a specialized algorithm for state-space models. We further integrate MTBNs into a widely used PPL system, BLOG, and verify the effectiveness of the new inference algorithms through representative examples.} } @article{wu2018NoSpurious, title = {No Spurious Local Minima in a Two Node Neural Network}, author = {Wu, Chenwei and Luo, Jiajun and Lee, Jason D}, year = 2018, journal = {International Conference on Learning Representations (ICLR) Workshop Track} } @article{wu2019global, title = {Global convergence of adaptive gradient methods for an over-parameterized neural network}, author = {Wu, Xiaoxia and Du, Simon S and Ward, Rachel}, year = 2019, journal = {arXiv preprint arXiv:1902.07111} } @article{wu2020steepest, title = {Steepest Descent Neural Architecture Optimization: Escaping Local Optimum with Signed Neural Splitting}, author = {Wu, Lemeng and Ye, Mao and Lei, Qi and Lee, Jason D and Liu, Qiang}, year = 2020, journal = {arXiv preprint arXiv:2003.10392} } @inproceedings{wzcshbdd18, title = {Towards Fast Computation of Certified Robustness for {R}e{LU} Networks}, author = {Weng, Tsui-Wei and Zhang, Huan and Chen, Hongge and Song, Zhao and Hsieh, Cho-Jui and Boning, Duane and Dhillon, Inderjit S and Daniel, Luca}, year = 2018, booktitle = {International Conference on Machine Learning (ICML)}, publisher = {arXiv preprint arXiv:1804.09699} } @inproceedings{xbssp18, title = {Dynamical Isometry and a Mean Field Theory of {CNN}s: How to Train 10,000-Layer Vanilla Convolutional Neural Networks}, author = {Xiao, Lechao and Bahri, Yasaman and Sohl-Dickstein, Jascha and Schoenholz, Samuel S. and Pennington, Jeffrey}, year = 2018, booktitle = {International Conference on Machine Learning (ICML)} } @article{Xiao2010, title = {{Dual averaging method for regularized stochastic learning and online optimization}}, author = {Xiao, Lin}, year = 2010, journal = {The Journal of Machine Learning Research}, volume = 11, pages = {2543--2596}, annote = {Contains the so-called "dual averaging" step.}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Xiao - 2010 - Dual averaging method for regularized stochastic learning and online optimization.pdf:pdf}, mendeley-groups = {Optimization/Stochastic Online Optimization} } @article{xiao2014proximal, title = {A proximal stochastic gradient method with progressive variance reduction}, author = {Xiao, Lin and Zhang, Tong}, year = 2014, journal = {SIAM Journal on Optimization}, publisher = {SIAM}, volume = 24, number = 4, pages = {2057--2075}, doi = {10.1137/140961791}, issn = {1052-6234}, abstract = {We consider the problem of minimizing the sum of two convex functions: one is the average of a large number of smooth component functions, and the other is a general convex function that admits a simple proximal mapping. We assume the whole objective function is strongly convex. Such problems often arise in machine learning, known as regularized empirical risk minimization. We propose and analyze a new proximal stochastic gradient method, which uses a multi-stage scheme to progressively reduce the variance of the stochastic gradient. While each iteration of this algorithm has similar cost as the classical stochastic gradient method (or incremental gradient method), we show that the expected objective value converges to the optimum at a geometric rate. The overall complexity of this method is much lower than both the proximal full gradient method and the standard proximal stochastic gradient method. 1}, archiveprefix = {arXiv}, arxivid = {arXiv:1403.4699v1}, eprint = {arXiv:1403.4699v1}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/Xiao, Zhang - 2014 - A Proximal Stochastic Gradient Method with Progressive Variance Reduction(2).pdf:pdf}, mendeley-groups = {Optimization/[with Yuan Yang],Optimization/Variance Reduction} } @inproceedings{Xiaodi2012, title = {Parallel Approximation of Min-max Problems with Applications to Classical and Quantum Zero-Sum Games}, author = {Gutoski, G. and Xiaodi Wu}, year = 2012, month = jun, booktitle = {Computational Complexity (CCC), 2012 IEEE 27th Annual Conference on}, pages = {21--31}, doi = {10.1109/CCC.2012.12}, issn = {1093-0159}, keywords = {approximation theory;computational complexity;game theory;mathematical programming;matrix multiplication;minimax techniques;parallel algorithms;quantum theory;theorem proving;DQIP;PSPACE;QRG(2);SQG;competing-provers complexity class;direct polynomial-space simulation;matrix multiplicative weights update method;min-max problems;multimessage quantum interactive proofs;near-optimal strategies;parallel algorithm;parallel approximation scheme;semidefinite matrices;semidefinite programs;transcript-like consistency condition;two player classical zero-sum games;two player quantum zero-sum games;Approximation methods;Bismuth;Complexity theory;Game theory;Games;Parallel algorithms;Registers;interactive proofs with competing provers;parallel approximation algorithms;semidefinite programs;zero-sum games} } @article{XiaoZhang2013-homotopy, title = {A proximal-gradient homotopy method for the sparse least-squares problem}, author = {Xiao, Lin and Zhang, Tong}, year = 2013, journal = {SIAM Journal on Optimization}, publisher = {SIAM}, volume = 23, number = 2, pages = {1062--1091} } @inproceedings{Xie2006, title = {{Efficient algorithm for approximating maximum inscribed sphere in high dimensional polytope}}, author = {Xie, Yulai and Snoeyink, Jack and Xu, Jinhui}, year = 2006, booktitle = {Proceedings of the 22nd annual symposium on computational geometry - SCG '06}, doi = {10.1145/1137856.1137861}, isbn = 1595933409, mendeley-groups = {Algorithms/Computational Geometry} } @inproceedings{xie2017diverse, title = {Diverse Neural Network Learns True Target Functions}, author = {Xie, Bo and Liang, Yingyu and Song, Le}, year = 2017, booktitle = {Artificial Intelligence and Statistics}, pages = {1216--1224} } @inproceedings{XieLiangSong2015-kernalPCA, title = {Scale up nonlinear component analysis with doubly stochastic gradients}, author = {Xie, Bo and Liang, Yingyu and Song, Le}, year = 2015, booktitle = {NIPS}, pages = {2341--2349} } @article{xing2018walk, title = {A walk with sgd}, author = {Xing, Chen and Arpit, Devansh and Tsirigotis, Christos and Bengio, Yoshua}, year = 2018, journal = {arXiv preprint arXiv:1802.08770} } @article{xiong2021randomized, title = {Randomized Exploration is Near-Optimal for Tabular {MDP}}, author = {Xiong, Zhihan and Shen, Ruoqi and Du, Simon S}, year = 2021, journal = {arXiv preprint arXiv:2102.09703} } @inproceedings{XLG, title = {Document clustering based on non-negative matrix factorization}, author = {Xu, Wei and Liu, Xin and Gong, Yihong}, year = 2003, booktitle = {Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval}, location = {Toronto, Canada}, publisher = {ACM}, address = {New York, NY, USA}, series = {SIGIR '03}, pages = {267--273}, doi = {10.1145/860435.860485}, isbn = {1-58113-646-3}, url = {http://doi.acm.org/10.1145/860435.860485}, numpages = 7, acmid = 860485, keywords = {document clustering, non-negative matrix factorization} } @article{xu2007convergence, title = {Convergence analysis of sample average approximation methods for a class of stochastic mathematical programs with equality constraints}, author = {Xu, Huifu and Meng, Fanwen}, year = 2007, journal = {Mathematics of Operations Research}, publisher = {INFORMS}, volume = 32, number = 3, pages = {648--668} } @article{xu2012alternating, title = {An alternating direction algorithm for matrix completion with nonnegative factors}, author = {Xu, Yangyang and Yin, Wotao and Wen, Zaiwen and Zhang, Yin}, year = 2012, journal = {Frontiers of Mathematics in China}, publisher = {Springer}, volume = 7, number = 2, pages = {365--384} } @misc{xu2015empirical, title = {Empirical Evaluation of Rectified Activations in Convolutional Network}, author = {Bing Xu and Naiyan Wang and Tianqi Chen and Mu Li}, year = 2015, eprint = {1505.00853}, archiveprefix = {arXiv}, primaryclass = {cs.LG} } @inproceedings{xu2016global, title = {Global analysis of expectation maximization for mixtures of two gaussians}, author = {Xu, Ji and Hsu, Daniel J and Maleki, Arian}, year = 2016, booktitle = {Advances in Neural Information Processing Systems}, pages = {2676--2684} } @article{xu2018minimal, title = {The minimal measurement number for low-rank matrix recovery}, author = {Xu, Zhiqiang}, year = 2018, journal = {Applied and Computational Harmonic Analysis}, publisher = {Elsevier}, volume = 44, number = 2, pages = {497--508} } @article{xu2019joint, title = {Joint inference of reward machines and policies for reinforcement learning}, author = {Xu, Zhe and Gavran, Ivan and Ahmad, Yousef and Majumdar, Rupak and Neider, Daniel and Topcu, Ufuk and Wu, Bo}, year = 2019, journal = {arXiv preprint arXiv:1909.05912} } @article{xu2020neural, title = {How neural networks extrapolate: From feedforward to graph neural networks}, author = {Xu, Keyulu and Li, Jingling and Zhang, Mozhi and Du, Simon S and Kawarabayashi, Ken-ichi and Jegelka, Stefanie}, year = 2020, journal = {arXiv preprint arXiv:2009.11848} } @inproceedings{xu2020what, title = {What Can Neural Networks Reason About?}, author = {Keyulu Xu and Jingling Li and Mozhi Zhang and Simon S. Du and Ken-ichi Kawarabayashi and Stefanie Jegelka}, year = 2020, booktitle = {International Conference on Learning Representations}, url = {https://openreview.net/forum?id=rJxbJeHFPS} } @article{xu2021fine, title = {Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap}, author = {Xu, Haike and Ma, Tengyu and Du, Simon S}, year = 2021, journal = {arXiv preprint arXiv:2102.04692} } @inproceedings{xu2021how, title = {How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks}, author = {Keyulu Xu and Mozhi Zhang and Jingling Li and Simon Shaolei Du and Ken-Ichi Kawarabayashi and Stefanie Jegelka}, year = 2021, booktitle = {International Conference on Learning Representations}, url = {https://openreview.net/forum?id=UH-cmocLJC} } @article{yaida2018fluctuation, title = {Fluctuation-dissipation relations for stochastic gradient descent}, author = {Yaida, Sho}, year = 2018, journal = {arXiv preprint arXiv:1810.00004} } @inproceedings{yang2008image, title = {Image super-resolution as sparse representation of raw image patches}, author = {Yang, Jianchao and Wright, John and Huang, Thomas and Ma, Yi}, year = 2008, booktitle = {Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on}, pages = {1--8}, organization = {IEEE}, owner = {gewor_000}, timestamp = {2013.11.10} } @inproceedings{yang2009dual, title = {Dual temporal difference learning}, author = {Yang, Min and Li, Yuxi and Schuurmans, Dale}, year = 2009, booktitle = {International Conference on Artificial Intelligence and Statistics}, pages = {631--638} } @article{yang2015explicit, title = {An Explicit Sampling Dependent Spectral Error Bound for Column Subset Selection}, author = {Yang, Tianbao and Zhang, Lijun and Jin, Rong and Zhu, Shenghuo}, year = 2015, journal = {arXiv preprint arXiv:1505.00526} } @inproceedings{yang2019reinforcement, title = {Reinforcement leaning in feature space: Matrix bandit, kernels, and regret bound}, author = {Yang, Lin F and Wang, Mengdi}, year = 2020, booktitle = {International Conference on Machine Learning} } @inproceedings{yang2019sample, title = {Sample-optimal parametric {Q}-learning using linearly additive features}, author = {Yang, Lin and Wang, Mengdi}, year = 2019, booktitle = {International Conference on Machine Learning} } @article{yang2020accelerating, title = {Accelerating safe reinforcement learning with constraint-mismatched policies}, author = {Yang, Tsung-Yen and Rosca, Justinian and Narasimhan, Karthik and Ramadge, Peter J}, year = 2020, journal = {arXiv preprint arXiv:2006.11645} } @article{yang2020bridging, title = {Bridging exploration and general function approximation in reinforcement learning: Provably efficient kernel and neural value iterations}, author = {Yang, Zhuoran and Jin, Chi and Wang, Zhaoran and Wang, Mengdi and Jordan, Michael I}, year = 2020, journal = {arXiv preprint arXiv:2011.04622} } @inproceedings{yang2020kernel, title = {Provably Efficient Reinforcement Learning with Kernel and Neural Function Approximations}, author = {Yang, Zhuoran and Jin, Chi and Wang, Zhaoran and Wang, Mengdi and Jordan, Michael}, year = 2020, booktitle = {Advances in Neural Information Processing Systems}, publisher = {Curran Associates, Inc.}, volume = 33, pages = {13903--13916}, url = {https://proceedings.neurips.cc/paper/2020/file/9fa04f87c9138de23e92582b4ce549ec-Paper.pdf}, editor = {H. Larochelle and M. Ranzato and R. Hadsell and M. F. Balcan and H. Lin} } @article{yang2020provable, title = {Provable Benefits of Representation Learning in Linear Bandits}, author = {Yang, Jiaqi and Hu, Wei and Lee, Jason D and Du, Simon S}, year = 2021, journal = {International Conference on Learning Representations (ICLR)} } @article{yang2020q, title = {{$Q$}-learning with Logarithmic Regret}, author = {Yang, Kunhe and Yang, Lin F and Du, Simon S}, year = 2020, journal = {arXiv preprint arXiv:2006.09118}, booktitle = {International Conference on Artificial Intelligence and Statistics}, pages = {1576--1584}, organization = {PMLR} } @inproceedings{yang2020reinforcement, title = {Reinforcement learning in feature space: Matrix bandit, kernels, and regret bound}, author = {Yang, Lin and Wang, Mengdi}, year = 2020, booktitle = {International Conference on Machine Learning}, pages = {10746--10756}, organization = {PMLR} } @inproceedings{yang2021impact, title = {Impact of representation learning in linear bandits}, author = {Yang, Jiaqi and Hu, Wei and Lee, Jason D and Du, Simon S}, year = 2021, booktitle = {International Conference on Learning Representations} } @article{yannakakis1991expressing, title = {Expressing combinatorial optimization problems by linear programs}, author = {Yannakakis, Mihalis}, year = 1991, journal = {Journal of Computer and System Sciences}, publisher = {Elsevier} } @inproceedings{yao1977probabilistic, title = {Probabilistic computations: Toward a unified measure of complexity}, author = {Yao, Andrew Chi-Chin}, year = 1977, booktitle = {Foundations of Computer Science (FOCS), 1977 IEEE 18th Annual Symposium on}, pages = {222--227}, organization = {IEEE} } @article{yao1994near, title = {Near-optimal time-space tradeoff for element distinctness}, author = {Yao, Andrew Chi-Chih}, year = 1994, journal = {SIAM Journal on Computing}, publisher = {SIAM}, volume = 23, number = 5, pages = {966--975} } @inproceedings{yao2009efficient, title = {Efficient Methods for Topic Model Inference on Streaming Document Collections}, author = {Limin Yao and David Mimno and Andrew McCallum}, year = 2009, booktitle = {KDD} } @inproceedings{yao2014pseudo, title = {Pseudo-MDPs and factored linear action models}, author = {Yao, Hengshuai and Szepesv{\'a}ri, Csaba and Pires, Bernardo Avila and Zhang, Xinhua}, year = 2014, booktitle = {IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning} } @article{ye2005new, title = {A new complexity result on solving the {M}arkov decision problem}, author = {Ye, Yinyu}, year = 2005, journal = {Mathematics of Operations Research}, publisher = {INFORMS}, volume = 30, number = 3, pages = {733--749}, date-added = {2017-05-19 05:09:58 +0000}, date-modified = {2017-05-19 05:09:58 +0000} } @article{ye2011simplex, title = {The simplex and policy-iteration methods are strongly polynomial for the {M}arkov decision problem with a fixed discount rate}, author = {Ye, Yinyu}, year = 2011, journal = {Mathematics of Operations Research}, publisher = {INFORMS}, volume = 36, number = 4, pages = {593--603}, date-added = {2017-05-19 05:07:20 +0000}, date-modified = {2017-05-19 05:07:20 +0000} } @inproceedings{yi1998efficient, title = {Efficient Retrieval of Similar Time Sequences Under Time Warping}, author = {Byoung-Kee Yi and H. V. Jagadish and Christos Faloutsos}, year = 1998, booktitle = {Proceeding of 14th International Conference on Data Engineering}, pages = {201--208}, doi = {10.1109/ICDE.1998.655778}, keywords = { Euclidean distance;FastMap;decelerations;dissimilarity metric;fast linear test;fast similarity searching;field tested dissimilarity metric;indexing viewpoint;large time sequence databases;local accelerations;sequence length;sequential scanning;similar time sequence retrieval;synthetic datasets;time warping;query processing;signal processing;temporal databases; }, owner = {leili}, timestamp = {2011.07.28} } @inproceedings{yi2000online, title = {Online Data Mining for Co-Evolving Time Sequences}, author = { Byoung-Kee Yi and N.D. Sidiropoulos and Theodore Johnson and H.V. Jagadish and Christos Faloutsos and Alexandros Biliris }, year = 2000, booktitle = {Proceedings of the 16th International Conference on Data Engineering}, location = {San Diego, CA}, publisher = {IEEE Computer Society}, address = {Washington, DC, USA}, pages = {13--22}, isbn = {0-7695-0506-6}, acmid = 847379, owner = {leili}, timestamp = {2011.07.28} } @inproceedings{yi2016fast, title = {Fast Algorithms for Robust {P}{C}{A} via Gradient Descent}, author = {Yi, Xinyang and Park, Dohyung and Chen, Yudong and Caramanis, Constantine}, year = 2016, booktitle = {Advances in Neural Information Processing Systems}, pages = {4152--4160} } @article{yildirim2008two, title = {Two algorithms for the minimum enclosing ball problem}, author = {Yildirim, E. Alper}, year = 2008, journal = {SIAM Journal on Optimization}, publisher = {SIAM}, volume = 19, number = 3, pages = {1368--1391} } @book{yin2003stochastic, title = {Stochastic {A}pproximation and {R}ecursive {A}lgorithms and {A}pplications}, author = {Yin, G George and Kushner, Harold J}, year = 2003, publisher = {Springer}, volume = 35 } @inproceedings{YinTatPaper, title = {Uniform Sampling for Matrix Approximation}, author = {Michael B. Cohen and Yin Tat Lee and Cameron Musco and Christopher Musco and Richard Peng and Aaron Sidford}, year = 2015, booktitle = {Proceedings of the 2015 Conference on Innovations in Theoretical Computer Science, {ITCS} 2015, Rehovot, Israel, January 11-13, 2015}, pages = {181--190}, doi = {10.1145/2688073.2688113}, url = {http://doi.acm.org/10.1145/2688073.2688113}, crossref = {DBLP:conf/innovations/2015}, timestamp = {Sun, 25 Jan 2015 11:31:05 +0100}, biburl = {http://dblp.uni-trier.de/rec/bib/conf/innovations/CohenLMMPS15}, bibsource = {dblp computer science bibliography, http://dblp.org} } @article{you2017large, title = {Large batch training of convolutional networks}, author = {You, Yang and Gitman, Igor and Ginsburg, Boris}, year = 2017, journal = {arXiv preprint arXiv:1708.03888} } @article{Young14, title = {Nearly Linear-Time Approximation Schemes for Mixed Packing/Covering and Facility-Location Linear Programs}, author = {Neal E. Young}, year = 2014, month = jul, journal = {ArXiv e-prints}, volume = {abs/1407.3015}, url = {http://arxiv.org/abs/1407.3015} } @inproceedings{Young2001, title = {{Sequential and parallel algorithms for mixed packing and covering}}, author = {Young, Neal E.}, year = 2001, booktitle = {42nd Annual IEEE Symposium on Foundations of Computer Science (FOCS'01)}, publisher = {IEEE Comput. Soc}, pages = {538--546}, doi = {10.1109/SFCS.2001.959930}, isbn = {0-7695-1116-3}, file = {:C$\backslash$:/Users/Zeyuan/Documents/Mendeley Desktop/Young - 2001 - Sequential and parallel algorithms for mixed packing and covering.pdf:pdf}, mendeley-groups = {Algorithms/Multiplicative Weight/LP} } @inproceedings{ys17, title = {Mean Field Residual Networks: On the Edge of Chaos}, author = {Yang, Greg and Schoenholz, Samuel}, year = 2017, booktitle = {Advances in Neural Information Processing Systems (NIPS)}, pages = {7103--7114} } @article{ys18, title = {Deep Mean Field Theory: Layerwise Variance and Width Variation as Methods to Control Gradient Explosion}, author = {Yang, Greg and Schoenholz, Sam S.}, year = 2018, journal = {ICLR open review}, url = {https://openreview.net/forum?id=rJGY8GbR-} } @article{yu2012analysis, title = {Analysis of kernel mean matching under covariate shift}, author = {Yu, Yaoliang and Szepesv{\'a}ri, Csaba}, year = 2012, journal = {arXiv preprint arXiv:1206.4650} } @article{yu2020mopo, title = {MOPO: Model-based Offline Policy Optimization}, author = {Yu, Tianhe and Thomas, Garrett and Yu, Lantao and Ermon, Stefano and Zou, James and Levine, Sergey and Finn, Chelsea and Ma, Tengyu}, year = 2020, journal = {arXiv preprint arXiv:2005.13239} } @article{yun2017global, title = {Global optimality conditions for deep neural networks}, author = {Yun, Chulhee and Sra, Suvrit and Jadbabaie, Ali}, year = 2017, journal = {arXiv preprint arXiv:1707.02444} } @article{yun2018critical, title = {A Critical View of Global Optimality in Deep Learning}, author = {Yun, Chulhee and Sra, Suvrit and Jadbabaie, Ali}, year = 2018, journal = {arXiv preprint arXiv:1802.03487} } @article{yy1241, title = {An Alternative View: When Does {SGD} Escape Local Minima?}, author = {Robert Kleinberg and Yuanzhi Li and Yang Yuan}, year = 2018, journal = {CoRR}, volume = {abs/1802.06175}, url = {http://arxiv.org/abs/1802.06175}, archiveprefix = {arXiv}, eprint = {1802.06175}, timestamp = {Mon, 13 Aug 2018 16:48:44 +0200}, biburl = {https://dblp.org/rec/bib/journals/corr/abs-1802-06175}, bibsource = {dblp computer science bibliography, https://dblp.org} } @article{zagoruyko2016wide, title = {Wide Residual Networks}, author = {Zagoruyko, Sergey and Komodakis, Nikos}, year = 2016, journal = {NIN}, volume = 8, pages = {35--67} } @inproceedings{zaharia2012resilient, title = {Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing}, author = {Zaharia, Matei and Chowdhury, Mosharaf and Das, Tathagata and Dave, Ankur and Ma, Justin and McCauley, Murphy and Franklin, Michael J and Shenker, Scott and Stoica, Ion}, year = 2012, booktitle = {Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation}, pages = {2--2}, organization = {USENIX Association} } @inproceedings{zanette2019limiting, title = {Limiting Extrapolation in Linear Approximate Value Iteration}, author = {Zanette, Andrea and Lazaric, Alessandro and Kochenderfer, Mykel J and Brunskill, Emma}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {5616--5625} } @inproceedings{zanette2019tighter, title = {Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds}, author = {Zanette, Andrea and Brunskill, Emma}, year = 2019, booktitle = {International Conference on Machine Learning}, pages = {7304--7312} } @inproceedings{zanette2020learning, title = {Learning Near Optimal Policies with Low Inherent {Bellman} Error}, author = {Zanette, Andrea and Lazaric, Alessandro and Kochenderfer, Mykel and Brunskill, Emma}, year = 2020, booktitle = {International Conference on Machine Learning} } @inproceedings{ZCZ09, title = {Inverse Time Dependency in Convex Regularized Learning}, author = {Zhu, Zeyuan Allen and Chen, Weizhu and Zhu, Chenguang and Wang, Gang and Wang, Haixun and Chen, Zheng}, year = 2009, series = {ICDM} } @article{zeiler2012adadelta, title = {ADADELTA: an adaptive learning rate method}, author = {Zeiler, Matthew D}, year = 2012, journal = {arXiv preprint arXiv:1212.5701} } @inproceedings{zeroinit2018, title = {Residual Learning Without Normalization via Better Initialization}, author = {Hongyi Zhang and Yann N. Dauphin and Tengyu Ma}, year = 2019, booktitle = {International Conference on Learning Representations}, url = {https://openreview.net/forum?id=H1gsz30cKX} } @article{ZG01, title = {Rank-one approximation to high order tensors}, author = {T. Zhang and G. Golub}, year = 2001, journal = {SIAM Journal on Matrix Analysis and Applications}, volume = 23, pages = {534--550} } @inproceedings{zhang2001jordan, title = {The Jordan canonical form of a real random matrix}, author = {Z. N. Zhang}, year = 2001, booktitle = {Numer. Math. J. Chinese Univ.}, issue = 23, owner = {leili}, page = {363-367}, timestamp = {2011.07.28} } @inproceedings{zhang2004solving, title = {Solving large scale linear prediction problems using stochastic gradient descent algorithms}, author = {Zhang, Tong}, year = 2004, booktitle = {Proceedings of the 21st International Conference on Machine Learning}, series = {ICML 2004} } @article{zhang2006from, title = {From $\epsilon$-entropy to {KL}-entropy: Analysis of minimum information complexity density estimation}, author = {Zhang, Tong}, year = 2006, journal = {The Annals of Statistics}, publisher = {Institute of Mathematical Statistics} } @inproceedings{zhang2006learning, title = {Learning from Incomplete Ratings Using Non-negative Matrix Factorization.}, author = {Zhang, Sheng and Wang, Weihong and Ford, James and Makedon, Fillia}, year = 2006, booktitle = {SDM}, organization = {SIAM} } @article{zhang2010clustered, title = {Clustered {Nystr{\"o}m} method for large scale manifold learning and dimension reduction}, author = {Zhang, Kai and Kwok, James T}, year = 2010, journal = {Neural Networks, IEEE Transactions on}, publisher = {IEEE}, volume = 21, number = 10, pages = {1576--1587} } @inproceedings{zhang2012communication, title = {Communication-efficient algorithms for statistical optimization}, author = {Zhang, Yuchen and Wainwright, Martin J and Duchi, John C}, year = 2012, booktitle = {Advances in Neural Information Processing Systems(NIPS)} } @inproceedings{zhang2012scaling, title = {Scaling up kernel svm on limited resources: A low-rank linearization approach}, author = {Zhang, Kai and Lan, Liang and Wang, Zhuang and Moerchen, Fabian}, year = 2012, booktitle = {International Conference on Artificial Intelligence and Statistics}, pages = {1425--1434} } @inproceedings{zhang2013domain, title = {Domain adaptation under target and conditional shift}, author = {Zhang, Kun and Muandet, Krikamol and Wang, Zhikun and others}, year = 2013, booktitle = {Proceedings of the 30th International Conference on Machine Learning (ICML-13)}, pages = {819--827} } @article{zhang2013gradient, title = {Gradient methods for convex minimization: better rates under weaker conditions}, author = {Zhang, Hui and Yin, Wotao}, year = 2013, journal = {arXiv preprint arXiv:1303.4645} } @inproceedings{zhang2013information, title = {Information-theoretic lower bounds for distributed statistical estimation with communication constraints}, author = {Zhang, Yuchen and Duchi, John and Jordan, Michael I and Wainwright, Martin J}, year = 2013, booktitle = {Advances in Neural Information Processing Systems (NIPS)} } @article{zhang2015learning, title = {Learning halfspaces and neural networks with random initialization}, author = {Zhang, Yuchen and Lee, Jason D and Wainwright, Martin J and Jordan, Michael I}, year = 2015, journal = {arXiv preprint arXiv:1511.07948} } @inproceedings{zhang2015stochastic, title = {Stochastic primal-dual coordinate method for regularized empirical risk minimization}, author = {Zhang, Yuchen and Xiao, Lin}, year = 2015, booktitle = {Proceedings of the 32nd International Conference on Machine Learning}, volume = 951, pages = 2015, url = {http://arxiv.org/abs/1409.3257}, abstract = {We consider a generic convex optimization problem associated with regularized empirical risk minimization of linear predictors. The problem structure allows us to reformulate it as a convex-concave saddle point problem. We propose a stochastic primal-dual coordinate (SPDC) method, which alternates between maximizing over a randomly chosen dual variable and minimizing over the primal variable. An extrapolation step on the primal variable is performed to obtain accelerated convergence rate. We also develop a mini-batch version of the SPDC method which facilitates parallel computing, and an extension with weighted sampling probabilities on the dual variables, which has a better complexity than uniform sampling on unnormalized data. Both theoretically and empirically, we show that the SPDC method has comparable or better performance than several state-of-the-art optimization methods.}, archiveprefix = {arXiv}, arxivid = {1409.3257}, eprint = {1409.3257}, file = {:C$\backslash$:/Users/Zeyuan/AppData/Local/Mendeley Ltd./Mendeley Desktop/Downloaded/1ca7470da54bcc99493d1dac5f702ca0b9ea4d23.pdf:pdf}, mendeley-groups = {Optimization/Saddle-Point,Optimization/[with Yuan Yang]} } @article{zhang2016convexified, title = {Convexified convolutional neural networks}, author = {Zhang, Yuchen and Liang, Percy and Wainwright, Martin J}, year = 2016, journal = {arXiv preprint arXiv:1609.01000} } @article{zhang2016l1, title = {l1-regularized Neural Networks are Improperly Learnable in Polynomial Time}, author = {Zhang, Yuchen and {Jason D. Lee} and Jordan, Michael I}, year = 2016, journal = {International Conference on Machine Learning (ICML)} } @article{zhang2017hitting, title = {A hitting time analysis of stochastic gradient langevin dynamics}, author = {Zhang, Yuchen and Liang, Percy and Charikar, Moses}, year = 2017, journal = {arXiv preprint arXiv:1702.05575} } @inproceedings{zhang2017learnability, title = {On the Learnability of Fully-Connected Neural Networks}, author = {Zhang, Yuchen and Lee, Jason and Wainwright, Martin and Jordan, Michael}, year = 2017, booktitle = {Artificial Intelligence and Statistics}, pages = {83--91} } @article{zhang2017mixup, title = {mixup: Beyond empirical risk minimization}, author = {Zhang, Hongyi and Cisse, Moustapha and Dauphin, Yann N and Lopez-Paz, David}, year = 2017, journal = {arXiv preprint arXiv:1710.09412} } @article{zhang2017stochastic, title = {Stochastic Variance-reduced Gradient Descent for Low-rank Matrix Recovery from Linear Measurements}, author = {Zhang, Xiao and Wang, Lingxiao and Gu, Quanquan}, year = 2017, journal = {arXiv preprint arXiv:1701.00481} } @inproceedings{zhang2018fast, title = {Fast and Sample Efficient Inductive Matrix Completion via Multi-Phase Procrustes Flow}, author = {Zhang, Xiao and Du, Simon and Gu, Quanquan}, year = 2018, month = {10--15 Jul}, booktitle = {Proceedings of the 35th International Conference on Machine Learning}, publisher = {PMLR}, series = {Proceedings of Machine Learning Research}, volume = 80, pages = {5756--5765}, url = {http://proceedings.mlr.press/v80/zhang18b.html}, editor = {Dy, Jennifer and Krause, Andreas}, pdf = {http://proceedings.mlr.press/v80/zhang18b/zhang18b.pdf}, abstract = {We revisit the inductive matrix completion problem that aims to recover a rank-$r$ matrix with ambient dimension $d$ given $n$ features as the side prior information. The goal is to make use of the known $n$ features to reduce sample and computational complexities. We present and analyze a new gradient-based non-convex optimization algorithm that converges to the true underlying matrix at a linear rate with sample complexity only linearly depending on $n$ and logarithmically depending on $d$. To the best of our knowledge, all previous algorithms either have a quadratic dependency on the number of features in sample complexity or a sub-linear computational convergence rate. In addition, we provide experiments on both synthetic and real world data to demonstrate the effectiveness of our proposed algorithm.} } @article{zhang2018learning, title = {Learning One-hidden-layer ReLU Networks via Gradient Descent}, author = {Zhang, Xiao and Yu, Yaodong and Wang, Lingxiao and Gu, Quanquan}, year = 2018, journal = {arXiv preprint arXiv:1806.07808} } @article{zhang2018spectral, title = {Spectral state compression of {Markov} processes}, author = {Zhang, Anru and Wang, Mengdi}, year = 2018, journal = {arXiv:1802.02920} } @article{zhang2019bridging, title = {Bridging theory and algorithm for domain adaptation}, author = {Zhang, Yuchen and Liu, Tianle and Long, Mingsheng and Jordan, Michael I}, year = 2019, journal = {arXiv preprint arXiv:1904.05801} } @inproceedings{zhang2019regret, title = {Regret minimization for reinforcement learning by evaluating the optimal bias function}, author = {Zhang, Zihan and Ji, Xiangyang}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {2823--2832} } @inproceedings{zhang2019theoretically, title = {Theoretically Principled Trade-off between Robustness and Accuracy}, author = {Zhang, Hongyang and Yu, Yaodong and Jiao, Jiantao and Xing, Eric and El Ghaoui, Laurent and Jordan, Michael}, year = 2019, booktitle = {International Conference on Machine Learning}, pages = {7472--7482} } @article{zhang2020almost, title = {Almost Optimal Model-Free Reinforcement Learningvia Reference-Advantage Decomposition}, author = {Zhang, Zihan and Zhou, Yuan and Ji, Xiangyang}, year = 2020, journal = {Advances in Neural Information Processing Systems}, volume = 33 } @article{zhang2020model, title = {Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample Complexity}, author = {Zhang, Zihan and Zhou, Yuan and Ji, Xiangyang}, year = 2020, journal = {arXiv preprint arXiv:2006.03864} } @article{zhang2020nearly, title = {Nearly Minimax Optimal Reward-free Reinforcement Learning}, author = {Zhang, Zihan and Du, Simon S and Ji, Xiangyang}, year = 2020, journal = {arXiv preprint arXiv:2010.05901} } @inproceedings{zhang2020overparameterized, title = {Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality}, author = {Zhang, Yi and Plevrakis, Orestis and Du, Simon S and Li, Xingguo and Song, Zhao and Arora, Sanjeev}, year = 2020, booktitle = {Advances in Neural Information Processing Systems}, publisher = {Curran Associates, Inc.}, volume = 33, pages = {679--688}, url = {https://proceedings.neurips.cc/paper/2020/file/0740bb92e583cd2b88ec7c59f985cb41-Paper.pdf}, editor = {H. Larochelle and M. Ranzato and R. Hadsell and M. F. Balcan and H. Lin} } @article{zhang2020reinforcement, title = {Is reinforcement learning more difficult than bandits? a near-optimal algorithm escaping the curse of horizon}, author = {Zhang, Zihan and Ji, Xiangyang and Du, Simon S}, year = 2020, journal = {arXiv preprint arXiv:2009.13503} } @article{zhang2021variance, title = {Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP}, author = {Zhang, Zihan and Yang, Jiaqi and Ji, Xiangyang and Du, Simon S}, year = 2021, journal = {arXiv preprint arXiv:2101.12745} } @inproceedings{zhao2007experimental, title = { Experimental Study of Virtual Machine Migration in Support of Reservation of Cluster Resources }, author = {Ming Zhao and Figueiredo, R.J.}, year = 2007, booktitle = { 2nd International Workshop on Virtualization Technology in Distributed Computing } } @article{zhao2011reinforcement, title = {Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer}, author = {Zhao, Yufan and Zeng, Donglin and Socinski, Mark A and Kosorok, Michael R}, year = 2011, journal = {Biometrics}, publisher = {Wiley Online Library}, volume = 67, number = 4, pages = {1422--1433} } @inproceedings{zhao2015nonconvex, title = {A Nonconvex Optimization Framework for Low Rank Matrix Estimation}, author = {Zhao, Tuo and Wang, Zhaoran and Liu, Han}, year = 2015, booktitle = {Advances in Neural Information Processing Systems}, pages = {559--567} } @article{zhao2020individual, title = {Individual Calibration with Randomized Forecasting}, author = {Zhao, Shengjia and Ma, Tengyu and Ermon, Stefano}, year = 2020, journal = {arXiv preprint arXiv:2006.10288} } @article{zhao2021provably, title = {Provably efficient policy gradient methods for two-player zero-sum Markov games}, author = {Zhao, Yulai and Tian, Yuandong and Lee, Jason D and Du, Simon S}, year = 2021, journal = {arXiv preprint arXiv:2102.08903} } @inproceedings{zheng2015convergent, title = {A convergent gradient descent algorithm for rank minimization and semidefinite programming from random linear measurements}, author = {Zheng, Qinqing and Lafferty, John}, year = 2015, booktitle = {Advances in Neural Information Processing Systems}, pages = {109--117} } @article{zheng2016convergence, title = {Convergence Analysis for Rectangular Matrix Completion Using Burer-Monteiro Factorization and Gradient Descent}, author = {Zheng, Qinqing and Lafferty, John}, year = 2016, journal = {arXiv preprint arXiv:1605.07051} } @article{zhong2017learning, title = {Learning Non-overlapping Convolutional Neural Networks with Multiple Kernels}, author = {Zhong, Kai and Song, Zhao and Dhillon, Inderjit S}, year = 2017, journal = {arXiv preprint arXiv:1711.03440} } @article{zhong2017recovery, title = {Recovery Guarantees for One-hidden-layer Neural Networks}, author = {Zhong, Kai and Song, Zhao and Jain, Prateek and Bartlett, Peter L and Dhillon, Inderjit S}, year = 2017, journal = {arXiv preprint arXiv:1706.03175}, booktitle = {International Conference on Machine Learning (ICML)}, publisher = {arXiv preprint arXiv:1706.03175} } @article{zhong2019pac, title = {{PAC} Reinforcement Learning without Real-World Feedback}, author = {Yuren Zhong and Aniket Anand Deshmukh and Clayton Scott}, year = 2019, journal = {ArXiv}, volume = {abs/1909.10449} } @article{zhou2005efficient, title = {Efficient algorithms for the smallest enclosing ball problem}, author = {Zhou, Guanglu and Tohemail, Kim-Chuan and Sun, Jie}, year = 2005, journal = {Computational Optimization and Applications}, publisher = {Springer}, volume = 30, number = 2, pages = {147--160} } @inproceedings{zhou2014divide, title = {Divide-and-Conquer Learning by Anchoring a Conical Hull}, author = {Zhou, Tianyi and Bilmes, Jeff A and Guestrin, Carlos}, year = 2014, booktitle = {NIPS}, pages = {1242--1250} } @article{zhou2017critical, title = {Critical Points of Neural Networks: Analytical Forms and Landscape Properties}, author = {Zhou, Yi and Liang, Yingbin}, year = 2017, journal = {arXiv preprint arXiv:1710.11205} } @article{zhou2017landscape, title = {The Landscape of Deep Learning Algorithms}, author = {Zhou, Pan and Feng, Jiashi}, year = 2017, journal = {arXiv preprint arXiv:1705.07038} } @article{zhou2018critical, title = {Critical points of linear neural networks: Analytical forms and landscape properties}, author = {Zhou, Yi and Liang, Yingbin}, year = 2018 } @inproceedings{zhou2020neural, title = {Neural contextual bandits with UCB-based exploration}, author = {Zhou, Dongruo and Li, Lihong and Gu, Quanquan}, year = 2020, booktitle = {International Conference on Machine Learning}, pages = {11492--11502}, organization = {PMLR} } @misc{zhou2021nearly, title = {Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes}, author = {Dongruo Zhou and Quanquan Gu and Csaba Szepesvari}, year = 2021, eprint = {2012.08507}, archiveprefix = {arXiv}, primaryclass = {cs.LG} } @inproceedings{ZHPA13-contrast, title = {Contrastive learning using spectral methods}, author = {James Zou and Daniel Hsu and David Parkes and Ryan P. Adams}, year = 2013, booktitle = {Advances in Neural Information Processing Systems 26}, url = {http://papers.nips.cc/paper/5007-contrastive-learning-using-spectral-methods} } @inproceedings{zhu2019anisotropic, title = {The anisotropic noise in stochastic gradient descent: Its behavior of escaping from sharp minima and regularization effects}, author = {Zhu, Zhanxing and Wu, Jingfeng and Yu, Bing and Wu, Lei and Ma, Jinwen}, year = 2019 } @article{ziehe2004fast, title = {A fast algorithm for joint diagonalization with non-orthogonal transformations and its application to blind source separation}, author = {Ziehe, A. and Laskov, P. and Nolte, G. and M{\"u}ller, K. R.}, year = 2004, journal = {Journal of Machine Learning Research}, volume = 5, pages = {777--800} } @inproceedings{zimin2013online, title = {Online learning in episodic Markovian decision processes by relative entropy policy search}, author = {Zimin, Alexander and Neu, Gergely}, year = 2013, booktitle = {Advances in neural information processing systems}, pages = {1583--1591} } @inproceedings{zinkevich2003online, title = {Online Convex Programming and Generalized Infinitesimal Gradient Ascent}, author = {Zinkevich, Martin}, year = 2003, booktitle = {Proceedings of the 20th International Conference on Machine Learning}, series = {ICML 2003}, pages = {928--936} } @inproceedings{zinkevich2010parallelized, title = {Parallelized stochastic gradient descent}, author = {Zinkevich, Martin and Weimer, Markus and Li, Lihong and Smola, Alex J}, year = 2010, booktitle = {Advances in Neural Information Processing Systems}, pages = {2595--2603} } @inproceedings{zlj16, title = {$\ell_1$-regularized neural networks are improperly learnable in polynomial time}, author = {Zhang, Yuchen and Lee, Jason D and Jordan, Michael I}, year = 2016, booktitle = {International Conference on Machine Learning (ICML)}, pages = {993--1001} } @inproceedings{ZLM13, title = {A Local Algorithm for Finding Well-Connected Clusters}, author = {Zeyuan Allen Zhu and Silvio Lattanzi and Vahab Mirrokni}, year = 2013, booktitle = {ICML} } @inproceedings{zlsd18, title = {Learning Long Term Dependencies via {F}ourier Recurrent Units}, author = {Zhang, Jiong and Lin, Yibo and Song, Zhao and Dhillon, Inderjit S}, year = 2018, booktitle = {International Conference on Machine Learning (ICML)}, publisher = {arXiv preprint arXiv:1803.06585} } @inproceedings{zordan2003mapping, title = { Mapping optical motion capture data to skeletal motion using a physical model }, author = {Zordan,, Victor Brian and Van Der Horst,, Nicholas C.}, year = 2003, booktitle = { SCA '03: Proceedings of the 2003 ACM SIGGRAPH/Eurographics symposium on Computer animation }, location = {San Diego, California}, publisher = {Eurographics Association}, address = {Aire-la-Ville, Switzerland, Switzerland}, pages = {245--250}, isbn = {1-58113-659-5}, owner = {leili}, timestamp = {2011.07.28} } @article{zou2005regularization, title = {Regularization and variable selection via the elastic net}, author = {Zou, Hui and Hastie, Trevor}, year = 2005, journal = {Journal of the Royal Statistical Society: Series B (Statistical Methodology)}, publisher = {Wiley Online Library}, volume = 67, number = 2, pages = {301--320} } @inproceedings{zou2013contrastive, title = {Contrastive Learning Using Spectral Methods}, author = {J. Y. Zou and D. Hsu and D. C. Parkes and R. P. Adams}, year = 2013, booktitle = {Advances in Neural Information Processing Systems}, pages = {2238--2246} } @article{zou2018stochastic, title = {Stochastic Gradient Descent Optimizes Over-parameterized Deep {ReLU} Networks}, author = {Zou, Difan and Cao, Yuan and Zhou, Dongruo and Gu, Quanquan}, year = 2018, journal = {arXiv preprint arXiv:1811.08888} } @inproceedings{zou2019finite, title = {{Finite-sample analysis for SARSA with linear function approximation}}, author = {Zou, Shaofeng and Xu, Tengyu and Liang, Yingbin}, year = 2019, booktitle = {Advances in Neural Information Processing Systems}, pages = {8668--8678} } @inproceedings{Zouzias2012, title = {A Matrix Hyperbolic Cosine Algorithm and Applications}, author = {Zouzias, Anastasios}, year = 2012, booktitle = {Proceedings of the 39th International Colloquium Conference on Automata, Languages, and Programming - Volume Part I}, location = {Warwick, UK}, publisher = {Springer-Verlag}, address = {Berlin, Heidelberg}, series = {ICALP'12}, pages = {846--858}, doi = {10.1007/978-3-642-31594-7_71}, isbn = {978-3-642-31593-0}, numpages = 13, acmid = 2359454 } @inproceedings{zurel2001efficient, title = {An efficient approximate allocation algorithm for combinatorial auctions}, author = {Zurel, Edo and Nisan, Noam}, year = 2001, booktitle = {Proceedings of the 3rd ACM conference on Electronic Commerce}, pages = {125--136}, organization = {ACM} } @inproceedings{zz-sdca-sampling, title = {Stochastic Optimization with Importance Sampling for Regularized Loss Minimization}, author = {Peilin Zhao and Tong Zhang}, year = 2015, booktitle = {Proceedings of the 32nd International Conference on Machine Learning}, volume = 37, pages = {1--9} } @article{zz123, title = {What Can ResNet Learn Efficiently, Going Beyond Kernels?}, author = {Zeyuan Allen{-}Zhu and Yuanzhi Li}, year = 2019, journal = {CoRR}, volume = {abs/1905.10337}, url = {http://arxiv.org/abs/1905.10337}, archiveprefix = {arXiv}, eprint = {1905.10337}, timestamp = {Wed, 29 May 2019 11:27:50 +0200}, biburl = {https://dblp.org/rec/bib/journals/corr/abs-1905-10337}, bibsource = {dblp computer science bibliography, https://dblp.org} } @article{amodei2016concrete, title={Concrete problems in AI safety}, author={Amodei, Dario and Olah, Chris and Steinhardt, Jacob and Christiano, Paul and Schulman, John and Man{\'e}, Dan}, journal={arXiv preprint arXiv:1606.06565}, year={2016} } @inproceedings{tamar2017learning, author={Tamar, Aviv and Thomas, Garrett and Zhang, Tianhao and Levine, Sergey and Abbeel, Pieter}, booktitle={2017 IEEE International Conference on Robotics and Automation (ICRA)}, title={Learning from the hindsight plan — Episodic MPC improvement}, year={2017}, volume={}, number={}, pages={336-343}, doi={10.1109/ICRA.2017.7989043} } @article{asadi2019combating, author = {Kavosh Asadi and Dipendra Misra and Seungchan Kim and Michael L. Littman}, title = {Combating the Compounding-Error Problem with a Multi-step Model}, journal = {arXiv preprint}, volume = {abs/1905.13320}, year = {2019}, url = {http://arxiv.org/abs/1905.13320}, archivePrefix = {arXiv}, eprint = {1905.13320} } @article{gu2016deep, author = {Shixiang Gu and Ethan Holly and Timothy P. Lillicrap and Sergey Levine}, title = {Deep Reinforcement Learning for Robotic Manipulation}, volume = {abs/1610.00633}, year = {2016}, url = {http://arxiv.org/abs/1610.00633}, archivePrefix = {arXiv}, eprint = {1610.00633} } @inproceedings{bellemare2020autonomous, title = {Autonomous navigation of stratospheric balloons using reinforcement learning}, author = {Marc G. Bellemare and Salvatore Candido and Pablo Samuel Castro and Jun Gong and Marlos C. Machado and Subhodeep Moitra and Sameera S. Ponda and Ziyu Wang}, journal = {Nature}, year = {2020}, pages = {77–82} } @incollection{NEURIPS2019_9015, title = {PyTorch: An Imperative Style, High-Performance Deep Learning Library}, author = {Paszke, Adam and Gross, Sam and Massa, Francisco and Lerer, Adam and Bradbury, James and Chanan, Gregory and Killeen, Trevor and Lin, Zeming and Gimelshein, Natalia and Antiga, Luca and Desmaison, Alban and Kopf, Andreas and Yang, Edward and DeVito, Zachary and Raison, Martin and Tejani, Alykhan and Chilamkurthy, Sasank and Steiner, Benoit and Fang, Lu and Bai, Junjie and Chintala, Soumith}, booktitle = {Advances in Neural Information Processing Systems 32}, editor = {H. Wallach and H. Larochelle and A. Beygelzimer and F. dAlch\'{e}-Buc and E. Fox and R. Garnett}, pages = {8024--8035}, year = {2019}, publisher = {Curran Associates, Inc.}, url = {http://papers.neurips.cc/paper/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf} } @article{ramachandran2017searching, title={Searching for activation functions}, author={Ramachandran, Prajit and Zoph, Barret and Le, Quoc V}, journal={arXiv preprint arXiv:1710.05941}, year={2017} } @inproceedings{cheng2019end, title={End-to-end safe reinforcement learning through barrier functions for safety-critical continuous control tasks}, author={Cheng, Richard and Orosz, G{\'a}bor and Murray, Richard M and Burdick, Joel W}, booktitle={Proceedings of the AAAI Conference on Artificial Intelligence}, volume={33}, number={01}, pages={3387--3395}, year={2019} } @inproceedings{eysenbach2018leave, title={Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning}, author={Eysenbach, B and Gu, S and Ibarz, J and Levine, S}, booktitle={6th International Conference on Learning Representations (ICLR 2018)}, year={2018}, organization={OpenReview. net} } @inproceedings{roderick2021provably, title={Provably Safe PAC-MDP Exploration Using Analogies}, author={Roderick, Melrose and Nagarajan, Vaishnavh and Kolter, Zico}, booktitle={International Conference on Artificial Intelligence and Statistics}, pages={1216--1224}, year={2021}, organization={PMLR} } @inproceedings{bansal2017hamilton, title={Hamilton-Jacobi reachability: A brief overview and recent advances}, author={Bansal, Somil and Chen, Mo and Herbert, Sylvia and Tomlin, Claire J}, booktitle={2017 IEEE 56th Annual Conference on Decision and Control (CDC)}, pages={2242--2253}, year={2017}, organization={IEEE} } @InProceedings{ achiam2017constrained, title = {Constrained policy optimization}, author = {Achiam, Joshua and Held, David and Tamar, Aviv and Abbeel, Pieter}, booktitle = {International Conference on Machine Learning}, pages = {22--31}, year = {2017}, organization = {PMLR} } @article{wang2019exploring, title={Exploring model-based planning with policy networks}, author={Wang, Tingwu and Ba, Jimmy}, journal={arXiv preprint arXiv:1906.08649}, year={2019} } @article{zanger2021safe, title={Safe Continuous Control with Constrained Model-Based Policy Optimization}, author={Zanger, Moritz A and Daaboul, Karam and Z{\"o}llner, J Marius}, journal={arXiv preprint arXiv:2104.06922}, year={2021} } @inproceedings{hans2008safe, title={Safe exploration for reinforcement learning.}, author={Hans, Alexander and Schneega{\ss}, Daniel and Sch{\"a}fer, Anton Maximilian and Udluft, Steffen}, booktitle={ESANN}, pages={143--148}, year={2008}, organization={Citeseer} }

arxiv_citations

abstract: | Safe reinforcement learning is a promising path toward applying reinforcement learning algorithms to real-world problems, where suboptimal behaviors may lead to actual negative consequences. In this work, we focus on the setting where unsafe states can be avoided by planning ahead a short time into the future. In this setting, a model-based agent with a sufficiently accurate model can avoid unsafe states. We devise a model-based algorithm that heavily penalizes unsafe trajectories, and derive guarantees that our algorithm can avoid unsafe states under certain assumptions. Experiments demonstrate that our algorithm can achieve competitive rewards with fewer safety violations in several continuous control tasks. author:

| Garrett Thomas
Stanford University
gwthomas@stanford.edu
Yuping Luo
Princeton University
yupingl@cs.princeton.edu
Tengyu Ma
Stanford University
tengyuma@stanford.edu bibliography:
main.bib title: Safe Reinforcement Learning by Imagining the Near Future

Introduction

Reinforcement learning (RL) enables the discovery of effective policies for sequential decision-making tasks via trial and error [@mnih2015human; @gu2016deep; @bellemare2020autonomous]. However, in domains such as robotics, healthcare, and autonomous driving, certain kinds of mistakes pose danger to people and/or objects in the environment. Hence there is an emphasis on the safety of the policy, both at execution time and while interacting with the environment during learning. This issue, referred to as safe exploration, is considered an important problem in AI safety [@amodei2016concrete].

In this work, we advocate a model-based approach to safety, meaning that we estimate the dynamics of the system to be controlled and use the model for planning (or more accurately, policy improvement). The primary motivation for this is that a model-based method has the potential to anticipate safety violations before they occur. Often in real-world applications, the engineer has an idea of what states should be considered violations of safety: for example, a robot colliding rapidly with itself or surrounding objects, a car driving on the wrong side of the road, or a patient's blood glucose levels spiking. Yet model-free algorithms typically lack the ability to incorporate such prior knowledge and must encounter some safety violations before learning to avoid them.

{#fig:didactic width="75%"}

We begin with the premise that in practice, forward prediction for relatively few timesteps is sufficient to avoid safety violations. Consider the illustrative example in Figure 1{reference-type="ref" reference="fig:didactic"}, in which an agent controls the acceleration (and thereby, speed) of a car by pressing the gas or brake (or nothing). Note that there is an upper bound on how far into the future the agent would have to plan to foresee and (if possible) avoid any collision, namely, the amount of time it takes to bring the car to a complete stop.

Assuming that the horizon required for detecting unsafe situations is not too large, we show how to construct a reward function with the property that an optimal policy will never incur a safety violation. A short prediction horizon is also beneficial for model-based RL, as the well-known issue of compounding error plagues long-horizon prediction [@asadi2019combating]: imperfect predictions are fed back into the model as inputs (possibly outside the distribution of inputs in the training data), leading to progressively worse accuracy as the prediction horizon increases.

Our main contribution is a model-based algorithm that utilizes a reward penalty -- the value of which is prescribed by our theoretical analysis -- to guarantee safety (under some assumptions). Experiments indicate that the practical instantiation of our algorithm, , effectively reduces the number of safety violations on several continuous control tasks, achieving a comparable performance with far fewer safety violations compared to several model-free safe RL algorithms. Code is made available at https://github.com/gwthomas/Safe-MBPO.

Background

In this work, we consider a deterministic[^1] Markov decision process (MDP) $M = (\mathcal{S}, \mathcal{A}, T, r, \gamma)$, where $\mathcal{S}$ is the state space, $\mathcal{A}$ the action space, $T : \mathcal{S}\times \mathcal{A}\to \mathcal{S}$ the transition dynamics, $r : \mathcal{S}\times \mathcal{A}\to [r_{\min}, r_{\max}]$ the reward function, and $\gamma \in [0,1)$ the discount factor. A policy $\pi : \mathcal{S}\to \Delta(\mathcal{A})$ determines what action to take at each state. A trajectory is a sequence of states and actions $\tau = (s_0, a_0, r_0, s_1, a_1, r_1, \dots)$ where $s_{t+1} = T(s_t,a_t)$ and $r_t = r(s_t,a_t)$.

Typically, the goal is to find a policy which maximizes the expected discounted return $\eta(\pi) = \mathbb{E}^\pi[\sum_{t=0}^\infty \gamma^t r_t]$. The notation $\mathbb{E}^\pi$ denotes that actions are sampled according to $a_t \sim \pi(s_t)$. The initial state $s_0$ is drawn from an initial distribution which we assume to be fixed and leave out of the notation for simplicity.

The $Q$ function $Q^\pi(s,a) = \mathbb{E}^\pi[\sum_{t=0}^\infty \gamma^t r_t ,|,s_0=s, a_0=a]$ quantifies the conditional performance of a policy $\pi$ assuming it starts in a specific state $s$ and takes action $a$, and the value function $V^\pi(s) = \mathbb{E}{a \sim \pi(s)}[Q^\pi(s,a)]$ averages this quantity over actions. The values of the best possible policies are denoted $Q^*(s,a) = \max\pi Q^\pi(s,a)$ and $V^(s) = \max_\pi V^\pi(s)$. The function $Q^$ has the important property that any optimal policy $\pi^* \in \arg\max_\pi \eta(\pi)$ must satisfy $\mathbb{P}(a^* \in \arg\max_a Q^(s,a)) = 1$ for all states $s$ and actions $a^ \sim \pi^(s)$. $Q^$ is the unique fixed point of the Bellman operator $$\mathcal{B}^*Q(s,a) = r(s,a) + \gamma \max_{a'} Q(s', a') \quad \text{where} ~ s' = T(s,a)$$

In model-based RL, the algorithm estimates a dynamics model $\widehat{T}$ using the data observed so far, then uses the model for planning or data augmentation. The theoretical justification for model-based RL is typically based some version of the "simulation lemma", which roughly states that if $\widehat{T}\approx T$ then $\hat\eta(\pi) \approx \eta(\pi)$ [@kearns2002near; @luo2018algorithmic].

Method

In this work, we train safe policies by modifying the reward function to penalize safety violations. We assume that the engineer specifies $\mathcal{S}_{\textup{unsafe}}$, the set of states which are considered safety violations.

We must also account for the existence of states which are not themselves unsafe, but lead inevitably to unsafe states regardless of what actions are taken.

::: {.definition} Definition 1. A state $s$ is said to be

a safety violation if $s \in \mathcal{S}_{\textup{unsafe}}$.
irrecoverable if $s \not\in \mathcal{S}{\textup{unsafe}}$ but for any sequence of actions $a_0, a_1, a_2, \dots$, the trajectory defined by $s_0 = s$ and $s{t+1} = T(s_{t}, a_{t})$ for all $t \in \mathbb{N}$ satisfies $s_{\bar{t}} \in \mathcal{S}_{\textup{unsafe}}$ for some $\bar{t} \in \mathbb{N}$.
unsafe if it is unsafe or irrecoverable, or safe otherwise. :::

We remark that these definitions are similar to those introduced in prior work on safe RL [@hans2008safe]. Crucially, we do not assume that the engineer specifies which states are (ir)recoverable, as that would require knowledge of the system dynamics. However, we do assume that a safety violation must come fairly soon after entering an irrecoverable region:

::: {#ass:fastfail .assumption} Assumption 1. There exists a horizon $H^ \in \mathbb{N}$ such that, for any irrecoverable states $s$, any sequence of actions $a_0, \dots, a_{H^-1}$ will lead to an unsafe state. That is, if $s_0 = s$ and $s_{t+1} = T(s_{t}, a_{t})$ for all $t \in {0, \dots, H^-1}$, then $s_{\bar{t}} \in \mathcal{S}_{\textup{unsafe}}$ for some $\bar{t} \in {1, \dots, H^}$. :::

This assumption rules out the possibility that a state leads inevitably to termination but takes an arbitrarily long time to do so. The implication of this assumption is that a perfect lookahead planner which considers the next $H$ steps into the future can avoid not only the unsafe states, but also any irrecoverable states, with some positive probability.

Reward penalty framework

Now we present a reward penalty framework for guaranteeing safety. Let $\widetilde{M}C = (\mathcal{S}, \mathcal{A}, \widetilde{T}, \tilde{r}, \gamma)$ be an MDP with reward function and dynamics $$\big(\tilde{r}(s,a), \widetilde{T}(s,a)\big) = \begin{cases} (r(s,a), T(s,a)) & s \not\in \mathcal{S}{\textup{unsafe}}\ (-C, s) & s\in \mathcal{S}_{\textup{unsafe}} \end{cases}$$ where the terminal cost $C \in \mathbb{R}$ is a constant (more on this below). That is, unsafe states are "absorbing" in that they transition back into themselves and receive the reward of $-C$ regardless of what action is taken.

The basis of our approach is to determine how large $C$ must be so that the Q values of actions leading to unsafe states are less than the Q values of safe actions.

::: {.lemma} Lemma 1. Suppose that Assumption Assumption 1{reference-type="ref" reference="ass:fastfail"} holds, and let $$C > \frac{r_{\max}- r_{\min}}{\gamma^{H^}} - r_{\max}. \label{eqn:theory-C}$$ Then for any state $s$, if $a$ is a safe action (i.e. $T(s,a)$ is a safe state) and $a'$ is an unsafe action (i.e. $T(s,a)$ is unsafe), it holds that $\widetilde{Q}^(s,a) > \widetilde{Q}^(s,a')$, where $\widetilde{Q}^$ is the $Q^$ function for the MDP $\widetilde{M}_C$.* :::

[[lemma:theory-C]]{#lemma:theory-C label="lemma:theory-C"}

::: {.proof} Proof. Since $a'$ is unsafe, it leads to an unsafe state in at most $H^$ steps by assumption. Thus the discounted reward obtained is at most $$\sum_{t=0}^{H^-1} \gamma^t r_{\max} + \sum_{t=H^}^\infty \gamma^t (-C) = \frac{r_{\max}(1-\gamma^{H^}) - C\gamma^{H^}}{1-\gamma}$$ By comparison, the safe action $a$ leads to another safe state, where it can be guaranteed to never encounter a safety violation. The reward of staying within the safe region forever must be at least $\frac{r_{\min}}{1-\gamma}$. Thus, it suffices to choose $C$ large enough that $$\frac{r_{\max}(1-\gamma^{H^}) - C\gamma^{H^*}}{1-\gamma} < \frac{r_{\min}}{1-\gamma}$$ Rearranging, we arrive at the condition stated. ◻ :::

The important consequence of this result is that an optimal policy for this MDP $\widetilde{M}$ will always take safe actions. However, in practice we cannot compute $\widetilde{Q}^*$ without knowing the dynamics model $T$. Therefore we extend our result to the model-based setting where the dynamics are imperfect.

Extension to model-based rollouts

We prove safety for the following theoretical setup. Suppose we have a dynamics model that outputs sets of states $\widehat{T}(s,a) \subseteq \mathcal{S}$ to account for uncertainty.

::: {.definition} Definition 2. We say that a set-valued dynamics model $\widehat{T}: \mathcal{S}\times \mathcal{A}\to \mathcal{P}(\mathcal{S})$[^2] is calibrated if $T(s,a) \in \widehat{T}(s,a)$ for all $(s,a) \in \mathcal{S}\times \mathcal{A}$. :::

[[def:calibration]]{#def:calibration label="def:calibration"}

We define the Bellmin operator: $$\underline{\mathcal{B}}^*Q(s,a) = \tilde{r}(s,a) + \gamma \min_{s' \in \widehat{T}(s,a)} \max_{a'} Q(s',a')$$

::: {.lemma} Lemma 2. The Bellmin operator $\underline{\mathcal{B}}^$ is a $\gamma$-contraction in the $\infty$-norm.* :::

[[lemma:bellmin]]{#lemma:bellmin label="lemma:bellmin"} The proof is deferred to Appendix 7.1{reference-type="ref" reference="app:proofs"}. As a consequence Lemma [lemma:bellmin]{reference-type="ref" reference="lemma:bellmin"} and Banach's fixed-point theorem, $\underline{\mathcal{B}}^$ has a unique fixed point $\underline{Q}^$ which can be obtained by iteration. This fixed point is a lower bound on the true Q function if the model is calibrated:

::: {.lemma} Lemma 3. If $\widehat{T}$ is calibrated in the sense of Definition [def:calibration]{reference-type="ref" reference="def:calibration"}, then $\underline{Q}^(s,a) \leq \widetilde{Q}^(s,a)$ for all $(s,a)$. :::

[[lemma:bellmin-lb]]{#lemma:bellmin-lb label="lemma:bellmin-lb"}

::: {.proof} Proof. Let $\tilde{\mathcal{B}}^*$ denote the Bellman operator with reward function $\tilde{r}$. First, observe that for any $Q, Q': \mathcal{S}\times \mathcal{A}\to \mathbb{R}$, $Q \leq Q'$ pointwise implies $\underline{\mathcal{B}}^*Q \leq \mathcal{B}^Q'$ pointwise because we have $\tilde{r}(s,a) + \gamma \max_{a'} Q(s',a') \leq \tilde{r}(s,a) + \gamma \max_{a'} Q'(s',a')$ pointwise and the min defining $\underline{\mathcal{B}}^$ includes the true $s' = T(s,a)$.

Now let $Q_0$ be any inital $Q$ function. Define $\widetilde{Q}_k = (\tilde{\mathcal{B}}^)^k Q_0$ and $\underline{Q}_k = (\underline{\mathcal{B}}^)^k Q_0$. An inductive argument coupled with the previous observation shows that $\underline{Q}_k \leq \widetilde{Q}k$ pointwise for all $k \in \mathbb{N}$. Hence, taking the limits $\widetilde{Q}^* = \lim{k \to \infty} \widetilde{Q}k$ and $\underline{Q}^* = \lim{k \to \infty} \underline{Q}_k$, we obtain $\underline{Q}^* \leq \widetilde{Q}^*$ pointwise. ◻ :::

Now we are ready to present our main theoretical result.

::: {.theorem} Theorem 1. Let $\widehat{T}$ be a calibrated dynamics model and $\pi^(s) = \arg\max_a \underline{Q}^(s,a)$ the greedy policy with respect to $\underline{Q}^$. Assume that Assumption Assumption 1{reference-type="ref" reference="ass:fastfail"} holds. Then for any $s \in \mathcal{S}$, if there exists an action $a$ such that $\underline{Q}^(s,a) \ge \frac{r_{\min}}{1-\gamma}$, then $\pi^(s)$ is a safe action.* :::

[[thm:main]]{#thm:main label="thm:main"}

::: {.proof} Proof. Lemma [lemma:bellmin-lb]{reference-type="ref" reference="lemma:bellmin-lb"} implies that $\underline{Q}^(s,a) \leq \widetilde{Q}^(s,a)$ for all $(s,a) \in \mathcal{S}\times \mathcal{A}$.

As shown in the proof of Lemma [lemma:theory-C]{reference-type="ref" reference="lemma:theory-C"}, any unsafe action $a'$ satisfies $$\underline{Q}^(s,a') \leq \widetilde{Q}^(s,a') \leq \frac{r_{\max}(1-\gamma^{H^}) - C\gamma^{H^}}{1-\gamma}$$ Similarly if $\underline{Q}^(s,a) \geq \frac{r_{\min}}{1-\gamma}$, we also have $$\frac{r_{\min}}{1-\gamma} \leq \underline{Q}^(s,a) \leq \widetilde{Q}^(s,a)$$ so $a$ is a safe action. Taking $C$ as in inequality [eqn:theory-C]{reference-type="eqref" reference="eqn:theory-C"} guarantees that $\underline{Q}^(s,a) > \underline{Q}^(s,a')$, so the greedy policy $\pi^$ will choose $a$ over $a'$. ◻ :::

This theorem gives us a way to establish safety using only short-horizon predictions. The conclusion conditionally holds for any state $s$, but for $s$ far from the observed states, we expect that $\widehat{T}(s,a)$ likely has to contain many states in order to satisfy the assumption that it contains the true next state, so that $\underline{Q}^(s,a)$ will be very small and we may not have any action such that $\underline{Q}^(s,a) \ge \frac{r_{\min}}{1-\gamma}$. However, it is plausible to believe that there can be such an $a$ for the set of states in the replay buffer, ${s : (s,a,r,s') \in \mathcal{D}}$.

Practical algorithm

[[alg:practical]]{#alg:practical label="alg:practical"}

Horizon $H$ Initialize empty buffers $\mathcal{D}$ and $\widehat{\mathcal{D}}$, an ensemble of probabilistic dynamics ${\widehat{T}{\theta_i}}{i=1}^N$, policy $\pi_{\phi}$, critic $Q_{\psi}$.

Collect initial data using random policy, add to $\mathcal{D}$.

Collect episode using $\pi_{\phi}$; add the samples to $\mathcal{D}$. Let $\ell$ be the length of the episode. Re-fit models ${\widehat{T}{\theta_i}}{i=1}^N$ by several epochs of SGD on $L_{\widehat{T}}(\theta_i)$ defined in [eqn:model-obj]{reference-type="eqref" reference="eqn:model-obj"} Compute empirical $r_{\min}$ and $r_{\max}$, and update $C$ according to [eqn:theory-C]{reference-type="eqref" reference="eqn:theory-C"}. Sample $s \sim \mathcal{D}$. Startin from $s$, roll out $H$ steps using $\pi_\phi$ and ${\widehat{T}{\theta_i}}$; add the samples to $\widehat{\mathcal{D}}$. Draw samples from $\mathcal{D}\cup \widehat{\mathcal{D}}$. Update $Q{\psi}$ by SGD on $L_Q(\psi)$ defined in [eqn:q-obj]{reference-type="eqref" reference="eqn:q-obj"} and target parameters $\bar\psi$ according to [eqn:target-update]{reference-type="eqref" reference="eqn:target-update"}. Update $\pi_{\phi}$ by SGD on $L_\pi(\phi)$ defined in [eqn:policy-obj]{reference-type="eqref" reference="eqn:policy-obj"}.

Based (mostly) on the framework described in the previous section, we develop a deep model-based RL algorithm. We build on practices established in previous deep model-based algorithms, particularly MBPO [@janner2019trust] a state-of-the-art model-based algorithm (which does not emphasize safety).

The algorithm, dubbed Safe Model-Based Policy Optimization (SMBPO), is described in Algorithm [alg:practical]{reference-type="ref" reference="alg:practical"}. It follows a common pattern used by online model-based algorithms: alternate between collecting data, re-fitting the dynamics models, and improving the policy.

Following prior work [@chua2018deep; @janner2019trust], we employ an ensemble of (diagonal) Gaussian dynamics models ${\widehat{T}{\theta_i}}{i=1}^N$, where $\widehat{T}i(s, a) = \mathcal{N}(\mu{\theta_i}(s,a), \mathop{\mathrm{diag}}(\sigma_{\theta_i}^2(s,a)))$, in an attempt to capture both aleatoric and epistemic uncertainties. Each model is trained via maximum likelihood on all the data observed so far: $$L_{\widehat{T}}(\theta_i) = -\mathbb{E}{(s,a,r,s') \sim \mathcal{D}} \log \widehat{T}{\theta_i}(s', r ,|,s, a) \label{eqn:model-obj}$$ However, random differences in initialization and mini-batch order while training lead to different models. The model ensemble can be used to generate uncertainty-aware predictions. For example, a set-valued prediction can be computed using the means $\widehat{T}(s,a) = {\mu_{\theta_i}(s,a)}_{i=1}^N$.

The models are used to generate additional samples for fitting the $Q$ function and updating the policy. In MBPO, this takes the form of short model-based rollouts, starting from states in $\mathcal{D}$, to reduce the risk of compounding error. At each step in the rollout, a model $\widehat{T}_i$ is randomly chosen from the ensemble and used to predict the next state. The rollout horizon $H$ is chosen as a hyperparameter, and ideally exceeds the (unknown) $H^*$ from Assumption Assumption 1{reference-type="ref" reference="ass:fastfail"}. In principle, one can simply increase $H$ to ensure it is large enough, but this increases the opportunity for compounding error.

MBPO is based on the soft actor-critic (SAC) algorithm, a widely used off-policy maximum-entropy actor-critic algorithm [@haarnoja2018soft]. The $Q$ function is updated by taking one or more SGD steps on the objective $$\begin{aligned} L_Q(\psi) &= \mathbb{E}{(s,a,r,s') \sim \mathcal{D}\cup \widehat{\mathcal{D}}}[(Q\psi(s,a) - (r + \gamma V_{\bar\psi}(s'))^2] \label{eqn:q-obj} \ \text{where}\quad V_{\bar\psi}(s') &= \begin{cases} -C/(1-\gamma) & s' \in \mathcal{S}{\textup{unsafe}}\ \mathbb{E}{a' \sim \pi(s')}[Q_{\bar\psi}(s', a') - \alpha \log \pi_\phi(a' ,|,s')] & s' \not\in \mathcal{S}_{\textup{unsafe}} \end{cases} \label{eqn:v-def}\end{aligned}$$ The scalar $\alpha$ is a hyperparameter of SAC which controls the tradeoff between entropy and reward. We tune $\alpha$ using the procedure suggested by [@haarnoja2018soft2].

The $\bar\psi$ are parameters of a "target" Q function which is updated via an exponential moving average towards $\psi$: $$\bar\psi \leftarrow \tau\psi + (1-\tau)\bar\psi \label{eqn:target-update}$$ for a hyperparameter $\tau \in (0,1)$ which is often chosen small, e.g., 0.005. This is a common practice used to promote stability in deep RL, originating from @lhphetsw15. We also employ the clipped double-Q method [@fujimoto2018addressing] in which two copies of the parameters ($\psi_1$ and $\psi_2$) and target parameters ($\bar\psi_1$ and $\bar\psi_2$) are maintained, and the target value in equation [eqn:v-def]{reference-type="eqref" reference="eqn:v-def"} is computed using $\min_{i=1,2} Q_{\bar\psi_i}(s',a')$.

Note that in [eqn:q-obj]{reference-type="eqref" reference="eqn:q-obj"}, we are fitting to the average TD target across models, rather than the min, even though we proved Theorem [thm:main]{reference-type="ref" reference="thm:main"} using the Bellmin operator. We found that taking the average worked better empirically, likely because the min was overly conservative and harmed exploration.

The policy is updated by taking one or more steps to minimize $$L_\pi(\phi) = \mathbb{E}{s \sim \mathcal{D}\cup \widehat{\mathcal{D}}, a \sim \pi\phi(s)}[\alpha \log \pi_{\phi}(a ,|,s) - Q_{\psi}(s,a)]. \label{eqn:policy-obj}$$

Experiments

In the experimental evaluation, we compare our algorithm to several model-free safe RL algorithms, as well as MBPO, on various continuous control tasks based on the MuJoCo simulator [@todorov2012mujoco]. Additional experimental details, including hyperparameter selection, are given in .

Tasks

Hopper {width="\textwidth"}

Cheetah-no-flip {width="\textwidth"}

Ant {width="\textwidth"}

Humanoid {width="\textwidth"}

The tasks are described below:

Hopper: Standard hopper environment from OpenAI Gym, except with the "alive bonus" (a constant) removed from the reward so that the task reward does not implicitly encode the safety objective. The safety condition is the usual termination condition for this task, which corresponds to the robot falling over.
Cheetah-no-flip: The standard half-cheetah environment from OpenAI Gym, with a safety condition: the robot's head should not come into contact with the ground.
Ant, Humanoid: Standard ant and humanoid environments from OpenAI Gym, except with the alive bonuses removed, and contact forces removed from the observation (as these are difficut to model). The safety condition is the usual termination condition for this task, which corresponds to the robot falling over.

For all of the tasks, the reward corresponds to positive movement along the $x$-axis (minus some small cost on action magnitude), and safety violations cause the current episode to terminate. See Figure [fig:tasks]{reference-type="ref" reference="fig:tasks"} for visualizations of the termination conditions.

Algorithms

We compare against the following algorithms:

MBPO: Corresponds to SMBPO with $C = 0$.
MBPO+bonus: The same as MBPO, except adding back in the alive bonus which was subtracted out of the reward.
Recovery RL, model-free (RRL-MF): Trains a critic to estimate the safety separately from the reward, as well as a recovery policy which is invoked when the critic predicts risk of a safety violation.
Lagrangian relaxation (LR): Forms a Lagrangian to implement a constraint on the risk, updating the dual variable via dual gradient descent.
Safety Q-functions for RL (SQRL): Also formulates a Lagrangian relaxation, and uses a filter to reject actions which are too risky according to the safety critic.
Reward constrained policy optimization (RCPO): Uses policy gradient to optimize a reward function which is penalized according to the safety critic.

All of the above algorithms except for MBPO are as implemented in the Recovery RL paper [@thananjeyan2020recovery] and its publicly available codebase^3. We follow the hyperparameter tuning procedure described in their paper; see Appendix 7.3{reference-type="ref" reference="app:impl-details"} for more details. A recent work [@bharadhwaj2020conservative] can also serve as a baseline but the code has not been released.

Our algorithm requires very little hyperparameter tuning. We use $\gamma = 0.99$ in all experiments. We tried both $H = 5$ and $H = 10$ and found that $H = 10$ works slightly better, so we use $H = 10$ in all experiments.

Undiscounted return of policy vs. total safety violations. We run 5 seeds for each algorithm independently and average the results. The curves indicate mean of different seeds and the shaded areas indicate one standard deviation centered at the mean. {#fig:return_vs_violations width="\textwidth"}

Results

The main criterion in which we are interested is performance (return) vs. the cumulative number of safety violations. The results are plotted in Figure 2{reference-type="ref" reference="fig:return_vs_violations"}. We see that our algorithm performs favorably compared to model-free alternatives in terms of this tradeoff, achieving similar or better performance with a fraction of the violations.

MBPO is competitive in terms of sample efficiency but incurs more safety violations because it isn't designed explicitly to avoid them.

Performance with varying $C$ {width="\textwidth"}

Cumulative safety violations with varying $C$ {width="\textwidth"}

We also show in Figure [fig:vary-C]{reference-type="ref" reference="fig:vary-C"} that hard-coding the value of $C$ leads to an intuitive tradeoff between performance and safety violations. With a larger $C$, SMBPO incurs substantially fewer safety violations, although the total rewards are learned slower.

Related Work

Safe Reinforcement Learning

Many of the prior works correct the action locally, that is, changing the action when the action is detected to lead to an unsafe state. @dalal2018safe linearizes the dynamics and adds a layer on the top of the policy for correction. @bharadhwaj2020conservative uses rejection sampling to ensure the action meets the safety requirement. @thananjeyan2020recovery either trains a backup policy which is only used to guarantee safety, or uses model-predictive control (MPC) to find the best action sequence. MPC could also be applied in the short-horizon setting that we consider here, but it involves high runtime cost that may not be acceptable for real-time robotics control. Also, MPC only optimizes for rewards under the short horizon and can lead to suboptimal performance on tasks that require longer-term considerations [@tamar2017learning].

Other works aim to solve the constrained MDP more efficiently and better, with Lagrangian methods being applied widely. The Lagrangian multipliers can be a fixed hyperparameter, or adjusted by the algorithm [@tessler2018reward; @stooke2020responsive]. The policy training might also have issues. The issue that the policy might change too fast so that it's no longer safe is addressed by building a trust region of policies [@achiam2017constrained; @zanger2021safe] and further projecting to a safer policy [@yang2020accelerating], and another issue of too optimistic policy is addressed by @bharadhwaj2020conservative by using conservative policy updates. Expert information can greatly improve the training-time safety. @srinivasan2020learning [@thananjeyan2020recovery] are provided offline data, while @turchetta2020safe is provided interventions which are invoked at dangerous states and achieves zeros safety violations during training.

Returnability is also considered by @eysenbach2018leave in practice, which trains a policy to return to the initial state, or by @roderick2021provably in theory, which designs a PAC algorithm to train a policy without safety violations. @bansal2017hamilton gives a brief overview of Hamilton-Jacobi Reachability and its recent progress.

Model-based Reinforcement Learning

Model-based reinforcement learning, which additionally learns the dynamics model, has gained its popularity due to its superior sample efficiency. @kurutach2018model uses an ensemble of models to produce imaginary samples to regularize leaerning and reduce instability. The use of model ensemble is further explored by @chua2018deep, which studies different methods to sample trajectories from the model ensemble. Based on @chua2018deep, @wang2019exploring combines policy networks with online learning. @luo2019algorithmic derives a lower bound of the policy in the real environment given its performance in the learned dynamics model, and then optimizes the lower bound stochastically. Our work is based on @janner2019trust, which shows the learned dynamics model doesn't generalize well for long horizon and proposes to use short model-generated rollouts instead of a full episodes. @dong2020expressivity studies the expressivity of $Q$ function and model and shows that at some environments, the model is much easier to learn than the $Q$ function.

Conclusion

We consider the problem of safe exploration in reinforcement learning, where the goal is to discover a policy that maximizes the expected return, but additionally desire the training process to incur minimal safety violations. In this work, we assume access to a user-specified function which can be queried to determine whether or not a given state is safe. We have proposed a model-based algorithm that can exploit this information to anticipate safety violations before they happen and thereby avoid them. Our theoretical analysis shows that safety violations could be avoided with a sufficiently large penalty and accurate dynamics model. Empirically, our algorithm compares favorably to state-of-the-art model-free safe exploration methods in terms of the tradeoff between performance and total safety violations, and in terms of sample complexity.

Acknowledgements {#acknowledgements .unnumbered}

TM acknowledges support of Google Faculty Award, NSF IIS 2045685, the Sloan Fellowship, and JD.com. YL is supported by NSF, ONR, Simons Foundation, Schmidt Foundation, DARPA and SRC.

Appendix

Proofs {#app:proofs}

::: {.proof} Proof of Lemma [lemma:bellmin]{reference-type="ref" reference="lemma:bellmin"} ($\underline{\mathcal{B}}^$ is a $\gamma$-contraction in $\infty$-norm).* First observe that for any functions $f$ and $g$, $$|\max_x f(x) - \max_x g(x)| \le \max_x |f(x)-g(x)| \label{eqn:max-ineq}$$ To see this, suppose $\max_x f(x) > \max_x g(x)$ (the other case is symmetric) and let $\tilde{x} = \arg\max_x f(x)$. Then $$|\max_x f(x) - \max_x g(x)| = f(\tilde{x}) - \max_x g(x) \le f(\tilde{x}) - g(\tilde{x}) \le \max_x |f(x)-g(x)|$$ We also note that [eqn:max-ineq]{reference-type="eqref" reference="eqn:max-ineq"} implies $$|\min_x f(x) - \min_x g(x)| \le \max_x |f(x)-g(x)|$$ since $\min_x f(x) = -\max_x (-f(x))$. Thus for any $Q, Q' : \mathcal{S}\times \mathcal{A}\to \mathbb{R}$, $$\begin{aligned} |\underline{\mathcal{B}}^*Q - \underline{\mathcal{B}}^*Q'|\infty &= \sup{s,a} |\underline{\mathcal{B}}^*Q(s,a) - \underline{\mathcal{B}}^Q'(s,a)| \ &= \gamma \sup_{s,a} \left|\min_{s' \in \widehat{T}(s,a)} \max_{a'} Q(s',a') - \min_{s' \in \widehat{T}(s,a)} \max_{a'} Q'(s',a')\right| \ &\le \gamma \sup_{s,a} \max_{s' \in \widehat{T}(s,a)} \left|\max_{a'} Q(s',a') - \max_{a'} Q'(s',a')\right| \ &\le \gamma \sup_{s',a'} |Q(s',a') - Q'(s',a')| \ &= \gamma |Q - Q'|_\infty\end{aligned}$$ Hence $\underline{\mathcal{B}}^$ is indeed a $\gamma$-contraction. ◻ :::

Extension to stochastic dynamics {#app:stochastic}

Here we outline a possible extension to stochastic dynamics, although we leave experiments with stochastic systems for future work.

First, let us modify the definitions to accommodate stochastic dynamics:

We introduce safety functions $\mu^\pi(s,a) = \mathbb{E}^\pi[\sum_t \text{Unsafe}(s_t) ,|,s_0=s, a_0=a]$, i.e. $Q^\pi$ where the cost is the Unsafe indicator and $\gamma = 1$. Note that if an unsafe state is reached, the episode terminates, so the sum is always 0 or 1. In words, $\mu^\pi(s,a)$ is the probability of ever encountering an unsafe state if the agent starts from state $s$, takes action $a$, and then follows $\pi$ thereafter. Similarly, let $\nu^\pi(s) = \mathbb{E}_{a \sim \pi(s)}[\mu^\pi(s,a)]$, analogous to $V^\pi$.
We also define the optimal safety functions $\mu^(s,a) = \min_{\pi} \mu^\pi(s,a)$ and $\nu^(s) = \min_\pi \nu^\pi(s)$.
A state-action pair $(s,a)$ is $p$-irrecoverable if $\mu^*(s,a) \ge p$. Otherwise we say that $(s,a)$ is $p$-safe.
A state $s$ is $p$-irrecoverable if $\nu^*(s) \ge p$, and $p$-safe otherwise.

Our rapid failure assumption must also be extended: There exists a horizon $H$ and threshold $q$ such that if $(s,a)$ is $p$-irrecoverable, then for any sequence of actions ${a_t}_{t=0}^\infty$ with $a_0 = a$, the probability of encountering an unsafe state within $H$ steps is at least $q$. (Note that necessarily $q \le p$.)

Analysis

Let $s$ be a $p$-safe state, and let $a$ and $a'$ be actions where $a$ is $p$-safe but $a'$ is $p$-irrecoverable[^4]. We want to have $\widetilde{Q}^(s,a) > \widetilde{Q}^(s,a')$ so that the greedy policy w.r.t. $\widetilde{Q}^$, which is an optimal policy for $\widetilde{M}$, will only take $p$-safe actions. Our strategy is to bound $\widetilde{Q}^(s,a')$ from above and $\widetilde{Q}^*(s,a)$ from below, then choose $C$ to make the desired inequality hold.

We consider $a'$ first, breaking it down into two cases:

An unsafe state is reached within $H$ steps. Since $(s,a')$ is $p$-irrecoverable, our assumption implies that an unsafe state is reached within $H$ steps with probability at least $q$. As calculated in the original submission, the maximum return of a trajectory which is unsafe within $H$ steps is at most $\frac{r_{\max}(1-\gamma^H) - C\gamma^H}{1-\gamma}$. Let us call this constant $R_C$. If $R_C < 0$, then $$\mathbb{P}(\text{unsafe within $H$ steps}) \cdot (\text{max return} ,|,\text{unsafe within $H$ steps}) \leq qR_C$$ Otherwise, we can use the fact that any probability is bounded by 1 to obtain $$\mathbb{P}(\text{unsafe within $H$ steps}) \cdot (\text{max return} ,|,\text{unsafe within $H$ steps}) \leq R_C$$ To satisfy both simultaneously, we can use the bound $\max{qR_C, R_C}$.
The next $H$ states encountered are all safe. This happens with probability less than $1-q$, and the maximal return is $\frac{r_{\max}}{1-\gamma}$ as usual.

From the reasoning above, we obtain $$\begin{aligned} \widetilde{Q}^(s,a') &\le \mathbb{P}(\text{unsafe within $H$ steps}) \cdot (\text{max return} ,|,\text{unsafe within $H$ steps}) ,+ \ &\quad \mathbb{P}(\text{safe for $H$ steps}) \cdot (\text{max return} ,|,\text{safe for $H$ steps}) \ &\le \max{qR_C, R_C} + (1-q)\frac{r_{\max}}{1-\gamma}\end{aligned}$$ Now consider $a$. Since $(s,a)$ is $p$-safe, $$\begin{aligned} \widetilde{Q}^(s,a) &\ge \mathbb{P}(\text{unsafe}) \cdot (\text{min reward} ,|,\text{unsafe}) + \mathbb{P}(\text{safe}) \cdot (\text{min reward} ,|,\text{safe}) \ &\ge p \left(\frac{-C}{1-\gamma}\right) + (1-p) \frac{r_{\min}}{1-\gamma} \ &= \frac{-p C + (1-p)r_{\min}}{1-\gamma}\end{aligned}$$ Note that the second step assumes $C \ge 0$. (We will enforce this constraint when choosing $C$.)

To ensure $\widetilde{Q}^(s,a) > \widetilde{Q}^(s,a')$, it suffices to choose $C$ so that the following inequalities hold simultaneously: $$\begin{aligned} \frac{-pC + (1-p)r_{\min}}{1-\gamma} &> qR_C + (1-q)\frac{r_{\max}}{1-\gamma} \label{eq:ineq1} \ \frac{-pC + (1-p)r_{\min}}{1-\gamma} &> R_C + (1-q)\frac{r_{\max}}{1-\gamma} \label{eq:ineq2}\end{aligned}$$ Multiplying both sides of [eq:ineq1]{reference-type="eqref" reference="eq:ineq1"} by $1-\gamma$ gives the equivalent $$-pC + (1-p)r_{\min}> qr_{\max}(1-\gamma^H) - qC\gamma^H + (1-q)r_{\max}$$ Rearranging, we need $$C > \frac{r_{\max}(1-q\gamma^H) - (1-p)r_{\min}}{q\gamma^H-p} =: \alpha_1$$ Similarly, multiplying both sides of [eq:ineq2]{reference-type="eqref" reference="eq:ineq2"} by $1-\gamma$ gives the equivalent $$-pC + (1-p)r_{\min}> r_{\max}(1-\gamma^H) - C\gamma^H + (1-q)r_{\max}$$ Rearranging, we need $$C > \frac{r_{\max}(2-q-\gamma^H) - (1-p)r_{\min}}{\gamma^H-p} =: \alpha_2$$ All things considered, the inequality $\widetilde{Q}^(s,a) > \widetilde{Q}^(s,a')$ holds if we set $$C > \max{\alpha_1, \alpha_2, 0}$$

Implementation details and hyperparameters {#app:impl-details}

In this appendix we provide additional details regarding the algorithmic implementation, including hyperparameter selection.

Here are some additional details regarding the (S)MBPO implementation:

All neural networks are implemented in PyTorch [@pytorch] and optimized using the Adam optimizer [@kingma2014adam] and batch size 256.
The dynamics models use a branched architecture, where a shared trunk computes an intermediate value $z = h_{\theta_1}([s,a])$ which is then passed to branches $\mu_{\theta_2}(z)$ and $\sigma_{\theta_3}(z)$. All three networks are implemented as multi-layer perceptrons (MLPs) with ReLU activation and 200 hidden width. The $h_{\theta}$ network has 3 layers (with ReLU on the final layer too), while $\mu_{\theta_2}$ and $\sigma_{\theta_3}$ each have one hidden layer (no ReLU on final layer).
Every 250 environment steps, we update the dynamics models, taking 2000 updates of the Adam optimizer.
The networks for the Q functions and policies all have two hidden layers of width 256.
We use a learning rate of 3e-4 for the Q function, 1e-4 for the policy, and 1e-3 for the model.
Following [@fujimoto2018addressing], we store two copies of the weights for $Q$ (and $\bar{Q}$), trained the same way but with different initializations. When computing the target $\bar{Q}$ in equation [eqn:q-obj]{reference-type="eqref" reference="eqn:q-obj"} and when computing $Q$ in equation [eqn:policy-obj]{reference-type="eqref" reference="eqn:policy-obj"}, we take the minimum of the two copies' predictions. When computing the $Q$ in equation [eqn:q-obj]{reference-type="eqref" reference="eqn:q-obj"}, we compute the loss for both copies of the weights and add the two losses.
When sampling batches of data from $\mathcal{D}\cup \widehat{\mathcal{D}}$, we take 10% of the samples from $\mathcal{D}$ and the remainder from $\widehat{\mathcal{D}}$.

The model-free algorithms have their own hyperparameters, but all share $\gamma_{\text{safe}}$ and $\epsilon_{\text{safe}}$. Following @thananjeyan2020recovery, we tune $\gamma_{\text{safe}}$ and $\epsilon_{\text{safe}}$ for recovery RL first, then hold those fixed for all algorithms and tune any remaining algorithm-specific hyperparameters. All these hyperparameters are given in the tables below:

         Name             Which algorithm(s)?       Choices        hopper   cheetah   ant   humanoid

$\gamma_{\text{safe}}$            all            0.5, 0.6, 0.7      0.6       0.5     0.6     0.6

$\epsilon_{\text{safe}}$ all 0.2, 0.3, 0.4 0.3 0.2 0.2 0.4 $\nu$ LR 1, 10, 100, 1000 1000 1000 1 1 $\nu$ SQRL 1, 10, 100, 1000 1 1000 10 1 $\lambda$ RCPO 1, 10, 100, 1000 10 10 1 10

We run our experiments using a combination of NVIDIA GeForce GTX 1080 Ti, TITAN Xp, and TITAN RTX GPUs from our internal cluster. A single run of (S)MBPO takes as long as 72 hours on a single GPU.

[^1]: Determinism makes safety essentially trivial in tabular MDPs. We focus on tasks with continuous state and/or action spaces. See Appendix 7.2{reference-type="ref" reference="app:stochastic"} for a possible extension of our approach to stochastic dynamics.

[^2]: $\mathcal{P}(X)$ is the powerset of a set $X$.

[^4]: Note that, as a consequence of the definitions, any action which is $p'$-safe with $p' < p$ is also $p$-safe, and similarly any action which is $p'$-irrecoverable with $p' > p$ is also $p$-irrecoverable.