ICML2022

November 8, 2025 · View on GitHub

会议论文列表

本会议共有 1234 篇论文

序号标题链接推荐理由推荐度摘要作者组织
1PAC-Bayesian Bounds on Rate-Efficient Classifiers0Alhabib Abbas, Yiannis Andreopoulos
2Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning0Lisha Chen, Momin Abbas, PinYu Chen, Quan Xiao, Tianyi Chen
3An Initial Alignment between Neural Network and Target is Needed for Gradient Descent to Learn0Christopher Marquis, Elisabetta Cornacchia, Emmanuel Abbe, Jan Hazla
4Active Sampling for Min-Max Fairness0Chris Russell, Jacob D. Abernethy, Jamie Morgenstern, Jie Zhang, Matthäus Kleindessner, Pranjal Awasthi
5Meaningfully debugging model mistakes using conceptual counterfactual explanations0Abubakar Abid, James Zou, Mert Yüksekgönül
6Batched Dueling Bandits0Arpit Agarwal, Rohan Ghuge, Viswanath Nagarajan
7Hierarchical Shrinkage: Improving the accuracy and interpretability of tree-based models0Abhineet Agarwal, Bin Yu, Chandan Singh, Omer Ronen, Yan Shuo Tan
8Deep equilibrium networks are sensitive to initialization statistics0Atish Agarwala, Samuel S. Schoenholz
9Learning of Cluster-based Feature Importance for Electronic Health Record Time-series0Henrique Aguiar, Mauro D. Santos, Peter J. Watkinson, Tingting Zhu
10On the Convergence of the Shapley Value in Parametric Bayesian Learning Games0Bryan Kian Hsiang Low, Lucas Agussurja, Xinyi Xu
11Individual Preference Stability for Clustering0Ali Vakilian, Jamie Morgenstern, Matthäus Kleindessner, Pattara Sukprasert, Pranjal Awasthi, Saba Ahmadi, Samir Khuller
12Understanding the unstable convergence of gradient descent0Jingzhao Zhang, Kwangjun Ahn, Suvrit Sra
13Minimum Cost Intervention Design for Causal Effect Identification0Jalal Etesami, Negar Kiyavash, Sina Akbari
14How Faithful is your Synthetic Data? Sample-level Metrics for Evaluating and Auditing Generative Models0Ahmed M. Alaa, Boris van Breugel, Evgeny S. Saveliev, Mihaela van der Schaar
15A Natural Actor-Critic Framework for Zero-Sum Markov Games0Ahmet Alacaoglu, Luca Viano, Niao He, Volkan Cevher
16Deploying Convolutional Networks on Untrusted Platforms Using 2D Holographic Reduced Representations0Edward Raff, James Holt, Mohammad Mahmudul Alam, Tim Oates
17Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer0Ana L. C. Bazzan, Bruno C. da Silva, Lucas Nunes Alegre
18Structured Stochastic Gradient MCMC0Alex J. Boyd, Antonios Alexos, Stephan Mandt
19XAI for Transformers: Better Explanations through Conservative Propagation0Ameen Ali, Grégoire Montavon, KlausRobert Müller, Lior Wolf, Oliver Eberle, Thomas Schnake
20RUMs from Head-to-Head Contests0Alessandro Panconesi, Andrew Tomkins, Flavio Chierichetti, Matteo Almanza, Ravi Kumar
21Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval0Dan Roth, Frank F. Xu, Graham Neubig, Junxian He, Sudipta Sengupta, Uri Alon
22Minimax Classification under Concept Drift with Multidimensional Adaptation and Performance Guarantees0José Antonio Lozano, Santiago Mazuelas, Verónica Álvarez
23Scalable First-Order Bayesian Optimization via Structured Automatic Differentiation0Carla P. Gomes, Sebastian E. Ament
24Public Data-Assisted Mirror Descent for Private Model Training0Abhradeep Thakurta, Arun Ganesh, Ehsan Amid, Om Thakkar, Rajiv Mathews, Shuang Song, Swaroop Ramaswamy, Thomas Steinke, Vinith M. Suriyakumar
25On Last-Iterate Convergence Beyond Zero-Sum Games0Gabriele Farina, Ioannis Anagnostides, Ioannis Panageas, Tuomas Sandholm
26Online Algorithms with Multiple Predictions0Amit Kumar, Debmalya Panigrahi, Keerti Anand, Rong Ge
27Learning to Hash Robustly, Guaranteed0Alexandr Andoni, Daniel Beaglehole
28Set Based Stochastic Subsampling0Bruno Andreis, Eunho Yang, Juho Lee, Seanie Lee, Sung Ju Hwang, Tuan A. Nguyen
29Towards Understanding Sharpness-Aware Minimization0Maksym Andriushchenko, Nicolas Flammarion
30Fair and Fast k-Center Clustering for Data Summarization0Adam Kurpisz, Haris Angelidakis, Leon Sering, Rico Zenklusen
31Interactive Correlation Clustering with Existential Cluster Constraints0Andrew McCallum, Nicholas Monath, Nishant Yadav, Rico Angell
32Image-to-Image Regression with Distribution-Free Uncertainty Quantification and Applications in Imaging0Amit Pal Singh Kohli, Anastasios N. Angelopoulos, Jitendra Malik, Michael I. Jordan, Srigokul Upadhyayula, Stephen Bates, Thayer Alshaabi, Yaniv Romano
33AdaGrad Avoids Saddle Points0Georgios Piliouras, Kimon Antonakopoulos, Panayotis Mertikopoulos, Xiao Wang
34UnderGrad: A Universal Black-Box Optimization Method with Almost Dimension-Free Convergence Rate Guarantees0Dong Quan Vu, Kfir Y. Levy, Kimon Antonakopoulos, Panayotis Mertikopoulos, Volkan Cevher
35Adapting the Linearised Laplace Model Evidence for Modern Deep Learning0David Janz, Eric T. Nalisnick, Erik A. Daxberger, James Urquhart Allingham, Javier Antorán, José Miguel HernándezLobato, Riccardo Barbano
36EAT-C: Environment-Adversarial sub-Task Curriculum for Efficient Reinforcement Learning0Chengqi Zhang, Guodong Long, Jing Jiang, Shuang Ao, Tianyi Zhou, Xuan Song
37Online Balanced Experimental Design0Anup B. Rao, David Arbour, Drew Dimmery, Tung Mai
38VariGrow: Variational Architecture Growing for Task-Agnostic Continual Learning based on Bayesian Novelty0Bobak J. Mortazavi, Randy Ardywibowo, Shuai Huang, Xiaoning Qian, Zepeng Huo, Zhangyang Wang
39Thresholded Lasso Bandit0Alexandre Proutière, Kaito Ariu, Kenshi Abe
40Gradient Based Clustering0Aleksandar Armacki, Dragana Bajovic, Dusan Jakovetic, Soummya Kar
41Understanding Gradient Descent on the Edge of Stability in Deep Learning0Abhishek Panigrahi, Sanjeev Arora, Zhiyuan Li
42Private optimization in the interpolation regime: faster rates and hardness results0Gary Cheng, Hilal Asi, John C. Duchi, Karan N. Chadha
43Optimal Algorithms for Mean Estimation under Local Differential Privacy0Hilal Asi, Kunal Talwar, Vitaly Feldman
44Asymptotically-Optimal Gaussian Bandits with Side Observations0Alexia Atsidakou, Constantine Caramanis, Orestis Papadigenopoulos, Sanjay Shakkottai, Sujay Sanghavi
45Congested Bandits: Optimal Routing via Short-term Resets0Kostas Kollias, Kush Bhatia, Pranjal Awasthi, Sreenivas Gollapudi
46Do More Negative Samples Necessarily Hurt In Contrastive Learning?0Nishanth Dikkala, Pranjal Awasthi, Pritish Kamath
47H-Consistency Bounds for Surrogate Loss Minimizers0Anqi Mao, Mehryar Mohri, Pranjal Awasthi, Yutao Zhong
48Iterative Hard Thresholding with Adaptive Regularization: Sparser Solutions Without Sacrificing Runtime0Kyriakos Axiotis, Maxim Sviridenko
49Proving Theorems using Incremental Learning and Hindsight Experience Replay0Ankit Anand, Doina Precup, Eser Aygün, Laurent Orseau, Lei M. Zhang, Shibl Mourad, Stephen Marcus McAleer, Vlad Firoiu, Xavier Glorot
50Near-optimal rate of consistency for linear models with missing values0Alexis Ayme, Aymeric Dieuleveut, Claire Boyer, Erwan Scornet
51How Tempering Fixes Data Augmentation in Bayesian Neural Networks0Gregor Bachmann, Lorenzo Noci, Thomas Hofmann
52ASAP.SGD: Instance-based Adaptiveness to Staleness in Asynchronous SGD0Karl Bäckström, Marina Papatriantafilou, Philippas Tsigas
53From Noisy Prediction to True Label: Noisy Prediction Calibration via Generative Model0Byeonghu Na, HeeSun Bae, IlChul Moon, JoonHo Jang, Kyungwoo Song, Seungjae Shin
54data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language0Alexei Baevski, Arun Babu, Jiatao Gu, Michael Auli, Qiantong Xu, WeiNing Hsu
55End-to-End Balancing for Causal Continuous Treatment-Effect Estimation0David Heckerman, Eric Tchetgen Tchetgen, Mohammad Taha Bahadori
56A Hierarchical Transitive-Aligned Graph Kernel for Un-attributed Graphs0Edwin R. Hancock, Lixin Cui, Lu Bai
57Near-Optimal Learning of Extensive-Form Games with Imperfect Information0Chi Jin, Song Mei, Tiancheng Yu, Yu Bai
58Gaussian Mixture Variational Autoencoder with Contrastive Learning for Multi-Label Classification0Carla P. Gomes, Junwen Bai, Shufeng Kong
59A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing0He Bai, JunKun Chen, Liang Huang, Mingbo Ma, Renjie Zheng, Xintong Li
60Stability Based Generalization Bounds for Exponential Family Langevin Dynamics0Arindam Banerjee, Tiancong Chen, Xinyan Li, Yingxue Zhou
61Certified Neural Network Watermarks with Randomized Smoothing0Arpit Bansal, Curtis Wigington, John P. Dickerson, Michael J. Curry, PingYeh Chiang, Rajiv Jain, Tom Goldstein, Varun Manjunatha
62Data Scaling Laws in NMT: The Effect of Noise and Architecture0Ankush Garg, Behnam Neyshabur, Behrooz Ghorbani, Biao Zhang, Colin Cherry, Orhan Firat, Yamini Bansal
63Learning Stable Classifiers by Transferring Unstable Features0Regina Barzilay, Shiyu Chang, Yujia Bao
64Fast Composite Optimization and Statistical Recovery in Federated Learning0Michael Crawshaw, Mingrui Liu, Shan Luo, Yajie Bao
65Generative Modeling for Multi-task Visual Learning0Martial Hebert, YuXiong Wang, Zhipeng Bao
66Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models0Bo Zhang, Chongxuan Li, Fan Bao, Jiacheng Sun, Jun Zhu
67On the Surrogate Gap between Contrastive and Supervised Losses0Han Bao, Kento Nozawa, Yoshihiro Nagano
68Representation Topology Divergence: A Method for Comparing Neural Network Representations0Evgeny Burnaev, Ilya Trofimov, Nikita Balabin, Serguei Barannikov
69Sparse Mixed Linear Regression with Guarantees: Taming an Intractable Problem with Invex Relaxation0Adarsh Barik, Jean Honorio
70Neural Fisher Discriminant Analysis: Optimal Neural Network Embeddings in Polynomial Time0Burak Bartan, Mert Pilanci
71Fictitious Play and Best-Response Dynamics in Identical Interest and Zero-Sum Stochastic Games0Lucas Baudin, Rida Laraki
72Information Discrepancy in Strategic Learning0Chara Podimata, Juba Ziani, Yahav Bechavod, Zhiwei Steven Wu
73On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces0Alec Koppel, Amrit Singh Bedi, Anjaly Parayil, Brian M. Sadler, Pratap Tokekar, Souradip Chakraborty
74Imitation Learning by Estimating Expertise of Demonstrators0Andy Shih, Dorsa Sadigh, Mark Beliaev, Ramtin Pedarsani, Stefano Ermon
75Matching Normalizing Flows and Probability Paths on Manifolds0Aditya Grover, Brandon Amos, Heli BenHamu, Joey Bose, Maximilian Nickel, Ricky T. Q. Chen, Samuel Cohen, Yaron Lipman
76Stochastic Contextual Dueling Bandits under Linear Stochastic Transitivity Models0Aadirupa Saha, Eyke Hüllermeier, Viktor Bengs
77Neural Inverse Kinematic0Lior Wolf, Nitsan Blau, Raphael Bensadoun, Shir Gur
78Volatility Based Kernels and Moving Average Means for Accurate Forecasting with Gaussian Processes0Andrew Gordon Wilson, Gregory W. Benton, Wesley J. Maddox
79Gradient Descent on Neurons and its Link to Approximate Second-order Optimization0Frederik Benzing
80Safe Learning in Tree-Form Sequential Decision Making: Handling Hard and Soft Constraints0Alberto Marchesi, Federico Cacciamani, Francesco Trovò, Martino Bernasconi, Matteo Castiglioni, Nicola Gatti
81Skin Deep Unlearning: Artefact and Instrument Debiasing in the Context of Melanoma Classification0Amir AtapourAbarghouei, Peter J. Bevan
82Approximate Bayesian Computation with Domain Expert in the Loop0Ayush Bharti, Louis Filstroff, Samuel Kaski
83Minimax M-estimation under Adversarial Contamination0Gennady Samorodnitsky, Guanhua Fang, Ping Li, Sujay Bhatt
84Nearly Optimal Catoni's M-estimator for Infinite Variance0Gennady Samorodnitsky, Guanhua Fang, Ping Li, Sujay Bhatt
85Personalization Improves Privacy-Accuracy Tradeoffs in Federated Learning0Alberto Bietti, ChenYu Wei, John Langford, Miroslav Dudík, Zhiwei Steven Wu
86Non-Vacuous Generalisation Bounds for Shallow Neural Networks0Benjamin Guedj, Felix Biggs
87Structure-preserving GANs0Jeremiah Birrell, Luc ReyBellet, Markos A. Katsoulakis, Wei Zhu
88Scalable Spike-and-Slab0Lester Mackey, Niloy Biswas, XiaoLi Meng
89Breaking Down Out-of-Distribution Detection: Many Methods Based on OOD Training Data Estimate a Combination of the Same Core Quantities0Alexander Meinke, Julian Bitterwolf, Matthias Hein, Maximilian Augustin
90A query-optimal algorithm for finding counterfactuals0Caleb Koch, Guy Blanc, Jane Lange, LiYang Tan
91Popular decision tree algorithms are provably noise tolerant0Ali Malik, Guy Blanc, Jane Lange, LiYang Tan
92Optimizing Sequential Experimental Design with Deep Reinforcement Learning0Amir Dezfouli, Edwin V. Bonilla, Iadine Chades, Tom Blau
93Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)0Bojun Huang
94Generalized Results for the Existence and Consistency of the MLE in the Bradley-Terry-Luce Model0Alessandro Rinaldo, Heejong Bong
95How to Train Your Wide Neural Network Without Backprop: An Input-Weight Alignment Perspective0Akhilan Boopathy, Ila Fiete
96Improving Language Models by Retrieving from Trillions of Tokens0Aidan Clark, Albin Cassirer, Andy Brock, Arthur Mensch, Aurelia Guy, Bogdan Damoc, Chris Jones, Diego de Las Casas, Eliza Rutherford, Erich Elsen, Geoffrey Irving, George van den Driessche, Jack W. Rae, Jacob Menick, JeanBaptiste Lespiau, Jordan Hoffmann, Karen Simonyan, Katie Millican, Laurent Sifre, Loren Maggiore, Michela Paganini, Oriol Vinyals, Roman Ring, Saffron Huang, Sebastian Borgeaud, Simon Osindero, Tom Hennigan, Trevor Cai
97Lie Point Symmetry Data Augmentation for Neural PDE Solvers0Daniel E. Worrall, Johannes Brandstetter, Max Welling
98An iterative clustering algorithm for the Contextual Stochastic Block Model with optimality guarantees0Christophe Biernacki, Guillaume Braun, Hemant Tyagi
99Tractable Dendritic RNNs for Reconstructing Nonlinear Dynamical Systems0Daniel Durstewitz, Florian Hess, Jonas M. Mikhaeil, Leonard F. Bereska, Manuel Brenner, PoChen Kuo, Zahra Monfared
100Learning to Predict Graphs with Fused Gromov-Wasserstein Barycenters0Céline Brouard, Florence d'AlchéBuc, Juho Rousu, Luc BrogatMotte, Rémi Flamary
101Efficient Learning of CNNs using Patch Based Features0Alon Brutzkus, Alon Regev Netser, Amir Globerson, Eran Malach, Shai ShalevShwartz
102Causal structure-based root cause analysis of outliers0Dominik Janzing, Kailash Budhathoki, Lenon Minorics, Patrick Blöbaum
103IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages0Desmond Elliott, Edoardo Maria Ponti, Emanuele Bugliarello, Fangyu Liu, Ivan Vulic, Jonas Pfeiffer, Siva Reddy
104Interactive Inverse Reinforcement Learning for Cooperative Games0AnneMarie George, Christos Dimitrakakis, Thomas Kleine Büning
105Convolutional and Residual Networks Provably Contain Lottery Tickets0Rebekka Burkholz
106Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path0Haoyuan Cai, Simon S. Du, Tengyu Ma
107Convergence of Invariant Graph Networks0Chen Cai, Yusu Wang
108Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency0Qi Cai, Zhaoran Wang, Zhuoran Yang
109Scaling Gaussian Process Optimization by Evaluating a Few Unique Candidates Multiple Times0Alessandro Lazaric, Daniele Calandriello, Lorenzo Rosasco, Luigi Carratino, Michal Valko
110Adaptive Gaussian Process Change Point Detection0Andreas Krause, Edoardo Caldarelli, Philippe Wenk, Stefan Bauer
111Measuring dissimilarity with diffeomorphism invariance0Alessandro Rudi, Benjamin Guedj, Carlo Ciliberto, Théophile Cantelobre
112A Model-Agnostic Randomized Learning Framework based on Random Hypothesis Subspace Sampling0Chao Lan, Yiting Cao
113Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for Safety-Critical Applications0Alexandre Capone, Armin Lederer, Sandra Hirche
114Burst-Dependent Plasticity and Dendritic Amplification Support Target-Based Learning and Hierarchical Imitation Learning0Cosimo Lupo, Cristiano Capone, Paolo Muratore, Pier Stanislao Paolucci
115A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving0Federico Cacciamani, Luca Carminati, Marco Ciccone, Nicola Gatti
116RECAPP: Crafting a More Efficient Catalyst for Convex Optimization0Aaron Sidford, Arun Jambulapati, Yair Carmon, Yujia Jin
117Estimating and Penalizing Induced Preference Shifts in Recommender Systems0Anca D. Dragan, Dylan HadfieldMenell, Micah D. Carroll, Stuart Russell
118YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for Everyone0Arnaldo Cândido Júnior, Christopher Dane Shulby, Edresson Casanova, Eren Gölge, Julian Weber, Moacir A. Ponti
119The Infinite Contextual Graph Markov Model0Alessio Micheli, Daniele Castellana, Davide Bacciu, Federico Errica
120Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data0Anirban Das, Shiqiang Wang, Stacy Patterson, Timothy J. Castiglia
121Online Learning with Knapsacks: the Best of Both Worlds0Andrea Celli, Christian Kroer, Matteo Castiglioni
122Stabilizing Off-Policy Deep Reinforcement Learning from Pixels0Edoardo Cetin, Oya Çeliktutan, Philip J. Ball, Stephen J. Roberts
123Accelerated, Optimal and Parallel: Some results on model-based stochastic optimization0Gary Cheng, John C. Duchi, Karan N. Chadha
124Robust Imitation Learning against Variations in Environment Dynamics0Jongseong Chae, Myungsik Cho, Seungyul Han, Sungho Choi, Whiyoung Jung, Youngchul Sung
125Fairness with Adaptive Weights0Junyi Chai, Xiaoqian Wang
126UNIREX: A Unified Learning Framework for Language Model Rationale Extraction0Aaron Chan, Hamed Firooz, Lambert Mathias, Liang Tan, Maziar Sanjabi, Shaoliang Nie, Xiang Ren, Xiaochang Peng
127Revisiting Label Smoothing and Knowledge Distillation Compatibility: What was Missing?0Keshigeyan Chandrasegaran, NgaiMan Cheung, NgocTrung Tran, Yunqing Zhao
128Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models0Ashish Shrivastava, Hema Koppula, JenHao Rick Chang, Oncel Tuzel, Xiaoshuai Zhang
129Learning Bellman Complete Representations for Offline Policy Evaluation0Jonathan D. Chang, Kaiwen Wang, Nathan Kallus, Wen Sun
130Sample Efficient Learning of Predictors that Complement Humans0David A. Sontag, Hussein Mozannar, MohammadAmin Charusaie, Samira Samadi
131Nyström Kernel Mean Embeddings0Alessandro Rudi, Antoine Chatalic, Lorenzo Rosasco, Nicolas Schreuder
132Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets0Tianlong Chen, Xiaolong Ma, Xuxi Chen, Yanzhi Wang, Zhangyang Wang
133Learning Domain Adaptive Object Detection with Probabilistic Teacher0Di Xie, Donglian Qi, Jie Song, Lei Zhang, Meilin Chen, Shicai Yang, Shiliang Pu, Weijie Chen, Xinchao Wang, Yueting Zhuang, Yunfeng Yan
134The Fundamental Price of Secure Aggregation in Differentially Private Federated Learning0Ananda Theertha Suresh, Christopher A. ChoquetteChoo, Peter Kairouz, WeiNing Chen
135Perfectly Balanced: Improving Transfer and Robustness of Supervised Contrastive Learning0Avanika Narayan, Christopher Ré, Daniel Y. Fu, Kayvon Fatahalian, Mayee F. Chen, Michael Zhang, Zhao Song
136Strategies for Safe Multi-Armed Bandits with Logarithmic Regret and Risk0Aditya Gangrade, Tianrui Chen, Venkatesh Saligrama
137On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs0Jiafan He, Quanquan Gu, Yuanzhou Chen
138Streaming Algorithms for Support-Aware Histograms0Justin Y. Chen, Piotr Indyk, Tal Wagner
139Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP0Haipeng Luo, Liyu Chen, Rahul Jain
140Learning Infinite-horizon Average-reward Markov Decision Process with Constraints0Haipeng Luo, Liyu Chen, Rahul Jain
141Active Multi-Task Representation Learning0Kevin G. Jamieson, Simon S. Du, Yifang Chen
142On Collective Robustness of Bagging Against Data Poisoning0Chentao Wu, Jie Li, Junchi Yan, Ruoxin Chen, Zenan Li
143Online Active Regression0Cheng Chen, Yi Li, Yiming Sun
144Selling Data To a Machine Learner: Pricing via Costly Signaling0Haifeng Xu, Junjie Chen, Minming Li
145ME-GAN: Learning Panoptic Electrocardio Representations for Multi-view ECG Synthesis Conditioned on Heart Diseases0Danny Z. Chen, Haochao Ying, Jian Wu, Jintai Chen, Kuanlun Liao, Kun Wei
146Weisfeiler-Lehman Meets Gromov-Wasserstein0Facundo Mémoli, Samantha Chen, Sunhyuk Lim, Yusu Wang, Zhengchao Wan
147On Non-local Convergence Analysis of Deep Linear Networks0Dachao Lin, Kun Chen, Zhihua Zhang
148Flow-based Recurrent Belief State Learning for POMDPs0Jianyu Chen, Ping Luo, Shengbo Li, Xiaoyu Chen, Yao Mark Mu
149Structure-Aware Transformer for Graph Representation Learning0Dexiong Chen, Karsten M. Borgwardt, Leslie O'Bray
150The Poisson Binomial Mechanism for Unbiased Federated Learning with Secure Aggregation0Ayfer Özgür, Peter Kairouz, WeiNing Chen
151Learning Mixtures of Linear Dynamical Systems0H. Vincent Poor, Yanxi Chen
152On Well-posedness and Minimax Optimal Rates of Nonparametric Q-function Estimation in Off-policy Evaluation0Xiaohong Chen, Zhengling Qi
153Faster Fundamental Graph Algorithms via Learned Predictions0Ali Vakilian, Fred Zhang, Justin Y. Chen, Sandeep Silwal
154Improve Single-Point Zeroth-Order Optimization Using High-Pass and Low-Pass Filters0Na Li, Xin Chen, Yujie Tang
155Deep Variational Graph Convolutional Recurrent Network for Multivariate Time Series Anomaly Detection0Bo Chen, Liang Dai, Long Tian, Mingyuan Zhou, Wenchao Chen, Zhibin Duan
156Auxiliary Learning with Joint Task and Data Scheduling0Chaoyu Guan, Hong Chen, Wenwu Zhu, Xin Wang, Yue Liu
157Optimization-Induced Graph Implicit Nonlinear Diffusion0Jiansheng Yang, Qi Chen, Yifei Wang, Yisen Wang, Zhouchen Lin
158Robust Meta-learning with Sampling Noise and Label Noise via Eigen-Reptile0Bo Long, Dong Chen, Lingfei Wu, Siliang Tang, Xiao Yun, Yueting Zhuang
159Adaptive Model Design for Markov Decision Process0Donglin Yang, Jiayang Li, Senmiao Wang, Siyu Chen, Zhaoran Wang, Zhuoran Yang
160State Transition of Dendritic Spines Improves Learning of Sparse Spiking Neural Networks0Tiejun Huang, Wei Fang, Yanqi Chen, Yonghong Tian, Zhaofei Yu, Zhengyu Ma
161Efficient Online ML API Selection for Multi-Label Classification Tasks0James Zou, Lingjiao Chen, Matei Zaharia
162Data-Efficient Double-Win Lottery Tickets from Robust Pre-training0Shiyu Chang, Sijia Liu, Tianlong Chen, Yang Zhang, Zhangyang Wang, Zhenyu Zhang
163Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness0Huan Zhang, PinYu Chen, Shiyu Chang, Sijia Liu, Tianlong Chen, Zhangyang Wang, Zhenyu Zhang
164Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation0Han Zhong, Liwei Wang, Xiaoyu Chen, Zhaoran Wang, Zhuoran Yang
165Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis0RongRong Chen, Shaofeng Zou, Yi Zhou, Ziyi Chen
166Task-aware Privacy Preservation for Multi-dimensional Data0Ao Tang, Jiangnan Cheng, Sandeep Chinchali
167Adversarially Trained Actor Critic for Offline Reinforcement Learning0Alekh Agarwal, ChingAn Cheng, Nan Jiang, Tengyang Xie
168Quantum-Inspired Algorithms from Randomized Numerical Linear Algebra0David P. Woodruff, Honghao Lin, Kenneth L. Clarkson, Lior Horesh, Nadiia Chepurko
169RieszNet and ForestRiesz: Automatic Debiased Machine Learning with Neural Nets and Random Forests0Vasilis Syrgkanis, Victor Chernozhukov, Victor QuintasMartinez, Whitney Newey
170Self-supervised learning with random-projection quantizer for speech recognition0ChungCheng Chiu, James Qin, Jiahui Yu, Yonghui Wu, Yu Zhang
171Discrete Probabilistic Inverse Optimal Transport0Patrick Shafto, Pei Wang, WeiTing Chiu
172Selective Network Linearization for Efficient Private Inference0Ameya Joshi, Brandon Reagen, Chinmay Hegde, Minsu Cho, Siddharth Garg
173From block-Toeplitz matrices to differential equations on graphs: towards a general theory for scalable masked Transformers0Adrian Weller, Arijit Sehanobish, Han Lin, Haoxian Chen, Jack ParkerHolder, Krzysztof Choromanski, Tamás Sarlós, Thomas Weingarten, Tianyi Zhang, Valerii Likhosherstov
174Shuffle Private Linear Contextual Bandits0Sayak Ray Chowdhury, Xingyu Zhou
175DNA: Domain Generalization with Diversified Neural Averaging0Hong Mei, Shanghang Zhang, Wenwu Zhu, Xin Wang, Xu Chu, Yasha Wang, Yujie Jin
176TPC: Transformation-Specific Smoothing for Point Cloud Models0Bo Li, Linyi Li, Wenda Chu
177Unified Scaling Laws for Routed Language Models0Aidan Clark, Albin Cassirer, Arthur Mensch, Aurelia Guy, Blake A. Hechtman, Bogdan Damoc, Chris Jones, David Budden, Diego de Las Casas, Elena Buchatskaya, Eliza Rutherford, Erich Elsen, George van den Driessche, Jack W. Rae, Jordan Hoffmann, Karen Simonyan, Koray Kavukcuoglu, Laurent Sifre, Marc'Aurelio Ranzato, Matthew J. Johnson, Michela Paganini, Oriol Vinyals, Sebastian Borgeaud, Simon Osindero, Tom Hennigan, Trevor Cai
178Context-Aware Drift Detection0Arnaud Van Looveren, Oliver Cobb
179On the Robustness of CountSketch to Adaptive Inputs0Edith Cohen, Jelani Nelson, Moshe Shechner, Tamás Sarlós, Uri Stemmer, Xin Lyu
180Diffusion bridges vector quantized variational autoencoders0Charles Ollion, Eric Moulines, Guillaume Quispe, Max Cohen, Sylvain Le Corff
181Online and Consistent Correlation Clustering0Andreas Maggiori, Nikos Parotsidis, Silvio Lattanzi, Vincent CohenAddad
182Massively Parallel k-Means Clustering for Perturbation Resilient Instances0Peilin Zhong, Vahab S. Mirrokni, Vincent CohenAddad
183One-Pass Diversified Sampling with Application to Terabyte-Scale Genomic Sequence Streams0Anshumali Shrivastava, Benito Geordie, Benjamin Coleman, Li Chou, Ryan A. Leo Elworth, Todd J. Treangen
184Transfer and Marginalize: Explaining Away Label Noise with Privileged Information0Effrosyni Kokiopoulou, Jesse Berent, Mark Collier, Rodolphe Jenatton
185MAML and ANIL Provably Learn Representations0Aryan Mokhtari, Liam Collins, Sanjay Shakkottai, Sewoong Oh
186Entropic Causal Inference: Graph Identifiability0Dmitriy A. Katz, Kristjan H. Greenewald, Murat Kocaoglu, Spencer Compton
187Mitigating Gender Bias in Face Recognition using the von Mises-Fisher Mixture Model0JeanRémy Conti, Nathan Noiry, Stéphan Clémençon, Stéphane Gentric, Vincent Despiegel
188Counterfactual Transportability: A Formal Approach0Elias Bareinboim, Juan D. Correa, Sanghack Lee
189Label-Free Explainability for Unsupervised Models0Jonathan Crabbé, Mihaela van der Schaar
190Evaluating the Adversarial Robustness of Adaptive Test-time Defenses0A. Taylan Cemgil, Evan Shelhamer, Francesco Croce, Matthias Hein, Sven Gowal, Thomas Brunner
191Adversarial Robustness against Multiple and Single lp-Threat Models via Quick Fine-Tuning of Robust Classifiers0Francesco Croce, Matthias Hein
192Self-conditioning Pre-Trained Language Models0Luca Zappella, Nicholas Apostoloff, Xavier Suau Cuadros
193Only tails matter: Average-Case Universality and Robustness in the Convex Regime0Courtney Paquette, Damien Scieur, Fabian Pedregosa, Gauthier Gidel, Leonardo Cunha
194Principal Component Flows0Adam D. Cobb, Edmond Cunningham, Susmit Jha
195Deep symbolic regression for recurrence prediction0François Charton, Guillaume Lample, PierreAlexandre Kamienny, Stéphane d'Ascoli
196Continuous Control with Action Quantization from Demonstrations0Anton Raichuk, Damien Vincent, Léonard Hussenot, Matthieu Geist, Olivier Pietquin, Robert Dadashi, Sertan Girgin
197Dialog Inpainting: Turning Documents into Dialogs0Aida Amini, Arun Tejasvi Chaganty, Kelvin Guu, Mike Green, Qazi Mamunur Rashid, Vincent Y. Zhao, Zhuyun Dai
198DisPFL: Towards Communication-Efficient Personalized Federated Learning via Decentralized Sparse Training0Dacheng Tao, Fengxiang He, Li Shen, Rong Dai, Xinmei Tian
199Marginal Distribution Adaptation for Discrete Sets via Module-Oriented Divergence Minimization0Bo Dai, Dale Schuurmans, Hanjun Dai, Mengjiao Yang, Yuan Xue
200Balancing Sample Efficiency and Suboptimality in Inverse Reinforcement Learning0Alberto Maria Metelli, Angelo Damiani, Giorgio Manganini, Marcello Restelli
201Understanding Robust Generalization in Learning Regular Languages0Dan Roth, Osbert Bastani, Soham Dan
202Unsupervised Image Representation Learning with Deep Latent Particles0Aviv Tamar, Tal Daniel
203Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation0Ayush Sekhari, Christoph Dann, Karthik Sridharan, Mehryar Mohri, Yishay Mansour
204Monarch: Expressive Structured Matrices for Efficient and Accurate Training0Alexander Liu, Aniruddh Rao, Arjun D. Desai, Atri Rudra, Beidi Chen, Christopher Ré, Jessica Grogan, Michael Poli, Nimit Sharad Sohoni, Tri Dao
205Score-Guided Intermediate Level Optimization: Fast Langevin Mixing for Inverse Problems0Alex Dimakis, Constantinos Daskalakis, Giannis Daras, Yuval Dagan
206Test-Time Training Can Close the Natural Distribution Shift Performance Gap in Deep Learning Based Compressed Sensing0Jiayu Liu, Mohammad Zalbagi Darestani, Reinhard Heckel
207Knowledge Base Question Answering by Case-based Reasoning over Subgraphs0Ameya Godbole, Andrew McCallum, Ankita Naik, Elliot Tower, Hannaneh Hajishirzi, Manzil Zaheer, Rajarshi Das, Robin Jia
208Framework for Evaluating Faithfulness of Local Explanations0Michal Moshkovitz, Nave Frost, Sanjoy Dasgupta
209Distinguishing rule and exemplar-based generalization in learning systems0Erin Grant, Ishita Dasgupta, Tom Griffiths
210Robust Multi-Objective Bayesian Optimization Under Input Noise0Enlu Zhou, Eytan Bakshy, Maximilian Balandat, Michael A. Osborne, Sait Cakmak, Samuel Daulton
211Attentional Meta-learners for Few-shot Polythetic Classification0Ben J. Day, Nikola Simidjievski, Pietro Lió, Ramón Viñas Torné
212Adversarial Vulnerability of Randomized Ensembles0Hassan Dbouk, Naresh R. Shanbhag
213Born-Infeld (BI) for AI: Energy-Conserving Descent (ECD) for Optimization0Eva Silverstein, Giuseppe Bruno De Luca
214Error-driven Input Modulation: Solving the Credit Assignment Problem without a Backward Pass0Gabriel Kreiman, Giorgia Dellaferrera
215DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations0Fei Deng, Ingook Jang, Sungjin Ahn
216NeuralEF: Deconstructing Kernels by Deep Neural Networks0Jiaxin Shi, Jun Zhu, Zhijie Deng
217Deep Causal Metric Learning0Xiang Deng, Zhongfei Zhang
218On the Convergence of Inexact Predictor-Corrector Methods for Linear Programming0Agniva Chowdhury, Gregory Dexter, Haim Avron, Petros Drineas
219Analysis of Stochastic Processes through Replay Buffers0Dotan Di Castro, Shie Mannor, Shirli DiCastro Shashua
220Streaming Algorithms for High-Dimensional Robust Statistics0Ankit Pensia, Daniel M. Kane, Ilias Diakonikolas, Thanasis Pittas
221Learning General Halfspaces with Adversarial Label Noise via Online Gradient Descent0Christos Tzamos, Ilias Diakonikolas, Nikos Zarifis, Vasilis Kontonis
222Variational Feature Pyramid Networks0Christophoros Nikou, Giorgos Sfikas, Panagiotis Dimitrakopoulos
223Understanding Doubly Stochastic Clustering0Benjamin D. Haeffele, Derek Lim, René Vidal, Tianjiao Ding
224Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence0ChenYu Wei, Dongsheng Ding, Kaiqing Zhang, Mihailo R. Jovanovic
225Generalization and Robustness Implications in Object-Centric Learning0Andrea Dittadi, Bernhard Schölkopf, Francesco Locatello, Michele De Vita, Ole Winther, Samuele S. Papa
226Fair Generalized Linear Models with a Convex Penalty0Axel S. Martin, Hyungrok Do, Judy Zhong, Padhraic Smyth, Preston Putzel
227Bayesian Learning with Information Gain Provably Bounds Risk for a Robust Adversarial Defense0Bao Gia Doan, Damith C. Ranasinghe, Ehsan Abbasnejad, Javen Qinfeng Shi
228On the Adversarial Robustness of Causal Algorithmic Recourse0AmirHossein Karimi, Bernhard Schölkopf, Ricardo DominguezOlmedo
229Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks0Kaisheng Ma, Linfeng Zhang, Mengdi Wu, Runpei Dong, Zhanhong Tan
230PACE: A Parallelizable Computation Encoder for Directed Acyclic Graphs0Fuhai Li, Muhan Zhang, Yixin Chen, Zehao Dong
231Privacy for Free: How does Dataset Condensation Help Privacy?0Bo Zhao, Lingjuan Lyu, Tian Dong
232Fast rates for noisy interpolation require rethinking the effect of inductive bias0Fanny Yang, Konstantin Donhauser, Nicolò Ruggeri, Stefan Stojanovic
233Adapting to Mixing Time in Stochastic Optimization with Markovian Data0Kfir Yehuda Levy, Ron Dorfman
234TACTiS: Transformer-Attentional Copulas for Time Series0Alexandre Drouin, Nicolas Chapados, Étienne Marcotte
235Branching Reinforcement Learning0Wei Chen, Yihan Du
236Bayesian Imitation Learning for End-to-End Mobile Manipulation0Alex Alemi, Daniel Ho, Eric Jang, Mohi Khansari, Yuqing Du
237GLaM: Efficient Scaling of Language Models with Mixture-of-Experts0Adams Wei Yu, Andrew M. Dai, Barret Zoph, Claire Cui, Dmitry Lepikhin, Kathleen S. MeierHellstern, Kellie Webster, Kevin Robinson, Kun Zhang, Liam Fedus, Lucas Dixon, Maarten P. Bosma, Marie Pellat, Maxim Krikun, Nan Du, Orhan Firat, Quoc V. Le, Simon Tong, Tao Wang, Toju Duke, Yanping Huang, Yanqi Zhou, Yonghui Wu, Yu Emma Wang, Yuanzhong Xu, Zhifeng Chen, Zongwei Zhou
238Learning Iterative Reasoning through Energy Minimization0Igor Mordatch, Joshua B. Tenenbaum, Shuang Li, Yilun Du
239SE(3) Equivariant Graph Neural Networks with Complete Local Frames0Bin Shao, He Zhang, Nanning Zheng, Qi Meng, TieYan Liu, Wei Chen, Weitao Du, Yuanqi Du
240A Context-Integrated Transformer-Based Neural Network for Auction Design0Jingwu Tang, Manzil Zaheer, Xiang Yan, Xiaotie Deng, Yutong Yin, Zhe Feng, Zhijian Duan
241Augment with Care: Contrastive Learning for Combinatorial Problems0Chris J. Maddison, Haonan Duan, Max B. Paulus, Pashootan Vaezipoor, Yangjun Ruan
242Parametric Visual Program Induction with Function Modularization0Wenwu Zhu, Xin Wang, Xuguang Duan, Ziwei Zhang
243Bayesian Deep Embedding Topic Meta-Learner0Bo Chen, Chaojie Wang, Jianqiao Sun, Mingyuan Zhou, Wenchao Chen, Yishi Xu, Zhibin Duan
244Deletion Robust Submodular Maximization over Matroids0Ashkan NorouziFard, Federico Fusco, Morteza Zadimoghaddam, Paul Duetting, Silvio Lattanzi
245From data to functa: Your data point is a function and you can treat it like one0Dan Rosenbaum, Danilo Jimenez Rezende, Emilien Dupont, Hyunjik Kim, S. M. Ali Eslami
246Efficient Low Rank Convex Bounds for Pairwise Discrete Graphical Models0George Katsirelos, Thomas Schiex, Valentin Durante
247Robust Counterfactual Explanations for Tree-Based Ensembles0Cecilia Tilli, Daniele Magazzeni, Jason Long, Sanghamitra Dutta, Saumitra Mishra
248On the Difficulty of Defending Self-Supervised Learning against Model Extraction0Adam Dziedzic, Jonas Guan, Muhammad Ahmad Kaleem, Nicolas Papernot, Nikita Dhawan
249LIMO: Latent Inceptionism for Targeted Molecule Generation0Bo Zhao, Kunyang Sun, Michael K. Gilson, Mudong Feng, Peter Eckmann, Rose Yu
250Inductive Biases and Variable Creation in Self-Attention Mechanisms0Benjamin L. Edelman, Cyril Zhang, Sham M. Kakade, Surbhi Goel
251Provable Reinforcement Learning with a Short-Term Memory0Akshay Krishnamurthy, Chi Jin, Sobhan Miryoosefi, Yonathan Efroni
252Sparsity in Partially Controllable Linear Systems0Akshay Krishnamurthy, Cyril Zhang, Sham M. Kakade, Yonathan Efroni
253FedNew: A Communication-Efficient and Privacy-Preserving Newton-Type Method for Federated Learning0Amrit Singh Bedi, Anis Elgabli, Chaouki Ben Issaid, Ketan Rajawat, Mehdi Bennis, Vaneet Aggarwal
254pathGCN: Learning General Graph Spatial Operators from Paths0Eldad Haber, Eran Treister, Moshe Eliasof
255Discrete Tree Flows via Tree-Structured Permutations0David I. Inouye, Hyung Zin Lim, Mai Elkady
256For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria0Andrew Critch, Caspar Oesterheld, Scott Emmons, Stuart Russell, Vincent Conitzer
257Streaming Algorithm for Monotone k-Submodular Maximization with Cardinality Constraints0Alina Ene, Huy L. Nguyen
258Towards Scaling Difference Target Propagation by Learning Backprop Targets0Abhinav Moudgil, Blake A. Richards, Eugene Belilovsky, Fabrice Normandin, Irina Rish, Maxence Ernoult, Sean Spinney, Yoshua Bengio
259Understanding Dataset Difficulty with V-Usable Information0Kawin Ethayarajh, Swabha Swayamdipta, Yejin Choi
260Head2Toe: Utilizing Intermediate Representations for Better Transfer Learning0Hugo Larochelle, Michael C. Mozer, Utku Evci, Vincent Dumoulin
261Variational Sparse Coding with Learned Thresholding0Christopher J. Rozell, Kion Fallah
262Training Discrete Deep Generative Models via Gapped Straight-Through Estimator0Alexander I. Rudnicky, Peter J. Ramadge, TaChung Chi, TingHan Fan
263DRIBO: Robust Deep Reinforcement Learning via Multi-View Information Bottleneck0Jiameng Fan, Wenchao Li
264Generalized Data Distribution Iteration0Changnan Xiao, Jiajun Fan
265Variational Wasserstein gradient flow0Amirhossein Taghvaei, Jiaojiao Fan, Qinsheng Zhang, Yongxin Chen
266Data Determines Distributional Robustness in Contrastive Language Image Pre-training (CLIP)0Achal Dave, Alex Fang, Gabriel Ilharco, Ludwig Schmidt, Mitchell Wortsman, Vaishaal Shankar, Yuhao Wan
267Bayesian Continuous-Time Tucker Decomposition0Akil Narayan, Robert M. Kirby, Shandian Zhe, Shikai Fang
268Byzantine Machine Learning Made Easy By Resilient Averaging of Momentums0John Stephan, Nirupam Gupta, Rachid Guerraoui, Rafael Pinot, Sadegh Farhadkhani
269An Equivalence Between Data Poisoning and Byzantine Gradient Attacks0Lê Nguyên Hoang, Oscar Villemaud, Rachid Guerraoui, Sadegh Farhadkhani
270Investigating Generalization by Controlling Normalized Margin0Alexander R. Farhang, Jeremy D. Bernstein, Kushal Tirumala, Yang Liu, Yisong Yue
271Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games0Christian Kroer, ChungWei Lee, Gabriele Farina, Haipeng Luo
272Local Linear Convergence of Douglas-Rachford for Linear Programming: a Probabilistic Analysis0Hamza Fawzi, Oisin Faust
273Matching Structure for Dual Learning0Hao Fei, Meishan Zhang, Shengqiong Wu, Yafeng Ren
274Cascaded Gaps: Towards Logarithmic Regret for Risk-Sensitive Reinforcement Learning0Ruitu Xu, Yingjie Fei
275Private frequency estimation via projective geometry0Huy L. Nguyen, Jelani Nelson, Kunal Talwar, Vitaly Feldman
276An Intriguing Property of Geophysics Inversion0Peng Jin, Shihang Feng, Yinan Feng, Yinpeng Chen, Youzuo Lin, Zicheng Liu
277Principled Knowledge Extrapolation with GANs0Deli Zhao, Jie Xiao, Jingren Zhou, Kecheng Zheng, Qibin Sun, Ruili Feng, ZhengJun Zha
278A Resilient Distributed Boosting Algorithm0Idan Mehalel, Shay Moran, Yuval Filmus
279Model-Value Inconsistency as a Signal for Epistemic Uncertainty0Abram L. Friesen, André Barreto, Angelos Filos, Diana Borsa, Eszter Vértes, Feryal M. P. Behbahani, Gregory Farquhar, Simon Osindero, Tom Schaul, Zita Marinho
280Coordinated Double Machine Learning0Matteo Sesia, Nitai Fingerhut, Yaniv Romano
281Conformal Prediction Sets with Limited False Positives0Adam Fisch, Regina Barzilay, Tal Schuster, Tommi S. Jaakkola
282Fast Population-Based Reinforcement Learning on a Single Machine0Arthur Flajolet, Claire Bizon Monroc, Karim Beguir, Thomas Pierrot
283Fast Relative Entropy Coding with A* coding0Gergely Flamich, José Miguel HernándezLobato, Stratis Markou
284Contrastive Mixture of Posteriors for Counterfactual Inference, Data Integration and Fairness0Aaron Sim, Adam Foster, Craig A. Glastonbury, Páidí Creed, Samer Abujudeh, Árpi Vezér
285Label Ranking through Nonparametric Regression0Alkis Kalavasis, Dimitris Fotakis, Eleni Psaroudaki
286A Neural Tangent Kernel Perspective of GANs0Emmanuel de Bézenac, Ibrahim Ayed, JeanYves Franceschi, Mickaël Chen, Patrick Gallinari, Sylvain Lamprier
287Extracting Latent State Representations with Linear Dynamics from Rich Observations0Abraham Frandsen, Holden Lee, Rong Ge
288SPDY: Accurate Pruning with Speedup Guarantees0Dan Alistarh, Elias Frantar
289Revisiting the Effects of Stochasticity for Hamiltonian Samplers0Dimitrios Milios, Giulio Franzese, Maurizio Filippone, Pietro Michiardi
290Bregman Neural Networks0Gilles Gasso, Jordan Frécon, Massimiliano Pontil, Saverio Salzo
291(Non-)Convergence Results for Predictive Coding Networks0Simon Frieder, Thomas Lukasiewicz
292Scaling Structured Inference with Randomization0John P. Cunningham, Mirella Lapata, Yao Fu
293Greedy when Sure and Conservative when Uncertain about the Opponents0Haobo Fu, Hongxiang Yu, Jiechao Xiong, Junliang Xing, Kai Li, Qiang Fu, Shuang Wu, Wei Yang, Weiming Liu, Ye Tian, Ying Wen
294DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks0Cheng Wan, Haichuan Yang, Jiayi Yuan, Meng Li, Raghuraman Krishnamoorthi, Vikas Chandra, Yingyan Lin, Yonggan Fu
295Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning0Chao Yu, Jiaqi Yang, Wei Fu, Yi Wu, Zelai Xu
296p-Laplacian Based Graph Neural Networks0Guoji Fu, Peilin Zhao, Yatao Bian
297Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error0David Meger, Doina Precup, Ofir Nachum, Scott Fujimoto, Shixiang Shane Gu
298Robin Hood and Matthew Effects: Differential Privacy Has Disparate Impact on Synthetic Data0Bristena Oprisanu, Emiliano De Cristofaro, Georgi Ganev
299The Complexity of k-Means Clustering when Little is Known0Karolina Okrasa, Kirill Simonov, Robert Ganian, Thekla Hamm, Viktoriia Korchemna
300IDYNO: Learning Nonparametric DAGs from Interventional Dynamic Data0Debarun Bhattacharjya, Elliot Nelson, Miao Liu, Tian Gao, Yue Yu
301Loss Function Learning for Domain Generalization by Implicit Gradient0Boyan Gao, Henry Gouk, Timothy M. Hospedales, Yongxin Yang
302On the Convergence of Local Stochastic Compositional Gradient Descent with Momentum0Heng Huang, Hongchang Gao, Junyi Li
303Deep Reference Priors: What is the best way to pretrain a model?0Pratik Chaudhari, Rahul Ramesh, Yansong Gao
304On the Equivalence Between Temporal and Static Equivariant Graph Representations0Bruno Ribeiro, Jianfei Gao
305Generalizing Gaussian Smoothing for Random Search0Katelyn Gao, Ozan Sener
306Rethinking Image-Scaling Attacks: The Interplay Between Vulnerabilities in Machine Learning Systems0Ilia Shumailov, Kassem Fawaz, Yue Gao
307Lazy Estimation of Variable Importance for Large Neural Networks0Abby Stevens, Garvesh Raskutti, Rebecca Willett, Yue Gao
308Fast and Reliable Evaluation of Adversarial Robustness with Minimum-Margin Attack0Binghui Xie, Bo Han, Feng Liu, Gang Niu, James Cheng, Jiongxiao Wang, Kaiwen Zhou, Ruize Gao
309Value Function based Difference-of-Convex Algorithm for Bilevel Hyperparameter Selection Problems0Haian Yin, Jane J. Ye, Jin Zhang, Lucy L. Gao, Shangzhi Zeng
310Learning to Incorporate Texture Saliency Adaptive Attention to Image Cartoonization0Xiang Gao, Yingjie Tian, Yuqi Zhang
311Stochastic smoothing of the top-K calibrated hinge loss for deep imbalanced classification0Alexis Joly, Camille Garcin, Joseph Salmon, Maximilien Servajean
312PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation0Andrea Martinelli, Andrea Zanelli, John Lygeros, Matilde Gargiani, Tyler H. Summers
313The power of first-order smooth optimization for black-box non-smooth problems0Aleksandr Beznosikov, Alexander V. Gasnikov, Anton Novitskii, Bin Gu, Dmitry Kamzolov, Farshed Abdukhakimov, Martin Takác, Pavel E. Dvurechensky, Vasilii Novitskii
314A Functional Information Perspective on Model Interpretation0Itai Gat, Nitay Calderon, Roi Reichart, Tamir Hazan
315UniRank: Unimodal Bandit Algorithms for Online Ranking0CamilleSovanneary Gauthier, Romaric Gaudel, Élisa Fromont
316Variational Inference with Locally Enhanced Bounds for Hierarchical Models0Justin Domke, Tomas Geffner
317Inducing Causal Structure for Interpretable Neural Networks0Atticus Geiger, Christopher Potts, Elisa Kreiss, Hanson Lu, Josh Rozner, Noah D. Goodman, Thomas Icard, Zhengxuan Wu
318Achieving Minimax Rates in Pool-Based Batch Active Learning0Claudio Gentile, Tong Zhang, Zhilei Wang
319Near-Exact Recovery for Tomographic Inverse Problems via Deep Learning0Ingo Gühring, Jan MacDonald, Martin Genzel, Maximilian März
320Online Learning for Min Sum Set Cover and Pandora's Box0Christos Tzamos, Evangelia Gergatsouli
321Equivariance versus Augmentation for Spherical Images0Christoffer Petersson, Daniel Persson, Fredrik Ohlsson, Hampus Linander, Jan E. Gerken, Oscar Carlsson
322A Regret Minimization Approach to Multi-Agent Control0Elad Hazan, Naomi Ehrich Leonard, Udari Madhushani, Udaya Ghai
323Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning0Byron David, Daniel Freeman, Igor Mordatch, Satoshi Kataoka, Seyed Kamyar Seyed Ghasemipour, Shixiang Shane Gu
324Faster Privacy Accounting via Evolving Discretization0Badih Ghazi, Pasin Manurangsi, Pritish Kamath, Ravi Kumar
325Plug-In Inversion: Model-Agnostic Inversion for Vision with Data Augmentations0Amin Ghiasi, Chen Zhu, Hamid Kazemi, Micah Goldblum, Steven Reich, Tom Goldstein
326Offline RL Policies Should Be Trained to be Adaptive0Anurag Ajay, Dibya Ghosh, Pulkit Agrawal, Sergey Levine
327Breaking the T\sqrt{T} Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits0Abishek Sankararaman, Avishek Ghosh
328SCHA-VAE: Hierarchical Context Aggregation for Few-Shot Generation0Giorgio Giannone, Ole Winther
329A Joint Exponential Mechanism For Differentially Private Top-k0Andres Muñoz Medina, Jennifer Gillenwater, Matthew Joseph, Mónica Ribero Diaz
330Neuro-Symbolic Hierarchical Rule Induction0Claire Glanois, Dong Li, Jianye Hao, Matthieu Zimmer, Paul Weng, Wulong Liu, Xuening Feng, Zhaohui Jiang
331It's Raw! Audio Generation with State-Space Models0Albert Gu, Chris Donahue, Christopher Ré, Karan Goel
332RankSim: Ranking Similarity Regularization for Deep Imbalanced Regression0Frederick Tung, Greg Mori, Yu Gong
333How to Fill the Optimum Set? Population Gradient Descent with Harmless Diversity0Chengyue Gong, Lemeng Wu, Qiang Liu
334Partial Label Learning via Label Influence Function0Dong Yuan, Wei Bao, Xiuwen Gong
335Secure Distributed Training at Scale0Alexander Borzunov, Eduard Gorbunov, Max Ryabinin, Michael Diskin
336Retrieval-Augmented Reinforcement Learning0Abram L. Friesen, Adrià Puigdomènech Badia, Andrea Banino, Anirudh Goyal, Arthur Guez, Charles Blundell, Ksenia Konyushkova, Mehdi Mirza, Michal Valko, Nan Rosemary Ke, Nicolas Heess, Peter Conway Humphreys, Simon Osindero, Theophane Weber, Timothy P. Lillicrap
337The State of Sparse Training in Deep Reinforcement Learning0Erich Elsen, Laura Graesser, Pablo Samuel Castro, Utku Evci
338Causal Inference Through the Structural Causal Marginal Problem0Bernhard Schölkopf, Dominik Janzing, Elke Kirschbaum, Jonas M. Kübler, Julius von Kügelgen, Luigi Gresele
339Mirror Learning: A Unifying Framework of Policy Optimisation0Christian A. Schröder de Witt, Jakob N. Foerster, Jakub Grudzien Kuba
340Adapting k-means Algorithms for Outliers0Christoph Grunau, Václav Rozhon
341Variational Mixtures of ODEs for Inferring Cellular Gene Expression Dynamics0David T. Blaauw, Joshua D. Welch, Yichen Gu
342Learning Pseudometric-based Action Representations for Offline Reinforcement Learning0Bo An, Chen Chen, Dong Li, Jianye Hao, Mengchen Zhao, Pengjie Gu
343NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields0Huayu Deng, Shanyan Guan, Xiaokang Yang, Yunbo Wang
344Fast-Rate PAC-Bayesian Generalization Bounds for Meta-Learning0Jiechao Guan, Zhiwu Lu
345Leveraging Approximate Symbolic Models for Reinforcement Learning via Skill Diversity0Lin Guan, Sarath Sreedharan, Subbarao Kambhampati
346Large-Scale Graph Neural Architecture Search0Chaoyu Guan, Hong Chen, Wenwu Zhu, Xin Wang, Ziwei Zhang
347Identifiability Conditions for Domain Adaptation0Ishaan Gulrajani, Tatsunori Hashimoto
348A Parametric Class of Approximate Gradient Updates for Policy Optimization0Dale Schuurmans, Junfeng Wen, Ramki Gummadi, Saurabh Kumar
349Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes0Hongyi Guo, Qi Cai, Yufeng Zhang, Zhaoran Wang, Zhuoran Yang
350No-Regret Learning in Partially-Informed Auctions0Ellen Vitercik, Michael I. Jordan, Wenshuo Guo
351Bounding Training Data Reconstruction in Private (Deep) Learning0Brian Karrer, Chuan Guo, Kamalika Chaudhuri, Laurens van der Maaten
352Adversarially trained neural representations are already as robust as biological neural representations0Aleksander Madry, Chong Guo, Guillaume Leclerc, James J. DiCarlo, Joel Dapello, Michael J. Lee, Yug Rao
353Class-Imbalanced Semi-Supervised Learning with Adaptive Thresholding0LanZhe Guo, Yufeng Li
354Deep Squared Euclidean Approximation to the Levenshtein Distance for DNA Storage0Alan J. X. Guo, Cong Liang, QingHu Hou
355Online Continual Learning through Mutual Information Maximization0Bing Liu, Dongyan Zhao, Yiduo Guo
356Fast Provably Robust Decision Trees and Boosting0JunQi Guo, MingZhuo Teng, Wei Gao, ZhiHua Zhou
357Understanding and Improving Knowledge Graph Embedding for Entity Alignment0Huajun Chen, Lingbing Guo, Mingyang Chen, Qiang Zhang, Wei Hu, Zequn Sun
358NISPA: Neuro-Inspired Stability-Plasticity Adaptation for Continual Learning in Sparse Networks0Constantine Dovrolis, Mustafa Burak Gurbuz
359Active Learning on a Budget: Opposite Strategies Suit High and Low Budgets0Avihu Dekel, Daphna Weinshall, Guy Hacohen
360You Only Cut Once: Boosting Data Augmentation with a Single Cut0Hongdong Li, Ian D. Reid, Jie Hong, Junlin Han, Lars Petersson, Mohammad Ali Armin, Pengfei Fang, Weihao Li
361Scalable MCMC Sampling for Nonsymmetric Determinantal Point Processes0Amin Karbasi, Elvis Dohmatob, Insu Han, Mike Gartrell
362G-Mixup: Graph Data Augmentation for Graph Classification0Ninghao Liu, Xia Hu, Xiaotian Han, Zhimeng Jiang
363Private Streaming SCO in ℓp geometry with Applications in High Dimensional Online Decision Making0Jiheng Zhang, Yang Wang, Yuan Yao, Yuxuan Han, Zhicong Liang, Zhipeng Liang
364Off-Policy Reinforcement Learning with Delayed Rewards0Beining Han, Jian Peng, Yuan Zhou, Zhizhou Ren, Zuofan Wu
365Adversarial Attacks on Gaussian Process Bandits0Eric Han, Jonathan Scarlett
366Random Gegenbauer Features for Scalable Kernel Methods0Amir Zandieh, Haim Avron, Insu Han
367Stochastic Reweighted Gradient Descent0Ayoub El Hanchi, Chris J. Maddison, David A. Stephens
368Dual Perspective of Label-Specific Feature Learning for Multi-Label Classification0JunYi Hang, MinLing Zhang
369Temporal Difference Learning for Model Predictive Control0Hao Su, Nicklas Hansen, Xiaolong Wang
370Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning0Amy Zhang, Ashvin Nair, Patrick Yin, Philippe HansenEstruch, Sergey Levine
371TURF: Two-Factor, Universal, Robust, Fast Distribution Learning Algorithm0Alon Orlitsky, Ayush Jain, Vaishakh Ravindrakumar, Yi Hao
372Contextual Information-Directed Sampling0Botao Hao, Chao Qin, Tor Lattimore
373GSmooth: Certified Robustness against Semantic Transformations via Generalized Randomized Smoothing0Chengyang Ying, Hang Su, Jian Song, Jun Zhu, Yinpeng Dong, Zhongkai Hao
374Implicit Regularization with Polynomial Growth in Deep Tensor Factorization0Hachem Kadri, Kais Hariz, Maher Moakher, Stéphane Ayache, Thierry Artières
375Strategic Instrumental Variable Regression: Recovering Causal Relationships From Strategic Responses0Dung Daniel T. Ngo, Hoda Heidari, Keegan Harris, Logan Stapleton, Steven Wu
376C*-algebra Net: A New Approach Generalizing Neural Network Parameters to C*-algebra0Tomoko Matsui, Yuka Hashimoto, Zhao Wang
377General-purpose, long-context autoregressive modeling with Perceiver AR0Andrew Jaegle, Catalina Cangea, Charlie Nash, Curtis Hawthorne, Hannah Sheahan, Ian Simon, JeanBaptiste Alayrac, Jesse H. Engel, João Carreira, Mateusz Malinowski, Matthew M. Botvinick, Neil Zeghidour, Oriol Vinyals, Sander Dieleman, Sebastian Borgeaud
378On Distribution Shift in Learning-based Bug Detectors0Jingxuan He, Luca BeurerKellner, Martin T. Vechev
379GNNRank: Learning Global Rankings from Pairwise Comparisons via Directed Graph Neural Networks0David Wipf, Gesine D. Reinert, Junchi Yan, Mihai Cucuringu, Quan Gan, Yixuan He
380Exploring the Gap between Collapsed & Whitened Features in Self-Supervised Learning0Bobby He, Mete Ozay
381Sparse Double Descent: Where Network Pruning Aggravates Overfitting0Quanzhi Zhu, Zeke Xie, Zengchang Qin, Zheng He
382A Reduction from Linear Contextual Bandit Lower Bounds to Estimation Lower Bounds0Jiahao He, Jiheng Zhang, Rachel Q. Zhang
383HyperPrompt: Prompt-based Task-Conditioning of Transformers0Donald Metzler, Ed H. Chi, HengTze Cheng, Huaixiu Steven Zheng, Jai Prakash Gupta, Vamsi Aribandi, YaGuang Li, Yi Tay, Yu Du, Yun He, Zhao Chen, Zhe Zhao
384Label-Descriptive Patterns and Their Application to Characterizing Classification Errors0Dietrich Klakow, Jilles Vreeken, Jonas Fischer, Michael A. Hedderich
385NOMU: Neural Optimization-based Model Uncertainty0Hanna S. Wutte, Jakob Heiss, Jakob Weissteiner, Josef Teichmann, Sven Seuken
386Scaling Out-of-Distribution Detection for Real-World Settings0Andy Zou, Dan Hendrycks, Dawn Song, Jacob Steinhardt, Joseph Kwon, Mantas Mazeika, Mohammadreza Mostajabi, Steven Basart
387Generalization Bounds using Lower Tail Exponents in Stochastic Optimizers0Liam Hodgkinson, Michael W. Mahoney, Rajiv Khanna, Umut Simsekli
388Unsupervised Detection of Contextualized Embedding Bias with Application to Ideology0Hinrich Schütze, Janet B. Pierrehumbert, Valentin Hofmann
389Neural Laplace: Learning diverse classes of differential equations in the Laplace domain0Mihaela van der Schaar, Samuel Holt, Zhaozhi Qian
390Deep Hierarchy in Bandits0Branislav Kveton, Joey Hong, Manzil Zaheer, Mohammad Ghavamzadeh, Sumeet Katariya
391DAdaQuant: Doubly-adaptive quantization for communication-efficient Federated Learning0Robert D. Mullins, Robert Hönig, Yiren Zhao
392Equivariant Diffusion for Molecule Generation in 3D0Clément Vignac, Emiel Hoogeboom, Max Welling, Victor Garcia Satorras
393Conditional GANs with Auxiliary Discriminative Classifier0Huawei Shen, Liang Hou, Qi Cao, Siyuan Pan, Xiaoshuang Li, Xueqi Cheng
394AdAUC: End-to-end Adversarial AUC Optimization Against Long-tail Problems0Qianqian Xu, Qingming Huang, Shilong Bao, Wenzheng Hou, Yuan He, Zhiyong Yang
395Wide Bayesian neural networks have a simple weight posterior: theory and accelerated sampling0Jascha SohlDickstein, Jeffrey Pennington, Jiri Hron, Roman Novak
396Learning inverse folding from millions of predicted structures0Adam Lerer, Alexander Rives, Brian Hie, Chloe Hsu, Jason Liu, Robert Verkuil, Tom Sercu, Zeming Lin
397Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation0Longbo Huang, Pihe Hu, Yu Chen
398Neuron Dependency Graphs: A Causal Abstraction of Neural Networks0Jin Tian, Yaojie Hu
399Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL0Chuanlong Xie, Siyi Hu, Xiaodan Liang, Xiaojun Chang
400On the Role of Discount Factor in Offline Reinforcement Learning0Chongjie Zhang, Hao Hu, Qianchuan Zhao, Yiqin Yang
401Transformer Quality in Linear Time0Hanxiao Liu, Quoc V. Le, Weizhe Hua, Zihang Dai
402Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents0Deepak Pathak, Igor Mordatch, Pieter Abbeel, Wenlong Huang
403Forward Operator Estimation in Generative Models with Kernel Transfer Operators0Rudrasis Chakraborty, Vikas Singh, Zhichun Huang
404Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits0Jiatai Huang, Longbo Huang, Yan Dai
405Frustratingly Easy Transferability Estimation0Junzhou Huang, LongKai Huang, Qiang Yang, Ying Wei, Yu Rong
406Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably)0Chang Zhou, Hongxia Yang, Junyang Lin, Longbo Huang, Yu Huang
407Action-Sufficient State Representation Learning for Control with Structural Constraints0Bernhard Schölkopf, Biwei Huang, Chaochao Lu, Clark Glymour, José Miguel HernándezLobato, Kun Zhang, Liu Leqi
4083DLinker: An E(3) Equivariant Variational Autoencoder for Molecular Linker Design0Jianzhu Ma, Muhan Zhang, Xingang Peng, Yinan Huang
409SDQ: Stochastic Differentiable Quantization with Mixed Precision0Eric P. Xing, Jeffry Wicaksana, KwangTing Cheng, Shichao Li, Xianghong Hu, Xijie Huang, Zechun Liu, Zhiqiang Shen
410Tackling Data Heterogeneity: A New Unified Framework for Decentralized SGD with Sample-induced Topology0Changzhi Yan, Jinming Xu, Yan Huang, Ying Sun, Zehan Zhu
411Efficient Representation Learning via Adaptive Context Pooling0Chen Huang, Joshua M. Susskind, Navdeep Jaitly, Walter Talbott
412On the Learning of Non-Autoregressive Transformers0Fei Huang, Hao Zhou, Lei Li, Minlie Huang, Tianhua Tao
413Going Deeper into Permutation-Sensitive Graph Neural Networks0Chaozhuo Li, Huiguang He, Yingheng Wang, Zhongyu Huang
414Directed Acyclic Transformer for Non-Autoregressive Machine Translation0Fei Huang, Hang Li, Hao Zhou, Minlie Huang, Yang Liu
415Unsupervised Ground Metric Learning Using Wasserstein Singular Vectors0Gabriel Peyré, GeertJan Huizing, Laura Cantini
416Robust Kernel Density Estimation with Median-of-Means principle0Batiste Le Bars, Ludovic Minvielle, Pierre Humbert
417A data-driven approach for learning to control computers0Adam Santoro, Alistair Muldal, David Raposo, Gregory Thornton, Josh Abramson, Peter Conway Humphreys, Petko Georgiev, Rachita Chhaparia, Timothy P. Lillicrap, Tobias Pohlen
418Proximal Denoiser for Convergent Plug-and-Play Optimization with Nonconvex Regularization0Arthur Leclaire, Nicolas Papadakis, Samuel Hurault
419Inverse Contextual Bandits: Learning How Behavior Evolves over Time0Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar
420Datamodels: Understanding Predictions with Data and Data with Predictions0Aleksander Madry, Andrew Ilyas, Guillaume Leclerc, Logan Engstrom, Sung Min Park
421Parsimonious Learning-Augmented Caching0Aditya Petety, Manish Purohit, Ravi Kumar, Sungjin Im
422Bayesian Optimization for Distributionally Robust Chance-constrained Problem0Ichiro Takeuchi, Masayuki Karasuyama, Shion Takeno, Yu Inatsu
423LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation0David Ireland, Giovanni Montana
424The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns via Spotlights of Attention0Jürgen Schmidhuber, Kazuki Irie, Róbert Csordás
425A Modern Self-Referential Weight Matrix That Learns to Modify Itself0Imanol Schlag, Jürgen Schmidhuber, Kazuki Irie, Róbert Csordás
426Revisiting Online Submodular Minimization: Gap-Dependent Regret Bounds, Best of Both Worlds and Adversarial Robustness0Shinji Ito
427Modeling Strong and Human-Like Gameplay with KL-Regularized Search0Adam Lerer, Anton Bakhtin, Athul Paul Jacob, David J. Wu, Gabriele Farina, Hengyuan Hu, Jacob Andreas, Noam Brown
428A deep convolutional neural network that is invariant to time rescaling0Aakash Sarkar, Brandon G. Jacques, Marc W. Howard, Per B. Sederberg, Zoran Tiganj
429Input Dependent Sparse Gaussian Processes0Bahram Jafrasteh, Carlos VillacampaCalvo, Daniel HernándezLobato
430Regret Minimization with Performative Feedback0Celestine MendlerDünner, Meena Jagadeesan, Tijana Zrnic
431Biological Sequence Design with GFlowNets0Alex HernándezGarcía, Bonaventure F. P. Dossou, Chanakya Ajit Ekbote, Dinghuai Zhang, Emmanuel Bengio, Jarrid RectorBrooks, Jie Fu, Lena Simine, Michael Kilgour, Moksh Jain, Payel Das, Tianyu Zhang, Yoshua Bengio
432Combining Diverse Feature Priors0Aleksander Madry, Dimitris Tsipras, Saachi Jain
433Training Your Sparse Neural Network Better with Any Mask0Ajay Kumar Jaiswal, Haoyu Ma, Tianlong Chen, Ying Ding, Zhangyang Wang
434Sequential Covariate Shift Detection Using Classifier Two-Sample Tests0Insup Lee, Osbert Bastani, Sangdon Park, Sooyong Jang
435Surrogate Likelihoods for Variational Annealed Importance Sampling0Du Phan, Martin Jankowiak
436Planning with Diffusion for Flexible Behavior Synthesis0Joshua B. Tenenbaum, Michael Janner, Sergey Levine, Yilun Du
437HyperImpute: Generalized Iterative Imputation with Automatic Model Selection0Alicia Curth, Bogdan Cebere, Daniel Jarrett, Mihaela van der Schaar, Tennison Liu
438Mitigating Modality Collapse in Multimodal VAEs via Impartial Optimization0Adrián Javaloy, Isabel Valera, Maryam Meghdadi
439Towards understanding how momentum improves generalization in deep learning0Samy Jelassi, Yuanzhi Li
440MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer0Jeewon Jeon, Whiyoung Jung, Woojun Kim, Youngchul Sung
441An Exact Symbolic Reduction of Linear Smart Predict+Optimize to Mixed Integer Linear Programming0Andrew Butler, Jihwan Jeong, Parth Jaggi, Scott Sanner
442Agnostic Learnability of Halfspaces via Logistic Loss0Kwangjun Ahn, Pranjal Awasthi, Satyen Kale, Stefani Karp, Ziwei Ji
443Improving Policy Optimization with Generalist-Specialist Learning0Hao Su, Shuang Liu, Xuanlin Li, Yiran Wu, Zhan Ling, Zhiwei Jia
444Translatotron 2: High-quality direct speech-to-speech translation with voice preservation0Michelle Tadmor Ramanovich, Roi Pomerantz, Tal Remez, Ye Jia
445Online Learning and Pricing with Reusable Resources: Linear Bandits with Sub-Exponential Rewards0Cong Shi, Huiwen Jia, Siqian Shen
446The Role of Deconfounding in Meta-learning0Fei Wu, Kun Kuang, Luotian Yuan, Xinhai Ye, Ying Wei, Yinjie Jiang, Zhengyu Chen, Zhihua Wang
447Subspace Learning for Effective Meta-Learning0James T. Kwok, Weisen Jiang, Yu Zhang
448Optimal Algorithms for Stochastic Multi-Level Compositional Optimization0Bokun Wang, Lijun Zhang, Tianbao Yang, Wei Jiang, Yibo Wang
449Antibody-Antigen Docking and Design via Hierarchical Structure Refinement0Regina Barzilay, Tommi S. Jaakkola, Wengong Jin
450Sharpened Quasi-Newton Methods: Faster Superlinear Rate and Larger Local Convergence Neighborhood0Alec Koppel, Aryan Mokhtari, Ketan Rajawat, Qiujiang Jin
451The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces0Chi Jin, Qinghua Liu, Tiancheng Yu
452Domain Adaptation for Time Series Forecasting via Attention Sharing0Danielle C. Maddix, Hao Wang, Xiaoyong Jin, Youngsuk Park, Yuyang Wang
453Accelerated Federated Learning with Decoupled Adaptive Optimization0Dejing Dou, Ji Liu, Jiaxiang Ren, Jiayin Jin, Lingjuan Lyu, Yang Zhou
454Supervised Off-Policy Ranking0Houqiang Li, Jian Yuan, Tao Qin, TieYan Liu, Xudong Zhang, Yue Jin, Yue Zhang
455Input-agnostic Certified Group Fairness via Gaussian Parameter Smoothing0Jiayin Jin, Lingfei Wu, Yang Zhou, Zeru Zhang
456Score-based Generative Modeling of Graphs via the System of Stochastic Differential Equations0Jaehyeong Jo, Seul Lee, Sung Ju Hwang
457Choosing Answers in Epsilon-Best-Answer Identification for Linear Bandits0Marc Jourdan, Rémy Degenne
458Robust Fine-Tuning of Deep Neural Networks with Hessian-based Generalization Guarantees0Dongyue Li, Haotian Ju, Hongyang R. Zhang
459Robust alignment of cross-session recordings of neural population activity by behaviour via unsupervised domain adaptation0Justin Jude, Lee E. Miller, Matthew G. Perich, Matthias H. Hennig
460On Measuring Causal Contributions via do-interventions0Dominik Janzing, Elias Bareinboim, Jin Tian, Patrick Blöbaum, Shiva Prasad Kasiviswanathan, Yonghan Jung
461Efficient Approximate Inference for Stationary Kernel on Frequency Domain0Jinkyoo Park, Kyungwoo Song, Yohan Jung
462Sketching Algorithms and Lower Bounds for Ridge Regression0David P. Woodruff, Praneeth Kacham
463Flashlight: Enabling Innovation in Tools for Machine Learning0Ann Lee, Awni Y. Hannun, Benoit Steiner, Edouard Grave, Gabriel Synnaeve, Gilad Avidov, Jacob D. Kahn, Jeff Cai, Paden Tomasello, Qiantong Xu, Ronan Collobert, Tatiana Likhomanenko, Vineel Pratap, Vitaliy Liptchinsky
464Learning-based Optimisation of Particle Accelerators Under Partial Observability Without Real-World Training0Annika Eichler, Jan Kaiser, Oliver Stein
465Stochastic Deep Networks with Linear Competing Units for Model-Agnostic Meta-Learning0Konstantinos Kalais, Sotirios Chatzis
466Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning0Kaiwen Wang, Nathan Kallus, Xiaojie Mao, Zhengyuan Zhou
467Improved Rates for Differentially Private Stochastic Convex Optimization with Heavy-Tailed Data0Gautam Kamath, Huanyu Zhang, Xingtu Liu
468Comprehensive Analysis of Negative Sampling in Knowledge Graph Representation Learning0Hidetaka Kamigaito, Katsuhiko Hayashi
469Matching Learned Causal Effects of Neural Networks with Domain Priors0Abbavaram Gowtham Reddy, Amit Sharma, Sai Srinivas Kancheti, Vineeth N. Balasubramanian
470Deduplicating Training Data Mitigates Privacy Risks in Language Models0Colin Raffel, Eric Wallace, Nikhil Kandpal
471Lyapunov Density Models: Constraining Distribution Shift in Learning-Based Control0Claire J. Tomlin, Jason J. Choi, Katie Kang, Michael Janner, Paula Gradu, Sergey Levine
472Forget-free Continual Learning with Winning Subnetworks0Chang D. Yoo, Haeyong Kang, Jaehong Yoon, Mark HasegawaJohnson, Rusty John Lloyd Mina, Sultan Rizky Hikmawan Madjid, Sung Ju Hwang
473Differentially Private Approximate Quantiles0Haim Kaplan, Shachar Schnapp, Uri Stemmer
474Simultaneous Graph Signal Clustering and Graph Learning0Abdullah Karaaslanli, Selin Aviyente
475Composing Partial Differential Equations with Physics-Aware Neural Networks0Martin V. Butz, Matthias Karlbauer, Sebastian Otte, Sergey Oladyshkin, Timothy Praditia, Wolfgang Nowak
476Meta-Learning Hypothesis Spaces for Sequential Decision-making0Andreas Krause, Jonas Rothfuss, Parnian Kassraie
477FOCUS: Familiar Objects in Common and Uncommon Settings0Priyatham Kattakinda, Soheil Feizi
478Training OOD Detectors in their Natural Habitats0Julia B. Nakhleh, Julian KatzSamuels, Robert D. Nowak, Yixuan Li
479Robustness Implies Generalization via Data-Dependent Generalization Bounds0Jiaoyang Huang, Kenji Kawaguchi, Kyle Luh, Zhun Deng
480Generating Distributional Adversarial Examples to Evade Statistical Detectors0Krishnaram Kenthapadi, Muhammad Bilal Zafar, Nathalie Rauschmayr, Sergül Aydöre, Yigitcan Kaya
481Secure Quantized Training for Deep Learning0Ke Sun, Marcel Keller
482A Convergent and Dimension-Independent Min-Max Optimization Algorithm0Nisheeth K. Vishnoi, Oren Mangoubi, Sushant Sachdeva, Vijay Keswani
483Neural Network Poisson Models for Behavioural and Neural Spike Train Data0Amir Dezfouli, Ehsan Arabzadeh, Forough Habibollahi, Moein Khajehnejad, Peter Dayan, Richard Nock
484Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling0Gauri Joshi, Pranay Sharma, Sajad Khodadadian, Siva Theja Maguluri
485Multi-Level Branched Regularization for Federated Learning0Bohyung Han, Geeho Kim, Jinkyu Kim
486Learning fair representation with a parametric integral probability metric0Dongha Kim, Ilsang Ohn, Insung Kong, Kunwoong Kim, Yongdai Kim
487Dataset Condensation via Efficient Synthetic-Data Parameterization0Hwanjun Song, Hyun Oh Song, JangHyun Kim, Jinuk Kim, Joonhyun Jeong, JungWoo Ha, Sangdoo Yun, Seong Joon Oh
488Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance0Heeseung Kim, Sungroh Yoon, Sungwon Kim
489Variational On-the-Fly Personalization0Jangho Kim, Juntae Lee, Nojun Kwak, Simyung Chang
490Fisher SAM: Information Geometry and Sharpness Aware Minimisation0Da Li, Minyoung Kim, Shell Xu Hu, Timothy M. Hospedales
491ViT-NeT: Interpretable Vision Transformers with Neural Tree Decoder0ByoungChul Ko, JaeYeal Nam, Sangwon Kim
492Sanity Simulations for Saliency Methods0Ameet Talwalkar, Gregory Plumb, Joon Sik Kim
493Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for High Precision Score Estimation0Dongjun Kim, IlChul Moon, Kyungwoo Song, Seungjae Shin, Wanmo Kang
494Rotting Infinitely Many-Armed Bandits0JungHun Kim, Milan Vojnovic, SeYoung Yun
495Accelerated Gradient Methods for Geodesically Convex Optimization: Tractable Algorithms and Convergence Analysis0Insoon Yang, Jungbin Kim
496Generalizing to New Physical Systems via Context-Informed Dynamics Model0Alain Rakotomamonjy, Jérémie Donà, Matthieu Kirchmeyer, Nicolas Baskiotis, Patrick Gallinari, Yuan Yin
497SoQal: Selective Oracle Questioning for Consistency Based Active Learning of Cardiac Signals0Dani Kiyasseh, David A. Clifton, Tingting Zhu
498Curriculum Reinforcement Learning via Constrained Optimal Transport0Carlo D'Eramo, Haoyi Yang, Jan Peters, Joni Pajarinen, Pascal Klink
499Exploiting Redundancy: Separable Group Convolutional Networks on Lie Groups0David M. Knigge, David W. Romero, Erik J. Bekkers
500Revisiting Contrastive Learning through the Lens of Neighborhood Component Analysis: an Integrated Framework0ChingYun Ko, Jeet Mohapatra, Lily Weng, Luca Daniel, PinYu Chen, Sijia Liu
501Transfer Learning In Differential Privacy's Hybrid-Model0Or Sheffet, Refael Kohen
502Markov Chain Monte Carlo for Continuous-Time Switching Dynamical Systems0Bastian Alt, Heinz Koeppl, Lukas Köhs
503Partial disentanglement for domain adaptation0Guangyi Chen, Kun Zhang, Lingjing Kong, Petar Stojanov, Shaoan Xie, Victor Akinwande, Weiran Yao, Yujia Zheng
504Simultaneously Learning Stochastic and Adversarial Bandits with General Graph Feedback0Fang Kong, Shuai Li, Yichi Zhou
505Adaptive Data Analysis with Correlated Observations0Aryeh Kontorovich, Menachem Sadigurschi, Uri Stemmer
506Controlling Conditional Language Models without Catastrophic Forgetting0Germán Kruszewski, Hady Elsahar, Marc Dymetman, Tomasz Korbak
507Batch Greenkhorn Algorithm for Entropic-Regularized Multimarginal Optimal Transport: Linear Rate of Convergence and Iteration Complexity0Massimiliano Pontil, Saverio Salzo, Vladimir R. Kostic
508Certified Adversarial Robustness Under the Bounded Support Set0Qinyuan Zheng, Yisen Wang, Yiwen Kou
509Exact Learning of Preference Structure: Single-peaked Preferences and Beyond0Edith Elkind, Sonja Kraiczy
510Reconstructing Nonlinear Dynamical Systems from Multi-Modal Time Series0Carlo Tombolini, Daniel Durstewitz, Daniel Kramer, Georgia Koppe, Philine Lou Bommer
511Probabilistic ODE Solutions in Millions of Dimensions0Jonathan Schmidt, Nathanael Bosch, Nicholas Krämer, Philipp Hennig
512Active Nearest Neighbor Regression Through Delaunay Refinement0Alexander Kravberg, Anastasiia Varava, Danica Kragic, Florian T. Pokorny, Giovanni Luca Marchetti, Vladislav Polianskii
513Functional Generalized Empirical Likelihood Estimation for Conditional Moment Restrictions0Bernhard Schölkopf, Heiner Kremer, JiaJie Zhu, Krikamol Muandet
514Calibrated and Sharp Uncertainties in Deep Learning via Density Estimation0Shachi Deshpande, Volodymyr Kuleshov
515ActiveHedge: Hedge meets Active Learning0Bhuvesh Kumar, Jacob D. Abernethy, Venkatesh Saligrama
516Balancing Discriminability and Transferability for Source-Free Domain Adaptation0Akshay R. Kulkarni, Deepesh Mehta, Jogendra Nath Kundu, Shreyas Anand Kulkarni, Suvaansh Bhambri, Varun Jampani, Venkatesh Babu Radhakrishnan
517Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters0Sergey Kolesnikov, Vladislav Kurenkov
518Equivariant Priors for compressed sensing with unknown orientation0Anna Kuzina, Arash Behboodi, Fabio Valerio Massoli, Kumar Pratik
519Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms0Constantine Caramanis, Jeongyeol Kwon, Shie Mannor, Yonathan Efroni
520Large Batch Experience Replay0Emmanuel Rachelson, Matthieu Geist, Thibault Lahire
521FedScale: Benchmarking Model and System Performance of Federated Learning at Scale0Fan Lai, Harsha V. Madhyastha, Jiachen Liu, Mosharaf Chowdhury, Sanjay Sri Vallabh Singapuram, Xiangfeng Zhu, Yinwei Dai
522Smoothed Adaptive Weighting for Imbalanced Semi-Supervised Learning: Improve Reliability Against Unknown Distribution Data0Chao Wang, ChenNee Chuah, Henrry Gunawan, SenChing S. Cheung, Zhengfeng Lai
523Functional Output Regression with Infimal Convolution: Exploring the Huber and ε-insensitive Losses0Alex Lambert, Dimitri Bouche, Florence d'AlchéBuc, Zoltán Szabó
524Tell me why! Explanations support learning relational and causal structure0Adam Santoro, Allison C. Tam, Andrew K. Lampinen, Chen Yan, Felix Hill, Ishita Dasgupta, James L. McClelland, Jane X. Wang, Neil C. Rabinowitz, Nicholas A. Roy, Stephanie C. Y. Chan
525Generative Cooperative Networks for Natural Language Generation0Antoine Chaffin, Benjamin Piwowarski, Ewa Kijak, Jacopo Staiano, Sylvain Lamprier, Thomas Scialom, Vincent Claveau
526DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural Network for Traffic Flow Forecasting0Hongyu Yang, Pyang Li, Shiyong Lan, Weikang Huang, Wenwu Wang, Yitong Ma
527Cooperative Online Learning in Stochastic and Adversarial MDPs0Aviv Rosenberg, Tal Lancewicki, Yishay Mansour
528PINs: Progressive Implicit Networks for Multi-Scale Neural Representations0Alexander SorkineHornung, Ricardo Silveira Cabral, Zoe Landgraf
529Co-training Improves Prompt-based Learning for Large Language Models0David A. Sontag, Hunter Lang, Monica N. Agrawal, Yoon Kim
530Goal Misgeneralization in Deep Reinforcement Learning0David Krueger, Jack Koch, Jacob Pfau, Lauro Langosco di Langosco, Lee D. Sharkey
531Marginal Tail-Adaptive Normalizing Flows0Asja Fischer, Johannes Lederer, Mike Laszkiewicz
532Bregman Proximal Langevin Monte Carlo via Bregman-Moreau Envelopes0Han Liu, Tim TszKit Lau
533Scalable Deep Reinforcement Learning Algorithms for Mean Field Games0Ayush Jain, Georgios Piliouras, Julien Pérolat, Mathieu Laurière, Matthieu Geist, Olivier Pietquin, Paul Muller, Romuald Elie, Sarah Perrin, Sertan Girgin, Theophile Cabannes
534Implicit Bias of Linear Equivariant Networks0Andrew K. Dienes, Bobak Toussi Kiani, Hannah Lawrence, Kristian G. Georgiev
535Differentially Private Maximal Information Coefficients0Aaron Johnson, Emmanuel Adéníran, John Lazarsfeld
536Entropic Gromov-Wasserstein between Gaussian Distributions0Dat Do, Dung Q. Le, Huy Nguyen, Khang Le, Nhat Ho, Tung Pham
537Neurocoder: General-Purpose Computation Using Stored Neural Programs0Hung Le, Svetha Venkatesh
538Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime0Bekzhan Kerimkulov, David Siska, JamesMichael Leahy, Lukasz Szpruch
539A Random Matrix Analysis of Data Stream Clustering: Coping With Limited Memory Resources0Florent Chatelain, Hugo Lebeau, Romain Couillet
540Neural Tangent Kernel Analysis of Deep Narrow Neural Networks0Albert No, Ernest K. Ryu, Jongmin Lee, Joo Young Choi
541Dataset Condensation with Contrastive Signals0Saehyung Lee, Sangdoo Yun, Sanghyuk Chun, Sangwon Jung, Sungroh Yoon
542Confidence Score for Source-Free Unsupervised Domain Adaptation0Dahuin Jung, Jonghyun Lee, Junho Yim, Sungroh Yoon
543A Statistical Manifold Framework for Point Cloud Data0Frank Chongwoo Park, Jinwon Choi, Seungyeon Kim, Yonghyeon Lee
544Low-Complexity Deep Convolutional Neural Networks on Fully Homomorphic Encryption Using Multiplexed Parallel Convolutions0Eunsang Lee, JongSeon No, JoonWoo Lee, Junghyun Lee, Woosuk Choi, Yongjune Kim, YoungSik Kim
545Statistical inference with implicit SGD: proximal Robbins-Monro vs. Polyak-Ruppert0JoongHo Won, Sungdong Lee, Yoonhyung Lee
546Maslow's Hammer in Catastrophic Forgetting: Node Re-Use vs. Node Activation0Andrew M. Saxe, Claudia Clopath, Sebastian Goldt, Sebastian Lee, Stefano Sarao Mannelli
547Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian Optimization0Deokjae Lee, Hyun Oh Song, Junhyeok Lee, Seungyong Moon
548Least Squares Estimation using Sketched Data with Heteroskedastic Errors0Serena Ng, Sokbae Lee
549Why the Rich Get Richer? On the Balancedness of Random Partition Models0Changwoo J. Lee, Huiyan Sang
550Model Selection in Batch Policy Optimization0Bo Dai, George Tucker, Jonathan Lee, Ofir Nachum
551Supervised Learning with General Risk Functionals0Audrey Huang, Kamyar Azizzadenesheli, Liu Leqi, Zachary C. Lipton
552Generalized Strategic Classification and the Case of Aligned Incentives0Nir Rosenfeld, Sagi Levanon
553A Simple Unified Framework for High Dimensional Bandit Problems0Adarsh Barik, Jean Honorio, Wenjie Li
554Robust Training of Neural Networks Using Scale Invariant Architectures0Manzil Zaheer, Sanjiv Kumar, Sashank J. Reddi, Srinadh Bhojanapalli, Zhiyuan Li
555Spatial-Channel Token Distillation for Vision MLPs0Chang Xu, Minjing Dong, Xinghao Chen, Yanxi Li, Yehui Tang, Yunhe Wang
556An Analytical Update Rule for General Policy Optimization0Haibo He, Hepeng Li, Nicholas Clavette
557On Convergence of Gradient Descent Ascent: A Tight Local Analysis0Ali Jadbabaie, Farzan Farnia, Haochuan Li, Subhro Das
558On the Finite-Time Performance of the Knowledge Gradient Algorithm0Siyang Gao, Yanwen Li
559Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning0Huazhe Xu, Jiaqi Yang, Tian Gao, Yi Wu, Yunfei Li
560G2CN: Graph Gaussian Convolution Networks with Concentrated Graph Filters0Mingjie Li, Xiaojun Guo, Yifei Wang, Yisen Wang, Zhouchen Lin
561Decomposing Temporal High-Order Interactions via Latent ODEs0Robert M. Kirby, Shandian Zhe, Shibo Li
562Neural Inverse Transform Sampler0Henry Li, Yuval Kluger
563PLATINUM: Semi-Supervised Model Agnostic Meta-Learning using Submodular Mutual Information0Changbin Li, Feng Chen, Rishabh K. Iyer, Suraj Kothawade
564Deconfounded Value Decomposition for Multi-Agent Reinforcement Learning0Baoxiang Wang, Changjie Fan, Fei Wu, Furui Liu, Jiahui Li, Jun Xiao, Kun Kuang, Long Chen
565C-MinHash: Improving Minwise Hashing with Circulant Permutation0Ping Li, Xiaoyun Li
566BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation0Caiming Xiong, Dongxu Li, Junnan Li, Steven C. H. Hoi
567Restarted Nonconvex Accelerated Gradient Descent: No More Polylogarithmic Factor in the O(ε-7/4) Complexity0Huan Li, Zhouchen Lin
568Achieving Fairness at No Utility Cost via Data Reweighing with Influence0Hongfu Liu, Peizhao Li
569High Probability Guarantees for Nonconvex Stochastic Gradient Descent with Heavy Tails0Shaojie Li, Yong Liu
570MetAug: Contrastive Learning via Meta Feature Augmentation0Bing Su, Changwen Zheng, Hui Xiong, Jiangmeng Li, Wenwen Qiang
571PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration0Hongyao Tang, Jianye Hao, Matthew E. Taylor, Pengyi Li, Tianpei Yang, Tong Sang, Wenyuan Tao, Xiaotian Hao, Yan Zheng, Zhen Wang
572CerDEQ: Certifiable Deep Equilibrium Model0Mingjie Li, Yisen Wang, Zhouchen Lin
573Generalization Guarantee of Training Graph Convolutional Networks with Graph Topology Sampling0Hongkang Li, Jinjun Xiong, Meng Wang, PinYu Chen, Sijia Liu
574Let Invariant Rationale Discovery Inspire Graph Contrastive Learning0An Zhang, Sihang Li, TatSeng Chua, Xiang Wang, Xiangnan He, Yingxin Wu
575Difference Advantage Estimation for Multi-Agent Policy Gradients0Guangming Xie, Yueheng Li, Zongqing Lu
576Private Adaptive Optimization with Side information0Manzil Zaheer, Sashank J. Reddi, Tian Li, Virginia Smith
577Permutation Search of Tensor Network Structures via Local Sampling0Chao Li, Junhua Zeng, Qibin Zhao, Zerui Tao
578Hessian-Free High-Resolution Nesterov Acceleration For Sampling0Hongyuan Zha, Molei Tao, Ruilin Li
579Double Sampling Randomized Smoothing0Bo Li, Jiawei Zhang, Linyi Li, Tao Xie
580HousE: Knowledge Graph Embedding with Householder Parameterization0Chaozhuo Li, Di He, Hao Sun, Jianan Zhao, Qi Zhang, Rui Li, Senzhang Wang, Weiwei Deng, Xing Xie, Yanming Shen, Yiqi Wang, Yuming Liu
581Learning Multiscale Transformer Models for Sequence Generation0Bei Li, Chengbo Jiao, Jingbo Zhu, Tong Xiao, Tong Zheng, Yi Jing
582Finding Global Homophily in Graph Neural Networks When Meeting Heterophily0Caihua Shan, Dongsheng Li, Renyu Zhu, Siqiang Luo, Weining Qian, Xiang Li, Yao Cheng
583Fat-Tailed Variational Inference with Anisotropic Tail Adaptive Flows0Feynman T. Liang, Liam Hodgkinson, Michael W. Mahoney
584Exploring and Exploiting Hubness Priors for High-Quality GAN Latent Sampling0Jing Wu, Yipeng Qin, YuKun Lai, Yuanbang Liang
585Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks0Alexander Ihler, Dailin Hu, Litian Liang, Pieter Abbeel, Roy Fox, Stephen McAleer, Yaosheng Xu
586TSPipe: Learn from Teacher Faster with Pipelines0Dongsu Han, Hwijoon Lim, Jinwoo Shin, Sukmin Yun, Yechan Kim
587Order Constraints in Optimal Transport0Fabian Lim, Laura Wynter, Shiau Hong Lim
588Flow-Guided Sparse Transformer for Video Deblurring0Haoqian Wang, Henghui Ding, Jing Lin, Luc Van Gool, Radu Timofte, Xiaowan Hu, Xueyi Zou, Youliang Yan, Yuanhao Cai, Yulun Zhang
589Federated Learning with Positive and Unlabeled Data0Chao Xu, Hanting Chen, Xiaolin Gui, Xinyang Lin, Yiping Deng, Yixing Xu, Yunhe Wang
590Decentralized Online Convex Optimization in Networked Systems0Adam Wierman, Guannan Qu, Judy Gan, Yash Kanoria, Yiheng Lin
591Unsupervised Flow-Aligned Sequence-to-Sequence Learning for Video Restoration0Haoqian Wang, Jing Lin, Luc Van Gool, Xiaowan Hu, Xueyi Zou, Youliang Yan, Yuanhao Cai, Yulun Zhang
592Constrained Gradient Descent: A Powerful and Principled Evasion Attack Against Neural Networks0Keane Lucas, Lujo Bauer, Mahmood Sharif, Michael K. Reiter, Weiran Lin
593Learning Augmented Binary Search Trees0David P. Woodruff, Honghao Lin, Tian Luo
594Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback0Aldo Pacchiano, Michael I. Jordan, Tianyi Lin, Yaodong Yu
595Measuring the Effect of Training Data on Deep Learning Predictions via Randomized Experiments0Anqi Zhang, Aurojit Panda, Jinkun Lin, Jinyang Li, Mathias Lécuyer, Siddhartha Sen
596Interactively Learning Preference Constraints in Linear Bandits0Andreas Krause, David Lindner, Katja Hofmann, Sebastian Tschiatschek
597Delayed Reinforcement Learning by Imitation0Davide Maran, Lorenzo Bisi, Marcello Restelli, Pierre Liotet
598CITRIS: Causal Identifiability from Temporal Intervened Sequences0Phillip Lippe, Sara Magliacane, Sindy Löwe, Stratis Gavves, Taco Cohen, Yuki M. Asano
599StreamingQA: A Benchmark for Adaptation to New Knowledge over Time in Question Answering Models0Adam Liska, Angeliki Lazaridou, Cyprien de Masson d'Autume, Devang Agrawal, Elena Gribovskaya, Ellen GilsenanMcMahon, Eren Sezener, Manzil Zaheer, Phil Blunsom, Sophia Austin, Susannah Young, Tayfun Terzi, Tim Scholtes, Tomás Kociský
600Distributionally Robust Q-Learning0Jose H. Blanchet, Perry Dong, Qinxun Bai, Wei Xu, Zhengqing Zhou, Zhengyuan Zhou, Zijian Liu
601Constrained Variational Policy Optimization for Safe Reinforcement Learning0Bo Li, Ding Zhao, Vladislav Isenbaev, Wei Liu, Zhepeng Cen, Zhiwei Steven Wu, Zuxin Liu
602Benefits of Overparameterized Convolutional Residual Networks: Function Approximation under Smoothness Constraint0Hao Liu, Minshuo Chen, Siawpeng Er, Tong Zhang, Tuo Zhao, Wenjing Liao
603Boosting Graph Structure Learning with Dummy Nodes0Jiayang Cheng, Xin Jiang, Xin Liu, Yangqiu Song
604Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent0Bin Li, Houqiang Li, Huacong Jiang, Weiming Liu
605Deep Probability Estimation0Aakash Kaku, Boyang Yu, Carlos FernandezGranda, Haoxiang Huang, Jonathan NilesWeed, Laure Zanna, Matan Leibovich, Narges Razavian, Sheng Liu, Sreyas Mohan, Weicheng Zhu
606Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers0Alexandre Muzio, Hany Hassan, Rui Liu, Young Jin Kim
607Simplex Neural Population Learning: Any-Mixture Bayes-Optimality in Symmetric Zero-sum Games0Luke Marris, Marc Lanctot, Nicolas Heess, Siqi Liu
608Rethinking Attention-Model Explainability through Faithfulness Violation Test0Chenqi Kong, Haoliang Li, Jing Li, Shiqi Wang, Yangyang Guo, Yibing Liu
609Optimization-Derived Learning with Essential Convergence Analysis of Training and Hyper-training0Jin Zhang, Risheng Liu, Shangzhi Zeng, Xuan Liu, Yixuan Zhang
610Deep Neural Network Fusion via Graph Matching with Applications to Model Ensemble and Federated Learning0Alan Yuhan Xi, Chang Liu, Chenfei Lou, Junchi Yan, Li Shen, Runzhong Wang
611Welfare Maximization in Competitive Equilibrium: Reinforcement Learning for Markov Exchange Economy0Miao Lu, Michael I. Jordan, Zhaoran Wang, Zhihan Liu, Zhuoran Yang
612Generating 3D Molecules for Target Protein Binding0Kanji Uchino, Koji Maruhashi, Meng Liu, Shuiwang Ji, Youzhi Luo
613Communication-efficient Distributed Learning for Large Batch Optimization0Barzan Mozafari, Rui Liu
614Adaptive Accelerated (Extra-)Gradient Methods with Variance Reduction0Alina Ene, Huy L. Nguyen, Ta Duy Nguyen, Zijian Liu
615REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer0Deepak Pathak, Kris Kitani, Xingyu Liu
616Kill a Bird with Two Stones: Closing the Convergence Gaps in Non-Strongly Convex Optimization by Directly Accelerated SVRG with Double Compensation and Snapshots0Fanhua Shang, Hongying Liu, Weixin An, Yuanyuan Liu, Zhouchen Lin
617Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits0Chi Jin, Qinghua Liu, Yuanhao Wang
618Local Augmentation for Graph Neural Networks0Dinghao Wu, Hanze Dong, Junzhou Huang, Lanqing Li, Peilin Zhao, Rex Ying, Songtao Liu, Tingyang Xu, Yu Rong
619Asking for Knowledge (AFK): Training RL Agents to Query External Knowledge Using Language0Alexander G. Schwing, IouJen Liu, MarcAlexandre Côté, PierreYves Oudeyer, Xingdi Yuan
620Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation0Yufeng Zhang, Zhaoran Wang, Zhihan Liu, Zhuoran Yang, Zuyue Fu
621GACT: Activation Compressed Training for Generic Network Architectures0Alvin Cheung, Dequan Wang, Jianfei Chen, Jie Tang, Joey Gonzalez, Lianmin Zheng, Michael W. Mahoney, Weize Chen, Xiaoxuan Liu, Xu Han, Yukuo Cen, Zhiyuan Liu
622Robust Training under Label Noise by Over-parameterization0Chong You, Qing Qu, Sheng Liu, Zhihui Zhu
623Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization0Jianye Hao, Jun Wang, Minghuan Liu, Weinan Zhang, Yong Yu, Yuzheng Zhuang, Zhengbang Zhu
624On the Impossibility of Learning to Cooperate with Adaptive Partner Strategies in Repeated Games0Frans A. Oliehoek, Robert Tyler Loftin
625AutoIP: A United Framework to Integrate Physics into Gaussian Processes0Aditi S. Krishnapriyan, Da Long, Michael W. Mahoney, Robert M. Kirby, Shandian Zhe, Zheng Wang
626Bayesian Model Selection, the Marginal Likelihood, and Generalization0Andrew Gordon Wilson, Gregory W. Benton, Micah Goldblum, Pavel Izmailov, Sanae Lotfi
627Feature Learning and Signal Propagation in Deep Neural Networks0Chris E. Mingard, Soufiane Hayou, Yizhang Lou
628Fluctuations, Bias, Variance & Ensemble of Learners: Exact Asymptotics for Convex Losses in High-Dimension0Bruno Loureiro, Cédric Gerbelot, Florent Krzakala, Gabriele Sicuro, Maria Refinetti
629A Single-Loop Gradient Descent and Perturbed Ascent Algorithm for Nonconvex Functional Constrained Optimization0Songtao Lu
630Additive Gaussian Processes Revisited0Alexis Boukouvalas, James Hensman, Xiaoyu Lu
631ModLaNets: Learning Generalisable Dynamics via Modularity and Physical Inductive Bias0Guanqi Chen, Jia Pan, Shijie Lin, Yupu Lu
632Model-Free Opponent Shaping0Christian A. Schröder de Witt, Christopher Lu, Jakob N. Foerster, Timon Willi
633Multi-slots Online Matching with High Entropy0Qintong Wu, Wenliang Zhong, Xingyu Lu
634Maximum Likelihood Training for Score-based Diffusion ODEs by High Order Denoising Score Matching0Cheng Lu, Chongxuan Li, Fan Bao, Jianfei Chen, Jun Zhu, Kaiwen Zheng
635Orchestra: Unsupervised Federated Learning via Globally Consistent Clustering0Akhil Mathur, Chi Ian Tang, Ekdeep Singh Lubana, Fahim Kawsar, Robert P. Dick
636A Rigorous Study of Integrated Gradients Method and Extensions to Internal Neuron Attributions0Daniel Lundström, Meisam Razaviyayn, Tianjian Huang
637BAMDT: Bayesian Additive Semi-Multivariate Decision Trees for Nonparametric Regression0Bani K. Mallick, Huiyan Sang, Zhao Tang Luo
638Disentangled Federated Learning for Tackling Attributes Skew via Invariant Aggregation and Diversity Transferring0Tieniu Tan, Yunlong Wang, Zhenan Sun, Zhengquan Luo, Zilei Wang
639Channel Importance Matters in Few-Shot Image Classification0Jing Xu, Xu Luo, Zenglin Xu
640Learning Dynamics and Generalization in Deep Reinforcement Learning0Clare Lyle, Mark Rowland, Marta Kwiatkowska, Will Dabney, Yarin Gal
641On Finite-Sample Identifiability of Contrastive Learning-Based Nonlinear Independent Component Analysis0Qi Lyu, Xiao Fu
642Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning0Boxiang Lyu, Mladen Kolar, Zhaoran Wang, Zhuoran Yang
643Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching0Andrew Shen, Dinesh Jayaraman, Osbert Bastani, Yecheng Jason Ma
644Quantification and Analysis of Layer-wise and Pixel-wise Information Discarding0Fan Zhou, Hao Zhang, Haotian Ma, Quanshi Zhang, Yinqing Zhang
645Interpretable Neural Networks with Frank-Wolfe: Sparse Relevance Maps and Relevance Orderings0Jan MacDonald, Mathieu Besançon, Sebastian Pokutta
646A Tighter Analysis of Spectral Clustering, and Beyond0He Sun, Peter Macgregor
647Zero-Shot Reward Specification via Grounded Natural Language0Deepak Pathak, Parsa Mahmoudieh, Trevor Darrell
648Feature selection using e-values0Snigdhansu Chatterjee, Subhabrata Majumdar
649SSL Enables Learning from Sparse Rewards in Image-Goal Navigation0Arjun Majumdar, Dhruv Batra, Gaurav S. Sukhatme, Gunnar A. Sigurdsson, Jesse Thomason, Robinson Piramuthu
650Knowledge-Grounded Self-Rationalization via Extractive and Natural Language Explanations0Bodhisattwa Prasad Majumder, Julian J. McAuley, Oana Camburu, Thomas Lukasiewicz
651Nonparametric Involutive Markov Chain Monte Carlo0Carol Mak, Fabian Zaiser, Luke Ong
652Architecture Agnostic Federated Learning for Neural Networks0Disha Makhija, Joydeep Ghosh, Nhat Ho, Xing Han
653Robustness in Multi-Objective Submodular Optimization: a Quantile Approach0Cédric Malherbe, Kevin Scaman
654More Efficient Sampling for Tensor Decomposition With Worst-Case Guarantees0Osman Asif Malik
655Unaligned Supervision for Automatic Music Transcription in The Wild0Amit H. Bermano, Ben Maman
656Decision-Focused Learning: Through the Lens of Learning to Rank0Jayanta Mandi, Maxime Mulamba Ke Tchomba, Tias Guns, Víctor Bucarey
657Differentially Private Coordinate Descent for Composite Empirical Risk Minimization0Aurélien Bellet, Joseph Salmon, Marc Tommasi, Paul Mangold
658Refined Convergence Rates for Maximum Likelihood Estimation under Finite Mixture Models0Nhat Ho, Tudor A. Manole
659On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning0Kaiqing Zhang, Lin Yang, Tamer Basar, Weichao Mao
660On the Effects of Artificial Data Modification0Adam PrügelBennett, Antonia Marcu
661Personalized Federated Learning through Local Memorization0Giovanni Neglia, Laetitia Kameni, Othmane Marfoq, Richard Vidal
662Nested Bandits0Houssam Zenati, Matthieu Martin, Panayotis Mertikopoulos, Thibaud Rahier
663Closed-Form Diffeomorphic Transformations for Time Series Alignment0Elisabeth Viles, Igor G. Olaizola, Iñigo Martinez
664SPECTRE: Spectral Conditioning Helps to Overcome the Expressivity Limits of One-shot Graph Generators0Andreas Loukas, Karolis Martinkus, Nathanaël Perraudin, Roger Wattenhofer
665Modular Conformal Calibration0Charles Marx, Shengjia Zhao, Stefano Ermon, Willie Neiswanger
666Continual Repeated Annealed Flow Transport Monte Carlo0Alexander G. de G. Matthews, Arnaud Doucet, Danilo Jimenez Rezende, Michael Arbel
667How to Stay Curious while avoiding Noisy TVs using Aleatoric Uncertainty Estimation0Augustine N. MavorParker, Caswell Barry, Kimberly A. Young, Lewis D. Griffin
668How to Steer Your Adversary: Targeted and Efficient Model Stealing Defenses with Gradient Redirection0Bo Li, David A. Forsyth, Mantas Mazeika
669Quant-BnB: A Scalable Branch-and-Bound Method for Optimal Decision Trees with Continuous Features0Haoyue Wang, Rahul Mazumder, Xiang Meng
670Optimizing Tensor Network Contraction Using Reinforcement Learning0Eli A. Meirom, Gal Chechik, Haggai Maron, Shie Mannor
671Causal Transformer for Estimating Counterfactual Outcomes0Dennis Frauen, Stefan Feuerriegel, Valentyn Melnychuk
672Steerable 3D Spherical Neurons0Michael Felsberg, Mårten Wadenbäck, Pavlo Melnyk
673Transformers are Meta-Reinforcement Learners0Luckeciano C. Melo
674ButterflyFlow: Building Invertible Layers with Butterfly Matrices0Chenlin Meng, Kristy Choi, Linqi Zhou, Stefano Ermon, Tri Dao
675In defense of dual-encoders for neural ranking0Aditya Krishna Menon, Ankit Singh Rawat, Sadeep Jayasumana, Sanjiv Kumar, Sashank J. Reddi, Seungyeon Kim
676Equivariant Quantum Graph Circuits0Ismail Ilkan Ceylan, Konstantinos Meichanetzidis, Péter Mernyei
677Stochastic Rising Bandits0Alberto Maria Metelli, Francesco Trovò, Marcello Restelli, Matteo Pirola
678Minimizing Control for Credit Assignment with Strong Feedback0Alexander Meulemans, Benjamin F. Grewe, João Sacramento, Maria R. Cervera, Matilde Tristany Farinha
679A Dynamical System Perspective for Lipschitz Neural Networks0Alexandre Allauzen, Alexandre Araujo, Blaise Delattre, Laurent Meunier
680Distribution Regression with Sliced Wasserstein Kernels0Carlo Ciliberto, Dimitri Meunier, Massimiliano Pontil
681Interpretable and Generalizable Graph Learning via Stochastic Attention Mechanism0Mia Liu, Pan Li, Siqi Miao
682Modeling Structure with Undirected Neural Networks0André F. T. Martins, Tsvetomila Mihaylova, Vlad Niculae
683Universal Hopfield Networks: A General Framework for Single-Shot Associative Memory Models0Beren Millidge, Rafal Bogacz, Thomas Lukasiewicz, Tommaso Salvatori, Yuhang Song
684Learning Stochastic Shortest Path with Linear Function Approximation0Jiafan He, Quanquan Gu, Tianhao Wang, Yifei Min
685Prioritized Training on Points that are Learnable, Worth Learning, and not yet Learnt0Adrien Morisot, Aidan N. Gomez, Andreas Kirsch, Benedikt Höltgen, Jan Markus Brauner, Mrinank Sharma, Muhammed Razzak, Sebastian Farquhar, Sören Mindermann, Winnie Xu, Yarin Gal
686POEM: Out-of-Distribution Detection with Posterior Sampling0Yifei Ming, Ying Fan, Yixuan Li
687A Simple Reward-free Approach to Constrained Reinforcement Learning0Chi Jin, Sobhan Miryoosefi
688Wide Neural Networks Forget Less Catastrophically0Arslan Chaudhry, Dilan Görür, Dong Yin, Huiyi Hu, Mehrdad Farajtabar, Razvan Pascanu, SeyedIman Mirzadeh
689Proximal and Federated Random Reshuffling0Ahmed Khaled, Konstantin Mishchenko, Peter Richtárik
690ProxSkip: Yes! Local Gradient Steps Provably Lead to Communication Acceleration! Finally!0Grigory Malinovsky, Konstantin Mishchenko, Peter Richtárik, Sebastian U. Stich
691Fast Convex Optimization for Two-Layer ReLU Networks: Equivalent Model Classes and Cone Decompositions0Aaron Mishkin, Arda Sahiner, Mert Pilanci
692Memory-Based Model Editing at Scale0Antoine Bosselut, Charles Lin, Chelsea Finn, Christopher D. Manning, Eric Mitchell
693Invariant Ancestry Search0Jonas Peters, Nikolaj Thams, Phillip B. Mogensen
694Differentially Private Community Detection for Stochastic Block Models0Anil Vullikanti, Dung Nguyen, Mohamed S. Mohamed, Ravi Tandon
695A Multi-objective / Multi-task Learning Framework Induced by Pareto Stationarity0Chaosheng Dong, Jia Liu, Michinari Momma
696EqR: Equivariant Representations for Data-Efficient Reinforcement Learning0Arnab Kumar Mondal, Kaleem Siddiqi, Siamak Ravanbakhsh, Vineet Jain
697Feature and Parameter Selection in Stochastic Linear Bandits0Ahmadreza Moradipari, Berkay Turan, Mahnoosh Alizadeh, Mohammad Ghavamzadeh, Yasin AbbasiYadkori
698Power-Law Escape Rate of SGD0Kangqiao Liu, Liu Ziyin, Masahito Ueda, Takashi Mori
699Rethinking Fano's Inequality in Ensemble Learning0Gaku Morio, Hiroaki Ozaki, Nobuo Nukaga, Shota Horiguchi, Terufumi Morishita
700SpeqNets: Sparsity-aware permutation-equivariant graph networks0Christopher Morris, Gaurav Rattan, Sandra Kiefer, Siamak Ravanbakhsh
701CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer0Jianyu Chen, Mingyu Ding, Ping Luo, Runjian Chen, Shoufa Chen, Yao Mark Mu
702Generalized Beliefs for Cooperative AI0Christian A. Schröder de Witt, Darius Muglich, Jakob N. Foerster, Luisa M. Zintgraf, Shimon Whiteson
703Bounding the Width of Neural Networks via Coupled Initialization A Worst Case Analysis0Alexander Munteanu, David P. Woodruff, Simon Omlor, Zhao Song
704Constants Matter: The Performance Gains of Active Learning0Sanjoy Dasgupta, Stephen O. Mussmann
705On the Generalization Analysis of Adversarial Learning0Marius Kloft, Waleed Mustafa, Yunwen Lei
706Universal and data-adaptive algorithms for model selection in linear contextual bandits0Akshay Krishnamurthy, Vidya K. Muthukumar
707The Importance of Non-Markovianity in Maximum State Entropy Exploration0Marcello Restelli, Mirco Mutti, Riccardo De Santi
708PAC-Net: A Model Pruning Approach to Inductive Transfer Learning0Changwook Jeong, Daesin Kim, In Huh, Jae Myung Choe, Jisu Ryu, KeeEung Kim, Sanghoon Myung, Wonik Jang
709AutoSNN: Towards Energy-Efficient Spiking Neural Networks0Byunggook Na, Dongjin Lee, Hyeokjun Choe, Jisoo Mok, Seongsik Park, Sungroh Yoon
710Implicit Bias of the Step Size in Linear Diagonal Neural Networks0Daniel Soudry, Kavya Ravichandran, Mor Shpigel Nacson, Nathan Srebro
711DNNR: Differential Nearest Neighbors Regression0Leon Sixt, Tim Landgraf, Youssef Nader
712Overcoming Oscillations in Quantization-Aware Training0Marios Fournarakis, Markus Nagel, Tijmen Blankevoort, Yelysei Bondarenko
713Strategic Representation0Ganesh Ghalme, Inbal TalgamCohen, Nir Rosenfeld, Vineet Nair
714Improving Ensemble Distillation With Weight Averaging and Diversifying Perturbation0Byeongho Heo, Giung Nam, Hyungi Lee, Juho Lee
715Measuring Representational Robustness of Neural Networks Through Shared Invariances0Adrian Weller, Camila Kolling, John P. Dickerson, Krishna P. Gummadi, Till Speicher, Vedant Nanda
716Tight and Robust Private Mean Estimation with Few Users0Hossein Esfandiari, Shyam Narayanan, Vahab S. Mirrokni
717Fast Aquatic Swimmer Optimization with Differentiable Projective Dynamics and Neural Network Hydrodynamic Models0Benjamin F. Grewe, Elvis Nava, John Z. Zhang, Mike Yan Michelis, Pingchuan Ma, Robert Kevin Katzschmann, Tao Du, Wojciech Matusik
718Multi-Task Learning as a Bargaining Game0Aviv Navon, Aviv Shamsian, Ethan Fetaya, Gal Chechik, Haggai Maron, Idan Achituve, Kenji Kawaguchi
719Variational Inference for Infinitely Deep Neural Networks0Achille Nazaret, David M. Blei
720Stable Conformal Prediction Sets0Eugène Ndiaye
721Discovering Generalizable Spatial Goal Representations via Graph-based Active Reward Learning0Aviv Netanyahu, Joshua B. Tenenbaum, Pulkit Agrawal, Tianmin Shu
722Sublinear-Time Clustering Oracle for Signed Graphs0Pan Peng, Stefan Neumann
723Improved Regret for Differentially Private Exploration in Linear MDP0Dung Daniel T. Ngo, Giuseppe Vietri, Steven Wu
724A Framework for Learning to Request Rich and Contextually Useful Information from Humans0Hal Daumé III, Khanh X. Nguyen, Yonatan Bisk
725Transformer Neural Processes: Uncertainty-Aware Meta Learning Via Sequence Modeling0Aditya Grover, Tung Nguyen
726Improving Transformers with Probabilistic Attention Keys0Dung D. D. Le, Duy Khuong Nguyen, Nhat Ho, Richard G. Baraniuk, Stanley J. Osher, Tam Minh Nguyen, Tan Minh Nguyen, VietAnh Tran
727On Transportation of Mini-batches: A Hierarchical Approach0Dang Nguyen, Dinh Phung, Hung Bui, Khai Nguyen, Nhat Ho, Quoc Dinh Nguyen, Trung Le, Tung Pham
728Improving Mini-batch Optimal Transport via Partial Transportation0Dang Nguyen, Khai Nguyen, Nhat Ho, TheAnh VuLe, Tung Pham
729Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs0Benjamin Eysenbach, Ruslan Salakhutdinov, Tianwei Ni
730Optimal Estimation of Policy Gradient via Double Fitted Iteration0Chengzhuo Ni, Mengdi Wang, Ruiqi Zhang, Xiang Ji, Xuezhou Zhang
731GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models0Aditya Ramesh, Alexander Quinn Nichol, Bob McGrew, Ilya Sutskever, Mark Chen, Pamela Mishkin, Prafulla Dhariwal, Pranav Shyam
732Diffusion Models for Adversarial Purification0Animashree Anandkumar, Arash Vahdat, Brandon Guo, Chaowei Xiao, Weili Nie, Yujia Huang
733The Primacy Bias in Deep Reinforcement Learning0Aaron C. Courville, Evgenii Nikishin, Max Schwarzer, Pierluca D'Oro, PierreLuc Bacon
734Causal Conceptions of Fairness and their Consequences0Hamed Nilforoshan, Johann D. Gaebler, Ravi Shroff, Sharad Goel
735Efficient Test-Time Model Adaptation without Forgetting0Jiaxiang Wu, Mingkui Tan, Peilin Zhao, Shijian Zheng, Shuaicheng Niu, Yaofo Chen, Yifan Zhang
736Generative Trees: Adversarial and Copycat0Mathieu GuillameBert, Richard Nock
737Path-Aware and Structure-Preserving Generation of Synthetically Accessible Molecules0DaeWoong Jeong, Honglak Lee, Juhwan Noh, Kiyoung Kim, Moontae Lee, Sehui Han, Yousung Jung
738Utilizing Expert Features for Contrastive Learning of Time-Series Representations0David Reeb, Ingo Steinwart, Lukas Oldenburg, Manuel T. Nonnenmacher
739Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval0Aidan N. Gomez, Debora S. Marks, Javier MarchenaHurtado, Jonathan Frazer, Mafalda Dias, Pascal Notin, Yarin Gal
740Fast Finite Width Neural Tangent Kernel0Jascha SohlDickstein, Roman Novak, Samuel S. Schoenholz
741Multicoated Supermasks Enhance Hidden Networks0Jaehoon Yu, Kazushi Kawamura, Kazutoshi Hirose, Kota Ando, Masato Motomura, Thiem Van Chu, Yasuyuki Okoshi, Ángel López GarcíaArias
742Generalized Leverage Scores: Geometric Interpretation and Applications0Antonis Matakos, Aristides Gionis, Bruno Ordozgoiti
743Practical Almost-Linear-Time Approximation Algorithms for Hybrid and Overlapping Graph Clustering0Charalampos E. Tsourakakis, Konstantinos Ameranis, Kunal Talwar, Lorenzo Orecchia
744Anticorrelated Noise Injection for Improved Generalization0Antonio Orvieto, Aurélien Lucchi, Francis R. Bach, Frank Proske, Hans Kersting
745Scalable Deep Gaussian Markov Random Fields for General Graphs0Fredrik Lindsten, Joel Oskarsson, Per Sidén
746Zero-shot AutoML with Pretrained Models0Ekrem Öztürk, Fabio Ferreira, Frank Hutter, Hadi S. Jomaa, Josif Grabocka, Lars SchmidtThieme
747History Compression via Language Models in Reinforcement Learning0Angela BittoNemling, Fabian Paischer, Hamid EghbalZadeh, Markus Holzleitner, Sebastian Lehner, Sepp Hochreiter, Thomas Adler, Vihang Patil
748A Study on the Ramanujan Graph Property of Winning Lottery Tickets0Arindam Biswas, Biswajit Basu, Bithika Pal, Pabitra Mitra, Sudeshna Kolay
749On Learning Mixture of Linear Regressions in the Non-Realizable Setting0Arya Mazumdar, Avishek Ghosh, Rajat Sen, Soumyabrata Pal
750Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification0Huazhe Xu, Ling Pan, Longbo Huang, Tengyu Ma
751A Unified Weight Initialization Paradigm for Tensorial Convolutional Neural Networks0Ao Liu, Jingquan Wang, Nannan Li, Yu Pan, Zenglin Xu, Zeyong Su
752Robustness and Accuracy Could Be Reconcilable by (Proper) Definition0Jun Zhu, Min Lin, Shuicheng Yan, Tianyu Pang, Xiao Yang
753Towards Coherent and Consistent Use of Entities in Narrative Generation0Kris Cao, Pinelopi Papalampidi, Tomás Kociský
754Constrained Discrete Black-Box Optimization using Mixed-Integer Programming0Christian Tjandraatmadja, David Belanger, Juan Pablo Vielma, Ross Anderson, Theodore P. Papalexopoulos
755A Theoretical Comparison of Graph Neural Network Extensions0Pál András Papp, Roger Wattenhofer
756Validating Causal Inference Methods0Carlos Varjao, Eric Tchetgen Tchetgen, Harsh Parikh, Louise Xu
757The Unsurprising Effectiveness of Pre-Trained Vision Models for Control0Abhinav Gupta, Aravind Rajeswaran, Senthil Purushwalkam, Simone Parisi
758Learning Symmetric Embeddings for Equivariant World Models0JanWillem van de Meent, Jung Yeon Park, Linfeng Zhao, Ondrej Biza, Robin Walters
759Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Robustness0Namuk Park, Songkuk Kim
760Exact Optimal Accelerated Complexity for Fixed-Point Iterations0Ernest K. Ryu, Jisun Park
761Kernel Methods for Radial Transformed Compositional Data with Many Zeros0Changwon Yoon, Cheolwoo Park, Jeongyoun Ahn, Junyoung Park
762Evolving Curricula with Regret-Based Environment Design0Edward Grefenstette, Jack ParkerHolder, Jakob N. Foerster, Michael Dennis, Mikayel Samvelyan, Minqi Jiang, Tim Rocktäschel
763Neural Language Models are not Born Equal to Fit Brain Data, but Training Helps0Alexandre Pasquiou, Bertrand Thirion, Christophe Pallier, John T. Hale, Yair Lakretz
764A new similarity measure for covariate shift with applications to nonparametric regression0Cong Ma, Martin J. Wainwright, Reese Pathak
765Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution0Johannes Brandstetter, José Antonio ArjonaMedina, MariusConstantin Dinu, Markus Hofmarcher, Matthias Dorfer, Patrick M. Blies, Sepp Hochreiter, Vihang Patil
766POET: Training Neural Networks on Tiny Devices with Integrated Rematerialization and Paging0Ion Stoica, Joseph Gonzalez, Paras Jain, Prabal Dutta, Shishir G. Patil
767Learning to Cut by Looking Ahead: Cutting Plane Selection via Imitation Learning0Andreas Krause, Chris J. Maddison, Giulia Zarpellon, Laurent Charlin, Max B. Paulus
768Neural Network Pruning Denoises the Features and Makes Local Connectivity Emerge in Visual Tasks0Franco Pellegrini, Giulio Biroli
769Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and Understanding0Ian R. Lane, Shinji Watanabe, Siddharth Dalmia, Yifan Peng
770Pocket2Mol: Efficient Molecular Sampling Based on 3D Protein Pockets0Jian Peng, Jianzhu Ma, Jiaqi Guan, Qi Xie, Shitong Luo, Xingang Peng
771Differentiable Top-k Classification Learning0Christian Borgelt, Felix Petersen, Hilde Kuehne, Oliver Deussen
772Multi-scale Feature Learning Dynamics: Insights for Double Descent0Amartya Mitra, Guillaume Lajoie, Mohammad Pezeshki, Yoshua Bengio
773A Differential Entropy Estimator for Training Neural Networks0Georg Pichler, Günther Koliander, Malik Boudiaf, Pablo Piantanida, Pierre Jean A. Colombo
774Federated Learning with Partial Model Personalization0Abdelrahman Mohamed, Krishna Pillutla, Kshitiz Malik, Lin Xiao, Maziar Sanjabi, Michael G. Rabbat
775Deep Networks on Toroids: Removing Symmetries Reveals the Structure of Flat Regions in the Landscape Geometry0Antonio Ferraro, Carlo Baldassi, Christoph Feinauer, Fabrizio Pittorino, Gabriele Perugini, Riccardo Zecchina
776Geometric Multimodal Contrastive Representation Learning0Ana Paiva, Danica Kragic, Francisco S. Melo, Hang Yin, Miguel Vasco, Petra Poklukar
777Constrained Offline Policy Optimization0Bruno C. da Silva, Jithin Jagannath, Madalina Fiterau, Nicholas Polosky
778Offline Meta-Reinforcement Learning with Online Self-Supervision0Ashvin Nair, Catherine Huang, Laura Smith, Sergey Levine, Vitchyr H. Pong
779Debiaser Beware: Pitfalls of Centering Regularized Transport Maps0AramAlexandre Pooladian, Jonathan NilesWeed, Marco Cuturi
780Adaptive Second Order Coresets for Data-efficient Machine Learning0Baharan Mirzasoleiman, David Davini, Omead Pooladzandi
781On the Practicality of Deterministic Epistemic Uncertainty0Federico Tombari, Fisher Yu, Janis Postels, Luc Van Gool, Luca Daniel Sieber, Mattia Segù, Tao Sun
782A Simple Guard for Learned Optimizers0Isabeau PrémontSchwarz, Jan Feyereisl, Jaroslav Vitku
783Hardness and Algorithms for Robust and Sparse Optimization0Eric Price, Samson Zhou, Sandeep Silwal
784Nonlinear Feature Diffusion on Hypergraphs0Austin R. Benson, Francesco Tudisco, Konstantin Prokopchik
785Universal Joint Approximation of Manifolds and Densities by Simple Injective Flows0Ivan Dokmanic, Maarten V. de Hoop, Matti Lassas, Michael Puthawala
786The Teaching Dimension of Regularized Kernel Learners0Aimin Zhou, ChenXi Su, Hong Qian, XuHui Liu, Yang Yu
787ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers0ChengI Lai, David D. Cox, Heting Gao, Junrui Ni, Kaizhi Qian, Mark HasegawaJohnson, Shiyu Chang, Yang Zhang
788Interventional Contrastive Learning with Meta Semantic Regularizer0Bing Su, Changwen Zheng, Hui Xiong, Jiangmeng Li, Wenwen Qiang
789Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost0Dan Qiao, Ming Min, Ming Yin, YuXiang Wang
790Generalizing to Evolving Domains with Latent Structure-Aware Sequential Autoencoder0Haoliang Li, Shiqi Wang, Tiexin Qin
791Graph Neural Architecture Search Under Distribution Shifts0Pengtao Xie, Wenwu Zhu, Xin Wang, Yijian Qin, Ziwei Zhang
792Spectral Representation of Robustness Measures for Optimization Under Input Uncertainty0Ivo Couckuyt, Jixiang Qing, Tom Dhaene
793Large-scale Stochastic Optimization of NDCG Surrogates for Deep Learning with Provable Convergence0Lijun Zhang, Quanqi Hu, Tianbao Yang, Yongjian Zhong, ZiHao Qiu
794Latent Outlier Exposure for Anomaly Detection with Contaminated Data0Aodong Li, Chen Qiu, Maja Rudolph, Marius Kloft, Stephan Mandt
795Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning0Chenjia Bai, Lingxiao Wang, Shuang Qiu, Zhaoran Wang, Zhuoran Yang
796Fast and Provable Nonconvex Tensor RPCA0Deyu Meng, Haiquan Qiu, Quanming Yao, Shaojie Tang, Yao Wang
797Generalized Federated Learning via Sharpness Aware Minimization0Bo Tang, Rui Duan, Xingyu Li, Yao Liu, Zhe Qu, Zhuo Lu
798Particle Transformer for Jet Tagging0Congqiao Li, Huilin Qu, Sitian Qian
799Winning the Lottery Ahead of Time: Efficient Early Network Pruning0Bertrand Charpentier, Daniel Zügner, John Rachwan, Morgane Ayle, Simon Geisler, Stephan Günnemann
800Convergence of Uncertainty Sampling for Active Learning0Anant Raj, Francis R. Bach
801DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale0Ammar Ahmad Awan, Conglong Li, Jeff Rasley, Minjia Zhang, Reza Yazdani Aminabadi, Samyam Rajbhandari, Yuxiong He, Zhewei Yao
802Fishr: Invariant Gradient Variances for Out-of-Distribution Generalization0Alexandre Ramé, Corentin Dancette, Matthieu Cord
803A Closer Look at Smoothness in Domain Adversarial Training0Arihant Jain, Harsh Rangwani, Mayank Mishra, Sumukh K. Aithal, Venkatesh Babu Radhakrishnan
804Linear Adversarial Concept Erasure0Michael Twiton, Ryan Cotterell, Shauli Ravfogel, Yoav Goldberg
805Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks0Asaf Maman, Nadav Cohen, Noam Razin
806One-Pass Algorithms for MAP Inference of Nonsymmetric Determinantal Point Processes0Anup B. Rao, Aravind Reddy, Eunyee Koh, Gang Wu, Nedim Lipka, Nesreen K. Ahmed, Ryan A. Rossi, Tung Mai, Zhao Song
807Universality of Winning Tickets: A Renormalization Group Perspective0Akshunna S. Dogra, Tianlong Chen, William T. Redman, Zhangyang Wang
808The dynamics of representation learning in shallow, non-linear autoencoders0Maria Refinetti, Sebastian Goldt
809Proximal Exploration for Model-guided Protein Sequence Design0Fan Ding, Jiahan Li, Jian Peng, Jianzhu Ma, Yuan Zhou, Zhizhou Ren
810Towards Theoretical Analysis of Transformation Complexity of ReLU DNNs0Jie Ren, Meng Zhou, Mingjie Li, Quanshi Zhang, ShihHan Chan
811Benchmarking and Analyzing Point Cloud Classification under Corruptions0Jiawei Ren, Liang Pan, Ziwei Liu
812A Unified View on PAC-Bayes Bounds for Meta-Learning0Arezou Rezazadeh
8133PC: Three Point Compressors for Communication-Efficient Distributed Training and a Better Theory for Lazy Aggregation0Eduard Gorbunov, Elnur Gasanov, Igor Sokolov, Ilyas Fatkhullin, Peter Richtárik, Zhize Li
814Robust SDE-Based Variational Formulations for Solving Linear PDEs via Deep Learning0Julius Berner, Lorenz Richter
815Probabilistically Robust Learning: Balancing Average and Worst-case Performance0Alexander Robey, George J. Pappas, Hamed Hassani, Luiz F. O. Chamon
816LyaNet: A Lyapunov Framework for Training Neural ODEs0Aaron D. Ames, Ivan Dario Jimenez Rodriguez, Yisong Yue
817Short-Term Plasticity Neurons Learning to Learn and Forget0Hector Garcia Rodriguez, Qinghai Guo, Timoleon Moraitis
818Function-space Inference with Sparse Implicit Processes0Bryan Zaldivar, Daniel HernándezLobato, Simón Rodríguez Santana
819Score Matching Enables Causal Discovery of Nonlinear Additive Noise Models0Bernhard Schölkopf, Chris Russell, Dominik Janzing, Francesco Locatello, Matthäus Kleindessner, Paul Rolland, Volkan Cevher
820Dual Decomposition of Convex Optimization Layers for Consistent Attention in Medical Images0Michal WeilerSagie, Tamir Hazan, Tom Ron
821A Consistent and Efficient Evaluation Strategy for Attribution Methods0Enkelejda Kasneci, Gjergji Kasneci, Tobias Leemann, Vadim Borisov, Yao Rong
822Efficiently Learning the Topology and Behavior of a Networked Dynamical System Via Active Queries0Abhijin Adiga, Anil Vullikanti, Daniel J. Rosenkrantz, Madhav V. Marathe, Richard Edwin Stearns, S. S. Ravi, Zirou Qiu
823Learning to Infer Structures of Network Games0Emanuele Rossi, Federico Monti, Michael M. Bronstein, Xiaowen Dong, Yan Leng
824Direct Behavior Specification via Constrained Reinforcement Learning0Christopher J. Pal, Joshua Romoff, Julien Roy, PierreLuc Bacon, Roger Girgis
825Constraint-based graph network simulator0Alvaro SanchezGonzalez, Peter W. Battaglia, Tobias Pfaff, Yulia Rubanova
826Continual Learning via Sequential Function-Space Variational Inference0Freddie Bickford Smith, Qixuan Feng, Tim G. J. Rudner, Yarin Gal, Yee Whye Teh
827Graph-Coupled Oscillator Networks0Ben Chamberlain, James Rowbottom, Michael M. Bronstein, Siddhartha Mishra, T. Konstantin Rusch
828Hindering Adversarial Attacks with Implicit Neural Representations0Andrei A. Rusu, Dan Andrei Calian, Raia Hadsell, Sven Gowal
829Exploiting Independent Instruments: Identification and Distribution Generalization0Jonas Peters, Leonard Henckel, Niklas Pfister, Sorawit Saengkyongam
830FedNL: Making Newton-Type Methods Applicable to Federated Learning0Mher Safaryan, Peter Richtárik, Rustem Islamov, Xun Qian
831Versatile Dueling Bandits: Best-of-both World Analyses for Learning from Relative Preferences0Aadirupa Saha, Pierre Gaillard
832Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits0Aadirupa Saha, Shubham Gupta
833Unraveling Attention via Convex Duality: Analysis and Interpretations of Vision Transformers0Arda Sahiner, Batu Ozturkler, John M. Pauly, Mert Pilanci, Morteza Mardani, Tolga Ergen
834Off-Policy Evaluation for Large Action Spaces via Embeddings0Thorsten Joachims, Yuta Saito
835Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training0Brian Zimmer, Brucek Khailany, Charbel Sakr, Rangharajan Venkatesan, Steve Dai, William J. Dally
836A Convergence Theory for SVGD in the Population Limit under Talagrand's Inequality T10Adil Salim, Lukang Sun, Peter Richtárik
837FITNESS: (Fine Tune on New and Similar Samples) to detect anomalies in streams with drift and outliers0Abishek Sankararaman, Balakrishnan Narayanaswamy, Vikramank Y. Singh, Zhao Song
838The Algebraic Path Problem for Graph Metrics0Enrique Fita Sanmartín, Fred A. Hamprecht, Sebastian Damrich
839LSB: Local Self-Balancing MCMC in Discrete Spaces0Emanuele Sansone
840PoF: Post-Training of Feature Extractor for Improving Generalization0Ikuro Sato, Masayuki Tanaka, Nakamasa Inoue, Rei Kawakami, Ryota Yamada
841Re-evaluating Word Mover's Distance0Hisashi Kashima, Makoto Yamada, Ryoma Sato
842Understanding Contrastive Learning Requires Incorporating Inductive Biases0Akshay Krishnamurthy, Cyril Zhang, Dipendra Misra, Jordan T. Ash, Nikunj Saunshi, Sanjeev Arora, Sham M. Kakade, Surbhi Goel
843The Neural Race Reduction: Dynamics of Abstraction in Gated Networks0Andrew M. Saxe, Sam Jay Lewallen, Shagun Sodhani
844Convergence Rates of Non-Convex Stochastic Gradient Descent Under a Generic Lojasiewicz Condition and Local Smoothness0Cédric Malherbe, Kevin Scaman, Ludovic Dos Santos
845An Asymptotic Test for Conditional Independence using Analytic Kernel Embeddings0Laurent Meunier, Meyer Scetbon, Yaniv Romano
846Linear-Time Gromov Wasserstein Distances using Low Rank Couplings and Costs0Gabriel Peyré, Marco Cuturi, Meyer Scetbon
847Streaming Inference for Infinite Feature Models0Gabrielle K. Liu, Ila Fiete, Rylan Schaeffer, Yilun Du
848Modeling Irregular Time Series with Continuous Recurrent Units0Maja Rudolph, Mazin Eltayeb, Mona Schirmer, Stefan Lessmann
849Structure Preserving Neural Networks: A Case Study in the Entropy Closure of the Boltzmann Equation0Cory D. Hauck, Martin Frank, Steffen Schotthöfer, Tianbai Xiao
850Improving Robustness against Real-World and Worst-Case Distribution Shifts through Decision Region Quantification0An Nguyen, Bjoern M. Eskofier, Dario Zanca, Doina Precup, Falk Pulsmeyer, Leo Schwinn, Leon Bungert, René Raab
851Symmetric Machine Theory of Mind0Graham Neubig, Melanie Sclar, Yonatan Bisk
852Data-SUITE: Data-centric identification of in-distribution incongruous examples0Jonathan Crabbé, Mihaela van der Schaar, Nabeel Seedat
853Continuous-Time Modeling of Counterfactual Outcomes Using Neural Controlled Differential Equations0Alexis Bellot, Fergus Imrie, Mihaela van der Schaar, Nabeel Seedat, Zhaozhi Qian
854Neural Tangent Kernel Beyond the Infinite-Width Limit: Effects of Depth and Initialization0Gitta Kutyniok, Mariia Seleznova
855Reinforcement Learning with Action-Free Pre-Training from Videos0Kimin Lee, Pieter Abbeel, Stephen James, Younggyo Seo
856Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation0Andreas Krause, Maryam Kamgarpour, Pier Giuseppe Sessa
857Selective Regression under Fairness Criteria0Abhin Shah, Gregory W. Wornell, Joshua K. Lee, Prasanna Sattigeri, Rameswar Panda, Subhro Das, Yuheng Bu
858Utility Theory for Sequential Decision Making0Mehran Shakerinava, Siamak Ravanbakhsh
859Translating Robot Skills: Learning Unsupervised Skill Correspondences Across Robots0Aravind Rajeswaran, Jean Oh, Stuart Anderson, Tanmay Shankar, Vikash Kumar, Yixin Lin
860A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning0Archit Sharma, Chelsea Finn, Rehaan Ahmad
861Content Addressable Memory Without Catastrophic Forgetting by Heteroassociation with a Fixed Scaffold0Ila R. Fiete, Sarthak Chandra, Sugandha Sharma
862Federated Minimax Optimization: Improved Convergence Analyses and Algorithms0Gauri Joshi, Pramod K. Varshney, Pranay Sharma, Rohan Panda
863DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning0Hassam Sheikh, Kizza Frisbee, Mariano Phielipp
864Instance Dependent Regret Analysis of Kernelized Bandits0Shubhanshu Shekhar, Tara Javidi
865Data Augmentation as Feature Manipulation0Ruoqi Shen, Suriya Gunasekar, Sébastien Bubeck
866Metric-Fair Active Learning0Jie Shen, Jing Wang, Nan Cui
867PDO-s3DCNNs: Partial Differential Operator Based Steerable 3D CNNs0Jinwen Ma, Qi She, Tao Hong, Zhengyang Shen, Zhouchen Lin
868Connect, Not Collapse: Explaining Contrastive Learning for Unsupervised Domain Adaptation0Ananya Kumar, Jeff Z. HaoChen, Kendrick Shen, Percy Liang, Robbie M. Jones, Sang Michael Xie, Tengyu Ma
869Constrained Optimization with Dynamic Bound-scaling for Effective NLP Backdoor Defense0Guangyu Shen, Guanhong Tao, Qiuling Xu, Shengwei An, Shiqing Ma, Xiangyu Zhang, Yingqi Liu, Zhuo Zhang
870Staged Training for Transformer Language Models0Iz Beltagy, Jesse Dodge, Kurt Keutzer, Matthew E. Peters, Pete Walsh, Sheng Shen
871Deep Network Approximation in Terms of Intrinsic Parameters0Haizhao Yang, Shijun Zhang, Zuowei Shen
872Gradient-Free Method for Heavily Constrained Nonconvex Optimization0Bin Gu, Hongchang Gao, Wanli Shi
873Global Optimization of K-Center Clustering0Jiayang Ren, Kaixun Hua, Mingfei Shi, Yankai Cao
874Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity0Gen Li, Laixi Shi, Yuejie Chi, Yuting Wei, Yuxin Chen
875Adversarial Masking for Self-Supervised Learning0Adam R. Kosiorek, N. Siddharth, Philip H. S. Torr, Yuge Shi
876Visual Attention Emerges from Recurrent Sparse Reconstruction0Baifeng Shi, Neel Joshi, Trevor Darrell, Xin Wang, Yale Song
877A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes0Chengchun Shi, Jiawei Huang, Masatoshi Uehara, Nan Jiang
878Robust Group Synchronization via Quadratic Programming0Cole M. Wyeth, Gilad Lerman, Yunpeng Shi
879Log-Euclidean Signatures for Intrinsic Distances Between Unaligned Datasets0Justin M. Solomon, Kristjan H. Greenewald, Mikhail Yurochkin, Tal Shnitzer
880Scalable Computation of Causal Bounds0Garud Iyengar, Madhumitha Shridharan
881Bit Prioritization in Variational Autoencoders via Progressive Coding0Rui Shu, Stefano Ermon
882Fair Representation Learning through Implicit Path Alignment0Boyu Wang, Changjian Shui, Christian Gagné, Jiaqi Li, Qi Chen
883Faster Algorithms for Learning Convex Functions0Ali Siahkamari, Brian Kulis, Christopher Liao, Durmus Alp Emre Acar, Kelly L. Geyer, Venkatesh Saligrama
884Coin Flipping Neural Networks0Assaf Schuster, Gal Yehuda, Nitzan Hodos, Yuval Sieradzki
885Reverse Engineering the Neural Tangent Kernel0James Benjamin Simon, Michael Robert DeWeese, Sajant Anand
886Demystifying the Adversarial Robustness of Random Transformation Defenses0Chawin Sitawarin, David A. Wagner, Zachary J. GolanStrieb
887Smoothed Adversarial Linear Contextual Bandits with Knapsacks0Arindam Banerjee, Shiliang Zuo, Vidyashankar Sivakumar
888GenLabel: Mixup Relabeling using Generative Models0Dimitris S. Papailiopoulos, Hongxu Chen, Jaekyun Moon, Jyyong Sohn, Kangwook Lee, Liang Shang
889Communicating via Markov Decision Processes0Christian A. Schröder de Witt, J. Zico Kolter, Jakob N. Foerster, Luisa M. Zintgraf, Martin Strohmeier, Maximilian Igl, Philip H. S. Torr, Samuel Sokota, Shimon Whiteson
890The Multivariate Community Hawkes Model for Dependent Relational Events in Continuous-time Networks0Hadeel Soliman, Kevin S. Xu, Lingfei Zhao, Subhadeep Paul, Zhipeng Huang
891Disentangling Sources of Risk for Distributional Multi-Agent Reinforcement Learning0Jinwoo Shin, Junsu Kim, Kyunghwan Son, Roben Delos Reyes, Sungsoo Ahn, Yung Yi
892TAM: Topology-Aware Margin Loss for Class-Imbalanced Node Classification0Eunho Yang, Jaeyun Song, Joonhyung Park
893A General Recipe for Likelihood-free Bayesian Optimization0Jiaming Song, Lantao Yu, Stefano Ermon, Willie Neiswanger
894Fully-Connected Network on Noncompact Symmetric Space and Ridgelet Transform based on Helgason-Fourier Analysis0Isao Ishikawa, Masahiro Ikeda, Sho Sonoda
895Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation0Aivar Sootla, Alexander I. CowenRivers, David Henry Mguni, Haitham Ammar, Jun Wang, Taher Jafferjee, Ziyan Wang
896Lightweight Projective Derivative Codes for Compressed Asynchronous Gradient Descent0Haibin Guan, Ilia Ilmer, Jun Li, Pedro Soto
897Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders0Andrew Gordon Wilson, Emily Delaney, Nate Gruver, Peyton Greenside, Phillip M. Maffettone, Samuel Stanton, Wesley J. Maddox
8983D Infomax improves GNNs for Molecular Property Prediction0Christian Dallago, Dominique Beaini, Gabriele Corso, Hannes Stärk, Pietro Lió, Prudencio Tossou, Stephan Günnemann
899EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction0Hannes Stärk, Lagnajit Pattanaik, Octavian Ganea, Regina Barzilay, Tommi S. Jaakkola
900Plug & Play Attacks: Towards Robust and Flexible Model Inversion Attacks0Antonia Adler, Antonio De Almeida Correia, Dominik Hintersdorf, Kristian Kersting, Lukas Struppek
901Scaling-up Diverse Orthogonal Convolutional Networks by a Paraunitary Framework0Furong Huang, Jiahao Su, Wonmin Byeon
902Divergence-Regularized Multi-Agent Actor-Critic0Kefan Su, Zongqing Lu
903Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems0Frans A. Oliehoek, Jinke He, Matthijs T. J. Spaan, Miguel Suau
904Improved StyleGAN-v2 based Inversion for Out-of-Distribution Images0Andreas Spanias, Jayaraman J. Thiagarajan, Mark Naufel, Rakshith Subramanyam, Vivek Sivaraman Narayanaswamy
905Continuous-Time Analysis of Accelerated Gradient Methods via Conservation Laws in Dilated Coordinate Systems0Ernest K. Ryu, Gyumin Roh, Jaewook J. Suh
906Do Differentiable Simulators Give Better Policy Gradients?0Hyung Ju Terry Suh, Kaiqing Zhang, Max Simchowitz, Russ Tedrake
907Intriguing Properties of Input-Dependent Randomized Smoothing0Aleksei Kuvshinov, Peter Súkeník, Stephan Günnemann
908Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments0Benjamin Black, John P. Dickerson, Jordan K. Terry, Ryan Sullivan
909AGNAS: Attention-Guided Micro and Macro-Architecture Search0Jilin Mei, Longxing Yang, Shun Lu, Xiaowei Li, Yinhe Han, Yu Hu, Zihao Sun
910Adaptive Random Walk Gradient Descent for Decentralized Optimization0Bao Wang, Dongsheng Li, Tao Sun
911MAE-DET: Revisiting Maximum Entropy Principle in Zero-Shot NAS for Efficient Object Detection0Hao Li, Ming Lin, Rong Jin, Xiuyu Sun, Zhenhong Sun, Zhiyu Tan
912Out-of-Distribution Detection with Deep Nearest Neighbors0Xiaojin Zhu, Yifei Ming, Yixuan Li, Yiyou Sun
913Black-Box Tuning for Language-Model-as-a-Service0Hong Qian, Tianxiang Sun, Xipeng Qiu, Xuanjing Huang, Yunfan Shao
914Correlated Quantization for Distributed Mean Estimation and Optimization0Ananda Theertha Suresh, Felix X. Yu, Jae Ro, Ziteng Sun
915Causal Imitation Learning under Temporally Correlated Noise0Drew Bagnell, Gokul Swamy, Sanjiban Choudhury, Steven Wu
916Being Properly Improper0Lalitha Sankar, Richard Nock, Tyler Sypherd
917Distributionally-Aware Kernelized Bandit Problems for Risk Aversion0Sho Takemori
918Sequential and Parallel Constrained Max-value Entropy Search via Information Lower Bound0Kazuki Shitara, Masayuki Karasuyama, Shion Takeno, Tomoyuki Tamura
919SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization0ChiehHsin Lai, Junki Ohmura, Naoki Murata, Shusuke Takahashi, Takashi Shibuya, Toshimitsu Uesaka, Toshiyuki Kumakura, WeiHsiang Liao, Yuhta Takida, Yuki Mitsufuji
920A Tree-based Model Averaging Approach for Personalized Treatment Effect Estimation from Heterogeneous Data Sources0ChungChou H. Chang, Ling Zhou, Lu Tang, Xiaoqing Tan
921N-Penetrate: Active Learning of Neural Collision Handler for Complex 3D Mesh Deformations0Breannan Smith, Dinesh Manocha, Qingyang Tan, Takaaki Shiratori, Zherong Pan
922Biased Gradient Estimate with Drastic Variance Reduction for Meta Reinforcement Learning0Yunhao Tang
923Rethinking Graph Neural Networks for Anomaly Detection0Jia Li, Jiajin Li, Jianheng Tang, Ziqi Gao
924Deep Safe Incomplete Multi-view Clustering: Theorem and Algorithm0Huayi Tang, Yong Liu
925Virtual Homogeneity Learning: Defending against Data Heterogeneity in Federated Learning0Bo Han, Shaohuai Shi, Xiaowen Chu, Xin He, Yonggang Zhang, Zhenheng Tang
926Cross-Space Active Learning on Graph Convolutional Networks0Hao Wu, Shiyuan Deng, Yufei Tao
927FedNest: Federated Bilevel, Minimax, and Compositional Optimization0Christos Thrampoulidis, Davoud Ataee Tarzanagh, Mingchen Li, Samet Oymak
928Efficient Distributionally Robust Bayesian Optimization with Worst-case Sensitivity0Bryan Kian Hsiang Low, Chuan Sheng Foo, Daisuke Urano, Richalynn Leong, Sebastian Shenghong Tay
929LIDL: Local Intrinsic Dimension Estimation Using Approximate Likelihood0Adam Golinski, Jacek Tabor, Lukasz Garncarek, Piotr Tempczyk, Przemyslaw Spurek, Rafal Michaluk
930LCANets: Lateral Competition Improves Robustness Against Corruption and Attack0Ben Migliori, Garrett T. Kenyon, Juston Moore, Michael A. Teti
931Reverse Engineering ℓp attacks: A block-sparse optimization approach with recovery guarantees0Darshan Thaker, Paris Giampouras, René Vidal
932Generalised Policy Improvement with Geometric Policy Composition0André Barreto, Diana Borsa, Mark Rowland, Rémi Munos, Shantanu Thakoor, Will Dabney
933Algorithms for the Communication of Samples0Lucas Theis, Noureldin Y. Ahmed
934Consistent Polyhedral Surrogates for Top-k Classification and Variants0Anish Thilagar, Emma Goodwill, Jessica Finocchiaro, Rafael M. Frongillo
935On the Finite-Time Complexity and Practical Computation of Approximate Stationarity Concepts of Lipschitz Functions0Anthony ManCho So, Kaiwen Zhou, Lai Tian
936From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses0Alexey Naumov, Daniil Tiapkin, Denis Belomestny, Eric Moulines, Michal Valko, Pierre Ménard, Sergey Samsonov, Yunhao Tang
937Nonparametric Sparse Tensor Factorization with Hierarchical Gamma Processes0Conor Tillinghast, Shandian Zhe, Zheng Wang
938Deciphering Lasso-based Classification Through a Large Dimensional Analysis of the Iterative Soft-Thresholding Algorithm0Aladin Virmaux, Ekkehard Schnoor, Igor Colin, Malik Tiomoko, Mohamed El Amine Seddik
939Extended Unconstrained Features Model for Exploring Deep Neural Collapse0Joan Bruna, Tom Tirer
940Object Permanence Emerges in a Random Walk along Memory0Adrien Gaidon, Allan Jabri, Jie Li, Pavel Tokmakov
941Generic Coreset for Scalable Learning of Monotonic Kernels: Logistic Regression, Sigmoid and more0Dan Feldman, Elad Tolochinsky, Ibrahim Jubran
942Failure and success of the spectral bias prediction for Laplace Kernel Ridge Regression: the case of low-dimensional data0Antonio Sclocchi, Matthieu Wyart, Umberto M. Tomasini
943Quantifying and Learning Linear Symmetry-Based Disentanglement0Jim Portegies, Loek Tonnaer, Luis Armando Pérez Rey, Mike Holenderski, Vlado Menkovski
944A Temporal-Difference Approach to Policy Gradient Estimation0Andrew Patterson, Martha White, Rupam Mahmood, Samuele Tosatto
945Simple and near-optimal algorithms for hidden stratification and multi-group learning0Christopher J. Tosh, Daniel Hsu
946Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization0Aviral Kumar, Brandon Trabucco, Sergey Levine, Xinyang Geng
947AnyMorph: Learning Transferable Polices By Inferring Agent Morphology0Brandon Trabucco, Glen Berseth, Mariano Phielipp
948Detecting Adversarial Examples Is (Nearly) As Hard As Classifying Them0Florian Tramèr
949Nesterov Accelerated Shuffling Gradient Method for Convex Optimization0Katya Scheinberg, Lam M. Nguyen, Trang H. Tran
950A Completely Tuning-Free and Robust Approach to Sparse Precision Matrix Estimation0Chau Tran, Guo Yu
951Tackling covariate shift with node-based Bayesian neural networks0Luigi Acerbi, Markus Heinonen, Samuel Kaski, Trung Q. Trinh
952Fenrir: Physics-Enhanced Regression for Initial Value Problems0Filip Tronarp, Nathanael Bosch, Philipp Hennig
953Interpretable Off-Policy Learning via Hyperbox Search0Daniel Tschernutter, Stefan Feuerriegel, Tobias Hatt
954FriendlyCore: Practical Differentially Private Aggregation0Edith Cohen, Eliad Tsfadia, Haim Kaplan, Uri Stemmer, Yishay Mansour
955Pairwise Conditional Gradients without Swap Steps and Sparser Kernel Herding0Kazuma Tsuji, Ken'ichiro Tanaka, Sebastian Pokutta
956Prototype Based Classification from Hierarchy to Fairness0Julie A. Shah, Mycal Tucker
957Consensus Multiplicative Weights Update: Learning to Learn using Projector-based Game Signatures0Nelson Vadori, Rahul Savani, Sumitra Ganesh, Thomas Spooner
958Self-Supervised Models of Audio Effectively Explain Human Cortical Responses to Speech0Aditya R. Vaidya, Alexander Huth, Shailee Jain
959Path-Gradient Estimators for Continuous Normalizing Flows0Kim Andrea Nicoli, Lorenz Vaitl, Pan Kessel, Shinichi Nakajima
960Improved Convergence Rates for Sparse Approximation Methods in Kernel-Based Learning0Alberto Bernacchia, DaShan Shiu, Jonathan Scarlett, Sattar Vakili
961EDEN: Communication-Efficient and Robust Distributed Mean Estimation for Federated Learning0Amit Portnoy, Gal Mendelson, Michael Mitzenmacher, Ran Ben Basat, Shay Vargaftik, Yaniv BenItzhak
962Towards Noise-adaptive, Problem-adaptive (Accelerated) Stochastic Gradient Descent0Benjamin DuboisTaine, Reza Babanezhad, Sharan Vaswani
963Correlation Clustering via Strong Triadic Closure Labeling: Fast Approximation Algorithms and Practical Lower Bounds0Nate Veldt
964The CLRS Algorithmic Reasoning Benchmark0Adrià Puigdomènech Badia, Andrea Banino, Charles Blundell, David Budden, Misha Dashevskiy, Petar Velickovic, Raia Hadsell, Razvan Pascanu
965Bregman Power k-Means for Clustering Exponential Family Data0Adithya Vellal, Jason Q. Xu, Saptarshi Chakraborty
966Estimation in Rotationally Invariant Generalized Linear Models via Approximate Message Passing0Kevin Kögler, Marco Mondelli, Ramji Venkataramanan
967Bayesian Optimization under Stochastic Delayed Feedback0Arun Verma, Bryan Kian Hsiang Low, Zhongxiang Dai
968VarScene: A Deep Generative Model for Realistic Scene Graph Synthesis0Abir De, Soumen Chakrabarti, Tathagat Verma, Vishwa Vinay, Yateesh Agrawal
969Calibrated Learning to Defer with One-vs-All Classifiers0Eric T. Nalisnick, Rajeev Verma
970Regret Bounds for Stochastic Shortest Path Problems with Linear Function Approximation0Advait Parulekar, Daniel Vial, R. Srikant, Sanjay Shakkottai
971On Implicit Bias in Overparameterized Bilevel Optimization0David Duvenaud, Fabian Pedregosa, Jonathan P. Lorraine, Paul Vicol, Roger B. Grosse
972Multiclass learning with margin: exponential rates with no bias-variance trade-off0Ernesto De Vito, Giacomo Meanti, Lorenzo Rosasco, Stefano Vigogna
973Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning0Adam R. Villaflor, Jeff Schneider, John M. Dolan, Swapnil Pande, Zhe Huang
974Bayesian Nonparametrics for Offline Skill Discovery0Chris J. Maddison, Gabriel LoaizaGanem, Harry J. Braviner, Panteha Naderian, Valentin Villecroze
975Hermite Polynomial Features for Private Data Generation0Frederik Harder, Kamil Adamczewski, Margarita Vinaroz, Mijung Park, MohammadAmin Charusaie
976What Can Linear Interpolation of Neural Network Loss Landscapes Tell Us?0Jonathan Frankle, Tiffany J. Vlaar
977Multirate Training of Neural Networks0Benedict J. Leimkuhler, Tiffany J. Vlaar
978Provably Adversarially Robust Nearest Prototype Classifiers0Matthias Hein, Václav Vorácek
979First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach0Andrew J. Wagenmaker, Kevin G. Jamieson, Max Simchowitz, Simon S. Du, Yifang Chen
980Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes0Andrew J. Wagenmaker, Kevin G. Jamieson, Max Simchowitz, Simon S. Du, Yifang Chen
981Training Characteristic Functions with Reinforcement Learning: XAI-methods play Connect Four0Felix Huber, Sebastian Pokutta, Stephan Wäldchen
982Retroformer: Pushing the Limits of End-to-end Retrosynthesis Transformer0Ben Liao, ChangYu Hsieh, Shengyu Zhang, Yue Wan
983Safe Exploration for Efficient Policy Evaluation and Comparison0Branislav Kveton, Rui Song, Runzhe Wan
984Greedy based Value Representation for Optimal Coordination in Multi-agent Reinforcement Learning0Lipeng Wan, Nanning Zheng, Xingyu Chen, Xuguang Lan, Zeyang Liu
985Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods0Ali RahimiKalahroudi, Harm van Seijen, Ida Momennejad, Janarthanan Rajendran, Sarath Chandar, Yi Wan
986Fast Lossless Neural Compression with Integer-Only Discrete Flows0Bo Zhang, Chongxuan Li, Jianfei Chen, Jun Zhu, Siyu Wang
987Accelerating Shapley Explanation via Contributive Cooperator Selection0Fan Yang, Guanchu Wang, Mengnan Du, Pushkar Tripathi, Quan Zhou, Xia Ben Hu, Xuanting Cai, YuNeng Chuang
988Denoised MDPs: Learning World Models Better Than the World Itself0Amy Zhang, Antonio Torralba, Phillip Isola, Simon S. Du, Tongzhou Wang, Yuandong Tian
989Neural Implicit Dictionary Learning via Mixture-of-Expert Training0Peihao Wang, Tianlong Chen, Zhangyang Wang, Zhiwen Fan
990Robust Models Are More Interpretable Because Attributions Look Normal0Anupam Datta, Matt Fredrikson, Zifan Wang
991Disentangling Disease-related Representation from Obscure for Disease Prediction0Churan Wang, Fandong Zhang, Fangwei Zhong, Fei Gao, Yizhou Wang, Yizhou Yu
992Solving Stackelberg Prediction Game with Least Squares Loss via Spherically Constrained Least Squares Reformulation0Alex L. Wang, Jiali Wang, Rujun Jiang, Wen Huang, Xudong Li
993VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix0Chengguo Yin, Feng Zheng, Ping Luo, Ran Cheng, Teng Wang, Wenhao Jiang, Zhichao Lu
994DynaMixer: A Vision MLP Architecture with Dynamic Mixing0Li Yuan, Wei Liu, Wenhao Jiang, Yibing Song, Yiming Zhu, Ziyu Wang
995Improving Screening Processes via Calibrated Subset Selection0Lequn Wang, Manuel Gomez Rodriguez, Thorsten Joachims
996The Geometry of Robust Value Functions0Bryan Hooi, Jiashi Feng, Kaixin Wang, Kuangqi Zhou, Navdeep Kumar, Shie Mannor
997What Dense Graph Do You Need for Self-Attention?0ChuTak Lee, Qipeng Guo, Xipeng Qiu, Xuanjing Huang, Yunhua Zhou, Yuxin Wang, Zhangyue Yin
998Improved Certified Defenses against Data Poisoning with (Deterministic) Finite Aggregation0Alexander Levine, Soheil Feizi, Wenxiao Wang
999Understanding Gradual Domain Adaptation: Improved Analysis, Optimal Path and Beyond0Bo Li, Han Zhao, Haoxiang Wang
1000Communication-Efficient Adaptive Federated Learning0Jinghui Chen, Lu Lin, Yujia Wang
1001Provable Acceleration of Heavy Ball beyond Quadratics for a Class of Polyak-Lojasiewicz Functions when the Non-Convexity is Averaged-Out0Andre Wibisono, Bin Hu, ChiHeng Lin, JunKun Wang
1002Robustness Verification for Contrastive Learning0Weiwei Liu, Zekai Wang
1003Convergence and Recovery Guarantees of the K-Subspaces Method for Subspace Clustering0Anthony ManCho So, Huikang Liu, Laura Balzano, Peng Wang
1004NP-Match: When Neural Processes meet Semi-Supervised Learning0Alexandros Neophytou, Daniela Massiceti, Jianfeng Wang, Thomas Lukasiewicz, Vladimir Pavlovic, Xiaolin Hu
1005Iterative Double Sketching for Faster Least-Squares Optimization0Rui Wang, Wangli Xu, Yanyan Ouyang
1006What Language Model Architecture and Pretraining Objective Works Best for Zero-Shot Generalization?0Adam Roberts, Colin Raffel, Daniel Hesslow, Hyung Won Chung, Iz Beltagy, Julien Launay, Teven Le Scao, Thomas Wang
1007Improving Task-free Continual Learning by Distributionally Robust Memory Evolution0Le Fang, Li Shen, Mingchen Gao, Qiuling Suo, Tiehang Duan, Zhenyi Wang
1008Risk-Averse No-Regret Learning in Online Convex Games0Michael M. Zavlanos, Yi Shen, Zifan Wang
1009Provable Domain Generalization via Invariant-Feature Subspace Recovery0Bo Li, Han Zhao, Haoxiang Wang, Haozhe Si
1010ProgFed: Effective, Communication, and Computation Efficient Federated Learning by Progressive Training0HuiPo Wang, Mario Fritz, Sebastian U. Stich, Yang He
1011Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models and Amortized Policy Search0Herke van Hoof, Qi Wang
1012Approximately Equivariant Networks for Imperfectly Symmetric Dynamics0Robin Walters, Rose Yu, Rui Wang
1013Three-stage Evolution and Fast Equilibrium for SGD with Non-degerate Critical Points0Yi Wang, Zhiren Wang
1014Understanding Instance-Level Impact of Fairness Constraints0Jialu Wang, Xin Eric Wang, Yang Liu
1015Tractable Uncertainty for Structure Learning0Benjie Wang, Marta Kwiatkowska, Matthew Wicker
1016Causal Dynamics Learning for Task-Independent State Abstraction0Peter Stone, Xuesu Xiao, Yuke Zhu, Zifan Xu, Zizhao Wang
1017Multiple-Play Stochastic Bandits with Shareable Finite-Capacity Arms0Hong Xie, John C. S. Lui, Xuchuang Wang
1018Generative Coarse-Graining of Molecular Conformations0Benjamin Kurt Miller, Chen Cai, Jian Tang, Minkai Xu, Rafael GómezBombarelli, Tess E. Smidt, Wujie Wang, Yusu Wang
1019Nonparametric Embeddings of Sparse High-Order Interaction Events0Akil Narayan, Conor Tillinghast, Shandian Zhe, Shibo Li, Yiming Xu, Zheng Wang
1020When Are Linear Stochastic Bandits Attackable?0Haifeng Xu, Hongning Wang, Huazheng Wang
1021DRAGONN: Distributed Randomized Approximate Gradients of Neural Networks0Anshumali Shrivastava, T. S. Eugene Ng, Xinyu Crystal Wu, Zhaozhuo Xu, Zhuang Wang
1022Finite-Sum Coupled Compositional Stochastic Optimization: Theory and Applications0Bokun Wang, Tianbao Yang
1023OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework0An Yang, Chang Zhou, Hongxia Yang, Jianxin Ma, Jingren Zhou, Junyang Lin, Peng Wang, Rui Men, Shuai Bai, Zhikang Li
1024How Powerful are Spectral Graph Neural Networks0Muhan Zhang, Xiyuan Wang
1025Thompson Sampling for Robust Transfer in Multi-Task Bandits0Chicheng Zhang, Kamalika Chaudhuri, Zhi Wang
1026Individual Reward Assisted Multi-Agent Reinforcement Learning0Changjie Fan, Chongjie Zhang, Jianye Hao, Li Wang, Tangjie Lv, Weixun Wang, Yang Gao, Yujing Hu, Yupeng Zhang
1027Removing Batch Normalization Boosts Adversarial Training0Aston Zhang, Haotao Wang, Mu Li, Shuai Zheng, Xingjian Shi, Zhangyang Wang
1028Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition0Alex J. Smola, Aston Zhang, Haotao Wang, Mu Li, Shuai Zheng, Yi Zhu, Zhangyang Wang
1029Nonparametric Factor Trajectory Learning for Dynamic Tensor Decomposition0Shandian Zhe, Zheng Wang
1030Thompson Sampling for (Combinatorial) Pure Exploration0Jun Zhu, Siwei Wang
1031Policy Gradient Method For Robust Reinforcement Learning0Shaofeng Zou, Yue Wang
1032Certifying Out-of-Domain Generalization for Blackbox Functions0Bo Li, Boxin Wang, Ce Zhang, Linyi Li, Maurice Weber, Zhikuan Zhao
1033More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize0Alexander Wei, Jacob Steinhardt, Wei Hu
1034To Smooth or Not? When Label Smoothing Meets Noisy Labels0Gang Niu, Hangyu Liu, Jiaheng Wei, Masashi Sugiyama, Tongliang Liu, Yang Liu
1035Open-Sampling: Exploring Out-of-Distribution data for Re-balancing Long-tailed datasets0Bo An, Hongxin Wei, Lei Feng, Lue Tao, Renchunzi Xie
1036Mitigating Neural Network Overconfidence with Logit Normalization0Bo An, Hao Cheng, Hongxin Wei, Lei Feng, Renchunzi Xie, Yixuan Li
1037Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics0Animesh Garg, Matthias Weissenbacher, Samarth Sinha, Yoshinobu Kawahara
1038Fishing for User Data in Large-Batch Federated Learning via Gradient Magnification0Jonas Geiping, Liam Fowl, Micah Goldblum, Tom Goldstein, Yuxin Wen
1039BabelTower: Learning to Auto-parallelized Program Translation0Chao Wang, Jianxing Xu, Ling Li, Qi Guo, Qiang Fu, Xiaqing Li, Xing Hu, Xuehai Zhou, Yanlin Tang, Yongwei Zhao, Yuanbo Wen, Yunji Chen, Zidong Du
1040Random Forest Density Estimation0Hanyuan Hang, Hongwei Wen
1041Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming0Chuan Wen, Dinesh Jayaraman, Jianing Qian, Jiaye Teng, Jierui Lin, Yang Gao
1042Preconditioning for Scalable Gaussian Process Hyperparameter Optimization0Geoff Pleiss, Jacob R. Gardner, John P. Cunningham, Jonathan Wenger, Philipp Hennig
1043Measure Estimation in the Barycentric Coding Model0Abiy Tasissa, James M. Murphy, Matthew Werenski, Ruijie Jiang, Shuchin Aeron
1044COLA: Consistent Learning with Opponent-Learning Awareness0Alistair Letcher, Jakob N. Foerster, Johannes Treutlein, Timon Willi
1045Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning0David Meger, Harley E. Wiltzer, Marc G. Bellemare
1046Easy Variational Inference for Categorical Models via an Independent Binary Approximation0Eric L. Miller, Michael C. Hughes, Michael T. Wojnowicz, Shuchin Aeron
1047Continual Learning with Guarantees via Weight Interval Constraints0Bartosz Wójcik, Jacek Tabor, Karol J. Piczak, Lukasz Pustelnik, Maciej Wolczyk, Pawel Morawiecki, Przemyslaw Spurek, Tomasz Trzcinski
1048A Deep Learning Approach for the Segmentation of Electroencephalography Data in Eye Tracking Applications0Alexander Veicht, Ard Kastrati, Dustin Klebe, JieMing Li, Lukas Wolf, Martyna Beata Plomecka, Nicolas Langer, Roger Wattenhofer
1049Leverage Score Sampling for Tensor Product Matrices in Input Sparsity Time0Amir Zandieh, David P. Woodruff
1050Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time0Ali Farhadi, Ari S. Morcos, Gabriel Ilharco, Hongseok Namkoong, Ludwig Schmidt, Mitchell Wortsman, Raphael Gontijo Lopes, Rebecca Roelofs, Samir Yitzhak Gadre, Simon Kornblith, Yair Carmon
1051Metric-Fair Classifier Derandomization0Jimmy Wu, Yang Liu, Yatong Chen
1052Structural Entropy Guided Graph Hierarchical Pooling0Junran Wu, Ke Xu, Shangzhe Li, Xueyuan Chen
1053Self-supervised Models are Good Teaching Assistants for Vision Transformers0Haiyan Wu, Ke Li, Shaohui Lin, Xing Sun, Yinqi Zhang, Yuan Xie, Yuting Gao
1054Characterizing and Overcoming the Greedy Nature of Learning in Multi-modal Deep Neural Networks0Krzysztof J. Geras, Kyunghyun Cho, Nan Wu, Stanislaw Jastrzebski
1055Instrumental Variable Regression with Confounder Balancing0Anpeng Wu, Bo Li, Fei Wu, Kun Kuang
1056MemSR: Training Memory-efficient Lightweight Model for Image Super-Resolution0ChungKuei Lee, Kailu Wu, Kaisheng Ma
1057Delay-Adaptive Step-sizes for Asynchronous Learning0Hamid Reza Feyzmahdavian, Mikael Johansson, Sindri Magnússon, Xuyang Wu
1058Variational nearest neighbor Gaussian process0Geoff Pleiss, John P. Cunningham, Luhuan Wu
1059Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach0Guangjian Tian, Jun Wang, Ling Shi, Shuang Wu
1060DAVINZ: Data Valuation using Deep Neural Networks at Initialization0Bryan Kian Hsiang Low, Yao Shu, Zhaoxuan Wu
1061Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum0Junlin Wu, Yevgeniy Vorobeychik
1062Revisiting Consistency Regularization for Deep Partial Label Learning0DengBao Wang, DongDong Wu, MinLing Zhang
1063Flowformer: Linearizing Transformers with Conservation Flows0Haixu Wu, Jialong Wu, Jianmin Wang, Jiehui Xu, Mingsheng Long
1064Nearly Optimal Policy Optimization with Stable at Any Time Guarantee0Han Zhong, Jiantao Jiao, Liwei Wang, Simon S. Du, Tianhao Wu, Yunchang Yang
1065RetrievalGuard: Provably Robust 1-Nearest Neighbor Image Retrieval0Heng Huang, Hongyang Zhang, Yihan Wu
1066Last Iterate Risk Bounds of SGD with Decaying Stepsize for Overparameterized Linear Regression0Difan Zou, Jingfeng Wu, Quanquan Gu, Sham M. Kakade, Vladimir Braverman
1067Optimal Clustering with Noisy Queries via Multi-Armed Bandit0Jinghui Xia, Zengfeng Huang
1068ProGCL: Rethinking Hard Negative Mining in Graph Contrastive Learning0Ge Wang, Jintao Chen, Jun Xia, Lirong Wu, Stan Z. Li
1069Synergy and Symmetry in Deep Learning: Interactions between the Data, Model, and Inference Algorithm0Jeffrey Pennington, Lechao Xiao
1070Identification of Linear Non-Gaussian Latent Hierarchical Structure0Biwei Huang, Feng Xie, Kun Zhang, Yangbo He, Zhengming Chen, Zhi Geng
1071COAT: Measuring Object Compositionality in Emergent Representations0Ari S. Morcos, Ramakrishna Vedantam, Sirui Xie, SongChun Zhu
1072Robust Policy Learning over Multiple Uncertainty Sets0Amy Zhang, Annie Xie, Chelsea Finn, Joelle Pineau, Shagun Sodhani
1073Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate and Momentum0Huishuai Zhang, Issei Sato, Masashi Sugiyama, Xinrui Wang, Zeke Xie
1074Self-Supervised Representation Learning via Latent Graph Prediction0Shuiwang Ji, Yaochen Xie, Zhao Xu
1075Efficient Computation of Higher-Order Subgraph Attribution via Message Passing0Grégoire Montavon, KlausRobert Müller, Ping Xiong, Shinichi Nakajima, Thomas Schnake
1076A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games0Chengshuai Shi, Cong Shen, Han Zhong, Tong Zhang, Wei Xiong
1077Importance Weighted Kernel Bayes' Rule0Arnaud Doucet, Arthur Gretton, Liyuan Xu, Yutian Chen
1078Learning to Separate Voices by Spatial Regions0Alan Xu, Romit Roy Choudhury
1079Detached Error Feedback for Distributed SGD with Random Sparsification0An Xu, Heng Huang
1080Accurate Quantization of Measures via Interacting Particle-based Optimization0Anna Korba, Dejan Slepcev, Lantian Xu
1081Unified Fourier-based Kernel and Nonlinearity Design for Equivariant Networks on Homogeneous Spaces0Edgar Dobriban, Jiahui Lei, Kostas Daniilidis, Yinshuang Xu
1082Inferring Cause and Effect in the Presence of Heteroscedastic Noise0Alexander Marx, Jilles Vreeken, Osman Mian, Sascha Xu
1083Prompting Decision Transformer for Few-Shot Policy Generalization0Chuang Gan, Ding Zhao, Joshua B. Tenenbaum, Mengdi Xu, Shun Zhang, Yikang Shen, Yuchen Lu
1084Analyzing and Mitigating Interference in Neural Architecture Search0Jian Li, Jin Xu, Kaitao Song, Renqian Luo, Tao Qin, TieYan Liu, Xu Tan, Yichong Leng
1085On the Statistical Benefits of Curriculum Learning0Ambuj Tewari, Ziping Xu
1086A Difference Standardization Method for Mutual Transfer Learning0Beilun Wang, Haoqing Xu, Meng Wang
1087SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks0ChinYi Cheng, Joseph G. Lambourne, Karl D. D. Willis, Pradeep Kumar Jayaraman, Xiang Xu, Yasutaka Furukawa
1088Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations0Haoran Xu, Honglei Yin, Huiling Qin, Xianyuan Zhan
1089Adversarial Attack and Defense for Non-Parametric Two-Sample Tests0Feng Liu, Jingfeng Zhang, Masashi Sugiyama, Mohan S. Kankanhalli, Xilie Xu
1090Adversarially Robust Models may not Transfer Better: Sufficient Conditions for Domain Transferability from the View of Regularization0Bo Li, Evelyn Ma, Hyun Ho Son, Jacky Y. Zhang, Sanmi Koyejo, Xiaojun Xu
1091A Theoretical Analysis on Independence-driven Importance Weighting for Covariate-shift Generalization0Peng Cui, Renzhe Xu, Tong Zhang, Xingxuan Zhang, Zheyan Shen
1092Langevin Monte Carlo for Contextual Bandits0Animashree Anandkumar, Eric V. Mazumdar, Hongkai Zheng, Kamyar Azizzadenesheli, Pan Xu
1093Investigating Why Contrastive Learning Benefits Robustness against Label Noise0Baharan Mirzasoleiman, Kyle Whitecross, Yihao Xue
1094Diversified Adversarial Attacks based on Conjugate Gradient Method0Haruki Sato, Hiroki Ishikura, Issa Oe, Katsuki Fujisawa, Keiichiro Yamamura, Nariaki Tateiwa, Nozomi Hata, Toru Mitsutake
1095Cycle Representation Learning for Inductive Relation Prediction0Chao Chen, Liangcai Gao, Tengfei Ma, Zhi Tang, Zuoyu Yan
1096Optimally Controllable Perceptual Lossy Compression0Fei Wen, Peilin Liu, Zeyu Yan
1097Active fairness auditing0Chicheng Zhang, Tom Yan
1098Self-Organized Polynomial-Time Coordination Graphs0Chongjie Zhang, Jianhao Wang, Qianlan Yang, Tonghan Wang, Weijun Dong, Zhizhou Ren
1099Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning0Mingyuan Zhou, Shentao Yang, Shujian Zhang, Yihao Feng
1100A Psychological Theory of Explainability0Patrick Shafto, Scott ChengHsin Yang, Tomas Folke
1101Omni-Granular Ego-Semantic Propagation for Self-Supervised Graph Representation Learning0Ling Yang, Shenda Hong
1102Unsupervised Time-Series Representation Learning with Iterative Bilinear Temporal-Spectral Fusion0Ling Yang, Shenda Hong
1103Searching for BurgerFormer with Micro-Meso-Macro Space Design0Jilin Mei, Longxing Yang, Shun Lu, Xiaowei Li, Yinhe Han, Yu Hu, Zihao Sun
1104Efficient Variance Reduction for Meta-learning0Hansi Yang, James T. Kwok
1105Injecting Logical Constraints into Neural Networks via Straight-Through Estimators0Chiyoun Park, Joohyung Lee, Zhun Yang
1106Locally Sparse Neural Networks for Tabular Biomedical Data0Junchen Yang, Ofir Lindenbaum, Yuval Kluger
1107Not All Poisons are Created Equal: Robust Training against Data Poisoning0Baharan Mirzasoleiman, Tian Yu Liu, Yu Yang
1108Does the Data Induce Capacity Control in Deep Learning?0Jialin Mao, Pratik Chaudhari, Rubing Yang
1109Informed Learning by Wide Neural Networks: Convergence, Generalization and Sampling Complexity0Jianyi Yang, Shaolei Ren
1110Linear Bandit Algorithms with Sublinear Time Complexity0Eric Price, Inderjit S. Dhillon, Sanjay Shakkottai, Shuo Yang, Sujay Sanghavi, Tongzheng Ren
1111A New Perspective on the Effects of Spectrum in Graph Neural Networks0Baocai Yin, Heng Qi, Mingqi Yang, Qiang Zhang, Rui Li, Yanming Shen
1112Fourier Learning with Cyclical Data0Chong Wang, Taiqing Wang, Tianyi Liu, Yingxiang Yang, Zhihan Xiong
1113Estimating Instance-dependent Bayes-label Transition Matrix using a Deep Neural Network0Bo Han, Erkun Yang, Gang Niu, Min Xu, Shuo Yang, Tongliang Liu, Yang Liu
1114A Study of Face Obfuscation in ImageNet0Jacqueline H. Yau, Jia Deng, Kaiyu Yang, Li FeiFei, Olga Russakovsky
1115Anarchic Federated Learning0Haibo Yang, Jia Liu, Prashant Khanduri, Xin Zhang
1116Identity-Disentangled Adversarial Augmentation for Self-supervised Learning0Dacheng Tao, Kaiwen Yang, Tianyi Zhou, Xinmei Tian
1117Learning from a Learning User for Optimal Recommendations0Chuanhao Li, Denis Nekipelov, Fan Yao, Haifeng Xu, Hongning Wang
1118Improving Out-of-Distribution Robustness via Selective Augmentation0Chelsea Finn, Huaxiu Yao, James Zou, Linjun Zhang, Sai Li, Weixin Liang, Yu Wang
1119NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework0Xiaocong Yang, Xingcheng Yao, Yanan Zheng, Zhilin Yang
1120Feature Space Particle Inference for Neural Network Ensembles0Ikuro Sato, Kohta Ishikawa, Rei Kawakami, Shingo Yashima, Teppei Suzuki
1121Centroid Approximation for Bootstrap: Improving Particle Quality at Inference0Mao Ye, Qiang Liu
1122Be Like Water: Adaptive Floating Point for Machine Learning0Alexander Ihler, Brandon Chuang, Max Sterner, Thomas Yeh, Zerlina Lai
1123QSFL: A Two-Level Uplink Communication Optimization Framework for Federated Learning0Gang Wang, Liping Yi, Xiaoguang Liu
1124De novo mass spectrometry peptide sequencing with a transformer model0Melih Yilmaz, Sewoong Oh, William Fondrie, William S. Noble, Wout Bittremieux
1125Bayesian Nonparametric Learning for Point Processes with Spatial Homogeneity: A Spatial Analysis of NBA Shot Locations0Fan Yin, Guanyu Hu, Jieying Jiao, Jun Yan
1126Bitwidth Heterogeneous Federated Learning with Progressive Weight Dequantization0Geon Park, Jaehong Yoon, Sung Ju Hwang, Wonyong Jeong
1127ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks0Baopu Li, Haoran You, Huihong Shi, Yingyan Lin, Yonggan Fu
1128Molecular Representation Learning via Heterogeneous Motif Graph Neural Networks0Hongyang Gao, Zhaoning Yu
1129Understanding Robust Overfitting of Adversarial Training and Beyond0Bo Han, Chaojian Yu, Chen Gong, Jun Yu, Li Shen, Mingming Gong, Tongliang Liu
1130How to Leverage Unlabeled Data in Offline Reinforcement Learning0Aviral Kumar, Chelsea Finn, Karol Hausman, Sergey Levine, Tianhe Yu, Yevgen Chebotar
1131Reachability Constrained Reinforcement Learning0Dongjie Yu, Haitong Ma, Jianyu Chen, Shengbo Li
1132Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning0Ali Jannesari, Arya Mazaheri, Sixing Yu
1133The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks0Shandian Zhe, Srikumar Ramalingam, Thiago Serra, Xin Yu
1134GraphFM: Improving Large-Scale GNN Training via Feature Momentum0Bokun Wang, Haiyang Yu, Limei Wang, Meng Liu, Shuiwang Ji, Tianbao Yang
1135Latent Diffusion Energy-Based Model for Interpretable Text Modelling0Baoxiong Jia, Bo Pang, Peiyu Yu, Ruiqi Gao, Sirui Xie, SongChun Zhu, Xiaojian Ma, Ying Nian Wu, Yixin Zhu
1136Predicting Out-of-Distribution Error with the Projection Norm0Alexander Wei, Jacob Steinhardt, Yaodong Yu, Yi Ma, Zitong Yang
1137Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning0Haoqi Yuan, Zongqing Lu
1138Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance0Denny Zhou, Lijun Zhang, Tianbao Yang, Xianzhi Du, Yuexin Wu, Zhuoning Yuan, ZiHao Qiu
1139Neural Tangent Kernel Empowered Federated Learning0ChauWai Wong, Dror Baron, Huaiyu Dai, Kai Yue, Richeng Jin, Ryan Pilgrim
1140Time Is MattEr: Temporal Self-supervision for Video Transformers0Dongyoon Han, Hwanjun Song, Jaehyung Kim, Jinwoo Shin, JungWoo Ha, Sukmin Yun
1141Pure Noise to the Rescue of Insufficient Data: Improving Imbalanced Classification by Training on Random Noise Images0Itay Benou, Michal Irani, Shiran Zada
1142Adaptive Conformal Predictions for Time Series0Aymeric Dieuleveut, Julie Josse, Margaux Zaffran, Olivier Féron, Yannig Goude
1143Actor-Critic based Improper Reinforcement Learning0Aditya Gopalan, Avi Mohan, Mohammadi Zaki, Shie Mannor
1144Stabilizing Q-learning with Linear Architectures for Provable Efficient Learning0Andrea Zanette, Martin J. Wainwright
1145Multi Resolution Analysis (MRA) for Approximate Self-Attention0Glenn Moo Fung, Jeffery Kline, Sourav Pal, Vikas Singh, Zhanpeng Zeng
1146Efficient PAC Learning from the Crowd with Pairwise Comparisons0Jie Shen, Shiwei Zeng
1147Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts0Hang Li, Xinsong Zhang, Yan Zeng
1148Position Prediction as an Effective Pretraining Strategy0Chen Huang, Dan Busbridge, Hanlin Goh, Jason Ramapuram, Joseph Y. Cheng, Joshua M. Susskind, Navdeep Jaitly, Shuangfei Zhai, Tatiana Likhomanenko, Walter Talbott
1149Anytime Information Cascade Popularity Prediction via Self-Exciting Processes0Akshay Aravamudan, Georgios C. Anagnostopoulos, Xi Zhang
1150Understanding Clipping for Federated Learning: Convergence and Client-Level Differential Privacy0Jinfeng Yi, Mingyi Hong, Steven Wu, Xiangyi Chen, Xinwei Zhang
1151Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs0Yikang Zhang, Zhao Zhong, Zhuo Chen
1152PDE-Based Optimal Strategy for Unconstrained Online Learning0Ashok Cutkosky, Ioannis Ch. Paschalidis, Zhiyu Zhang
1153Stochastic Continuous Submodular Maximization: Boosting via Non-oblivious Function0Haoyuan Hu, Qixin Zhang, Yu Yang, Zaiyi Chen, Zengde Deng
1154When and How Mixup Improves Calibration0James Zou, Kenji Kawaguchi, Linjun Zhang, Zhun Deng
1155UAST: Uncertainty-Aware Siamese Tracking0Dawei Zhang, Yanwei Fu, Zhonglong Zheng
1156Examining Scaling and Transfer of Language Model Architectures for Machine Translation0Ankur Bapna, Behrooz Ghorbani, Biao Zhang, Jonathan Shen, Orhan Firat, Xavier Garcia, Yong Cheng
1157Revisiting End-to-End Speech-to-Text Translation From Scratch0Barry Haddow, Biao Zhang, Rico Sennrich
1158A Stochastic Multi-Rate Control Framework For Modeling Distributed Optimization Algorithms0Mingyi Hong, Nicola Elia, Sairaj V. Dhople, Xinwei Zhang
1159GALAXY: Graph-based Active Learning at the Extreme0Jifan Zhang, Julian KatzSamuels, Robert D. Nowak
1160Fairness Interventions as (Dis)Incentives for Strategic Manipulation0Kun Jin, Mingyan Liu, Mohammad Mahdi Khalili, Parinaz Naghizadeh, Xueru Zhang
1161Role-based Multiplex Network Embedding0Gang Kou, Hegui Zhang
1162Dynamic Topic Models for Temporal Document Networks0Delvin Ce Zhang, Hady W. Lauw
1163Personalized Federated Learning via Variational Bayesian Inference0Kaiyang Guo, Wenpeng Li, Xu Zhang, Yinchuan Li, Yunfeng Shao
1164Federated Learning with Label Distribution Skew via Logits Calibration0Bo Li, Chao Wu, Jianghe Xu, Jie Zhang, Shouhong Ding, Shuang Wu, Zhiqi Li
1165Neural Network Weights Do Not Converge to Stationary Points: An Invariant Measure Perspective0Ali Jadbabaie, Haochuan Li, Jingzhao Zhang, Suvrit Sra
1166Beyond Worst-Case Analysis in Stochastic Approximation: Moment Estimation Improves Instance Complexity0Ali Jadbabaie, Hongzhou Lin, Jingzhao Zhang, Subhro Das, Suvrit Sra
1167Deep and Flexible Graph Neural Architecture Search0Bin Cui, Wentao Zhang, Yang Li, Yu Shen, Zheyu Lin, Zhi Yang
1168A Langevin-like Sampler for Discrete Distributions0Qiang Liu, Ruqi Zhang, Xingchao Liu
1169Rich Feature Construction for the Optimization-Generalization Dilemma0David LopezPaz, Jianyu Zhang, Léon Bottou
1170Generative Flow Networks for Discrete Probabilistic Modeling0Aaron C. Courville, Alexandra Volokhova, Dinghuai Zhang, Nikolay Malkin, Yoshua Bengio, Zhen Liu
1171Neurotoxin: Durable Backdoors in Federated Learning0Ashwinee Panda, Joseph Gonzalez, Kannan Ramchandran, Linyue Song, Michael W. Mahoney, Prateek Mittal, Yaoqing Yang, Zhengming Zhang
1172Making Linear MDPs Practical via Contrastive Representation Learning0Bo Dai, Dale Schuurmans, Joseph Gonzalez, Mengjiao Yang, Tianjun Zhang, Tongzheng Ren
1173NAFS: A Simple yet Tough-to-beat Baseline for Graph Representation Learning0Bin Cui, Mingyu Yang, Wentao Zhang, Yang Li, Yu Shen, Zeang Sheng, Zhi Yang
1174Correct-N-Contrast: a Contrastive Approach for Improving Robustness to Spurious Correlations0Chelsea Finn, Christopher Ré, Hongyang R. Zhang, Michael Zhang, Nimit Sharad Sohoni
1175Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach0Alekh Agarwal, Masatoshi Uehara, Mengdi Wang, Wen Sun, Xuezhou Zhang, Yuda Song
1176Partial Counterfactual Identification from Observational and Experimental Data0Elias Bareinboim, Jin Tian, Junzhe Zhang
1177Set Norm and Equivariant Skip Connections: Putting the Deep in Deep Sets0John M. Higgins, Lily H. Zhang, Rajesh Ranganath, Veronica Tozzo
1178Learning to Estimate and Refine Fluid Motion with Physical Dynamics0James B. Tlhomole, Jianhong Wang, Matthew D. Piggott, Mingrui Zhang
1179A Branch and Bound Framework for Stronger Adversarial Attacks of ReLU Networks0ChoJui Hsieh, Huan Zhang, J. Zico Kolter, Kaidi Xu, Shiqi Wang, Suman Jana, Yihan Wang
1180A Simple yet Universal Strategy for Online Convex Optimization0Guanghui Wang, Jinfeng Yi, Lijun Zhang, Tianbao Yang
1181Low-Precision Stochastic Gradient Langevin Dynamics0Andrew Gordon Wilson, Christopher De Sa, Ruqi Zhang
1182Expression might be enough: representing pressure and demand for reinforcement learning based traffic signal control0Bo Du, Jianqing Wu, Jun Shen, Liang Zhang, Linyuan Lü, Qiang Wu
1183Uncertainty Modeling in Generative Compressed Sensing0Jian Wang, Mengchu Xu, Xiaojun Mao, Yilang Zhang
1184Building Robust Ensembles via Margin Boosting0Aaron C. Courville, Arun Sai Suggala, Dinghuai Zhang, Hongyang Zhang, Pradeep Ravikumar, Yoshua Bengio
1185Revisiting and Advancing Fast Adversarial Training Through The Lens of Bi-Level Optimization0Guanhua Zhang, Mingyi Hong, Prashant Khanduri, Shiyu Chang, Sijia Liu, Yihua Zhang
1186Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory0Chengzhuo Ni, Mengdi Wang, Ruiqi Zhang, Xuezhou Zhang
1187ROCK: Causal Inference Principles for Reasoning about Commonsense Causality0Dan Roth, Hongming Zhang, Jiayao Zhang, Weijie J. Su
1188No-Regret Learning in Time-Varying Zero-Sum Games0Haipeng Luo, Mengxiao Zhang, Peng Zhao, ZhiHua Zhou
1189PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance0Alexander Bukharin, Chen Liang, Pengcheng He, Qingru Zhang, Simiao Zuo, Tuo Zhao, Weizhu Chen
1190NysADMM: faster composite convex optimization via low-rank approximation0Madeleine Udell, Shipu Zhao, Zachary Frangella
1191Toward Compositional Generalization in Object-Oriented World Modeling0Lawson L. S. Wong, Linfeng Zhao, Lingzhi Kong, Robin Walters
1192Dynamic Regret of Online Markov Decision Processes0Longfei Li, Peng Zhao, ZhiHua Zhou
1193Learning to Solve PDE-constrained Inverse Problems with Graph Networks0David B. Lindell, Gordon Wetzstein, Qingqing Zhao
1194Learning from Counterfactual Links for Link Prediction0Daheng Wang, Gang Liu, Meng Jiang, Tong Zhao, Wenhao Yu
1195Global Optimization Networks0Erez Louidor, Maya R. Gupta, Sen Zhao
1196Certified Robustness Against Natural Language Attacks by Causal Intervention0Anh Tuan Luu, Chang Ma, Haiteng Zhao, Hanwang Zhang, Xinshuai Dong, ZhiHong Deng
1197Efficient Learning for AlphaZero via Path Consistency0Dengwei Zhao, Lei Xu, Shikui Tu
1198Penalizing Gradient Norm for Efficiently Improving Generalization in Deep Learning0Hao Zhang, Xiuyuan Hu, Yang Zhao
1199Ripple Attention for Visual Perception with Sub-quadratic Complexity0Huijie Pan, Lin Zheng, Lingpeng Kong
1200Linear Complexity Randomized Self-attention Mechanism0Chong Wang, Lin Zheng, Lingpeng Kong
1201Online Decision Transformer0Aditya Grover, Amy Zhang, Qinqing Zheng
1202Learning Efficient and Robust Ordinary Differential Equations via Invertible Neural Networks0Edwin V. Bonilla, Fabio Ramos, Lionel Ott, Tin Lai, Weiming Zhi
1203HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning0Andrey Zhmoginov, Maksym Vladymyrov, Mark Sandler
1204Describing Differences between Text Distributions with Natural Language0Charlie Snell, Dan Klein, Jacob Steinhardt, Ruiqi Zhong
1205Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets0Han Zhong, Jiyuan Tan, Liwei Wang, Tong Zhang, Wei Xiong, Zhaoran Wang, Zhuoran Yang
1206Dimension-free Complexity Bounds for High-order Nonconvex Finite-sum Optimization0Dongruo Zhou, Quanquan Gu
1207A Hierarchical Bayesian Approach to Inverse Reinforcement Learning with Symbolic Reward Machines0Weichao Zhou, Wenchao Li
1208On the Optimization Landscape of Neural Collapse under MSE Loss: Global Optimality with Unconstrained Features0Chong You, Jinxin Zhou, Qing Qu, Tianyu Ding, Xiao Li, Zhihui Zhu
1209Model Agnostic Sample Reweighting for Out-of-Distribution Learning0Peng Cui, Renjie Pi, Renzhe Xu, Tong Zhang, Weizhong Zhang, Xiao Zhou, Yong Lin
1210Sparse Invariant Risk Minimization0Tong Zhang, Weizhong Zhang, Xiao Zhou, Yong Lin
1211Prototype-Anchored Learning for Learning with Imperfect Annotations0Deming Zhai, Junjun Jiang, Xiangyang Ji, Xianming Liu, Xin Gao, Xiong Zhou
1212FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting0Liang Sun, Qingsong Wen, Rong Jin, Tian Zhou, Xue Wang, Ziqing Ma
1213Probabilistic Bilevel Coreset Selection0Renjie Pi, Tong Zhang, Weizhong Zhang, Xiao Zhou, Yong Lin, Zonghao Chen
1214Approximate Frank-Wolfe Algorithms over Graph-structured Support Sets0Baojian Zhou, Yifan Sun
1215Improving Adversarial Robustness via Mutual Information Estimation0Bo Han, Dawei Zhou, Nannan Wang, Tongliang Liu, Xiaoyu Wang, Xinbo Gao, Yibing Zhan
1216Modeling Adversarial Noise for Adversarial Training0Bo Han, Dawei Zhou, Nannan Wang, Tongliang Liu
1217Contrastive Learning with Boosted Memorization0Bo Han, Jiangchao Yao, Ya Zhang, Yanfeng Wang, Zhihan Zhou
1218Understanding The Robustness in Vision Transformers0Animashree Anandkumar, Chaowei Xiao, Daquan Zhou, Enze Xie, Jiashi Feng, José M. Álvarez, Zhiding Yu
1219VLUE: A Multi-Task Multi-Dimension Benchmark for Evaluating Vision-Language Pre-training0Shizhe Diao, Wangchunshu Zhou, Xinsong Zhang, Yan Zeng
1220Detecting Corrupted Labels Without Training a Model to Predict0Yang Liu, Zhaowei Zhu, Zihao Dong
1221Contextual Bandits with Large Action Spaces: Made Practical0Dylan J. Foster, John Langford, Paul Mineiro, Yinglun Zhu
1222Neural-Symbolic Models for Logical Queries on Knowledge Graphs0Jian Tang, Mikhail Galkin, Zhaocheng Zhu, Zuobai Zhang
1223Topology-aware Generalization of Decentralized SGD0Dacheng Tao, Fengxiang He, Lan Zhang, Mingli Song, Tongtian Zhu, Zhengyang Niu
1224Resilient and Communication Efficient Learning for Heterogeneous Federated Systems0Jiayu Zhou, Junyuan Hong, Steve Drew, Zhuangdi Zhu
1225On Numerical Integration in Neural Ordinary Differential Equations0Aiqing Zhu, Beibei Zhu, Pengzhan Jin, Yifa Tang
1226When AUC meets DRO: Optimizing Partial AUC for Deep Learning with Non-Convex Convergence Guarantee0Bokun Wang, Dixian Zhu, Gang Li, Tianbao Yang, Xiaodong Wu
1227Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces0Paul Mineiro, Yinglun Zhu
1228Residual-Based Sampling for Online Outlier-Robust PCA0Jie Shen, Tianhao Zhu
1229Region-Based Semantic Factorization in GANs0Deli Zhao, Jiapeng Zhu, Qifeng Chen, Yinghao Xu, Yujun Shen
1230Beyond Images: Label Noise Transition Matrix Estimation for Tasks with Lower-Quality Features0Jialu Wang, Yang Liu, Zhaowei Zhu
1231Towards Uniformly Superhuman Autonomy via Subdominance Minimization0Brian D. Ziebart, Paul Vernaza, Sanjiban Choudhury, Xinyan Yan
1232Inductive Matrix Completion: No Bad Local Minima and a Fast Algorithm0Boaz Nadler, Pini Zilber
1233Counterfactual Prediction for Outcome-Oriented Treatments0Bo Li, Hao Zou, Jiangang Han, Peng Cui, Shuiping Chen, Xuetao Ding
1234SpaceMAP: Visualizing High-Dimensional Data by Space Expansion0Qian Tao, Xinrui Zu