ICML2023

November 8, 2025 · View on GitHub

会议论文列表

本会议共有 1828 篇论文

序号标题链接推荐理由推荐度摘要作者组织
1Data Structures for Density Estimation0Alexandr Andoni, Anders Aamand, Justin Y. Chen, Piotr Indyk, Sandeep Silwal, Shyam Narayanan
2ClusterFuG: Clustering Fully connected Graphs by Multicut0Ahmed Abbas, Paul Swoboda
3Generalization on the Unseen, Logic Reasoning and Degree Curriculum0Aryo Lotfi, Emmanuel Abbe, Kevin Rizk, Samy Bengio
4Toward Large Kernel Models0Amirhesam Abedsoltan, Mikhail Belkin, Parthe Pandit
5Expertise Trees Resolve Knowledge Limitations in Collective Decision-Making0Ann Nowé, Axel Abels, Tom Lenaerts, Vito Trianni
6Comparison of meta-learners for estimating multi-valued treatment heterogeneous effects0Antoine Bertoncello, Josselin Garnier, Naoufal Acharki, Ramiro Lugo
7BNN-DP: Robustness Certification of Bayesian Neural Networks via Dynamic Programming0Andrea Patane, Luca Laurenti, Morteza Lahijanian, Steven Adams
8SAM operates far from home: eigenvalue regularization as a dynamical phenomenon0Atish Agarwala, Yann N. Dauphin
9Second-order regression models exhibit progressive sharpening to the edge of stability0Atish Agarwala, Fabian Pedregosa, Jeffrey Pennington
10Global optimality of Elman-type RNNs in the mean-field regime0Andrea Agazzi, Jianfeng Lu, Sayan Mukherjee
11SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification0Ameet Deshpande, Karthik R. Narasimhan, Pranjal Aggarwal
12Adaptive IMLE for Few-shot Pretraining-free Generative Modelling0Ke Li, Mehran Aghabozorgi, Shichong Peng
13Scaling Laws for Generative Mixed-Modal Language Models0Alexis Conneau, Armen Aghajanyan, Karen Hambardzumyan, Lili Yu, Luke Zettlemoyer, Naman Goyal, Omer Levy, Stephen Roller, Susan Zhang, WeiNing Hsu
14Hypothesis Transfer Learning with Surrogate Classification Losses: Generalization Bounds through Algorithmic Stability0Anass Aghbalou, Guillaume Staerman
15Constrained Causal Bayesian Optimization0Alan Malek, Ira Ktena, Silvia Chiappa, Virginia Aglietti
16Explaining the effects of non-convergent MCMC in the training of Energy-Based Models0Aurélien Decelle, Beatriz Seoane, Elisabeth Agoritsas, Giovanni Catania
17Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies0Adam Tauman Kalai, Gati V. Aher, Rosa I. Arriaga
18Interventional Causal Representation Learning0Divyat Mahajan, Kartik Ahuja, Yixin Wang, Yoshua Bengio
19Sequential Underspecified Instrument Selection for Cause-Effect Estimation0Elisabeth Ailer, Jason S. Hartford, Niki Kilbertus
20Atari-5: Distilling the Arcade Learning Environment down to Five Games0Marcus Hutter, Matthew Aitchison, Penny Sweetser
21Towards credible visual model interpretation with path attribution0Mohammad A. A. K. Jalwana, Naveed Akhtar
22Convergence of First-Order Methods for Constrained Nonconvex Optimization with Dependent Data0Ahmet Alacaoglu, Hanbaek Lyu
23Recasting Self-Attention with Holographic Reduced Representations0Edward Raff, James Holt, Mohammad Mahmudul Alam, Stella Biderman, Tim Oates
24The Saddle-Point Method in Differential Privacy0Flávio P. Calmon, Juan Felipe Gómez, Lalitha Sankar, Oliver Kosut, Shahab Asoodeh, Wael Alghamdi
25Nonlinear Advantage: Trained Networks Might Not Be As Complex as You Think0Christian H. X. Ali MehmetiGöpel, Jan Disselhoff
26A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models0Balaji Lakshminarayanan, Dustin Tran, James Urquhart Allingham, Jeremiah Zhe Liu, Jie Ren, Michael W. Dusenberry, Xiuye Gu, Yin Cui
27On the Privacy-Robustness-Utility Trilemma in Distributed Learning0John Stephan, Nirupam Gupta, Rachid Guerraoui, Rafael Pinot, Youssef Allouah
28Differentially Private Distributed Bayesian Linear Regression with MCMC0Baris Alparslan, S. Ilker Birbil, Sinan Yildirim
29Robust and Scalable Bayesian Online Changepoint Detection0FrançoisXavier Briol, Jeremias Knoblauch, Matías Altamirano
30Neural Wasserstein Gradient Flows for Discrepancies with Riesz Kernels0Fabian Altekrüger, Gabriele Steidl, Johannes Hertrich
31Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost0András György, Lin Yang, Sanae Amani, Tor Lattimore
32A Kernelized Stein Discrepancy for Biological Sequences0Alan Nawzad Amin, Debora Susan Marks, Eli N. Weinstein
33The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation0Csaba Szepesvári, Nan Jiang, Philip Amortila
34Meta Optimal Transport0Brandon Amos, Giulia Luise, Ievgen Redko, Samuel Cohen
35Near-Optimal Φ-Regret Learning in Extensive-Form Games0Gabriele Farina, Ioannis Anagnostides, Tuomas Sandholm
36A Modern Look at the Relationship between Sharpness and Generalization0Francesco Croce, Maksym Andriushchenko, Matthias Hein, Maximilian Müller, Nicolas Flammarion
37SGD with Large Step Sizes Learns Sparse Features0Aditya Vardhan Varre, Loucas PillaudVivien, Maksym Andriushchenko, Nicolas Flammarion
38Neural Continuous-Discrete State Space Models for Irregularly-Sampled Time Series0Abdul Fatir Ansari, Alvin Heng, Andre Lim, Harold Soh
39Paging with Succinct Predictions0Adam Polak, Antonios Antoniadis, Bertrand Simon, Joan Boyar, Kim S. Larsen, Lene Monrad Favrholdt, Marek Eliás, Ruben Hoeksma
40Mixing Predictions for Online Metric Algorithms0Adam Polak, Antonios Antoniadis, Bertrand Simon, Christian Coester, Marek Eliás
41Exponential Smoothing for Off-Policy Learning0Anna Korba, David Rohde, Imad Aouali, VictorEmmanuel Brunel
42Polynomial Time and Private Learning of Unbounded Gaussian Mixture Models0Christopher Liaw, Hassan Ashtiani, Jamil Arbas
43Principled Acceleration of Iterative Numerical Methods Using Machine Learning0Qianxiao Li, Sohei Arisaka
44Faster Rates of Convergence to Stationary Points in Differentially Private Optimization0Cristóbal Guzmán, Enayat Ullah, Michael Menart, Raef Bassily, Raman Arora, Tomás González
45Prototype-Sample Relation Distillation: Towards Replay-Free Continual Learning0Eugene Belilovsky, MohammadReza Davari, Nader Asadi, Rahaf Aljundi, Sudhir P. Mudur
46Near-Optimal Algorithms for Private Online Optimization in the Realizable Regime0Hilal Asi, Kunal Talwar, Tomer Koren, Vitaly Feldman
47From Robustness to Privacy and Back0Hilal Asi, Jonathan R. Ullman, Lydia Zakynthinou
48SGD with AdaGrad Stepsizes: Full Adaptivity with High Probability to Unknown Parameters, Unbounded Gradients and Affine Variance0Amit Attia, Tomer Koren
49Adversarially Robust PAC Learnability of Real-Valued Functions0Idan Attias, Steve Hanneke
50Infusing Lattice Symmetry Priors in Attention Mechanisms for Sample-Efficient Abstract Geometric Reasoning0Andreas Loukas, Mattia Atzeni, Mrinmaya Sachan
51Learning to Initiate and Reason in Event-Driven Cascading Processes0Eli A. Meirom, Gal Chechik, Shie Mannor, Yuval Atzmon
52On the convergence of the MLE as an estimator of the learning rate in the Exp3 algorithm0Julien Aubert, Luc Lehéricy, Patricia ReynaudBouret
53Dirichlet Diffusion Score Model for Biological Sequence Generation0Chenlai Shi, Jian Zhou, Kseniia Dudnyk, Pavel Avdeyev, Yuhao Tan
54Gradient Descent Converges Linearly for Logistic Regression on Separable Data0Kyriakos Axiotis, Maxim Sviridenko
55Naive imputation implicitly regularizes high-dimensional linear models0Alexis Ayme, Aymeric Dieuleveut, Claire Boyer, Erwan Scornet
56Half-Hop: A graph upsampling approach for slowing down message passing0ChiHeng Lin, Eva L. Dyer, Lakshmi Sathidevi, Mehdi Azabou, Michal Valko, Petar Velickovic, Ran Liu, Shantanu Thakoor, Venkataramana Ganesh
57CLUTR: Curriculum Learning via Unsupervised Task Representation Learning0Abdus Salam Azad, Aleksandra Faust, Ion Stoica, Izzeddin Gur, Jasper Emhoff, Nathaniel Alexis, Pieter Abbeel
58Personalized Subgraph Federated Learning0Jaehong Yoon, Jinheon Baek, Jiongdao Jin, Sung Ju Hwang, Wonyong Jeong
59Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language0Alexei Baevski, Arun Babu, Michael Auli, WeiNing Hsu
60Efficient preconditioned stochastic gradient descent for estimation in latent variable models0Charlotte Baey, Estelle Kuhn, JeanBenoist Leger, Maud Delattre, Sarah Lemler
61Feed Two Birds with One Scone: Exploiting Wild Data for Both Out-of-Distribution Generalization and Detection0Gregory Canal, Haoyue Bai, Jeongyeol Kwon, Robert D. Nowak, Xuefeng Du, Yixuan Li
62Answering Complex Logical Queries on Knowledge Graphs via Query Computation Tree Optimization0Juanzi Li, Lei Hou, Xin Lv, Yushi Bai
63Linear optimal partial transport embedding0Ivan Vladimir Medri, Rana Muhammad Shahroz Khan, Rocio Diaz Martin, Soheil Kolouri, Yikun Bai
64Implicit Graph Neural Networks: A Monotone Operator Viewpoint0Bao Wang, Cory D. Hauck, Justin M. Baker, Qingsong Wang
65Tensor Decompositions Meet Control Theory: Learning General Mixtures of Linear Dynamical Systems0Ainesh Bakshi, Allen Liu, Ankur Moitra, Morris Yau
66Block Subsampled Randomized Hadamard Transform for Nyström Approximation on Distributed Architectures0Laura Grigori, Matthias Beaupère, Oleg Balabanov, Victor Lederer
67Efficient Online Reinforcement Learning with Offline Data0Ilya Kostrikov, Laura Smith, Philip J. Ball, Sergey Levine
68Mirror Sinkhorn: Fast Online Optimization on Transport Polytopes0Marin Ballu, Quentin Berthet
69On the Functional Similarity of Robust and Non-Robust Neural Representations0András Balogh, Márk Jelasity
70Robust Budget Pacing with a Single Sample0Balasubramanian Sivan, Di Wang, Rachitesh Kumar, Santiago R. Balseiro, Vahab Mirrokni
71Dynamic Constrained Submodular Optimization with Polylogarithmic Update Time0Kiarash Banihashem, Leyla Biabani, MohammadTaghi Hajiaghayi, Morteza Monemizadeh, Peyman Jabbarzade, Samira Goudarzi
72One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale0Chongxuan Li, Fan Bao, Gang Yue, Hang Su, Jun Zhu, Kaiwen Xue, Shen Nie, Shi Pu, Yaole Wang, Yue Cao
73Optimizing the Collaboration Structure in Cross-Silo Federated Learning0Haohan Wang, Jingrui He, Jun Wu, Wenxuan Bao
74MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation0Lior Yariv, Omer BarTal, Tali Dekel, Yaron Lipman
75Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space0Anas Barakat, Ilyas Fatkhullin, Niao He
76Interpretable Neural-Symbolic Concept Reasoning0Alberto Tonda, Francesco Giannini, Frédéric Precioso, Gabriele Ciravegna, Giuseppe Marra, Lucie Charlotte Magister, Mateja Jamnik, Mateo Espinosa Zarlenga, Pietro Barbiero, Pietro Lio
77Moccasin: Efficient Tensor Rematerialization for Neural Networks0Bistra Dilkina, Burak Bartan, Christopher Lott, Haoming Li, Harris Teague
78User-level Private Stochastic Convex Optimization with Optimal Rates0Raef Bassily, Ziteng Sun
79A Statistical Perspective on Retrieval-Based Models0Ankit Singh Rawat, Manzil Zaheer, Soumya Basu
80Human-Timescale Adaptation in an Open-Ended Task Space0Adrian Collister, Alexander Zacherl, Avishkar Bhoopchand, Edward Hughes, Feryal M. P. Behbahani, Hannah Openshaw, Jack ParkerHolder, Jakob Bauer, Jakub Sygnowski, Karl Tuyls, Karol Gregor, Kate Baumli, Lei M. Zhang, Lucy Gonzalez, Maria LoksThompson, Michael Chang, Natalie Clay, Nathalie BradleySchmieg, Nemanja Rakicevic, Nicolas Perez Nieves, Sarah York, Satinder Singh, Sheleem Kashem, Shreya Pathak, Tim Rocktäschel, Vibhavari Dasagi, Yannick Schroecker
81A Kernel Stein Test of Goodness of Fit for Sequential Models0Arthur Gretton, Heishiro Kanagawa, Jerome Baum
82Individually Fair Learning with One-Sided Feedback0Aaron Roth, Yahav Bechavod
83Predicting Ordinary Differential Equations with Transformers0Alexander Neitz, Giambattista Parascandolo, Michal Klein, Niki Kilbertus, Sören Becker
84Explaining Reinforcement Learning with Shapley Values0Daniel Beechey, Thomas M. S. Smith, Özgür Simsek
85TIDE: Time Derivative Diffusion for Deep Learning on Graphs0Maks Ovsjanikov, Maximilian Krahn, Maysam Behmanesh
86Fast as CHITA: Neural Network Pruning with Combinatorial Optimization0Hussein Hazimeh, Natalia Ponomareva, Rahul Mazumder, Riade Benbaki, Wenyu Chen, Xiang Meng, Zhe Zhao
87Continuously Parameterized Mixture Models0Christopher M. Bender, Junier Oliva, Marc Niethammer, Yifeng Shi
88Controllable Neural Symbolic Regression0Luca Biggio, PierreAlexandre Kamienny, Tommaso Bendinelli
89On Second-Order Scoring Rules for Epistemic Uncertainty Quantification0Eyke Hüllermeier, Viktor Bengs, Willem Waegeman
90Certified Robust Neural Networks: Generalization and Corruption Resistance0Bart P. G. Van Parys, M. Amine Bennouna, Ryan Lucas
91Gaussian processes at the Helm(holtz): A more fluid model for ocean currents0Brian L. Trippe, David R. Burt, Junfei Xia, Kaushik Srinivasan, Renato Berlinghieri, Ryan James Giordano, Tamara Broderick, Tamay M. Özgökmen
92Optimal Rates and Efficient Algorithms for Online Bayesian Persuasion0Alberto Marchesi, Andrea Celli, Francesco Trovò, Martino Bernasconi, Matteo Castiglioni, Nicola Gatti
93Constrained Phi-Equilibria0Alberto Marchesi, Francesco Trovò, Martino Bernasconi, Matteo Castiglioni, Nicola Gatti
94Differentiable and Transportable Structure Learning0Fergus Imrie, Jeroen Berrevoets, Mihaela van der Schaar, Nabeel Seedat
95Polyhedral Complex Extraction from ReLU Networks using Edge Subdivision0Arturs Berzins
96Robust One-Class Classification with Signed Distance Function using 1-Lipschitz Neural Networks0Andres TroyaGalvis, Guillaume Coiffier, Louis Béthune, Mathieu Serrurier, Paul Novello, Quentin Vincenot, Thibaut Boissin
97Neural Algorithmic Reasoning with Causal Regularisation0Beatrice Bevilacqua, Borja Ibarz, Charles Blundell, Ioana Bica, Jovana Mitrovic, Kyriacos Nikiforou, Michela Paganini, Petar Velickovic
98Optimally-weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference0Ayush Bharti, FrançoisXavier Briol, Masha Naslidnyk, Oscar Key, Samuel Kaski
99Bandit Online Linear Optimization with Hints and Queries0Aditya Bhaskara, Ashok Cutkosky, Manish Purohit, Ravi Kumar
100Improved Online Conformal Prediction via Strongly Adaptive Online Learning0Aadyot Bhatnagar, Caiming Xiong, Huan Wang, Yu Bai
101Data-Copying in Generative Models: A Formal Framework0Kamalika Chaudhuri, Robi Bhattacharjee, Sanjoy Dasgupta
102Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling0Aviya Skowron, Edward Raff, Eric Hallahan, Hailey Schoelkopf, Herbie Bradley, Kyle O'Brien, Lintang Sutawika, Mohammad Aflah Khan, Oskar van der Wal, Quentin Gregory Anthony, Shivanshu Purohit, Stella Biderman, USVSN Sai Prashanth
103StriderNet: A Graph Reinforcement Learning Approach to Optimize Atomic Structures on Rough Energy Landscapes0N. M. Anoop Krishnan, Sahil Manchanda, Sayan Ranu, Srikanth Sastry, Vaibhav Bihani
104Modeling Temporal Data as Continuous Functions with Stochastic Process Diffusion0Anderson Schneider, Kashif Rasul, Marin Bilos, Stephan Günnemann, Yuriy Nevmyvaka
105In or Out? Fixing ImageNet Out-of-Distribution Detection Evaluation0Julian Bitterwolf, Matthias Hein, Maximilian Müller
106Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames0Aravindh Mahendran, Gamaleldin Fathy Elsayed, Mehdi S. M. Sajjadi, Ondrej Biza, Sjoerd van Steenkiste, Thomas Kipf
107Understanding Oversquashing in GNNs through the Lens of Effective Resistance0Amir Nayyeri, Mitchell Black, Yusu Wang, Zhengchao Wan
108Unit Scaling: Out-of-the-Box Low-Precision Training0Carlo Luschi, Charlie Blake, Douglas Orr
109FLEX: an Adaptive Exploration Algorithm for Nonlinear Systems0Marc Lelarge, Matthieu Blanke
110Not all Strongly Rayleigh Distributions Have Small Probabilistic Generating Circuits0Markus Bläser
111Learning the Dynamics of Sparsely Observed Interacting Systems0Adeline Fermanian, Agathe Guilloux, AnneSophie Jannot, Linus Bleistein
112Subset Selection Based On Multiple Rankings in the Presence of Bias: Effectiveness of Fairness Constraints for Multiwinner Voting Score Functions0Anay Mehrotra, L. Elisa Celis, Lingxiao Huang, Niclas Boehmer, Nisheeth K. Vishnoi
113Properties of the Mallows Model Depending on the Number of Alternatives: A Warning for an Experimentalist0Niclas Boehmer, Piotr Faliszewski, Sonja Kraiczy
114A Robust Optimisation Perspective on Counterexample-Guided Repair of Neural Networks0David Boetius, Stefan Leue, Tobias Sutter
115Beyond the Universal Law of Robustness: Sharper Laws for Random Features and Neural Tangent Kernels0Marco Mondelli, Shayan Kiyani, Simone Bombari
116Sliced-Wasserstein on Symmetric Positive Definite Matrices for M/EEG Signals0Alain Rakotomamonjy, Benoît Malézieux, Clément Bonet, Lucas Drumetz, Matthieu Kowalski, Nicolas Courty, Thomas Moreau
117Spherical Fourier Neural Operators: Learning Stable Dynamics on the Sphere0Anima Anandkumar, Boris Bonev, Christian Hundt, Jaideep Pathak, Karthik Kashinath, Maximilian Baust, Thorsten Kurth
118The Regret of Exploration and the Control of Bad Episodes in Reinforcement Learning0Bruno Gaujal, Victor Boone
119Model-agnostic Measure of Generalization Difficulty0Akhilan Boopathy, Asaad Mohammedsaleh, Ila Fiete, Jaedong Hwang, Kevin Liu, Shu Ge
120Returning The Favour: When Regression Benefits From Probabilistic Causal Knowledge0Dino Sejdinovic, Jake Fawkes, Shahine Bouabid
121In Search for a Generalizable Method for Source Free Domain Adaptation0Bart van Merrienboer, Eleni Triantafillou, Malik Boudiaf, Tom Denton, Vincent Dumoulin
122Quantum Speedups for Zero-Sum Games via Improved Dynamic Gibbs Sampling0Aaron Sidford, Adam Bouland, Kevin Tian, Yosheb M. Getachew, Yujia Jin
123Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?0Akash Nagaraj, Julien Colin, Lakshya Singhal, Rishav Mukherji, Thomas Fel, Thomas Serre, Victor Boutin
124Settling the Reward Hypothesis0David Abel, John D. Martin, Michael Bowling, Will Dabney
125ILLUME: Rationalizing Vision-Language Models through Human Interactions0Björn Deiseroth, Kristian Kersting, Manuel Brack, Patrick Schramowski
126Provably Learning Object-Centric Representations0Bernhard Schölkopf, Jack Brady, Julius von Kügelgen, Roland S. Zimmermann, Wieland Brendel, Yash Sharma
127Quantifying Human Priors over Social and Navigation Networks0Gecia Bravo Hermsdorff
128Critical Points and Convergence Analysis of Generative Deep Linear Networks Trained with Bures-Wasserstein Loss0Guido Montúfar, Jing An, Katerina Papagiannouli, Pierre Bréchet
129Emergence of Sparse Representations from Noise0Bruno A. Olshausen, Gabriel Kreiman, Rylan Schaeffer, Trenton Bricken
130Differentially Private Optimization on Large Model at Small Cost0George Karypis, Sheng Zha, YuXiang Wang, Zhiqi Bu
131Machine Learning Force Fields with Data Cost Aware Training0Alexander Bukharin, Shengjie Wang, Simiao Zuo, Tianyi Liu, Tuo Zhao, Weihao Gao, Wen Yan
132Label differential privacy and private training data release0Andrés Muñoz Medina, Róbert Istvan BusaFekete, Sergei Vassilvitskii, Umar Syed
133The SSL Interplay: Augmentations, Inductive Bias, and Generalization0Alberto Bietti, Bobak Toussi Kiani, Randall Balestriero, Vivien Cabannes, Yann LeCun
134Online Mechanism Design for Information Acquisition0Federico Cacciamani, Matteo Castiglioni, Nicola Gatti
135MyoDex: A Generalizable Prior for Dexterous Manipulation0Sudeep Dasari, Vikash Kumar, Vittorio Caggiano
136What Can Be Learnt With Wide Convolutional Neural Networks?0Alessandro Favero, Francesco Cagnetta, Matthieu Wyart
137Causal Discovery with Latent Confounders Based on Higher-Order Cumulants0Kun Zhang, Ruichu Cai, Wei Chen, Zhifeng Hao, Zhiyi Huang
138On the Connection Between MPNN and Graph Transformer0Chen Cai, Rose Yu, Truong Son Hy, Yusu Wang
139Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition0Anbang Yao, Dongqi Cai, Yangyuxuan Kang, Yurong Chen
140Extrapolated Random Tree for Regression0Hanfang Yang, Yiwei Dong, Yuchao Cai, Yuheng Ma
141Cyclic Block Coordinate Descent With Variance Reduction for Composite Nonconvex Optimization0Chaobing Song, Jelena Diakonikolas, Stephen J. Wright, Xufeng Cai
142Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?0Ruisi Cai, Zhangyang Wang, Zhenyu Zhang
143Doubly Optimal No-Regret Learning in Monotone Games0Weiqiang Zheng, Yang Cai
144Multi-Agent Learning from Learners0Francesco Chini, Mine Melodi Caliskan, Setareh Maghsudi
145Efficient Learning of Mesh-Based Physical Simulation with Bi-Stride Multi-Scale Graph Neural Network0Chenfanfu Jiang, Menglei Chai, Minchen Li, Yadi Cao
146Variational Sparse Inverse Cholesky Approximation for Latent Gaussian Processes via Double Kullback-Leibler Minimization0Felix Jimenez, Florian Tobias Schäfer, Huiyan Sang, Jian Cao, Matthias Katzfuss, Myeongjong Kang
147Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation0Deva Ramanan, James Hays, Liangyan Gui, Mengtian Li, Shengcao Cao, YuXiong Wang
148One-sided Matrix Completion from Two Observations Per Row0Gregory Valiant, Percy Liang, Steven Cao
149State and parameter learning with PARIS particle Gibbs0Eric Moulines, Gabriel Cardoso, Jimmy Olsson, Sylvain Le Corff, Yazid Janati El Idrissi
150Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning0Clément Romac, Olivier Sigaud, PierreYves Oudeyer, Sylvain Lamprier, Thomas Carta, Thomas Wolf
151Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning0Nicolas Castanet, Olivier Sigaud, Sylvain Lamprier
152Scalable Safe Policy Improvement via Monte Carlo Tree Search0Alberto Castellini, Alessandro Farinelli, Edoardo Zorzi, Federico Bianchi, Matthijs T. J. Spaan, Thiago D. Simão
153LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning0Nathalie Baracaldo, Shiqiang Wang, Stacy Patterson, Swanand Kadhe, Timothy Castiglia, Yi Zhou
154On the Robustness of Text Vectorizers0Damien Garreau, Rémi Catellier, Samuel Vaiter
155Learning Globally Smooth Functions on Manifolds0Alejandro Ribeiro, Benjamin David Haeffele, Juan Cerviño, Luiz F. O. Chamon, René Vidal
156Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond0Chulhee Yun, Jaewook Lee, Jaeyoung Cha
157Orthogonality-Enforced Latent Space in Autoencoders: An Approach to Learning Disentangled Representations0Jaehoon Cha, Jeyan Thiyagalingam
158STEERING : Stein Information Directed Exploration for Model-Based Reinforcement Learning0Alec Koppel, Amrit S. Bedi, Dinesh Manocha, Furong Huang, Mengdi Wang, Souradip Chakraborty
159Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits0Ambuj Tewari, Saptarshi Roy, Sunrit Chakraborty
160Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition0Diana L. Borsa, Rémi Munos, Shantanu Thakoor, Will Dabney, Yash Chandak, Yunhao Tang, Zhaohan Daniel Guo
161Memory-Based Dual Gaussian Processes for Sequential Learning0Arno Solin, Mohammad Emtiyaz Khan, Paul Edmund Chang, Prakhar Verma, S. T. John
162Muse: Text-To-Image Generation via Masked Generative Transformers0Aaron Maschinot, Dilip Krishnan, Han Zhang, Huiwen Chang, Jarred Barber, José Lezama, Kevin Patrick Murphy, Lu Jiang, Michael Rubinstein, MingHsuan Yang, William T. Freeman, Yuanzhen Li
163On Investigating the Conservative Property of Score-Based Generative Models0BoWun Cheng, ChenHao Chao, ChunYi Lee, WeiFang Sun
164Robust and private stochastic linear bandits0Hossein Esfandiari, Vahab Mirrokni, Vasileios Charisopoulos
165Streaming Submodular Maximization with Differential Privacy0Anamay Chaturvedi, Huy L. Nguyen, Thy Dinh Nguyen
166Why does Throwing Away Data Improve Worst-Group Error?0David LopezPaz, Kamalika Chaudhuri, Kartik Ahuja, Martín Arjovsky
167Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits0Daniel Vial, R. Srikant, Ronshee Chawla, Sanjay Shakkottai
168Correcting discount-factor mismatch in on-policy policy gradient methods0A. Rupam Mahmood, Fengdi Che, Gautham Vasan
169Fast Federated Machine Unlearning with Nonlinear Functional Theory0Da Yan, Dejing Dou, Ji Liu, Jun Huan, Lingjuan Lyu, Tianshi Che, Yang Zhou, Zijie Zhang
170On the Statistical Benefits of Temporal Difference Learning0Daniel Russo, David Cheikhi
171Multi-Layer Neural Networks as Trainable Ladders of Hilbert Spaces0Zhengdao Chen
172Beyond the Edge of Stability via Two-step Gradient Updates0Joan Bruna, Lei Chen
173Trompt: Towards a Better Deep Neural Network for Tabular Data0HsinRung Chou, KuanYu Chen, PingHan Chiang, TienHao Chang, TingWei Chen
174Differentially Private Stochastic Convex Optimization under a Quantile Loss Function0Du Chen, Geoffrey A. Chua
175Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-type Samplers0Alex Dimakis, Giannis Daras, Sitan Chen
176Provably Convergent Schrödinger Bridge with Applications to Probabilistic Time Series Imputation0Anderson Schneider, Fengpei Li, Kashif Rasul, Nicole Tianjiao Yang, Shandian Zhe, Shikai Fang, Wei Deng, Yikai Zhang, Yu Chen, Yuriy Nevmyvaka
177ED-Batch: Efficient Automatic Batching of Dynamic Neural Networks via Learned Finite State Machines0Phillip B. Gibbons, Pratik Pramod Fegade, Siyuan Chen, Tianqi Chen, Todd C. Mowry
178Is Learning Summary Statistics Necessary for Likelihood-free Inference?0Adrian Weller, Michael U. Gutmann, Yanzhi Chen
179Subequivariant Graph Reinforcement Learning in 3D Environments0Fuchun Sun, Jiaqi Han, Runfa Chen, Wenbing Huang
180GuardHFL: Privacy Guardian for Heterogeneous Federated Learning0Guowen Xu, Hanxiao Chen, Hongwei Li, Kangjie Chen, Meng Hao, Tianwei Zhang, Xilin Zhang
181Efficient and Degree-Guided Graph Generation via Discrete Diffusion Modeling0Jiaxing He, Liping Liu, Xiaohui Chen, Xu Han
182Evolving Semantic Prototype Improves Generative Zero-Shot Learning0Kun Zhang, Shiming Chen, Tongliang Liu, Wenjin Hou, Xiaohan Ding, Xinge You, Yibing Song, Ziming Hong
183Explore and Exploit the Diverse Knowledge in Model Zoo for Domain Generalization0Fengwei Zhou, Tianyang Hu, Yimeng Chen, Zhenguo Li, ZhiMing Ma
184Decentralized Stochastic Bilevel Optimization with Improved per-Iteration Complexity0Krishna Balasubramanian, Minhui Huang, Shiqian Ma, Xuxing Chen
185Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data0Kaixuan Huang, Mengdi Wang, Minshuo Chen, Tuo Zhao
186Sample Complexity of Probability Divergences under Group Symmetry0Luc ReyBellet, Markos A. Katsoulakis, Wei Zhu, Ziyu Chen
187Improved Analysis of Score-based Generative Modeling: User-Friendly Bounds under Minimal Smoothness Assumptions0Holden Lee, Hongrui Chen, Jianfeng Lu
188Bidirectional Looking with A Novel Double Exponential Moving Average to Adaptive and Non-adaptive Momentum Optimizers0Bo Du, Hai Zhao, Lefei Zhang, Yineng Chen, Zuchao Li
189HarsanyiNet: Computing Accurate Shapley Values in a Single Forward Propagation0Jin Huang, Keyan Zhang, Lu Chen, Quanshi Zhang, Siyu Lou
190Generalized Implicit Follow-The-Regularized-Leader0Francesco Orabona, Keyi Chen
191Fisher Information Embedding for Node and Graph Learning0Dexiong Chen, Karsten M. Borgwardt, Paolo Pellizzoni
192Rethinking Visual Reconstruction: Experience-Based Content Completion Guided by Visual Cues0Gang Pan, Jiaxuan Chen, Yu Qi
193Stratified Adversarial Robustness with Rejection0Jayaram Raghuram, Jiefeng Chen, Jihye Choi, Somesh Jha, Xi Wu, Yingyu Liang
194Multi-task Hierarchical Adversarial Inverse Reinforcement Learning0Dipesh Tamboli, Jiayu Chen, Tian Lan, Vaneet Aggarwal
195Model Transferability with Responsive Decision Subjects0Kun Zhang, Yang Liu, Yatong Chen, Zeyu Tang
196Layered State Discovery for Incremental Autonomous Exploration0Alessandro Lazaric, Andrea Tirinzoni, Liyu Chen, Matteo Pirotta
197Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization0Lijun Zhang, Peng Zhao, Sijia Chen, WeiWei Tu
198Learning to Optimize Differentiable Games0Nelson Vadori, Tianlong Chen, Xuxi Chen, Zhangyang Wang
199Coordinated Dynamic Bidding in Repeated Second-Price Auctions with Budgets0Haoran Sun, Qian Wang, Xiang Yan, Xiaotie Deng, Yurong Chen, Zhaohua Chen, Zhijian Duan
200Semi-Offline Reinforcement Learning for Optimized Text Generation0Changyu Chen, Jie Cao, Li Dong, Rui Yan, Victor Ye Dong, Xiting Wang, Yi Liu, Yiqiao Jin
201Lower Bounds for Learning in Revealing POMDPs0Caiming Xiong, Fan Chen, Huan Wang, Song Mei, Yu Bai
202Implicit Neural Spatial Representations for Time-dependent PDEs0Changxi Zheng, Eitan Grinspun, Honglin Chen, Peter Yichen Chen, Rundi Wu
203BEATs: Audio Pre-Training with Acoustic Tokenizers0Chengyi Wang, Daniel Tompkins, Furu Wei, Sanyuan Chen, Shujie Liu, Wanxiang Che, Xiangzhan Yu, Yu Wu, Zhuo Chen
204Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model0Jibang Wu, Siyu Chen, Yifan Wu, Zhuoran Yang
205Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic Optimization0Jing Xu, Lesi Chen, Luo Luo
206Efficient Personalized Federated Learning via Sparse Model-Adaptation0Bolin Ding, Daoyuan Chen, Dawei Gao, Liuyi Yao, Yaliang Li
207A Gromov-Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening0Jie Chen, Rentian Yao, Yifan Chen, Yun Yang
208How to address monotonicity for model risk management?0Dangxing Chen, Weicheng Ye
209Sketched Ridgeless Linear Regression: The Role of Downsampling0Qiang Sun, Siyue Yang, Xin Chen, Yicheng Zeng
210Context-Aware Bayesian Network Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning0Dingyang Chen, Qi Zhang
211Bidirectional Learning for Offline Model-based Biological Sequence Design0Can Chen, Mark Coates, Xue Liu, Yingxue Zhang
212Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling0Mingyuan Zhou, Tianqi Chen
213Lifelong Language Pretraining with Distribution-Specialized Experts0Claire Cui, James Laudon, Nan Du, Wuyang Chen, Yanping Huang, Yanqi Zhou, Zhifeng Chen
214Generalized-Smooth Nonconvex Optimization is As Efficient As Smooth Nonconvex Optimization0Yi Zhou, Yingbin Liang, Zhaosong Lu, Ziyi Chen
215Weakly Supervised Regression with Interval Targets0Bo An, Lei Feng, Ximing Li, Xin Cheng, Yuzhou Cao
216PLay: Parametrically Conditioned Layout Generation using Latent Diffusion0ChinYi Cheng, Forrest Huang, Gang Li, Yang Li
217Identification of the Adversary from a Single Adversarial Example0Haochen Sun, Minhao Cheng, PinYu Chen, Rui Min
218Parallel Online Clustering of Bandits via Hedonic Game0Cheng Pan, Setareh Maghsudi, Xiaotong Cheng
219Mu2SLAM: Multitask, Multilingual Speech and Language Models0Ankur Bapna, Melvin Johnson, Wolfgang Macherey, Yong Cheng, Yu Zhang
220Understanding the Role of Feedback in Online Learning with Switching Costs0Bo Ji, Duo Cheng, Xingyu Zhou
221Tighter Bounds on the Expressivity of Transformer Encoders0Anand Pillay, David Chiang, Peter Cholak
222Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup0Chenwei Wu, Muthu Chidambaram, Rong Ge, Xiang Wang
223Hiding Data Helps: On the Benefits of Masking for Sparse Coding0Chenwei Wu, Muthu Chidambaram, Rong Ge, Yu Cheng
224PINA: Leveraging Side Information in eXtreme Multi-label Classification via Predicted Instance Neighborhood Aggregation0ChoJui Hsieh, Eli Chien, HsiangFu Yu, Jiong Zhang, JyunYu Jiang, Olgica Milenkovic, WeiCheng Chang
225Tight Certification of Adversarially Trained Neural Networks via Nonconvex Low-Rank Semidefinite Relaxations0HongMing Chiu, Richard Y. Zhang
226Neural Latent Aligner: Cross-trial Alignment for Learning Representations of Complex, Naturalistic Neural Data0Cheol Jun Cho, Edward F. Chang, Gopala Krishna Anumanchipalli
227On the Convergence of Federated Averaging with Cyclic Client Participation0Gauri Joshi, Pranay Sharma, Satyen Kale, Tong Zhang, Yae Jee Cho, Zheng Xu
228GREAD: Graph Neural Reaction-Diffusion Networks0Jeongwhan Choi, Noseong Park, Seoyoung Hong, SungBae Cho
229Is Overfitting Necessary for Implicit Video Representation?0Dokwan Oh, Hee Min Choi, Hyoa Kang
230Semi-Parametric Contextual Pricing Algorithm using Cox Proportional Hazards Model0GiSoo Kim, Minhwan Oh, Myunghee Cho Paik, Wooseong Cho, YoungGeun Choi, Yunseo Choi
231Restoration based Generative Models0Jaemoo Choi, Myungjoo Kang, Yesom Park
232Concept-based Explanations for Out-of-Distribution Detectors0Atul Prakash, Jayaram Raghuram, Jiefeng Chen, Jihye Choi, Ryan Feng, Somesh Jha
233Active causal structure learning with advice0Arnab Bhattacharyya, Davin Choo, Themistoklis Gouleakis
234New metrics and search algorithms for weighted causal DAGs0Davin Choo, Kirankumar Shiragur
235Computational Doob h-transforms for Online Filtering of Discretely Observed Diffusions0Alexandre H. Thiery, Andras Fulop, Jeremy Heng, Nicolas Chopin
236Multi-Epoch Matrix Factorization Mechanisms for Private Machine Learning0Abhradeep Guha Thakurta, Christopher A. ChoquetteChoo, Hugh Brendan McMahan, J. Keith Rush
237Taming graph kernels with random features0Krzysztof Marcin Choromanski
238Efficient Graph Field Integrators Meet Point Clouds0Adrian Weller, Alvin Pan, Arijit Sehanobish, David Watkins, Deepali Jain, Eli Berger, Han Lin, Krzysztof Marcin Choromanski, Kumar Avinava Dubey, Snigdha Chaturvedi, Somnath Basu Roy Chowdhury, Tamás Sarlós, Tetiana Parshakova, Tianyi Zhang, Valerii Likhosherstov, Yunfan Zhao
239ContraBAR: Contrastive Bayes-Adaptive Deep RL0Aviv Tamar, Era Choshen
240Forget Unlearning: Towards True Data-Deletion in Machine Learning0Neil Shah, Rishav Chourasia
241Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural Networks0Meng Wang, Mohammed Nowaz Rabbani Chowdhury, PinYu Chen, Shuai Zhang, Sijia Liu
242What do CNNs Learn in the First Layer and Why? A Linear Systems Perspective0Rhea Chowers, Yair Weiss
243Unifying Molecular and Textual Representations via Multi-task Language Modelling0Dimitrios Christofidellis, Giorgio Giannone, Jannis Born, Matteo Manica, Ole Winther, Teodoro Laino
244Wasserstein Barycenter Matching for Graph Size Generalization of Message Passing Neural Networks0Hong Mei, Shanghang Zhang, Wenwu Zhu, Xin Wang, Xu Chu, Yasha Wang, Yujie Jin
245Shape-Guided Dual-Memory Learning for 3D Anomaly Detection0Chieh Liu, HwannTzong Chen, TingI Hsieh, TyngLuh Liu, YuMin Chu
246Multiply Robust Off-policy Evaluation and Learning under Truncation by Death0Jianing Chu, Shu Yang, Wenbin Lu
247InfoOT: Information Maximizing Optimal Transport0ChingYao Chuang, David AlvarezMelis, Stefanie Jegelka
248A Toy Model of Universality: Reverse Engineering how Networks Learn Group Operations0Bilal Chughtai, Lawrence Chan, Neel Nanda
249Distribution Free Prediction Sets for Node Classification0Jase Clarkson
250Sequential Strategic Screening0Ali Vakilian, Juba Ziani, Kevin Stangl, Lee Cohen, Saeed SharifiMalvajerdi
251Few-Sample Feature Selection via Feature Manifold Learning0David Cohen, Ronen Talmon, Tal Shnitzer, Yuval Kluger
252Spatial Implicit Neural Representations for Global-Scale Species Mapping0Alexander Shepard, Christian Lange, Elijah Cole, Grant Van Horn, Oisin Mac Aodha, Patrick Leary, Pietro Perona, Scott Loarie
253K-SHAP: Policy Clustering Algorithm for Anonymous Multi-Agent State-Action Pairs0Andrea Coletta, Svitlana Vyetrenko, Tucker Balch
254Inferring Relational Potentials in Interacting Systems0Armand Comas Massague, Christian Fernandez Lopez, Joshua B. Tenenbaum, Mario Sznaier, Octavia I. Camps, Sandesh Ghimire, Yilun Du
255Task-specific experimental design for treatment effect estimation0Alexander Adam, Bethany Connolly, Christopher Frye, Gary Willis, Ilya Feige, Kim Moore, Tobias Schwedes
256A Mathematical Model for Curriculum Learning for Parities0Elchanan Mossel, Elisabetta Cornacchia
257Learning to Maximize Mutual Information for Dynamic Feature Selection0Ian Connick Covert, Mingyu Lu, Nathan J. White, Nayoon Kim, SuIn Lee, Wei Qiu
258Rethinking Weak Supervision in Helping Contrastive Learning0Jingyi Cui, Weiran Huang, Yifei Wang, Yisen Wang
259Bayes-optimal Learning of Deep Random Networks of Extensive-width0Florent Krzakala, Hugo Cui, Lenka Zdeborová
260A General Representation Learning Framework with Generalization Performance Guarantees0Jianqing Liang, Jiye Liang, Junbiao Cui, Qin Yue
261IRNeXt: Rethinking Convolutional Network Design for Image Restoration0Alois Knoll, Sining Yang, Wenqi Ren, Xiaochun Cao, Yuning Cui
262Scaling Up Dataset Distillation to ImageNet-1K with Constant Memory0ChoJui Hsieh, Justin Cui, Ruochen Wang, Si Si
263Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation0Haichao Yu, Linjie Yang, Yiming Cui
264Adaptive Identification of Populations with Treatment Benefit in Clinical Trials: Machine Learning Challenges and Solutions0Alicia Curth, Alihan Hüyük, Mihaela van der Schaar
265In Search of Insights, Not Magic Bullets: Towards Demystification of the Model Selection Dilemma in Heterogeneous Treatment Effect Estimation0Alicia Curth, Mihaela van der Schaar
266Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion0Ashok Cutkosky, Francesco Orabona, Harsh Mehta
267Monge, Bregman and Occam: Interpretable Optimal Transport in High-Dimensions with Feature-Sparse Maps0Marco Cuturi, Michal Klein, Pierre Ablin
268From Noisy Fixed-Point Iterations to Private ADMM for Centralized and Federated Learning0Aurélien Bellet, Debabrota Basu, Edwige Cyffers
269Chameleon: Adapting to Peer Images for Planting Durable Backdoors in Federated Learning0Songze Li, Yanbo Dai
270Refined Regret for Adversarial MDPs with Linear Function Approximation0ChenYu Wei, Haipeng Luo, Julian Zimmert, Yan Dai
271MultiRobustBench: Benchmarking Robustness Against Multiple Attacks0Chong Xiang, PinYu Chen, Prateek Mittal, Saeed Mahloujifar, Sihui Dai, Vikash Sehwag
272Moderately Distributional Exploration for Domain Generalization0Bo Han, Rui Dai, Xinmei Tian, Yonggang Zhang, Zhen Fang
273Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning0Brett Daley, Christopher Amato, Marlos C. Machado, Martha White
274Efficient displacement convex optimization with particle gradient descent0Chi Jin, Hadi Daneshmand, Jason D. Lee
275Multiple Thinking Achieving Meta-Ability Decoupling for Object Navigation0Chengju Liu, Liuyi Wang, Lu Chen, Qijun Chen, Ronghao Dang, Zongtao He
276Neural Collapse in Deep Linear Networks: From Balanced to Imbalanced Data0Hien Dang, Hung TranThe, Nhat Ho, Stanley J. Osher, Tan Minh Nguyen, Tho Tran Huu
277Reinforcement Learning Can Be More Efficient with Multiple Rewards0Christoph Dann, Mehryar Mohri, Yishay Mansour
278Best of Both Worlds Policy Optimization0ChenYu Wei, Christoph Dann, Julian Zimmert
279Image generation with shortest path diffusion0Alberto Bernacchia, Anil Batra, Ayan Das, DaShan Shiu, Farhang Nabiei, Fengting Liao, Sattar Vakili, Stathi Fotiadis
280Efficient List-Decodable Regression using Batches0Abhimanyu Das, Ayush Jain, Rajat Sen, Weihao Kong
281Beyond Uniform Lipschitz Condition in Differentially Private Optimization0Rudrajit Das, Satyen Kale, Sujay Sanghavi, Tong Zhang, Zheng Xu
282Understanding Self-Distillation in the Presence of Label Noise0Rudrajit Das, Sujay Sanghavi
283Interval Bound Interpolation for Few-shot Learning with Few Tasks0Anish Chakrabarty, Sankha Subhra Mullick, Shounak Datta, Swagatam Das
284Hypervolume Knowledge Gradient: A Lookahead Approach for Multi-Objective Bayesian Optimization with Partial Information0Eytan Bakshy, Maximilian Balandat, Samuel Daulton
285Fast Combinatorial Algorithms for Min Max Correlation Clustering0Benjamin Moseley, Heather Newman, Sami Davies
286Predictive Flows for Faster Ford-Fulkerson0Benjamin Moseley, Sami Davies, Sergei Vassilvitskii, Yuyan Wang
287The Persistent Laplacian for Data Science: Evaluating Higher-Order Persistent Spectral Representations of Data0Rubén J. SánchezGarcía, Thomas Davies, Zhengchao Wan
288Mitigating Propagation Failures in Physics-informed Neural Networks using Retain-Resample-Release (R3) Sampling0Anuj Karpatne, Arka Daw, Jie Bu, Paris Perdikaris, Sifan Wang
289On the Robustness of Randomized Ensembles to Adversarial Perturbations0Hassan Dbouk, Naresh R. Shanbhag
290Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute0Fei Sha, Joshua Ainslie, Michiel de Jong, Nicholas FitzGerald, Sumit Sanghai, William W. Cohen, Yury Zemlyanskiy
291Continuous Spatiotemporal Transformer0Antonio Henrique de Oliveira Fonseca, David van Dijk, Emanuele Zappala, Josue Ortega Caro
292The Value of Out-of-Distribution Data0Ashwin De Silva, Carey E. Priebe, Joshua T. Vogelstein, Pratik Chaudhari, Rahul Ramesh
293High Fidelity Image Counterfactuals with Probabilistic Causal Models0Ben Glocker, Fabio De Sousa Ribeiro, Miguel Monteiro, Nick Pawlowski, Tian Xia
294Learning Noisy OR Bayesian Networks with Max-Product Belief Propagation0Antoine Dedieu, Dileep George, Guangyao Zhou, Miguel LázaroGredilla
295Learning-Rate-Free Learning by D-Adaptation0Aaron Defazio, Konstantin Mishchenko
296Scaling Vision Transformers to 22 Billion Parameters0Alexander Kolesnikov, Alexey A. Gritsenko, Andreas Peter Steiner, Anurag Arnab, Aravindh Mahendran, Avital Oliver, Basil Mustafa, Carlos Riquelme Ruiz, Cristina Nader Vasconcelos, Daniel Keysers, Dustin Tran, Fantine Huot, Filip Pavetic, Fisher Yu, Gamaleldin Fathy Elsayed, Ibrahim Alabdulmohsin, Jasmijn Bastings, Jeremiah J. Harmsen, Joan Puigcerver, Jonathan Heek, Josip Djolonga, Justin Gilmer, Lucas Beyer, Manoj Kumar, Mario Lucic, Mark Collier, Mathilde Caron, Matthias Minderer, Michael Tschannen, Mostafa Dehghani, Neil Houlsby, Piotr Padlewski, Robert Geirhos, Rodolphe Jenatton, Sjoerd van Steenkiste, Thomas Kipf, Thomas Mensink, Utku Evci, Vighnesh Birodkar, Xiao Wang, Xiaohua Zhai, Yi Tay
297Efficient Bound of Lipschitz Constant for Convolutional Layers by Gram Iteration0Alexandre Allauzen, Alexandre Araujo, Blaise Delattre, Quentin Barthélemy
298Blossom: an Anytime Algorithm for Computing Optimal Decision Trees0Emir Demirovic, Emmanuel Hebrard, Louis Jean
299Optimizing NOTEARS Objectives via Topological Swaps0Bryon Aragam, Chang Deng, Kevin Bello, Pradeep Kumar Ravikumar
300Uncertainty Estimation by Fisher Information-based Evidential Deep Learning0Danruo Deng, Furui Liu, Guangyong Chen, PhengAnn Heng, Yang Yu
301Multi-channel Autobidding with Budget and ROI Constraints0Jason Cheuk Nam Liang, Negin Golrezaei, Patrick Jaillet, Vahab Mirrokni, Yuan Deng
302Surrogate Module Learning: Reduce the Gradient Error Accumulation in Training Spiking Neural Networks0Hao Lin, Shi Gu, Shikuang Deng, Yuhang Li
303Confidence and Dispersity Speak: Characterizing Prediction Matrix for Unsupervised Accuracy Estimation0Liang Zheng, Stephen Gould, Weijian Deng, Yumin Suh
304Great Models Think Alike: Improving Model Reliability via Inter-Model Latent Agreement0Ailin Deng, Bryan Hooi, Miao Xiong
305Hyperbolic Image-text Representations0Justin Johnson, Karan Desai, Maximilian Nickel, Shanmukha Ramakrishna Vedantam, Tanmay Rajpurohit
306Hardware-Aware Compression with Random Operation Access Specific Tile (ROAST) Hashing0Aditya Desai, Anshumali Shrivastava, Keren Zhou
307The case for 4-bit precision: k-bit Inference Scaling Laws0Luke Zettlemoyer, Tim Dettmers
308Fairness in Matching under Uncertainty0Aleksandra Korolova, David Kempe, Siddartha Devic, Vatsal Sharan
309Efficient Parametric Approximations of Neural Network Function Space Distance0Juhan Bae, Nikita Dhawan, Roger Baker Grosse, Sicong Huang
310A Large-Scale Study of Probabilistic Calibration in Neural Network Regression0Souhaib Ben Taieb, Victor Dheur
311Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path0Dongruo Zhou, Jiafan He, Qiwei Di, Quanquan Gu
312On Over-Squashing in Message Passing Neural Networks: The Impact of Width, Depth, and Topology0Federico Barbero, Francesco Di Giovanni, Giulia Luise, Lorenzo Giusti, Michael M. Bronstein, Pietro Lio
313Nearly-Linear Time and Streaming Algorithms for Outlier-Robust PCA0Ankit Pensia, Daniel Kane, Ilias Diakonikolas, Thanasis Pittas
314Near-Optimal Cryptographic Hardness of Agnostically Learning Halfspaces and ReLU Regression under Gaussian Marginals0Daniel Kane, Ilias Diakonikolas, Lisheng Ren
315Improving Graph Generation by Restricting Graph Bandwidth0Alex M. Tseng, Gabriele Scalia, Kangway V. Chuang, Nathaniel Lee Diamant, Tommaso Biancalani
316Forward-Backward Gaussian Variational Inference via JKO in the Bures-Wasserstein Space0Adil Salim, Krishna Balasubramanian, Michael Ziyang Diao, Sinho Chewi
317Subset-Based Instance Optimality in Private Estimation0Alex Kulesza, Ananda Theertha Suresh, Travis Dick, Ziteng Sun
318Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models0François Fleuret, Nikolaos Dimitriadis, Pascal Frossard
319Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models0Ding Zhao, Marco Pavone, Tong Che, Wenhao Ding
320DSGD-CECA: Decentralized SGD with Communication-Optimal Exact Consensus Algorithm0Bicheng Ying, Kexin Jin, Kun Yuan, Lisang Ding, Wotao Yin
321Open-Vocabulary Universal Image Segmentation with MaskCLIP0Jieke Wang, Zheng Ding, Zhuowen Tu
322Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning0Junpeng Yue, Tiejun Huang, Wanpeng Zhang, Xiangjun Wang, Ziluo Ding, Zongqing Lu
323PixelAsParam: A Gradient View on Diffusion Sampling with Guidance0AnhDung Dinh, Chang Xu, Daochang Liu
324Second-Order Optimization with Lazy Hessians0El Mahdi Chayti, Martin Jaggi, Nikita Doikov
325Polynomial Preconditioning for Gradient Methods0Anton Rodomanov, Nikita Doikov
326On Data Manifolds Entailed by Structural Causal Models0AmirHossein Karimi, Bernhard Schölkopf, Georgios Arvanitidis, Ricardo DominguezOlmedo
327Towards Understanding and Reducing Graph Structural Noise for GNNs0Mingze Dong, Yuval Kluger
328SpeedDETR: Speed-aware Transformers for End-to-end Object Detection0ChihHsien Chou, Hao Tang, Peiyan Dong, Peng Zhang, Xin Meng, Yanzhi Wang, Zhenglun Kong
329Understand and Modularize Generator Optimization in ELECTRA-style Pretraining0Chengyu Dong, Hao Cheng, Jianfeng Gao, Jingbo Shang, Liyuan Liu, Xiaodong Liu
330Diversity-enhancing Generative Network for Few-shot Hypothesis Adaptation0Bo Han, Feng Liu, Gang Niu, Haoang Chi, Masashi Sugiyama, Mingming Gong, Ruijiang Dong, Tongliang Liu
331PASTA: Pessimistic Assortment Optimization0Cong Shi, Ethan X. Fang, Juncheng Dong, Vahid Tarokh, Weibin Mo, Zhengling Qi
332Adaptively Weighted Data Augmentation Consistency Regularization for Robust Optimization under Concept Shift0Rachel A. Ward, Yijun Dong, Yuege Xie
333Does Sparsity Help in Learning Misspecified Linear Bandits?0Jialin Dong, Lin Yang
334Symmetry-Aware Robot Design with Structured Subgroups0Chongjie Zhang, Heng Dong, Junyu Zhang, Tonghan Wang
335DoCoFL: Downlink Compression for Cross-Device Federated Learning0Kfir Yehuda Levy, Ron Dorfman, Shay Vargaftik, Yaniv BenItzhak
336Meta-Learning the Inductive Bias of Simple Neural Circuits0Maria Yuffa, Peter E. Latham, Will Dorrell
337Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains0Do Young Eun, Jie Hu, Vishwaraj Doshi
338Linear Time GPs for Inferring Latent Trajectories from Neural Spike Trains0Il Memming Park, Matthew Dowling, Yuan Zhao
339On the Convergence Rate of Gaussianization with Random Rotations0Armand Rousselot, Christoph Schnörr, Felix Draxler, Jens Müller, Lars Kühmichel, Ullrich Köthe
340PaLM-E: An Embodied Multimodal Language Model0Aakanksha Chowdhery, Andy Zeng, Ayzaan Wahid, Brian Ichter, Corey Lynch, Daniel Duckworth, Danny Driess, Fei Xia, Igor Mordatch, Jonathan Tompson, Karol Hausman, Klaus Greff, Marc Toussaint, Mehdi S. M. Sajjadi, Pete Florence, Pierre Sermanet, Quan Vuong, Sergey Levine, Tianhe Yu, Vincent Vanhoucke, Wenlong Huang, Yevgen Chebotar
341Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC0Arnaud Doucet, Conor Durkan, Jascha SohlDickstein, Joshua B. Tenenbaum, Rob Fergus, Robin Strudel, Sander Dieleman, Will Sussman Grathwohl, Yilun Du
342Multi-task Representation Learning for Pure Exploration in Linear Bandits0Longbo Huang, Wen Sun, Yihan Du
343Nonparametric Generative Modeling with Conditional Sliced-Wasserstein Flows0Chao Du, Min Lin, Shuicheng Yan, Tianbo Li, Tianyu Pang
344Subsample Ridge Ensembles: Equivalences and Generalized Cross-Validation0Arun K. Kuchibhotla, JinHong Du, Pratik Patil
345On Uni-Modal Feature Learning in Supervised Multi-Modal Learning0Chenzhuang Du, Hang Zhao, Jiaye Teng, Tianyuan Yuan, Tingle Li, Yang Yuan, Yichen Liu, Yue Wang
346Guiding Pretraining in Reinforcement Learning with Large Language Models0Abhishek Gupta, Cédric Colas, Jacob Andreas, Olivia Watkins, Pieter Abbeel, Trevor Darrell, Yuqing Du, Zihan Wang
347A Flexible Diffusion Model0He Zhang, Tao Yang, Weitao Du, Yuanqi Du
348Fast Excess Risk Rates via Offset Rademacher Complexity0Chenguang Duan, Jerry Zhijian Yang, Lican Kang, Xiliang Lu, Yuling Jiao
349Are Diffusion Models Vulnerable to Membership Inference Attacks?0Fei Kong, Jinhao Duan, Kaidi Xu, Shiqi Wang, Xiaoshuang Shi
350Bayesian Progressive Deep Topic Model with Knowledge Informed Textual Data Coarsening Process0Bo Chen, Mingyuan Zhou, Xinyang Liu, Yishi Xu, Yudi Su, Zhibin Duan
351Are Equivariant Equilibrium Approximators Beneficial?0Xiaotie Deng, Yunxuan Ma, Zhijian Duan
352Evaluating Self-Supervised Learning via Risk Decomposition0Percy Liang, Tatsunori Hashimoto, Yann Dubois
353Fully Dynamic Submodular Maximization over Matroids0Ashkan NorouziFard, Federico Fusco, Morteza Zadimoghaddam, Paul Duetting, Silvio Lattanzi
354Optimal No-Regret Learning for One-Sided Lipschitz Functions0Guru Guruganesh, Jon Schneider, Joshua Ruizhi Wang, Paul Duetting
355Integrating Prior Knowledge in Contrastive Learning with Kernel0Benoit Dufumier, Carlo Alberto Barbano, Edouard Duchesnay, Pietro Gori, Robin Louiset
356Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows0Di Luo, Marin Soljacic, Owen M. Dugan, Peter Y. Lu, Rumen Dangovski
357Adaptive Whitening in Neural Populations with Gain-modulating Interneurons0David J. Heeger, David Lipshutz, Dmitri B. Chklovskii, Eero P. Simoncelli, Lyndon R. Duong
358Generalization Bounds using Data-Dependent Fractal Dimensions0Benjamin Dupuis, George Deligiannidis, Umut Simsekli
359Multi-Objective Population Based Training0Alexander Chebykin, Arkadiy Dushatskiy, Peter A. N. Bosman, Tanja Alderliesten
360Neural Diffusion Processes0Alan Saul, Fergus Simpson, Vincent Dutordoir, Zoubin Ghahramani
361FAENet: Frame Averaging Equivariant GNN for Materials Modeling0Alex HernándezGarcía, Alexandre Duval, David Rolnick, Fragkiskos D. Malliaros, Santiago Miret, Victor Schmidt, Yoshua Bengio
362Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces0Javier E. Santos, Nicholas Lubbers, Yen Ting Lin, Zachary R. Fox
363The Computational Complexity of Concise Hypersphere Classification0Eduard Eiben, Iyad A. Kanj, Robert Ganian, Sebastian Ordyniak, Stefan Szeider
364E(n) Equivariant Message Passing Simplicial Networks0Erik J. Bekkers, Floor Eijkelboom, Rob Hesselink
365Performative Recommendation: Diversifying Content via Strategic Incentives0Itay Eilat, Nir Rosenfeld
366Hyperparameters in Reinforcement Learning and How To Tune Them0Marius Lindauer, Roberta Raileanu, Theresa Eimer
367Fairness in Streaming Submodular Maximization over a Matroid Constraint0Ashkan NorouziFard, Federico Fusco, Jakab Tardos, Jakub Tarnawski, Marwa El Halabi
368Difference of submodular minimization via DC programming0George Orfanides, Marwa El Halabi, Tim Hoheisel
369Graph Positional Encoding via Random Feature Propagation0Beatrice Bevilacqua, Eran Treister, Fabrizio Frasca, Gal Chechik, Haggai Maron, Moshe Eliasof
370Improving Graph Neural Networks with Learnable Propagation Operators0Eran Treister, Lars Ruthotto, Moshe Eliasof
371Phase Transitions in the Detection of Correlated Databases0Dor Elimelech, Wasim Huleihel
372A new near-linear time algorithm for k-nearest neighbor search using a compressed cover tree0Vitaliy Kurlin, Yury Elkin
373Motion Question Answering via Modular Motion Programs0Jiajun Wu, Jiaman Li, Joy Hsu, Mark Endo
374Learning Perturbations to Explain Time Series Predictions0Joseph Enguehard
375Regret Minimization and Convergence to Equilibria in General-sum Markov Games0Liad Erez, Tal Lancewicki, Tomer Koren, Uri Sherman, Yishay Mansour
376Delayed Bandits: When Do Intermediate Observations Help?0Dirk van der Hoeven, Emmanuel Esposito, Hao Qiu, Nicolò CesaBianchi, Saeed Masoudian, Yevgeny Seldin
377Scaling Spherical CNNs0Ameesh Makadia, Carlos Esteves, JeanJacques E. Slotine
378Stochastic Gradient Descent under Markovian Sampling Schemes0Mathieu Even
379Continual Learning in Linear Classification on Separable Data0Badea Marjieh, Daniel Soudry, Edward Moroshko, Gon Buzaglo, Itay Evron, Maroun Khriesh, Nathan Srebro
380A Connection between One-Step RL and Critic Regularization in Reinforcement Learning0Benjamin Eysenbach, Matthieu Geist, Ruslan Salakhutdinov, Sergey Levine
381Neural Status Registers0Lukas Faber, Roger Wattenhofer
382Learning Rate Schedules in the Presence of Distribution Shift0Adel Javanmard, Matthew Fahrbach, Pratik Worah, Vahab Mirrokni
383Predicting Rare Events by Shrinking Towards Proportional Odds0Gregory Faletto, Jacob Bien
384Free-Form Variational Inference for Gaussian Process State-Space Models0Edwin V. Bonilla, Scott A. Sisson, Terence J. O'Kane, Xuhui Fan
385Optimizing DDPM Sampling with Shortcut Fine-Tuning0Kangwook Lee, Ying Fan
386LSDS++ : Dual Sampling for Accelerated k-means++0Chenglin Fan, Ping Li, Xiaoyun Li
387Smart Initial Basis Selection for Linear Programs0Abdullah Ali Sivas, Oleksandr Yakovenko, Owen Ren, Xinglu Wang, Yong Zhang, Zhenan Fan, Zirui Zhou
388General Covariance Data Augmentation for Neural PDE Solvers0Alexander Rudikov, Ivan V. Oseledets, Tianchi Yu, Vladimir Fanaskov
389The Fast Johnson-Lindenstrauss Transform Is Even Faster0Kasper Green Larsen, Mikael Møller Høgsgaard, Ora Nova Fandina
390Regression with Label Permutation in Generalized Linear Model0Guanhua Fang, Ping Li
391Robust Collaborative Learning with Linear Gradient Overhead0John Stephan, LêNguyên Hoang, Nirupam Gupta, Rachid Guerraoui, Rafael Pinot, Sadegh Farhadkhani
392Neural FIM for learning Fisher information metrics from point cloud data0Alexander Tong, Guillaume Huguet, Guy Wolf, Ian Adelstein, Maximilian Nickel, Oluwadamilola Fasina, Smita Krishnaswamy, Yanlei Zhang
393Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies0Anas Barakat, Anastasia Kireeva, Ilyas Fatkhullin, Niao He
394Parallel Neurosymbolic Integration with Concordia0Efthymia Tsamoura, Jonathan Feldstein, Modestas Jurcius
395Why Target Networks Stabilise Temporal Difference Methods0Matthew J. A. Smith, Mattie Fellows, Shimon Whiteson
396Weighted Sampling without Replacement for Deep Top-k Classification0Bart Selman, Carla P. Gomes, Dieqiao Feng, Yuanqi Du
397Improved Online Learning Algorithms for CTR Prediction in Ad Auctions0Christopher Liaw, Zhe Feng, Zixin Zhou
398Fractional Denoising for 3D Molecular Pre-training0Shikun Feng, WeiYing Ma, Yanyan Lan, Yuyan Ni, ZhiMing Ma
399Improved Algorithms for White-Box Adversarial Streams0David P. Woodruff, Ying Feng
400Non-stationary Reinforcement Learning under General Function Approximation0Jing Yang, Ming Yin, Ruiquan Huang, Songtao Feng, Yingbin Liang, YuXiang Wang
401Random Matrix Analysis to Balance between Supervised and Unsupervised Learning under the Low Density Separation Assumption0Aladin Virmaux, Malik Tiomoko, Vasilii Feofanov
402SurCo: Learning Linear SURrogates for COmbinatorial Nonlinear Optimization Problems0Aaron M. Ferber, Benoit Steiner, Bistra Dilkina, Daochen Zha, Martin Schubert, Taoan Huang, Yuandong Tian
403Scaling Laws for Multilingual Neural Machine Translation0Behrooz Ghorbani, Markus Freitag, Orhan Firat, Patrick Fernandes, Xavier Garcia
404Constant Matters: Fine-grained Error Bound on Differentially Private Continual Observation0Hendrik Fichtenberger, Jalaj Upadhyay, Monika Henzinger
405Adapting to game trees in zero-sum imperfect information games0Côme Fiegel, Michal Valko, Pierre Ménard, Rémi Munos, Tadashi Kozuno, Vianney Perchet
406User-defined Event Sampling and Uncertainty Quantification in Diffusion Models for Physical Dynamical Systems0Andrew Gordon Wilson, Anudhyan Boral, Fei Sha, Leonardo ZepedaNúñez, Marc Anton Finzi
407ACAT: Adversarial Counterfactual Attention for Classification and Detection in Medical Imaging0Alessandro Fontanella, Amos J. Storkey, Antreas Antoniou, Emanuele Trucco, Grant Mair, Joanna M. Wardlaw, Wenwen Li
408Explainable Data-Driven Optimization: From Context to Decision and Back Again0Alexandre Forel, Axel Parmentier, Thibaut Vidal
409Hardness of Independent Learning and Sparse Equilibrium Computation in Markov Games0Dylan J. Foster, Noah Golowich, Sham M. Kakade
410Disentangled Generative Models for Robust Prediction of System Dynamics0Anil Anthony Bharath, Chris D. Cantwell, Mario Lino Valencia, Shunlong Hu, Stathi Fotiadis, Stef Garasto
411Can Forward Gradient Match Backpropagation?0Edouard Oyallon, Eugene Belilovsky, Louis Fournier, Michael Eickenberg, Stéphane Rivaud
412Last Switch Dependent Bandits with Monotone Payoff Functions0Assaf Zeevi, Ayoub Foussoul, Orestis Papadigenopoulos, Vineet Goyal
413A Theoretical Analysis of the Learning Dynamics under Class Imbalance0Aurélien Lucchi, Emanuele Francazi, Marco BaityJesi
414SparseGPT: Massive Language Models Can be Accurately Pruned in One-Shot0Dan Alistarh, Elias Frantar
415Learning Temporally AbstractWorld Models without Online Experimentation0Benjamin Freed, Guillaume Adrien Sartoretti, Howie Choset, Jeff Schneider, Siddarth Venkatraman
416A Coupled Flow Approach to Imitation Learning0Elad Sarafian, Gideon Joseph Freund, Sarit Kraus
417Simple Hardware-Efficient Long Convolutions for Sequence Modeling0Armin W. Thomas, Atri Rudra, Christopher Ré, Daniel Y. Fu, Elliot L. Epstein, Eric Nguyen, Michael Zhang, Tri Dao
418MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without Camera Poses0Ishan Misra, Xiaolong Wang, Yang Fu
419Go Beyond Imagination: Maximizing Episodic Reachability with World Models0Honglak Lee, Run Peng, Yao Fu
420Specializing Smaller Language Models towards Multi-Step Reasoning0Ashish Sabharwal, Hao Peng, Litu Ou, Tushar Khot, Yao Fu
421Accelerated Stochastic Optimization Methods under Quasar-convexity0Ashia Camage Wilson, Dongchu Xu, Qiang Fu
422Meta-learning Parameterized Skills0George Konidaris, Haotian Fu, Michael Littman, Saket Tiwari, Shangqun Yu
423NeRFool: Uncovering the Vulnerability of Generalizable Neural Radiance Fields against Adversarial Perturbations0Shang Wu, Shunyao Zhang, Souvik Kundu, Ye Yuan, Yingyan Celine Lin, Yonggan Fu
424Hierarchies of Reward Machines0Alessandra Russo, Anders Jonsson, Daniel FurelosBlanco, Krysia Broda, Mark Law
425Why Random Pruning Is All We Need to Start Sparse0Advait Harshal Gadhikar, Rebekka Burkholz, Sohom Mukherjee
426Cell-Free Latent Go-Explore0Emmanuel Dellandréa, Quentin Gallouédec
427Graph Reinforcement Learning for Network Control via Bi-Level Optimization0Daniele Gammelli, Filipe Rodrigues, Francisco C. Pereira, James Harrison, Kaidi Yang, Marco Pavone
428Why Is Public Pretraining Necessary for Private Model Training?0Abhradeep Guha Thakurta, Arun Ganesh, Lun Wang, Mahdi Haghifam, Milad Nasr, Om Thakkar, Sewoong Oh, Thomas Steinke
429Do Perceptually Aligned Gradients Imply Robustness?0Bahjat Kawar, Michael Elad, Roy Ganz
430Solving Linear Programs with Fast Online Learning Algorithms0Chunlin Sun, Dongdong Ge, Wenzhi Gao, Yinyu Ye
431Gradient Descent Finds the Global Optima of Two-Layer Physics-Informed Neural Networks0Michael Ng, Yihang Gao, Yiqi Gu
432Generalizing Neural Wave Functions0Nicholas Gao, Stephan Günnemann
433On the Impact of Algorithmic Recourse on Social Segregation0Himabindu Lakkaraju, Ruijiang Gao
434DDGR: Continual Learning with Deep Diffusion-based Generative Replay0Rui Gao, Weiwei Liu
435PAL: Program-aided Language Models0Aman Madaan, Graham Neubig, Jamie Callan, Luyu Gao, Pengfei Liu, Shuyan Zhou, Uri Alon, Yiming Yang
436Out-of-Domain Robustness via Targeted Augmentations0Irena Gao, Pang Wei Koh, Percy Liang, Shiori Sagawa, Tatsunori Hashimoto
437Scaling Laws for Reward Model Overoptimization0Jacob Hilton, John Schulman, Leo Gao
438The Unreasonable Effectiveness of Few-shot Learning for Machine Translation0Colin Cherry, George F. Foster, Maxim Krikun, Melvin Johnson, Orhan Firat, Xavier Garcia, Yamini Bansal
439RLSbench: Domain Adaptation Under Relaxed Label Shift0Alex Smola, James Sharpnack, Nick Erickson, Saurabh Garg, Sivaraman Balakrishnan, Zachary Chase Lipton
440RankMe: Assessing the Downstream Performance of Pretrained Self-Supervised Representations by Their Rank0Laurent Najman, Quentin Garrido, Randall Balestriero, Yann LeCun
441Self-supervised learning of Split Invariant Equivariant representations0Laurent Najman, Quentin Garrido, Yann LeCun
442Federated Heavy Hitter Recovery under Linear Sketching0Adrià Gascón, Ananda Theertha Suresh, Peter Kairouz, Ziteng Sun
443On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization0Mridul Agarwal, Mudit Gaur, Vaneet Aggarwal
444A Reinforcement Learning Framework for Dynamic Mediation Analysis0Chengchun Shi, Jitao Wang, Lin Ge, Rui Song, Zhenke Wu
445Compositional Score Modeling for Simulation-Based Inference0Andriy Mnih, George Papamakarios, Tomas Geffner
446Cramming: Training a Language Model on a single GPU in one day0Jonas Geiping, Tom Goldstein
447Transformers Meet Directed Graphs0Ali Taylan Cemgil, Cosmin Paduraru, Daniel J. Mankowitz, Simon Geisler, Stephan Günnemann, Yujia Li
448Memory-Based Meta-Learning on Non-Stationary Distributions0Anian Ruoss, Elliot Catt, Grégoire Delétang, Joel Veness, Jordi GrauMoya, Laurent Orseau, Li Kevin Wenliang, Marcus Hutter, Tim Genewein, Vincent Dutordoir
449Towards Reliable Neural Specifications0Arie Gurfinkel, Chuqin Geng, Nham Le, Xiaojie Xu, Xujie Si, Zhaoyue Wang
450Oracles & Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning0David C. Parkes, Matthias Gerstgrasser
451Approximately Optimal Core Shapes for Tensor Decompositions0Gang Fu, Matthew Fahrbach, Mehrdad Ghadiri, Vahab Mirrokni
452GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks0Jingfeng Zhang, Masashi Sugiyama, Maxime Cordy, Mike Papadakis, Salah Ghamizi, Yves Le Traon
453On User-Level Private Convex Optimization0Badih Ghazi, Chiyuan Zhang, Pasin Manurangsi, Pritish Kamath, Raghu Meka, Ravi Kumar
454Contextual Reliability: When Different Features Matter in Different Contexts0Aditi Raghunathan, Amrith Setlur, Anca D. Dragan, Daniel S. Brown, Gaurav Rohit Ghosal
455Reinforcement Learning from Passive Data via Latent Intentions0Chethan Anand Bhateja, Dibya Ghosh, Sergey Levine
456Harmonic Neural Networks0Antonio Andrea Gentile, Atiyo Ghosh, Chul Lee, Dongho Kim, Hyukgeun Cha, JeongIl Kye, Mario Dagrada, SeongHyok Sean Kim, Vincent Emanuel Elfving, Yunjun Choi
457Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repeat0Forough Arabshahi, Kayhan Batmanghelich, Ke Yu, Shantanu Ghosh
458Looped Transformers as Programmable Computers0Angeliki Giannou, Dimitris Papailiopoulos, Jason D. Lee, Jyyong Sohn, Kangwook Lee, Shashank Rajput
459Generalized Disparate Impact for Configurable Fairness Solutions in ML0Eleonora Misino, Luca Giuliani, Michele Lombardi
460Multicalibration as Boosting for Regression0Aaron Roth, Declan Harrison, Ira GlobusHarris, Jessica Sorrell, Michael Kearns
461Adversarial robustness of amortized Bayesian inference0Jakob H. Macke, Manuel Glöckler, Michael Deistler
462Efficient RL via Disentangled Environment and Agent Representations0Deepak Pathak, Kevin Gmelin, Russell Mendonca, Shikhar Bahl
463Aligning Language Models with Preferences through f-divergence Minimization0Dongyoung Go, Germán Kruszewski, Jos Rozen, Marc Dymetman, Nahyeon Ryu, Tomasz Korbak
464Robust Consensus in Ranking Data Analysis: Definitions, Properties and Computational Issues0Clément Calauzènes, Ekhine Irurozki, Morgane Goibert, Stéphan Clémençon
465Learning Distributions over Quantum Measurement Outcomes0Scott Aaronson, Weiyuan Gong
466Convergence of Proximal Point and Extragradient-Based Methods Beyond Monotonicity: the Case of Negative Comonotonicity0Adrien B. Taylor, Eduard Gorbunov, Gauthier Gidel, Samuel Horváth
467Adaptive Annealed Importance Sampling with Constant Rate Progress0Fernando PérezCruz, Shirin Goshtasbpour, Victor Cohen
468Formalizing Preferences Over Runtime Distributions0Devon R. Graham, Kevin LeytonBrown, Tim Roughgarden
469Topological Point Cloud Clustering0Michael T. Schaub, Vincent Peter Grande
470On Sampling with Approximate Transport Maps0Alain Oliviero Durmus, Eric Moulines, Louis Grenioux, Marylou Gabrié
471Hidden Symmetries of ReLU Networks0David Rolnick, J. Elisenda Grigsby, Kathryn Lindsey
472EF21-P and Friends: Improved Theoretical Communication Complexity for Distributed Optimization with Bidirectional Compression0Alexander Tyurin, Kaja Gruntkowska, Peter Richtárik
473NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion0Alex Trevithick, Christian Theobalt, Jiatao Gu, Joshua M. Susskind, KaiEn Lin, Lingjie Liu, Ravi Ramamoorthi
474DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design0Jian Peng, Jianzhu Ma, Jiaqi Guan, Liang Wang, Qiang Liu, Quanquan Gu, Xiangxin Zhou, Yu Bao, Yuwei Yang
475On Excess Mass Behavior in Gaussian Mixture Models with Orlicz-Wasserstein Distances0Aritra Guha, Nhat Ho, XuanLong Nguyen
476Conformalization of Sparse Generalized Linear Models0Etash Kumar Guha, Eugène Ndiaye, Xiaoming Huo
477Privacy-Aware Compression for Federated Learning Through Numerical Mechanism Design0Chuan Guo, Kamalika Chaudhuri, Michael G. Rabbat, Pierre Stock
478Out-of-Distribution Generalization of Federated Learning via Implicit Invariant Relationships0Kai Guo, Tieru Wu, Xiaofeng Cao, Yaming Guo, Yi Chang
479FeDXL: Provable Federated Learning for Deep X-Risk Optimization0Jiebo Luo, Rong Jin, Tianbao Yang, Zhishuai Guo
480Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP0Huazheng Wang, Jiacheng Guo, Mengdi Wang, Xuezhou Zhang, Zhuoran Yang, Zihao Li
481Analyzing Privacy Leakage in Machine Learning via Multiple Hypothesis Testing: A Lesson From Fano0Alexandre Sablayrolles, Chuan Guo, Maziar Sanjabi
482Linkless Link Prediction via Relational Distillation0Neil Shah, Nitesh V. Chawla, Shichang Zhang, Tong Zhao, William Shiao, Yozen Liu, Zhichun Guo
483FedBR: Improving Federated Learning on Heterogeneous Data via Local Learning Bias Reduction0Tao Lin, Xiaoying Tang, Yongxin Guo
484Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction0Adithya Balachandran, Jie Chen, Minghao Guo, Payel Das, Samuel W. Song, Veronika Thost, Wojciech Matusik
485Graph Neural Networks with Learnable and Optimal Polynomial Bases0Yuhe Guo, Zhewei Wei
486LongCoder: A Long-Range Pre-trained Language Model for Code Completion0Canwen Xu, Daya Guo, Jian Yin, Julian J. McAuley, Nan Duan
487Estimating Heterogeneous Treatment Effects: Mutual Information Bounds and Learning Algorithms0Jianmin Wang, Mingsheng Long, Xingzhuo Guo, Yuchen Zhang
488Identifying Useful Learnwares for Heterogeneous Label Spaces0LanZhe Guo, Yufeng Li, Zhi Zhou, ZhiHua Zhou
489High-dimensional Location Estimation via Norm Concentration for Subgamma Vectors0Eric Price, Jasper C. H. Lee, Shivam Gupta
490GRAFENNE: Learning on Graphs with Heterogeneous and Dynamic Feature Sets0Sahil Manchanda, Sayan Ranu, Shubham Gupta, Srikanta J. Bedathur
491Online Platt Scaling with Calibeating0Aaditya Ramdas, Chirag Gupta
492Multi-Task Structural Learning using Local Task Similarity induced Neuron Creation and Removal0Bahram Zonooz, Elahe Arani, NareshKumar Gurulingan
493Conditionally Strongly Log-Concave Generative Models0Etienne Lempereur, Florentin Guth, Joan Bruna, Stéphane Mallat
494DRew: Dynamically Rewired Message Passing with Delay0Benjamin Gutteridge, Francesco Di Giovanni, Michael M. Bronstein, Xiaowen Dong
495Kernel Logistic Regression Approximation of an Understandable ReLU Neural Network0Lionel Fillatre, Marie Guyomard, Susana Barbosa
496Conformal Prediction Sets for Graph Neural Networks0Aleksandar Bojchevski, Simone Antonelli, Soroush H. Zargarbashi
497Social learning spontaneously emerges by searching optimal heuristics with deep reinforcement learning0Hawoong Jeong, Seungwoong Ha
498Convex Geometry of ReLU-layers, Injectivity on the Ball and Local Reconstruction0Daniel Haider, Martin Ehler, Péter Balázs
499Robust Counterfactual Explanations for Neural Networks With Probabilistic Guarantees0Daniele Magazzeni, Erfaun Noorani, Faisal Hamman, Sanghamitra Dutta, Saumitra Mishra
500Wrapped Cauchy Distributed Angular Softmax for Long-Tailed Visual Recognition0Boran Han
501On the Impact of Knowledge Distillation for Model Interpretability0Hyeongrok Han, HyunSoo Choi, Siwon Kim, Sungroh Yoon
502Alternately Optimized Graph Neural Networks0Feng Shi, Haitao Mao, Haoyu Han, Jiliang Tang, MohamadAli Torkamani, Victor Lee, Xiaorui Liu
503System Identification of Neural Systems: If We Got It Right, Would We Know?0Brian Cheung, Tomaso A. Poggio, Yena Han
504Total Variation Graph Neural Networks0Filippo Maria Bianchi, Jonas Berg Hansen
505Learning Physical Models that Can Respect Conservation Laws0Danielle C. Maddix, Derek Hansen, Gaurav Gupta, Michael W. Mahoney, Shima Alizadeh
506On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline0Aravind Rajeswaran, Hao Su, Huazhe Xu, Nicklas Hansen, Tongzhou Mu, Xiaolong Wang, Yanjie Ze, Zhecheng Yuan
507Leveraging Demonstrations to Improve Online Learning: Quality Matters0Benjamin Van Roy, Botao Hao, Rahul Jain, Tor Lattimore, Zheng Wen
508Coupled Variational Autoencoder0Patrick Shafto, Xiaoran Hao
509GNOT: A General Neural Operator Transformer for Operator Learning0Chengyang Ying, Hang Su, Jian Song, Jun Zhu, Songming Liu, Yinpeng Dong, Ze Cheng, Zhengyi Wang, Zhongkai Hao
510Algorithmic Collective Action in Machine Learning0Celestine MendlerDünner, Eric Mazumdar, Moritz Hardt, Tijana Zrnic
511Gaussian Process Priors for Systems of Linear Partial Differential Equations with Constant Coefficients0Bogdan Raita, Marc Härkönen, Markus LangeHegermann
512Theoretical Guarantees of Learning Ensembling Strategies with Applications to Time Series Forecasting0Bernie Wang, Danielle C. Maddix, Gaurav Gupta, Hilaf Hasson, Youngsuk Park
513Global Context Vision Transformers0Ali Hatamizadeh, Greg Heinrich, Hongxu Yin, Jan Kautz, Pavlo Molchanov
514Counterfactual Analysis in Dynamic Latent State Models0Martin B. Haugh, Raghav Singal
515Sampling-based Nyström Approximation and Kernel Quadrature0Harald Oberhauser, Satoshi Hayakawa, Terry J. Lyons
516Width and Depth Limits Commute in Residual Networks0Greg Yang, Soufiane Hayou
517A Generalization of ViT/MLP-Mixer to Graphs0Adam Perold, Bryan Hooi, Thomas Laurent, Xavier Bresson, Xiaoxin He, Yann LeCun
518Domain Adaptation for Time Series Under Feature and Label Shifts0Consuelo Cuevas, Huan He, Marinka Zitnik, Owen Queen, Teddy Koker, Theodoros Tsiligkaridis
519Contrastive Learning Meets Homophily: Two Birds with One Stone0Di Jin, Dongxiao He, Jitao Zhao, Rui Guo, Weixiong Zhang, Yuxiao Huang, Zhen Wang, Zhiyong Feng
520Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes0Dongruo Zhou, Heyang Zhao, Jiafan He, Quanquan Gu
521CRISP: Curriculum based Sequential neural decoders for Polar code family0Ashok Vardhan Makkuva, Pramod Viswanath, S. Ashwin Hebbar, Sewoong Oh, Suma Bhat, Viraj Vivek Nadkarni
522Sketch-Flip-Merge: Mergeable Sketches for Private Distinct Counting0Daniel Ting, Graham Cormode, Jonathan Hehir
523Functional Neural Networks: Shift invariant models for functional data with applications to EEG classification0Corinna Weber, Florian Heinrichs, Mavin Heim
524Distance Weighted Supervised Learning for Offline Interaction Data0Dorsa Sadigh, Jensen Gao, Joey Hejna
525Group Equivariant Fourier Neural Operators for Partial Differential Equations0Cong Fu, Jacob Helwig, Jerry Kurtin, Shuiwang Ji, Stephan Wojtowytsch, Xuan Zhang
526Training-Free Neural Active Learning with Initialization-Robustness Guarantees0Apivich Hemachandra, Bryan Kian Hsiang Low, Jasraj Singh, SeeKiong Ng, Zhongxiang Dai
527A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs0Mikael Henaff, Minqi Jiang, Roberta Raileanu
528Robust Camera Pose Refinement for Multi-Resolution Hash Encoding0Hwan Heo, Hyunwoo J. Kim, Jaewon Lee, JinHwa Kim, Jiyoung Lee, Soohyun Kim, Taekyung Kim
529Generalized Teacher Forcing for Learning Chaotic Dynamics0Daniel Durstewitz, Florian Hess, Manuel Brenner, Zahra Monfared
530Causal Modeling of Policy Interventions From Treatment-Outcome Sequences0Anne Tuulikki Juuti, Caglar Hizli, Kirsi Hannele Pietiläinen, Pekka Marttinen, S. T. John, Tuure Tapani Saarinen
531Monotonicity and Double Descent in Uncertainty Estimation with Gaussian Processes0Christopher van der Heide, Fred Roosta, Liam Hodgkinson, Michael W. Mahoney
532AdaBoost is not an Optimal Weak to Strong Learner0Kasper Green Larsen, Martin Ritzert, Mikael Møller Høgsgaard
533Dual Propagation: Accelerating Contrastive Hebbian Learning with Dyadic Neurons0Christopher Zach, D. Staudt, Rasmus Kjær Høier
534Multi-Task Off-Policy Learning from Bandit Feedback0Branislav Kveton, Joey Hong, Manzil Zaheer, Mohammad Ghavamzadeh, Sumeet Katariya
535Constrained Optimization via Exact Augmented Lagrangian and Randomized Iterative Sketching0Ilgee Hong, Michael W. Mahoney, Mladen Kolar, Sen Na
536Revisiting Data-Free Knowledge Distillation with Poisoned Teachers0Jiayu Zhou, Junyuan Hong, Lingjuan Lyu, Ruoxi Jia, Shuyang Yu, Yi Zeng
537simple diffusion: End-to-end diffusion for high resolution images0Emiel Hoogeboom, Jonathan Heek, Tim Salimans
538Causal Strategic Classification: A Tale of Two Shifts0Guy Horowitz, Nir Rosenfeld
539Fair and Accurate Decision Making through Group-Aware Learning0Bhanu Garg, Li Zhang, Pengtao Xie, Ramtin Hosseini
540Approximation Algorithms for Fair Range Clustering0Ali Vakilian, Sepideh Mahabadi, Sèdjro Salomon Hotegni
541Decoding Layer Saliency in Language Transformers0Elizabeth Mary Hou, Gregory David Castañón
542PromptBoosting: Black-Box Text Classification with Ten Forward Passes0Bairu Hou, Jacob Andreas, Joe O'Connor, Shiyu Chang, Yang Zhang
543Sparse Learning of Dynamical Systems in RKHS: An Operator-Theoretic Approach0Boya Hou, Nathan Dahlin, Sina Sanjari, Subhonmesh Bose, Umesh Vaidya
544Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits0Vincent Y. F. Tan, Yunlong Hou, Zixin Zhong
545Automatic Data Augmentation via Invariance-Constrained Learning0Alejandro Ribeiro, Ignacio Hounie, Luiz F. O. Chamon
546Thompson Sampling with Diffusion Generative Prior0Branislav Kveton, Patrick Blöbaum, Shiva Prasad Kasiviswanathan, YuGuan Hsieh
547Tighter Analysis for ProxSkip0Heng Huang, Zhengmian Hu
548Omnipredictors for Constrained Optimization0Chutong Yang, Inbal Rachel Livni Navon, Lunjia Hu, Omer Reingold
549GFlowNet-EM for Learning Compositional Latent Variable Models0Alexandros Graikos, Edward J. Hu, Katie E. Everett, Moksh Jain, Nikolay Malkin, Yoshua Bengio
550Blockwise Stochastic Variance-Reduced Methods with Parallel Speedup for Multi-Block Bilevel Optimization0Lijun Zhang, Quanqi Hu, Tianbao Yang, Zhishuai Guo, ZiHao Qiu
551Language Instructed Reinforcement Learning for Human-AI Coordination0Dorsa Sadigh, Hengyuan Hu
552Surface Snapping Optimization Layer for Single Image Object Shape Reconstruction0Alexander G. Schwing, Raymond A. Yeh, YuanTing Hu
553Learning to Learn from APIs: Black-Box Data-Free Meta-Learning0Baoyuan Wu, Chun Yuan, Dacheng Tao, Li Shen, Zhenyi Wang, Zixuan Hu
554For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal0Li Erran Li, Renhao Wang, Yang Gao, Yingdong Hu
555Beyond Lipschitz Smoothness: A Tighter Analysis for Nonconvex Optimization0Heng Huang, Xidong Wu, Zhengmian Hu
556Understanding the Impact of Adversarial Robustness on Accuracy Disparity0Fan Wu, Han Zhao, Hongyang Zhang, Yuzheng Hu
557Reinforcement Learning in Low-rank MDPs with Density Features0Audrey Huang, Jinglin Chen, Nan Jiang
558Composer: Creative and Controllable Image Synthesis with Composable Conditions0Deli Zhao, Di Chen, Jingren Zhou, Lianghua Huang, Yu Liu, Yujun Shen
559Model-Aware Contrastive Learning: Towards Escaping the Dilemmas0Bo Wang, Chao Zhang, Chunlin Chen, Haoxing Chen, Huaxiong Li, Ziqi Wen, Zizheng Huang
560High-dimensional Clustering onto Hamiltonian Cycle0Shenghui Cheng, Stan Z. Li, Tianyi Huang, Zhengjun Zhang
561Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning0Jiatai Huang, Longbo Huang, Yan Dai
562Fast Algorithms for Distributed k-Clustering with Outliers0Jianxin Wang, Jinhui Xu, Junyu Huang, Qilong Feng, Ziyun Huang
563Searching Large Neighborhoods for Integer Linear Programs with Contrastive Learning0Aaron M. Ferber, Benoit Steiner, Bistra Dilkina, Taoan Huang, Yuandong Tian
564On Coresets for Clustering in Small Dimensional Euclidean spaces0Lingxiao Huang, Ruiyuan Huang, Xuan Wu, Zengfeng Huang
565Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models0Dongchao Yang, Jiawei Huang, Jinglin Liu, Luping Liu, Mingze Li, Rongjie Huang, Xiang Yin, Yi Ren, Zhenhui Ye, Zhou Zhao
566The Power of Uniform Sampling for k-Median0Jianing Lou, Lingxiao Huang, Shaofeng H.C. Jiang
567Reparameterized Policy Learning for Multimodal Trajectory Optimization0Chuang Gan, Hao Su, Litian Liang, Xuanlin Li, Zhan Ling, Zhiao Huang
568Theoretical Bounds on the Network Community Profile from Low-rank Semi-definite Programming0C. Seshadhri, David F. Gleich, Yufan Huang
569NeuralStagger: Accelerating Physics-constrained Neural PDE Solver with Spatial-temporal Decomposition0Jia Zhang, Qi Meng, TieYan Liu, Wenlei Shi, Xiaotian Gao, Xinquan Huang, Yue Wang
570Policy Contrastive Imitation Learning0Jialei Huang, Yang Gao, Yingdong Hu, ZhaoHeng Yin
571Are Large Kernels Better Teachers than Transformers for ConvNets?0Li Shen, Lu Yin, Meng Fang, Mykola Pechenizkiy, Shiwei Liu, Tianjin Huang, Zhangyang Wang, Zhenyu Zhang
572Achieving Linear Speedup in Non-IID Federated Bilevel Learning0Dewei Zhang, Kaiyi Ji, Minhui Huang
573Federated Linear Contextual Bandits with User-level Differential Privacy0Huanyu Zhang, Jing Yang, Luca Melis, Meisam Hejazinia, Milan Shen, Ruiquan Huang
574Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks0Brian Cheung, Minyoung Huh, Phillip Isola, Pulkit Agrawal
575Cut your Losses with Squentropy0Like Hui, Mikhail Belkin, Stephen Wright
576SOM-CPC: Unsupervised Contrastive Learning with Self-Organizing Maps for Structured Representations of High-Rate Time Series0Arthur Andreas Nijdam, Iris A. M. Huijben, Merel M. van Gilst, Ruud van Sloun, Sebastiaan Overeem
577One-Shot Federated Conformal Prediction0Aurélien Bellet, Batiste Le Bars, Pierre Humbert, Sylvain Arlot
578The Impact of Exploration on Convergence and Performance of Multi-Agent Q-Learning Dynamics0Aamal Abbas Hussain, Dario Paccagnan, Francesco Belardinelli
579Combinatorial Neural Bandits0Kyuwook Chai, Minhwan Oh, Taehyun Hwang
580MAGANet: Achieving Combinatorial Generalization by Modeling a Group Action0Geonho Hwang, Hyunsoo Cho, Jaewoong Choi, Myungjoo Kang
581Information-Theoretic State Space Model for Multi-View Reinforcement Learning0GeonHyeong Kim, HyeongJoo Hwang, KeeEung Kim, Seokin Seo, Seunghoon Hong, Sungyoon Kim, Youngsoo Jang
582Under-Counted Tensor Completion with Neural Incorporation of Attributes0Eugene Seo, Rebecca A. Hutchinson, Shahana Ibrahim, Xiao Fu
583On the Identifiability and Estimation of Causal Location-Scale Noise Models0Alexander Immer, Alexander Marx, Bernhard Schölkopf, Christoph Schultheiss, Julia E. Vogt, Peter Bühlmann
584Stochastic Marginal Likelihood Gradients using Neural Tangent Kernels0Alexander Immer, Bernhard Schölkopf, Gunnar Rätsch, Mark van der Wilk, Tycho F. A. van der Ouderaa
585Differentially Private Hierarchical Clustering with Provable Approximation Guarantees0Alessandro Epasto, Jacob Imola, Mohammad Mahdian, Vahab Mirrokni, Vincent CohenAddad
586Neural Network Accelerated Implicit Filtering: Integrating Neural Network Surrogates With Provably Convergent Derivative Free Optimization Methods0Avi Ziv, Brian Irwin, Eldad Haber, Raviv Gal
587Principled Offline RL in the Presence of Rich Exogenous Information0Alex Lamb, Aniket Rajiv Didolkar, Dipendra Misra, Harm van Seijen, Hongyu Zang, John Langford, Manan Tomar, Remi Tachet des Combes, Riashat Islam, Xin Li, Yonathan Efroni
588Unveiling the Latent Space Geometry of Push-Forward Generative Models0David Picard, Jérémie Mary, Thibaut Issenhuth, Ugo Tanielian
589CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design0Adam Foster, Cheng Zhang, Desi R. Ivanova, Joel Jennings, Tom Rainforth
590DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule0Maor Ivgi, Oliver Hinder, Yair Carmon
591Maximal Initial Learning Rates in Deep ReLU Networks0Boris Hanin, David Rolnick, Gaurav Iyer
592Data-Driven Subgroup Identification for Linear Regression0James Zou, Ruishan Liu, Zachary Izzo
593Efficient Training of Language Models using Few-Shot Learning0Sanjiv Kumar, Sashank J. Reddi, Satyen Kale, Seungyeon Kim, Shankar Krishnan, Sobhan Miryoosefi, Stefani Karp
594Scalable Adaptive Computation for Iterative Generation0Allan Jabri, David J. Fleet, Ting Chen
595Unconstrained Online Learning with Unbounded Losses0Andrew Jacobsen, Ashok Cutkosky
596Multi-Objective GFlowNets0Alex HernándezGarcía, Emmanuel Bengio, Jarrid RectorBrooks, Moksh Jain, Santiago Miret, Sharath Chandra Raparthy, Yoshua Bengio
597The Price of Differential Privacy under Continual Observation0Adam D. Smith, Palak Jain, Satchit Sivakumar, Sofya Raskhodnikova
598Graph Ladling: Shockingly Simple Parallel GNN Training without Intermediate Communication0Ajay Kumar Jaiswal, Shiwei Liu, Tianlong Chen, Ying Ding, Zhangyang Wang
599Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models0Ajay Kumar Jaiswal, Shiwei Liu, Tianlong Chen, Ying Ding, Zhangyang Wang
600Exploring the Benefits of Training Expert Language Models over Instruction Tuning0Doyoung Kim, Joel Jang, Kyungjae Lee, Lajanugen Logeswaran, Minjoon Seo, Moontae Lee, Seonghyeon Ye, Seungone Kim
601Learning to Boost Training by Periodic Nowcasting Near Future Weights0ByungOk Han, Jaehong Kim, Jaeyeon Lee, Jinhyeok Jang, Won Hwa Kim, Woohan Yun, Youngwoo Yoon
602Unscented Autoencoder0Faris Janjos, J. Marius Zoellner, Lars Rosenbaum, Maxim Dolgov
603Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments0Corentin Tallec, Daniel Jarrett, Florent Altché, Michal Valko, Rémi Munos, Thomas Mesnard
604BiRT: Bio-inspired Replay in Vision Transformers for Continual Learning0Bahram Zonooz, Elahe Arani, Kishaan Jeeveswaran, Prashant Shivaram Bhat
605Recovering Top-Two Answers and Confusion Probability in Multi-Choice Crowdsourcing0Hye Won Chung, Hyeonsu Jeong
606Leveraging Label Non-Uniformity for Node Classification in Graph Neural Networks0Feng Ji, Hanyang Meng, Jielong Yang, Kai Zhao, See Hian Lee, Wee Peng Tay
607Bidirectional Adaptation for Robust Semi-Supervised Learning with Inconsistent Data Distributions0JieJing Shao, LanZhe Guo, LinHan Jia, Yufeng Li, Yuke Xiang, Zhi Zhou
608Short-lived High-volume Bandits0Andrew A. Li, Ian Anderson, Nishant Oli, Paul Duff, R. Ravi, Su Jia
609Smooth Non-stationary Bandits0Nathan Kallus, Peter I. Frazier, Qian Xie, Su Jia
610A Unified Optimization Framework of ANN-SNN Conversion: Towards Optimal Mapping from Activation Values to Firing Rates0Bin Gu, Giulia De Masi, Haiyan Jiang, Huan Xiong, Srinivas Anumasa
611VIMA: Robot Manipulation with Multimodal Prompts0Agrim Gupta, Anima Anandkumar, Guanzhi Wang, Li FeiFei, Linxi Fan, Yanjun Chen, Yongqiang Dou, Yuke Zhu, Yunfan Jiang, Zichen Zhang
612Estimating Causal Effects using a Multi-task Deep Ensemble0David E. Carlson, Keyu Li, Yiling Liu, Yiman Ren, Zhuoran Hou, Ziyang Jiang
613Online Restless Bandits with Unobserved States0Bo Jiang, Bowen Jiang, Chenghu Zhou, Jian Li, Tao Lin, Xinbing Wang
614Detecting Out-of-distribution Data through In-distribution Class Prior0Bo Han, Feng Liu, Feng Zheng, Hong Chen, Tongliang Liu, Xue Jiang, Zhen Fang
615Towards Stable and Efficient Adversarial Training against l1 Bounded Adversarial Attacks0Chen Liu, Mathieu Salzmann, Sabine Süsstrunk, Yulun Jiang, Zhichao Huang
616Learning Unnormalized Statistical Models via Compositional Optimization0Changyou Chen, Jiayu Qin, Lijun Zhang, Lingyu Wu, Tianbao Yang, Wei Jiang
617Approximate Causal Effect Identification under Weak Confounding0Lai Wei, Murat Kocaoglu, Ziwei Jiang
618MEWL: Few-shot multimodal word learning with referential uncertainty0Chi Zhang, Guangyuan Jiang, Manjie Xu, Shiji Xin, Wei Liang, Yixin Zhu, Yujia Peng
619NeuralSlice: Neural 3D Triangle Mesh Reconstruction via Slicing 4D Tetrahedral Meshes0Chenbo Jiang, Jie Yang, Lin Gao, Shwai He, YuKun Lai
620Effective Structured Prompting by Meta-Learning and Representative Verbalizer0James T. Kwok, Weisen Jiang, Yu Zhang
621Understanding Incremental Learning of Gradient Descent: A Fine-grained Analysis of Matrix Sensing0Jason D. Lee, Jikai Jin, Kaifeng Lyu, Simon Shaolei Du, Zhiyuan Li
622Thompson Sampling with Less Exploration is Fast and Optimal0Pan Xu, Tianyuan Jin, Xianglin Yang, Xiaokui Xiao
623R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents0Christian Walder, Daniel D. Johnson, Daniel Tarlow
624Automatically Auditing Large Language Models via Discrete Optimization0Aditi Raghunathan, Anca D. Dragan, Erik Jones, Jacob Steinhardt
625On the Expressive Power of Geometric Graph Neural Networks0Chaitanya K. Joshi, Cristian Bodnar, Pietro Lio, Simon V. Mathis, Taco Cohen
626Data-Efficient Contrastive Self-supervised Learning: Most Beneficial Examples for Supervised Learning Contribute the Least0Baharan Mirzasoleiman, Siddharth Joshi
627Robust Subtask Learning for Compositional Generalization0Kishor Jothimurugan, Osbert Bastani, Rajeev Alur, Steve Hsu
628On Bridging the Gap between Mean Field and Finite Width Deep Random Multilayer Perceptron with Batch Normalization0Amir Joudaki, Francis R. Bach, Hadi Daneshmand
629FARE: Provably Fair Representation Learning with Practical Certificates0Dimitar Iliev Dimitrov, Martin T. Vechev, Mislav Balunovic, Nikola Jovanovic
630Scaling of Class-wise Training Losses for Post-hoc Calibration0Jongwon Choi, Seungjin Jung, Seungmo Seo, Yonghyun Jeong
631Fighting Fire with Fire: Contrastive Debiasing without Bias-free Data via Generative Bias-transformation0Eunho Yang, Hajin Shim, June Yong Yang, Yeonsung Jung
632Estimating Joint Treatment Effects by Combining Multiple Experiments0Elias Bareinboim, Jin Tian, Yonghan Jung
633The Catalog Problem: Clustering and Ordering Variable-Sized Sets0Graham W. Taylor, Leon Derczynski, Mateusz Maria Jurewicz
634Equivariance with Learned Canonicalization Functions0Arnab Kumar Mondal, Siamak Ravanbakhsh, SékouOumar Kaba, Yan Zhang, Yoshua Bengio
635Biases in Evaluation of Molecular Optimization Methods and Bias Reduction Strategies0Hiroshi Kajino, Kohei Miyaguchi, Takayuki Osogami
636Statistical Indistinguishability of Learning Algorithms0Alkis Kalavasis, Amin Karbasi, Grigoris Velegkas, Shay Moran
637Identifying Interpretable Subspaces in Image Representations0C. Bayan Bruss, Hamed Firooz, Maziar Sanjabi, Neha Mukund Kalibhat, Shweta Bhardwaj, Soheil Feizi
638Nonlinear Causal Discovery with Latent Confounders0David Kaltenpoth, Jilles Vreeken
639Deep Generative Symbolic Regression with Monte-Carlo-Tree-Search0Guillaume Lample, Marco Virgolin, PierreAlexandre Kamienny, Sylvain Lamprier
640One-vs-the-Rest Loss to Focus on Important Samples in Adversarial Training0Hiroshi Takahashi, Kentaro Ohno, Masanori Yamada, Sekitoshi Kanai, Shin'ya Yamaguchi, Yasutoshi Ida
641Large Language Models Struggle to Learn Long-Tail Knowledge0Adam Roberts, Colin Raffel, Eric Wallace, Haikang Deng, Nikhil Kandpal
642Git-Theta: A Git Extension for Collaborative Development of Machine Learning Models0Anisha Mascarenhas, Brian Lester, Colin Raffel, Haokun Liu, Mohammed Muqeeth, Monty Evans, Nikhil Kandpal, Tenghao Huang, Vishal Baskaran
643A Deep Conjugate Direction Method for Iteratively Solving Linear Systems0Ayano Kaneda, David Hyde, Jingyu Chen, Joseph Teran, Osman Akar, Victoria Alicia Trevino Kala
644Leveraging Proxy of Training Data for Test-Time Adaptation0Donghyeon Kwon, Jungseul Ok, Juwon Kang, Nayeong Kim, Suha Kwak
645Beyond Reward: Offline Preference-guided Policy Optimization0Diyuan Shi, Donglin Wang, Jinxin Liu, Li He, Yachen Kang
646Poisoning Generative Replay in Continual Learning to Promote Forgetting0Siteng Kang, Xinhua Zhang, Zhan Shi
647Node Embedding from Neural Hamiltonian Orbits in Graph Neural Networks0Kai Zhao, Qiyu Kang, Sijie Wang, Wee Peng Tay, Yang Song
648Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and Implicit Bias0Kazuki Osawa, Ryo Karakida, Tomohiro Hayase, Tomoumi Takase
649Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning0Amin Karbasi, Nikki Lijing Kuang, Siddharth Mitra, YiAn Ma
650On the Relationship Between Explanation and Prediction: A Causal View0AmirHossein Karimi, Been Kim, Bernhard Schölkopf, Krikamol Muandet, Simon Kornblith
651Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning Using Independent Component Analysis0Chuan Guo, G. Edward Suh, HsienHsin S. Lee, Kiwan Maeng, Moinuddin K. Qureshi, Sanjay Kariyappa, Wenjie Xiong
652General Sequential Episodic Memory Model0Arjun Karuvally, Hava T. Siegelmann, Terrence J. Sejnowski
653Regression with Sensor Data Containing Incomplete Observations0Takayuki Katsuki, Takayuki Osogami
654Data Representations' Study of Latent Image Manifolds0Ilya Kaufman, Omri Azencot
655Multi-Modal Classifiers for Open-Vocabulary Object Detection0Andrew Zisserman, Prannay Kaul, Weidi Xie
656Learning Mixtures of Markov Chains and MDPs0Ambuj Tewari, Chinmaya Kausik, Kevin Tan
657Curious Replay for Model-based Adaptation0Chris Doyle, Isaac Kauvar, Linqi Zhou, Nick Haber
658How Does Information Bottleneck Help Deep Learning?0Jiaoyang Huang, Kenji Kawaguchi, Xu Ji, Zhun Deng
659Instrumental Variable Estimation of Average Partial Causal Effects0Jin Tian, Manabu Kuroki, Yuta Kawakami
660The Test of Tests: A Framework for Differentially Private Hypothesis Testing0Adam Groce, Andrew P. Bray, Kaiyan Shi, Zeki Kazan
661Exact Inference in High-order Structured Prediction0Chuyang Ke, Jean Honorio
662Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally Coupled Oscillatory Recurrent Neural Networks0Max Welling, T. Anderson Keller
663Homomorphism AutoEncoder - Learning Group Structured Representations from Observed Transitions0Benjamin F. Grewe, Bernhard Schölkopf, Hamza Keurti, HsiaoRu Pan, Michel Besserve
664Rethinking Backdoor Attacks0Alaa Khaddaj, Aleksandar Makelov, Aleksander Madry, Andrew Ilyas, Guillaume Leclerc, Hadi Salman, Kristian Georgiev
665PAC Prediction Sets for Large Language Models of Code0Adam Khakhar, Osbert Bastani, Stephen Mell
666Accelerated Primal-Dual Methods for Convex-Strongly-Concave Saddle Point Problems0Digvijay Boob, Mohammad Khalafi
667Loss Balancing for Fair Supervised Learning0Mahed Abroshan, Mohammad Mahdi Khalili, Xueru Zhang
668Linearly Constrained Bilevel Optimization: A Smoothed Implicit Gradient Approach0Ioannis C. Tsaknakis, Jia Liu, Jiawei Zhang, Mingyi Hong, Prashant Khanduri, Sijia Liu, Yihua Zhang
669Emergent Asymmetry of Precision and Recall for Measuring Fidelity and Diversity of Generative Models in High Dimensions0Mahyar Khayatkhoei, Wael AbdAlmageed
670Learning-augmented private algorithms for multiple quantile release0Kareem Amin, Mikhail Khodak, Sergei Vassilvitskii, Travis Dick
671CrossSplit: Mitigating Label Noise Memorization through Data Splitting0Aristide Baratin, Jihye Kim, Simon LacosteJulien, Yan Zhang
672Trainability, Expressivity and Interpretability in Gated Neural ODEs0Kamesh Krishnamurthy, Tankut Can, Timothy Doyeon Kim
673SAAL: Sharpness-Aware Active Learning0Byeonghu Na, IlChul Moon, JoonHo Jang, Kyungwoo Song, Wanmo Kang, Yeongmin Kim, YoonYeong Kim, Youngjae Cho
674Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum0Daesol Cho, H. Jin Kim, Jigang Kim
675Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit Feedback0Assaf Zeevi, Garud Iyengar, Wonyoung Kim
676Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming0Deokjae Lee, Hyun Oh Song, Jinuk Kim, Yeonwoo Jeong
677Probabilistic Concept Bottleneck Models0Dahuin Jung, Eunji Kim, Sangha Park, Siwon Kim, Sungroh Yoon
678DevFormer: A Symmetric Transformer for Context-Aware Device Placement0Federico Berto, Haeyeon Kim, Jinkyoo Park, Joungho Kim, Minsu Kim
679Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models0Dongjun Kim, IlChul Moon, Se Jung Kwon, Wanmo Kang, Yeongmin Kim
680Robust Non-Linear Feedback Coding via Power-Constrained Deep Learning0Christopher G. Brinton, David J. Love, Junghoon Kim, Taejoon Kim
681LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework0Jeonghye Kim, Woojun Kim, Youngchul Sung
682BPipe: Memory-Balanced Pipeline Parallelism for Training Large Language Models0ByungGon Chun, GyeongIn Yu, Hyoungjoo Kim, Taebum Kim
683Probabilistic Imputation for Time-series Classification with Missing Data0Eunggu Yun, Hwangrae Lee, Hyunsu Kim, Jaehun Lee, Juho Lee, Seunghyun Kim
684Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills0Jaesik Choi, Kyowoon Lee, Seongun Kim
685Margin-based Neural Network Watermarking0Byungjoo Kim, Seanie Lee, Sooel Son, Sung Ju Hwang, Suyoung Lee
686Regularizing Towards Soft Equivariance Under Mixed Symmetries0Hongseok Yang, Hyungi Lee, Hyunsu Kim, Juho Lee
687Model-based Offline Reinforcement Learning with Count-based Conservatism0Byeongchan Kim, Minhwan Oh
688Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization0Chanyeong Kim, Hyunglip Bae, Jongwoong Park, Woo Chang Kim
689SurProGenes: Survival Risk-Ordered Representation of Cancer Patients and Genes for the Identification of Prognostic Genes0Hanseok Jeong, Jeongseon Kim, Junetae Kim, Kyoungsuk Park, SunYoung Kim, Youngwook Kim
690Stable and Consistent Prediction of 3D Characteristic Orientation via Invariant Residual Learning0Chunghyun Park, Jaesik Park, Minsu Cho, Seungwook Kim, Yoonwoo Jeong
691Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning0Dongyeop Kang, Jaehyung Kim, Jinwoo Shin
692An Adaptive Entropy-Regularization Framework for Multi-Agent Reinforcement Learning0Woojun Kim, Youngchul Sung
693Practical and Matching Gradient Variance Bounds for Black-Box Variational Bayesian Inference0Jacob R. Gardner, Jisu Oh, Kaiwen Wu, Kyurae Kim
694Learnability and Algorithm for Continual Learning0Bing Liu, Changnan Xiao, Gyuhak Kim, Tatsuya Konishi
695Unifying Nesterov's Accelerated Gradient Methods for Convex and Strongly Convex Objective Functions0Insoon Yang, Jungbin Kim
696Denoising MCMC for Accelerating Diffusion-Based Generative Models0Beomsu Kim, Jong Chul Ye
697Structure Learning of Latent Factors via Clique Search on Correlation Thresholded Graphs0Dale Kim, Qing Zhou
698Fair and Robust Estimation of Heterogeneous Treatment Effects for Policy Learning0José R. Zubizarreta, Kwangho Kim
699Proper Losses for Discrete Generative Models0Bo Waggoner, Dhamma Kimpara, Rafael M. Frongillo
700Controlling Posterior Collapse by an Inverse Lipschitz Constraint on the Decoder Network0Kenji Fukumizu, Kenta Oono, Shinichi Maeda, Yuichi Yoshida, Yuri Kinoshita
701A Watermark for Large Language Models0Ian Miers, John Kirchenbauer, Jonas Geiping, Jonathan Katz, Tom Goldstein, Yuxin Wen
702Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs0Enkelejda Kasneci, Michael Kirchhof, Seong Joon Oh
703Training Normalizing Flows from Dependent Data0Christoph Lippert, Marius Kloft, Matthias Kirchler
704IncDSI: Incrementally Updatable Document Retrieval0Chao Wan, Justin Lovelace, Kilian Q. Weinberger, Varsha Kishore, Yoav Artzi
705Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice0Csaba Szepesvári, Jincheng Mei, Matthieu Geist, Michal Valko, Mohammad Gheshlaghi Azar, Nino Vieillard, Olivier Pietquin, Pierre Ménard, Rémi Munos, Tadashi Kozuno, Toshinori Kitamura, Wataru Kumagai, Wenhao Yang, Yunhao Tang, Yutaka Matsuo
706Drug Discovery under Covariate Shift with Domain-Informed Prior Distributions over Functions0Charlotte M. Deane, Garrett M. Morris, Leo Klarner, Michael Reutlinger, Tim G. J. Rudner, Torsten Schindler, Yee Whye Teh
707Deep Laplacian-based Options for Temporally-Extended Exploration0Marlos C. Machado, Martin Klissarov
708Generalized Reductions: Making any Hierarchical Clustering Fair and Balanced with Low Cost0John P. Dickerson, Marina Knittel, Max Springer, MohammadTaghi Hajiaghayi
709Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?0Boris Knyazev, Doha Hwang, Simon LacosteJulien
710Online Learning with Feedback Graphs: The True Shape of Regret0Alexandra Carpentier, Tomás Kocák
711Grounding Language Models to Images for Multimodal Inputs and Outputs0Daniel Fried, Jing Yu Koh, Ruslan Salakhutdinov
712Rigid Body Flows for Sampling Molecular Crystal Structures0Frank Noé, Jonas Köhler, Michele Invernizzi, Pim de Haan
713Enabling First-Order Gradient-Based Learning for Equilibrium Computation in Markets0Fabian Raoul Pieroth, Martin Bichler, Nils Kohring
714Revisiting Gradient Clipping: Stochastic bias and tight convergence guarantees0Anastasia Koloskova, Hadrien Hendrikx, Sebastian U. Stich
715On Computing Optimal Tree Ensembles0Christian Komusiewicz, Frank Sommer, Manuel Sorge, Pascal Kunz
716GOAT: A Global Transformer on Large-scale Graphs0C. Bayan Bruss, Jiuhai Chen, John Kirchenbauer, Kezhi Kong, Renkun Ni, Tom Goldstein
717Autoregressive Diffusion Model for Graph Generation0B. Aditya Prakash, Chao Zhang, Haotian Sun, Jiaming Cui, Lingkai Kong, Yuchen Zhuang
718End-to-End Full-Atom Antibody Design0Wenbing Huang, Xiangzhe Kong, Yang Liu
719Covariate balancing using the integral probability metric for causal inference0Insung Kong, Joonhyuk Jung, Kwonsang Lee, Yongdai Kim, Yuha Park
720Masked Bayesian Neural Networks : Theoretical Guarantee and its Posterior Inference0Dongyoon Yang, Gyuseung Baek, Ilsang Ohn, Insung Kong, Jongjin Lee, Yongdai Kim
721Parameter-Level Soft-Masking for Continual Learning0Bing Liu, Chihiro Ono, Gyuhak Kim, Mori Kurokawa, Tatsuya Konishi, Zixuan Ke
722Pretraining Language Models with Human Preferences0Angelica Chen, Christopher L. Buckley, Ethan Perez, Jason Phang, Kejian Shi, Rasika Vinayak Bhalerao, Samuel R. Bowman, Tomasz Korbak
723Detecting Adversarial Directions in Deep Reinforcement Learning to Make Robust Decisions0Ezgi Korkmaz, Jonah BrownCohen
724Ewald-based Long-Range Message Passing for Molecular Graphs0Arthur Kosmala, Johannes Gasteiger, Nicholas Gao, Stephan Günnemann
725TabDDPM: Modelling Tabular Data with Diffusion Models0Akim Kotelnikov, Artem Babenko, Dmitry Baranchuk, Ivan Rubachev
726Randomized Schur Complement Views for Graph Contrastive Learning0Vignesh Kothapalli
727Benign Overfitting in Two-layer ReLU Convolutional Neural Networks0Quanquan Gu, Yiwen Kou, Yuanzhou Chen, Zixiang Chen
728Variational Mixture of HyperGenerators for Learning Distributions over Functions0Batuhan Koyuncu, Ignacio Peis, Isabel Valera, Pablo M. Olmos, Pablo SánchezMartín
729Gradient Descent Monotonically Decreases the Sharpness of Gradient Flow Solutions in Scalar Networks and Beyond0Daniel Soudry, Itai Kreisler, Mor Shpigel Nacson, Yair Carmon
730Estimation Beyond Data Reweighting: Kernel Method of Moments0Bernhard Schölkopf, Heiner Kremer, JiaJie Zhu, Yassine Nemmour
731Multi-Task Differential Privacy Under Distribution Skew0Abhradeep Guha Thakurta, Li Zhang, Mukund Sundararajan, Prateek Jain, Shuang Song, Walid Krichene
732Towards Bridging the Gaps between the Right to Explanation and the Right to be Forgotten0Himabindu Lakkaraju, Jiaqi Ma, Satyapriya Krishna
733Graph Neural Tangent Kernel: Convergence on Large Graphs0Luana Ruiz, Sanjukta Krishnagopal
734Diffusion Models for Black-Box Optimization0Aditya Grover, Satvik Mehul Mashkaria, Siddarth Krishnamoorthy
735Learning to Design Analog Circuits to Meet Threshold Specifications0Dmitrii Krylov, Hamidreza Aghasi, Hiba Ajmal, Junhan Ouyang, Pooya Khajeh, Roy Fox, Thomas Reeves, Tongkai Liu
736Variance Control for Distributional Reinforcement Learning0Fan Zhou, Liwen Zhang, Qi Kuang, Zhoufan Zhu
737Hierarchical Imitation Learning with Vector Quantized Models0Alexander Ilin, Joni Pajarinen, Kalle Kujanpää
738SinDDM: A Single Image Denoising Diffusion Model0Matan Kleiner, Shahar Yadin, Tomer Michaeli, Vladimir Kulikov
739Towards Explaining Distribution Shifts0David I. Inouye, Sean Kulinski
740Featured Graph Coarsening with Similarity Guarantees0Anurag Sharma, Manoj Kumar, Sandeep Kumar, Shashwat Saxena
741Modeling Dynamic Environments with Scene Graph Memory0Andrey Kurenkov, Chengshu Li, Emily Jin, Jiajun Wu, Li FeiFei, Michael Lingelbach, Roberto MartínMartín, Ruohan Zhang, Silvio Savarese, Tanmay Agarwal
742Tied-Augment: Controlling Representation Similarity Improves Data Augmentation0Ekin Dogus Cubuk, Emirhan Kurtulus, Yann N. Dauphin, Zichao Li
743Cooperation in the Latent Space: The Benefits of Adding Mixture Components in Variational Autoencoders0Alexandra Hotti, Jens Lagergren, Oskar Kviman, Ricky Molén, Semih Kurt, Víctor Elvira
744GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency0Jiuhn Song, Minseop Kwak, Seungryong Kim
745Rotation and Translation Invariant Representation Learning with Implicit Neural Representations0Ernest K. Ryu, Joo Young Choi, Sehyun Kwon
746Reward-Mixing MDPs with Few Latent Contexts are Learnable0Constantine Caramanis, Jeongyeol Kwon, Shie Mannor, Yonathan Efroni
747A Fully First-Order Method for Stochastic Bilevel Optimization0Dohyun Kwon, Jeongyeol Kwon, Robert D. Nowak, Stephen Wright
748Complexity of Block Coordinate Descent with Proximal Regularization and Applications to Wasserstein CP-dictionary Learning0Dohyun Kwon, Hanbaek Lyu
749Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value0James Zou, Yongchan Kwon
750Emergence of Adaptive Circadian Rhythms in Deep Reinforcement Learning0Aqeel Labash, Daniel Majoral, Florian Stelzer, Raul Vicente Zafra
751Synergies between Disentanglement and Sparsity: Generalization and Identifiability in Multi-Task Learning0Divyat Mahajan, Ioannis Mitliagkas, Quentin Bertrand, Simon LacosteJulien, Sébastien Lachapelle, Tristan Deleu, Yoshua Bengio
752Nearly-Optimal Hierarchical Clustering for Well-Clustered Graphs0BogdanAdrian Manghiuc, He Sun, Steinar Laenen
753Hybrid Energy Based Model in the Feature Space for Out-of-Distribution Detection0Clément Rambour, Elias Ramzi, Marc Lafon, Nicolas Thome
754A theory of continuous generative flow networks0Alex HernándezGarcía, Alexandra Volokhova, Dinghuai Zhang, Léna Néhale Ezzine, Nikolay Malkin, Pablo Lemos, Salem Lahlou, Tristan Deleu, Yoshua Bengio
755Automatically marginalized MCMC in probabilistic programming0Daniel Sheldon, Hui Guan, Javier Burroni, Jinlin Lai
756DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation0Chengxi Li, Daniel Fried, Luke Zettlemoyer, Ruiqi Zhong, Sida I. Wang, Tao Yu, Tianyi Zhang, WenTau Yih, Yiming Wang, Yuhang Lai
757ChiPFormer: Transferable Chip Placement via Offline Decision Transformer0Bin Wang, Jianye Hao, Jinxin Liu, Ping Luo, Yao Lai, Zhentao Tang
758FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation0ChiehHsin Lai, Naoki Murata, Stefano Ermon, Toshimitsu Uesaka, Yuhta Takida, Yuki Mitsufuji
759Private Statistical Estimation of Many Quantiles0Aurélien Garivier, Clément Lalanne, Rémi Gribonval
760Bootstrap in High Dimension with Low Computation0Henry Lam, Zhenyuan Liu
761LegendreTron: Uprising Proper Multiclass Loss Learning0Christian J. Walder, Kevin H. Lam, Richard Nock, Spiridon I. Penev
762Metagenomic Binning using Connectivity-constrained Variational Autoencoders0Alessandro Tibo, Andre Lamurias, Katja Hose, Mads Albertsen, Thomas Dyhre Nielsen
763Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback0Aviv Rosenberg, Dmitry Sotnikov, Tal Lancewicki
764Lottery Tickets in Evolutionary Optimization: On Sparse Backpropagation-Free Trainability0Henning Sprekeler, Robert Tjarko Lange
765On the Occupancy Measure of Non-Markovian Policies in Continuous MDPs0Remi Tachet des Combes, Romain Laroche
766Minimalistic Predictions to Schedule Jobs with Online Precedence Constraints0Alexander Lindermayr, Alexandra Anna Lassota, Jens Schlöter, Nicole Megow
767Speeding Up Bellman Ford via Minimum Violation Permutations0Ola Svensson, Sergei Vassilvitskii, Silvio Lattanzi
768Who Needs to Know? Minimal Knowledge for Optimal Coordination0Ameesh Shah, Micah Carroll, Michael D. Dennis, Niklas Lauffer, Stuart Russell
769Target-based Surrogates for Stochastic Optimization0Jonathan Wilder Lavington, Mark Schmidt, Nicolas Le Roux, Reza Babanezhad Harikandeh, Sharan Vaswani
770Cluster Explanation via Polyhedral Descriptions0Connor Lawless, Oktay Günlük
771Pre-training for Speech Translation: CTC Meets Optimal Transport0Benjamin Lecouteux, Changhan Wang, Didier Schwab, Hongyu Gong, Juan Pino, PhuongHang Le
772Bootstrapped Representations in Reinforcement Learning0Anna Harutyunyan, Charline Le Lan, Marc G. Bellemare, Mark Rowland, Rishabh Agarwal, Stephen Tu, Will Dabney
773Strategic Classification with Unknown User Manipulations0Ruth Urner, Shai BenDavid, Tosca Lechner
774Learning in POMDPs is Sample-Efficient with Hindsight Observability0Alekh Agarwal, Christoph Dann, Jonathan Lee, Tong Zhang
775Towards Deep Attention in Graph Neural Networks: Problems and Remedies0Fanchen Bu, Jaemin Yoo, Kijung Shin, Soo Yong Lee
776InGram: Inductive Knowledge Graph Embedding via Relation Graphs0Chanyoung Chung, Jaejun Lee, Joyce Jiyoung Whang
777Optimality of Thompson Sampling with Noninformative Priors for Pareto Bandits0ChaoKai Chiang, Jongyeong Lee, Junya Honda, Masashi Sugiyama
778Conditional Graph Information Bottleneck for Molecular Relational Learning0Chanyoung Park, Dongmin Hyun, Gyoung S. Na, Junseok Lee, Namkyeong Lee, Sungwon Kim
779Exploring Chemical Space with Score-based Out-of-distribution Generation0Jaehyeong Jo, Seul Lee, Sung Ju Hwang
780Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding0Fangyu Liu, Hexiang Hu, Iulia Raluca Turc, Julian Martin Eisenschlos, Kenton Lee, Kristina Toutanova, Mandar Joshi, MingWei Chang, Peter Shaw, Urvashi Khandelwal
781FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization0Dongsoo Lee, Jeonghoon Kim, Jung Hyun Lee, Se Jung Kwon
782CoDi: Co-evolving Contrastive Diffusion Models for Mixed-type Tabular Synthesis0Chaejeong Lee, Jayoung Kim, Noseong Park
783Minimizing Trajectory Curvature of ODE-based Generative Models0Beomsu Kim, Jong Chul Ye, Sangyun Lee
784H-Likelihood Approach to Deep Neural Networks with Temporal-Spatial Random Effects for High-Cardinality Categorical Features0Hangbin Lee, Youngjo Lee
785On the Importance of Feature Decorrelation for Unsupervised Representation Learning in Reinforcement Learning0Byungkun Lee, Dongyoon Hwang, Hojoon Lee, Hyunho Lee, Jaegul Choo, Koanho Lee
786HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption0Garam Lee, Junbum Shin, Jung Woo Kim, MunKyu Lee, Seewoo Lee
787QASA: Advanced Question Answering on Scientific Articles0Dasol Hwang, HongIn Lee, Jaehyeon Kim, Kyungjae Lee, Moontae Lee, Sunghyun Park, Yoonjoo Lee
788Demystifying Disagreement-on-the-Line in High Dimensions0Behrad Moniri, Donghwan Lee, Edgar Dobriban, Hamed Hassani, Xinmeng Huang
789On the Correctness of Automatic Differentiation for Neural Networks with Machine-Representable Parameters0Alex Aiken, Sejun Park, Wonyeol Lee
790Implicit Jacobian regularization weighted with impurity of probability output0Jaewook Lee, Jinseong Park, Sungyoon Lee
791Unsupervised Skill Discovery for Learning Shared Structures across Changing Environments0SangHyun Lee, SeungWoo Seo
792Generalization Analysis for Contrastive Representation Learning0DingXuan Zhou, Tianbao Yang, Yiming Ying, Yunwen Lei
793Learning Control by Iterative Inversion0Aviv Tamar, Gal Leibovich, Gal Novik, Guy Jacob, Or Avner
794Sampling-Based Accuracy Testing of Posterior Estimators for General Inference0Adam Coogan, Laurence Perreault Levasseur, Pablo Lemos, Yashar Hezaveh
795Fast Inference from Transformers via Speculative Decoding0Matan Kalman, Yaniv Leviathan, Yossi Matias
796Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using Online Function Approximation0Alon Cohen, Asaf B. Cassel, Orin Levy, Yishay Mansour
797GLOBE-CE: A Translation Based Approach for Global Counterfactual Explanations0Dan Ley, Daniele Magazzeni, Saumitra Mishra
798TIPS: Topologically Important Path Sampling for Anytime Neural Networks0Guihong Li, Kartikeya Bhardwaj, Radu Marculescu, Yuedong Yang
799MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations0Anqi Li, Byron Boots, ChingAn Cheng
800Internet Explorer: Targeted Representation Learning on the Open Web0Alexander Cong Li, Alexei A. Efros, Deepak Pathak, Ellis Langham Brown
801Prototype-oriented unsupervised anomaly detection for multivariate time series0Bo Chen, Dongsheng Wang, Long Tian, Mingyuan Zhou, Wenchao Chen, Yuxin Li
802Learning Preconditioners for Conjugate Gradient PDE Solvers0Peter Yichen Chen, Tao Du, Wojciech Matusik, Yichen Li
803Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation0Anurag Ajay, Pulkit Agrawal, Tao Chen, Zechu Li, ZhangWei Hong
804Minimum Width of Leaky-ReLU Neural Networks for Uniform Universal Approximation0Guanghua Ji, Li'ang Li, Yifei Duan, Yongqiang Cai
805FAIRER: Fairness as Decision Rationale Alignment0Aishan Liu, Mengnan Du, Qing Guo, Tianlin Li, Yang Liu, Zhiming Li
806RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolution0Hongyao Tang, Jianye Hao, Pengyi Li, Xian Fu, Yan Zheng
807Adversarial Collaborative Learning on Non-IID Features0Bingsheng He, Dawn Song, Qinbin Li
808Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints0Cong Shen, Donghao Li, Jing Yang, Ruiquan Huang
809Transformers as Algorithms: Generalization and Stability in In-context Learning0Dimitris Papailiopoulos, Muhammed Emrullah Ildiz, Samet Oymak, Yingcong Li
810Improving Hyperparameter Learning under Approximate Inference in Gaussian Process Models0Arno Solin, Rui Li, S. T. John
811Local Vertex Colouring Graph Neural Networks0Dongwoo Kim, Qing Wang, Shouheng Li
812Analysis of Error Feedback in Federated Non-Convex Optimization with Biased Compression: Fast Convergence and Partial Participation0Ping Li, Xiaoyun Li
813How Do Transformers Learn Topic Structure: Towards a Mechanistic Understanding0Andrej Risteski, Yuanzhi Li, Yuchen Li
814BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models0Dongxu Li, Junnan Li, Silvio Savarese, Steven C. H. Hoi
815Nearly Optimal Algorithms with Sublinear Computational Complexity for Online Kernel Regression0Junfan Li, Shizhong Liao
816Revisiting Weighted Aggregation in Federated Learning with Neural Networks0Chao Wu, Tao Lin, Xinyi Shang, Zexi Li
817Distribution-dependent McDiarmid-type Inequalities for Functions of Unbounded Interaction0Shaojie Li, Yong Liu
818Optimal Convergence Rates for Agnostic Nyström Kernel Learning0Jian Li, Weiping Wang, Yong Liu
819Reconstructive Neuron Pruning for Backdoor Defense0Bo Li, Lingjuan Lyu, Nodens Koren, Xingjun Ma, Xixiang Lyu, Yige Li, YuGang Jiang
820Meta Learning of Interface Conditions for Multi-Domain Physics-Informed Neural Networks0Akil Narayan, Conor Tillinghast, Michael Penwarden, Mike Kirby, Shandian Zhe, Shibo Li, Yiming Xu
821Deep Anomaly Detection under Labeling Budget Constraints0Aodong Li, Chen Qiu, Maja Rudolph, Marius Kloft, Padhraic Smyth, Stephan Mandt
822On the Initialization of Graph Neural Networks0David Wipf, Jiahang Li, Xiang Song, Yakun Song
823Federated Adversarial Learning: A Framework with Convergence Analysis0Jiaming Yang, Xiaoxiao Li, Zhao Song
824How Powerful are Shallow Neural Networks with Bandlimited Random Weights?0Feilong Cao, Jiye Liang, Ming Li, Sho Sonoda, Yu Guang Wang
825Efficient Quantum Algorithms for Quantum Optimal Control0Chunhao Wang, Xiantao Li
826Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling0Lin Yang, Yiran Wang, Yu Cheng, Yunfan Li
827Hierarchical Diffusion for Offline Decision Making0Bo Jin, Hongyuan Zha, Wenhao Li, Xiangfeng Wang
828Divide and Conquer Dynamic Programming: An Almost Linear Time Change Point Detection Methodology in High Dimensions0Alessandro Rinaldo, Daren Wang, Wanshan Li
829Architecture-Agnostic Masked Image Modeling - From ViT back to CNN0Di Wu, Fang Wu, Siyuan Li, Stan Z. Li, Zelin Zang
830Learning Antidote Data to Individual Unfairness0Ethan Xia, Hongfu Liu, Peizhao Li
831Propensity Matters: Measuring and Enhancing Balancing for Recommendation0Chunyuan Zheng, Haoxuan Li, Peng Cui, Peng Wu, Yanghao Xiao
832GraphCleaner: Detecting Mislabelled Samples in Popular Graph Learning Benchmarks0Bryan Hooi, Miao Xiong, Yuwen Li
833SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process0Chao Zhang, Haoming Jiang, Hongyuan Zha, Simiao Zuo, Tuo Zhao, Yanbo Xu, Zichong Li
834Horizon-free Learning for Markov Decision Processes and Games: Stochastically Bounded Rewards and Improved Bounds0Lin Yang, Shengshi Li
835Transcendental Idealism of Planner: Evaluating Perception from Planning Perspective for Autonomous Driving0Weixin Li, Xiaodong Yang
836Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees0Jianyi Yang, Pengfei Li, Shaolei Ren
837FedVS: Straggler-Resilient and Privacy-Preserving Vertical Federated Learning for Split Models0Duanyi Yao, Jin Liu, Songze Li
838Achieving Hierarchy-Free Approximation for Bilevel Programs with Equilibrium Constraints0Boyi Liu, Jiayang Li, Jing Yu, Yu Marco Nie, Zhaoran Wang
839LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation0Chen Liang, Pengcheng He, Qingru Zhang, Tuo Zhao, Weizhu Chen, Yifan Yu, Yixiao Li
840Nesterov Meets Optimism: Rate-Optimal Separable Minimax Optimization0Chris Junchi Li, Gauthier Gidel, Huizhuo Yuan, Michael I. Jordan, Quanquan Gu
841Alternating Local Enumeration (TnALE): Solving Tensor Network Structure Search with Fewer Evaluations0Cesar F. Caiafa, Chao Li, Chunmei Li, Junhua Zeng, Qibin Zhao
842Understanding the Complexity Gains of Single-Task RL with a Curriculum0Qiyang Li, Sergey Levine, Yi Ma, Yuexiang Zhai
843Does a Neural Network Really Encode Symbolic Concepts?0Mingjie Li, Quanshi Zhang
844Cooperative Open-ended Learning Framework for Zero-Shot Coordination0Jichen Sun, Shao Zhang, Wei Pan, Xinbing Wang, Yali Du, Yang Li, Ying Wen
845Offline Reinforcement Learning with Closed-Form Policy Improvement Operators0Edwin Zhang, Jiachen Li, Ming Yin, Qinxun Bai, William Yang Wang, YuXiang Wang
846Optimal Arms Identification with Knapsacks0Lan Zhang, Shaoang Li, Xiangyang Li, Yingqi Yu
847Internally Rewarded Reinforcement Learning0Cornelius Weber, Jae Hee Lee, Mengdi Li, Stefan Wermter, Xufeng Zhao
848Trustworthy Policy Learning under the Counterfactual No-Harm Criterion0Chunyuan Zheng, Haoxuan Li, Peng Wu, Yixiao Cao, Yue Liu, Zhi Geng
849Structured Cooperative Learning with Graphical Model Priors0Dacheng Tao, Shuangtong Li, Tianyi Zhou, Xinmei Tian
850Low Complexity Homeomorphic Projection to Ensure Neural-Network Solution Feasibility for Optimization over (Non-)Convex Set0Enming Liang, Minghua Chen, Steven H. Low
851Consistency of Multiple Kernel Clustering0Chuan Ma, En Zhu, Weixuan Liang, Xinwang Liu, Yong Liu, Yunping Zhao, Zhe Liu
852A Distribution Optimization Framework for Confidence Bounds of Risk Measures0Hao Liang, ZhiQuan Luo
853Accuracy on the Curve: On the Nonlinear Correlation of ML Performance Between Data Subpopulations0James Zou, Weixin Liang, Xinyu Yang, Yining Mao, Yongchan Kwon
854AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners0Fei Ni, Masayoshi Tomizuka, Mingyu Ding, Ping Luo, Yao Mu, Zhixuan Liang
855Learning Compiler Pass Orders using Coreset and Normalized Value Prediction0Ali Shameli, Benoit Steiner, Chris Cummins, Hugh James Leather, Jiadong Guo, Kevin Stone, Mostafa Elhoushi, Pengtao Xie, Xiaomeng Yang, Youwei Liang, Yuandong Tian
856Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples0Chumeng Liang, Haibing Guan, Jiaru Zhang, Ruhui Ma, Tao Song, Xiaoyu Wu, Yang Hua, Yiming Xue, Zhengui Xue
857CLUSTSEG: Clustering for Universal Segmentation0Dongfang Liu, James Chenhao Liang, Tianfei Zhou, Wenguan Wang
858Conformal Inference is (almost) Free for Neural Networks Trained with Early Stopping0Matteo Sesia, Yanfei Zhou, Ziyi Liang
859Less is More: Task-aware Layer-wise Distillation for Language Model Compression0Chen Liang, Pengcheng He, Qingru Zhang, Simiao Zuo, Tuo Zhao, Weizhu Chen
860Statistical Inference and A/B Testing for First-Price Pacing Equilibria0Christian Kroer, Luofeng Liao
861Supervised Metric Learning to Rank for Retrieval via Contextual Similarity Optimization0Brian Kulis, Christopher Liao, Theodoros Tsiligkaridis
862Revisiting Domain Randomization via Relaxed State-Adversarial Policy Optimization0PingChun Hsieh, YuShuen Wang, YunHsuan Lien
863Variational Open-Domain Question Answering0Andreas Geert Motzfeldt, Ida Riis Jensen, Ole Winther, Valentin Liévin
864Generating Novel, Designable, and Diverse Protein Structures by Equivariantly Diffusing Oriented Residue Clouds0Mohammed AlQuraishi, Yeqing Lin
865Hyperbolic Diffusion Embedding and Distance for Hierarchical Representation Learning0Gal Mishne, Ronald R. Coifman, Ronen Talmon, YaWei Eileen Lin
866Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning0Frank Nielsen, Mark Schmidt, Melvin Leok, Mohammad Emtiyaz Khan, Valentin Duruisseaux, Wu Lin
867Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise0Chen Lin, Nan Duan, Tong Wu, Weizhu Chen, Yelong Shen, Yeyun Gong, Zhenghao Lin, Zhihao Fan
868Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations0Chenhang He, ManWai Mak, Weiwei Lin, Youzhi Tu
869Theory on Forgetting and Generalization of Continual Learning0Ness B. Shroff, Peizhong Ju, Sen Lin, Yingbin Liang
870Accelerated Cyclic Coordinate Dual Averaging with Extrapolation for Composite Convex Optimization0Chaobing Song, Cheuk Yin Lin, Jelena Diakonikolas
871Safe Offline Reinforcement Learning with Real-Time Budget Constraints0Bo Tang, Chao Yu, Dong Wang, Qian Lin, Qianlong Xie, Shangqin Mao, Xingxing Wang, Zifan Wu
872Probabilistic Unrolling: Scalable, Inverse-Free Maximum Likelihood Estimation for Latent Gaussian Models0Alexander Lin, Bahareh Tolooshams, Demba E. Ba, Yves F. Atchadé
873Fast Online Value-Maximizing Prediction Sets with Conformal Cost Control0Cao Xiao, Jimeng Sun, Shubhendu Trivedi, Zhen Lin
874Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features0Chieh Hubert Lin, HsinYing Lee, HungYu Tseng, Maneesh Kumar Singh, MingHsuan Yang
875Fair yet Asymptotically Equal Collaborative Learning0Bryan Kian Hsiang Low, ChuanSheng Foo, SeeKiong Ng, Xiaoqiang Lin, Xinyi Xu
876Efficient Approximations of Complete Interatomic Potentials for Crystal Property Prediction0Keqiang Yan, Shuiwang Ji, Xiaoning Qian, Yi Liu, Youzhi Luo, Yuchao Lin
877Continuation Path Learning for Homotopy Optimization0Qingfu Zhang, Xi Lin, Xiaoyuan Zhang, Zhiyuan Yang
878Speed-Oblivious Online Scheduling: Knowing (Precise) Speeds is not Necessary0Alexander Lindermayr, Martin Rapp, Nicole Megow
879Graph Mixup with Soft Alignments0Hongyi Ling, Meng Liu, Na Zou, Shuiwang Ji, Zhimeng Jiang
880Deep Graph Representation Learning and Optimization for Influence Maximization0Chen Ling, James Song, Junji Jiang, Junxiang Wang, Liang Zhao, Meikang Qiu, My T. Thai, Renhao Xue
881Emergent Agentic Transformer from Chain of Hindsight Experience0Hao Liu, Pieter Abbeel
882Shapley Based Residual Decomposition for Instance Analysis0Amanda S. Barnard, Tommy Liu
883Learning Representations without Compositional Assumptions0Jeroen Berrevoets, Mihaela van der Schaar, Tennison Liu, Zhaozhi Qian
884Byzantine-Robust Learning on Heterogeneous Data via Gradient Splitting0Chen Chen, Fangzhao Wu, Gang Chen, Lingjuan Lyu, Sai Wu, Yuchen Liu
885Towards Constituting Mathematical Structures for Learning to Optimize0HanQin Cai, Jialin Liu, Wotao Yin, Xiaohan Chen, Zhangyang Wang
886AudioLDM: Text-to-Audio Generation with Latent Diffusion Models0Danilo P. Mandic, Haohe Liu, Mark D. Plumbley, Wenwu Wang, Xinhao Mei, Xubo Liu, Yi Yuan, Zehua Chen
887Identifiability of Label Noise Transition Matrix0Hao Cheng, Kun Zhang, Yang Liu
888A Group Symmetric Stochastic Differential Equation Model for Molecule Multi-modal Pretraining0Hongyu Guo, Jian Tang, Shengchao Liu, Weitao Du, ZhiMing Ma
889Using Perturbation to Improve Goodness-of-Fit Tests based on Kernelized Stein Discrepancy0Andrew B. Duncan, Axel Gandy, Xing Liu
890Cones: Concept Neurons in Diffusion Models for Customized Generation0Deli Zhao, Jingren Zhou, Kai Zhu, Kecheng Zheng, Ruili Feng, Yang Cao, Yifei Zhang, Yu Liu, Zhiheng Liu
891Opponent-Limited Online Search for Imperfect Information Games0Haobo Fu, Qiang Fu, Wei Yang, Weiming Liu
892Towards Robust and Safe Reinforcement Learning with Benign Off-policy Data0Ding Zhao, Hanjiang Hu, Huan Zhang, Yihang Yao, Zhepeng Cen, Zijian Guo, Zuxin Liu
893Constrained Decision Transformer for Offline Safe Reinforcement Learning0Ding Zhao, Tingnan Zhang, Wenhao Yu, Yihang Yao, Zhepeng Cen, Zijian Guo, Zuxin Liu
894Understanding and Defending Patched-based Adversarial Attacks for Vision Transformer0Jun Yang, Liang Liu, Yanan Guo, Youtao Zhang
895NUNO: A General Framework for Learning Parametric PDEs with Non-Uniform Data0Chengyang Ying, Hang Su, Jun Zhu, Songming Liu, Ze Cheng, Zhongkai Hao
896Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs0EnPei Hu, GuanTing Liu, HungYi Lee, PuJen Cheng, ShaoHua Sun
897Online Local Differential Private Quantile Inference via Self-normalization0Lei Ding, Linglong Kong, Qirui Hu, Yi Liu
898GFlowOut: Dropout with Generative Flow Networks0Anirudh Goyal, Bonaventure F. P. Dossou, Chris Chinenye Emezue, Dianbo Liu, Dinghuai Zhang, Kenji Kawaguchi, Moksh Jain, Nadhir Hassen, Nikolay Malkin, Qianli Shen, Salem Lahlou, Xu Ji, Yoshua Bengio
8992D-Shapley: A Framework for Fragmented Data Valuation0Hoang Anh Just, Ruoxi Jia, Xi Chen, Xiangyu Chang, Zhihong Liu
900Causal Structure Learning for Latent Intervened Non-stationary Data0Chenxi Liu, Kun Kuang
901Structural Re-weighting Improves Graph Domain Adaptation0Han Zhao, Nhan Tran, Pan Li, Qiang Qiu, Shikun Liu, Tianchun Li, Yongbin Feng
902Dink-Net: Neural Clustering on Large Graphs0Jun Xia, Ke Liang, Sihang Zhou, Stan Z. Li, Xihong Yang, Xinwang Liu, Yue Liu
903Oscillation-free Quantization for Low-bit Vision Transformers0KwangTing Cheng, ShihYang Liu, Zechun Liu
904Understanding the Distillation Process from Deep Generative Models to Tractable Probabilistic Circuits0Anji Liu, Guy Van den Broeck, Xuejie Liu, Yitao Liang
905Averaged Method of Multipliers for Bi-Level Optimization without Lower-Level Strong Convexity0Jin Zhang, Risheng Liu, Shangzhi Zeng, Wei Yao, Yaohua Liu
906Graph Switching Dynamical Systems0Efstratios Gavves, Miltiadis Kofinas, Sara Magliacane, Yongtuo Liu
907High Probability Convergence of Stochastic Gradient Methods0Alina Ene, Huy Nguyen, Ta Duy Nguyen, Thien Hang Nguyen, Zijian Liu
908OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Models0Enshu Liu, Huazhong Yang, Xuefei Ning, Yu Wang, Zinan Lin
909Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning0Boyin Liu, Du Zhang, Jianqiang Yi, Yanyan Liang, Yi Pan, Zhiqiang Pu
910RSC: Accelerate Graph Neural Networks Training via Randomized Sparse Computations0Daochen Zha, Kaixiong Zhou, Shengyuan Chen, Xia Hu, Xiao Huang, Zirui Liu
911Algorithms for bounding contribution for histogram estimation under user-level privacy0Ananda Theertha Suresh, Marco Gruteser, Peter Kairouz, Wennan Zhu, Yuhan Liu
912Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning0Allan Zhou, Chelsea Finn, Evan Zheran Liu, Sahaana Suri, Tong Mu
913Generating Private Synthetic Data with Genetic Algorithms0Giuseppe Vietri, Jingwu Tang, Steven Wu, Terrance Liu
914FusionRetro: Molecule Representation Fusion via In-Context Learning for Retrosynthetic Planning0Dinghao Wu, Jian Tang, Lu Lin, Minkai Xu, Peilin Zhao, Rex Ying, Songtao Liu, Zhengkai Tu, Zuobai Zhang
915I2SB: Image-to-Image Schrödinger Bridge0Anima Anandkumar, Arash Vahdat, DeAn Huang, Evangelos A. Theodorou, GuanHorng Liu, Weili Nie
916What can online reinforcement learning with function approximation benefit from general coverage conditions?0Fanghui Liu, Luca Viano, Volkan Cevher
917TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation0Gabriel LoaizaGanem, Jimmy Ba, Noël Vouitsis, Satya Krishna Gorti, Zhaoyan Liu
918Global Optimization with Parametric Function Approximation0Chong Liu, YuXiang Wang
919Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time0Anshumali Shrivastava, Beidi Chen, Binhang Yuan, Ce Zhang, Christopher Ré, Jue Wang, Tianyi Zhou, Tri Dao, Yuandong Tian, Zhao Song, Zichang Liu
920Trapdoor Normalization with Irreversible Ownership Verification0Hanwen Liu, Yadong Mu, Yuesheng Zhu, Zhenyu Weng
921Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models0Hong Liu, Sang Michael Xie, Tengyu Ma, Zhiyuan Li
922Taxonomy-Structured Domain Adaptation0GuangHe Lee, GuangYuan Hao, Hao He, Hao Wang, Tianyi Liu, Zihao Xu
923Dropout Reduces Underfitting0Joseph Jin, Trevor Darrell, Zhiqiang Shen, Zhiqiu Xu, Zhuang Liu
924Revisiting Pseudo-Label for Single-Positive Multi-Label Learning0Biao Liu, Jiaqi Lv, Ning Xu, Xin Geng
925Retrosynthetic Planning with Dual Value Networks0Austin Tripp, Di Xue, Guoqing Liu, Krzysztof Maziarz, Marwin H. S. Segler, Shufang Xie, Tao Qin, TieYan Liu, Yingce Xia, Zongzhang Zhang
926Online Nonstochastic Control with Adversarial and Static Constraints0Lei Ying, Xin Liu, Zixian Yang
927Optimization for Amortized Inverse Problems0Qi Lei, Quan Zhang, Tianci Liu, Tong Yang
928Active Policy Improvement from Multiple Black-box Oracles0Chaoqi Wang, Matthew R. Walter, Takuma Yoneda, Xuefeng Liu, Yuxin Chen
929Gradient-based Wang-Landau Algorithm: A Novel Sampler for Output Distribution of Neural Networks over the Input Space0Jingbo Shang, Weitang Liu, YiZhuang You, YingWai Li
930VectorMapNet: End-to-end Vectorized HD Map Learning0Hang Zhao, Tianyuan Yuan, Yicheng Liu, Yilun Wang, Yue Wang
931Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information Sharing0Kaiqing Zhang, Xiangyu Liu
932Prometheus: Taming Sample and Communication Complexities in Constrained Decentralized Stochastic Bilevel Learning0Jia Liu, Prashant Khanduri, Songtao Lu, Xin Zhang, Zhuqing Liu
933D2Match: Leveraging Deep Learning and Degeneracy for Subgraph Matching0Haiqin Yang, Jiaqi Sun, Lin Zhang, Xuanzhou Liu, Yujiu Yang
934Image Shortcut Squeezing: Countering Perturbative Availability Poisons with Compression0Martha A. Larson, Zhengyu Zhao, Zhuoran Liu
935Which Invariance Should We Transfer? A Causal Minimax Learning Approach0Fang Fang, Mingzhou Liu, Xiangyu Zheng, Xinwei Sun, Yizhou Wang
936Unsupervised Out-of-Distribution Detection with Diffusion Inpainting0Jin Peng Zhou, Kilian Q. Weinberger, Yufan Wang, Zhenzhen Liu
937NA2Q: Neural Attention Additive Model for Interpretable Multi-Agent Q-Learning0Chunlin Chen, Yuanyang Zhu, Zichuan Liu
938Contextual Combinatorial Bandits with Probabilistically Triggered Arms0Adam Wierman, Jinhang Zuo, John C. S. Lui, Mohammad Hajiesmaili, Siwei Wang, Wei Chen, Xutong Liu
939Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning0Akhil Bagaria, George Konidaris, Sam Lobel
940Multi-Symmetry Ensembles: Improving Diversity and Generalization via Opposing Symmetries0Akash Srivastava, Charlotte Loh, Florian Wenzel, Kai Xu, Marin Soljacic, Rumen Dangovski, Seungwook Han, Shivchander Sudalairaj
941The Flan Collection: Designing Data and Methods for Effective Instruction Tuning0Adam Roberts, Albert Webson, Barret Zoph, Denny Zhou, Hyung Won Chung, Jason Wei, Le Hou, Quoc V. Le, Shayne Longpre, Tu Vu, Yi Tay
942Dataset Distillation with Convexified Implicit Gradients0Daniela Rus, Mathias Lechner, Noel Loo, Ramin M. Hasani
943Reflected Diffusion Models0Aaron Lou, Stefano Ermon
944Never mind the metrics - what about the uncertainty? Visualising binary confusion matrix metric distributions to put performance in perspective0Andrew P. Bradley, David R. Lovell, Dimity Miller, Jaiden Capra
945Bilevel Optimization with Coupled Decision-Dependent Distributions0Songtao Lu
946Two-Scale Gradient Descent Ascent Dynamics Finds Mixed Nash Equilibria of Continuous Games: A Mean-Field Perspective0Yulong Lu
947STEP: Learning N: M Structured Sparsity Masks from Scratch with Precondition0Amir Yazdanbakhsh, Christopher De Sa, Oleg Rybakov, Shivani Agrawal, Suvinay Subramanian, Yucheng Lu
948Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning0Cheng Lu, Chongxuan Li, Hang Su, Huayu Chen, Jianfei Chen, Jun Zhu
949Exploring the Limits of Model-Targeted Indiscriminate Data Poisoning Attacks0Gautam Kamath, Yaoliang Yu, Yiwei Lu
950QAS-Bench: Rethinking Quantum Architecture Search and A Benchmark0Ge Yan, Jiaming Shan, Junchi Yan, Kaisen Pan, Wenjie Wu, Xudong Lu
951Learning Dense Correspondences between Photos and Sketches0Judith E. Fan, Xiaolong Wang, Xuanchen Lu
952Adversarial Cheap Talk0Alistair Letcher, Chris Lu, Jakob Nicolaus Foerster, Timon Willi
953Federated Conformal Predictors for Distributed Uncertainty Quantification0Charles Lu, Michael I. Jordan, Ramesh Raskar, Sai Praneeth Karimireddy, Yaodong Yu
954Mechanistic Mode Connectivity0David Scott Krueger, Ekdeep Singh Lubana, Eric J. Bigelow, Hidenori Tanaka, Robert P. Dick
955A Unifying Framework to the Analysis of Interaction Methods using Synergy Functions0Daniel Lundström, Meisam Razaviyayn
956SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation0Huaishao Luo, Junwei Bao, Tianrui Li, Xiaodong He, Youzheng Wu
957Image Restoration with Mean-Reverting Stochastic Differential Equations0Fredrik K. Gustafsson, Jens Sjölund, Thomas B. Schön, Zheng Zhao, Ziwei Luo
958Dimensionality Reduction for General KDE Mode Finding0Cas Widdershoven, Christopher Musco, Xinyu Luo
959Iterative Approximate Cross-Validation0Rina Barber, Yuetian Luo, Zhimei Ren
960A Closer Look at Few-shot Classification Again0Hao Wu, Ji Zhang, Jing Xu, Jingkuan Song, Lianli Gao, Xu Luo
961HOPE: High-order Graph ODE For Modeling Interacting Dynamics0Huiyu Jiang, Jingyang Yuan, Ming Zhang, Wei Ju, Xiao Luo, Yifang Qin, Yizhou Sun, Zijie Huang
962Stabilizing GANs' Training with Brownian Motion Controller0Jianfei Chen, Jun Zhu, Tianjiao Luo, Ziyu Zhu
963OCD: Learning to Overfit with Conditional Diffusion Models0Lior Wolf, Shahar Lutati
964DiscoBAX: Discovery of optimal intervention sets in genomic experiment design0Andrew Jesson, Arash Mehrjou, Clare Lyle, Pascal Notin, Patrick Schwab, Stefan Bauer, Yarin Gal
965Understanding Plasticity in Neural Networks0Bernardo Ávila Pires, Clare Lyle, Evgenii Nikishin, Razvan Pascanu, Will Dabney, Zeyu Zheng
966Bandits with Knapsacks: Advice on Time-Varying Demands0Lixing Lyu, Wang Chi Cheung
967Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions0Boxiang Lyu, Sanmi Koyejo, Zachary Robertson, Zhe Feng
968Which Tricks are Important for Learning to Rank?0Aleksei Ustimenko, Andrey Gulin, Ivan Lyzhin, Liudmila Prokhorenkova
969Learning Neural Constitutive Laws from Motion Observations for Generalizable PDE Dynamics0Bolei Deng, Chuang Gan, Joshua B. Tenenbaum, Peter Yichen Chen, Pingchuan Ma, Tao Du, Wojciech Matusik
970LIV: Language-Image Representations and Rewards for Robotic Control0Amy Zhang, Dinesh Jayaraman, Osbert Bastani, Vikash Kumar, Yecheng Jason Ma
971Graph Inductive Biases in Transformers without Message Passing0Adriana RomeroSoriano, Chen Lin, Derek Lim, Liheng Ma, Mark Coates, Philip H. S. Torr, Puneet K. Dokania, SerNam Lim
972Learning Signed Distance Functions from Noisy 3D Point Clouds via Noise to Noise Mapping0Baorui Ma, YuShen Liu, Zhizhong Han
973Learning Intuitive Policies Using Action Features0Jakob Nicolaus Foerster, Jizhou Liu, Max KleimanWeiner, Mingwei Ma, Samuel Sokota
974Over-parametrization via Lifting for Low-rank Matrix Sensing: Conversion of Spurious Solutions to Strict Saddle Points0Igor Molybog, Javad Lavaei, Somayeh Sojoudi, Ziye Ma
975Buying Information for Stochastic Optimization0Christos Tzamos, Mingchen Ma
976Generated Graph Detection0Michael Backes, Ning Yu, Xinlei He, Yang Zhang, Yihan Ma, Yun Shen, Zhikun Zhang
977Calibrating Multimodal Learning0Bingzhe Wu, Changqing Zhang, Huan Ma, Huazhu Fu, Joey Tianyi Zhou, Qinghua Hu, Qingyang Zhang
978AutoCoreset: An Automatic Practical Coreset Construction Framework0Alaa Maalouf, Daniela Rus, Murad Tukan, Vladimir Braverman
979Learning GFlowNets From Partial Episodes For Improved Convergence And Stability0Andrei Cristian Nica, Emmanuel Bengio, Jarrid RectorBrooks, Kanika Madan, Maksym Korablyov, Moksh Jain, Nikolay Malkin, Tom Bosc, Yoshua Bengio
980Applied Online Algorithms with Heterogeneous Predictors0Jessica Maghakian, Jian Li, Mohammad Hajiesmaili, Ramesh K. Sitaraman, Russell Lee, Zhenhua Liu
981CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations0Gengchen Mai, Jiaming Song, Ni Lao, Stefano Ermon, Yutong He
982Vertical Federated Graph Neural Network for Recommender System0Peihua Mai, Yan Pang
983Can Neural Network Memorization Be Localized?0Chiyuan Zhang, Hanie Sedghi, J. Zico Kolter, Michael Curtis Mozer, Pratyush Maini, Zachary Chase Lipton
984Fundamental Tradeoffs in Learning with Prior Information0Anirudha Majumdar
985Additive Causal Bandits with Unknown Graph0Alan Malek, Silvia Chiappa, Virginia Aglietti
986Weighted Tallying Bandits: Overcoming Intractability via Repeated Exposure Optimality0Aarti Singh, Conor Igoe, Dhruv Malik, Yuanzhi Li
987A Kernel-Based View of Language Model Fine-Tuning0Alexander Wettig, Danqi Chen, Dingli Yu, Sadhika Malladi, Sanjeev Arora
988Performative Reinforcement Learning0Debmalya Mandal, Goran Radanovic, Stelios Triantafyllou
989Differential Privacy has Bounded Impact on Fairness in Classification0Aurélien Bellet, Marc Tommasi, Michaël Perrot, Paul Mangold
990Random Classification Noise does not defeat All Convex Potential Boosters Irrespective of Model Choice0Richard Nock, Robert C. Williamson, Yishay Mansour
991H-Consistency Bounds for Pairwise Misranking Loss Surrogates0Anqi Mao, Mehryar Mohri, Yutao Zhong
992Cross-Entropy Loss Functions: Theoretical Analysis and Applications0Anqi Mao, Mehryar Mohri, Yutao Zhong
993Supported Trust Region Optimization for Offline Reinforcement Learning0Chen Chen, Hongchang Zhang, Xiangyang Ji, Yi Xu, Yixiu Mao
994Robust Perception through Equivariance0Abhishek Vaibhav Joshi, Carl Vondrick, Chengzhi Mao, Hao Wang, Junfeng Yang, Lingyu Zhang
995Reliable Measures of Spread in High Dimensional Latent Spaces0Anna C. Marbut, Katy McKinneyBock, Travis J. Wheeler
996SRATTA: Sample Re-ATTribution Attack of Secure Aggregation in Federated Learning0Arthur Pignet, Jean Ogier du Terrail, Regis Loeb, Tanguy Marchand, Ulysse MarteauFerey
997Neuro-Symbolic Continual Learning: Knowledge, Reasoning Shortcuts and Concept Rehearsal0Andrea Passerini, Elisa Ficarra, Emanuele Marconato, Gianpaolo Bontempo, Simone Calderara, Stefano Teso
998Evaluating Unsupervised Denoising Requires Unsupervised Metrics0Adria MarcosMorales, Carlos FernandezGranda, Joshua Lawrence Vincent, Mai Tan, Matan Leibovich, Peter A. Crozier, Piyush Haluai, Sreyas Mohan
999Regions of Reliability in the Evaluation of Multivariate Probabilistic Forecasts0Alexandre Drouin, Nicolas Chapados, Valentina Zantedeschi, Étienne Marcotte
1000Analyzing Diffusion as Serial Reproduction0Ilia Sucholutsky, Nori Jacoby, Raja Marjieh, Thomas A. Langlois, Thomas L. Griffiths
1001Quantized Distributed Training of Large Models with Convergence Guarantees0Adrian Vladu, Dan Alistarh, Ilia Markov, Qi Guo
1002Efficient Transformed Gaussian Processes for Non-Stationary Dependent Multi-class Classification0Daniel HernándezLobato, Juan Maroñas
1003Computational Asymmetries in Robust Classification0Michele Lombardi, Samuele Marro
1004Neural Network Approximations of PDEs Beyond Linearity: A Representational Perspective0Andrej Risteski, Jianfeng Lu, Tanya Marwah, Zachary Chase Lipton
1005Generative Pretraining for Black-Box Optimization0Aditya Grover, Satvik Mehul Mashkaria, Siddarth Krishnamoorthy
1006Improved Policy Evaluation for Randomized Trials of Algorithmic Resource Allocation0Aditya Mate, Aparna Taneja, Bryan Wilder, Milind Tambe
1007Multi-Fidelity Covariance Estimation in the Log-Euclidean Geometry0Aimee Maurais, Benjamin Peherstorfer, Terrence Alsup, Youssef M. Marzouk
1008Communication-Constrained Bandits under Additive Gaussian Noise0Jonathan Scarlett, Prathamesh Mayekar, Vincent Y. F. Tan
1009Nonparametric Density Estimation under Distribution Drift0Alessio Mazzetto, Eli Upfal
1010PAC-Bayesian Generalization Bounds for Adversarial Generative Models0Florence Clerc, Pascal Germain, Sokhna Diarra Mbacke
1011Robustness in Multimodal Learning under Train-Test Modality Mismatch0Alexander T. Toshev, Brandon McKinzie, Joseph Yitan Cheng, Vaishaal Shankar, Yinfei Yang
1012A Model-free Closeness-of-influence Test for Features in Supervised Learning0Mohammad Mehrabi, Ryan A. Rossi
1013Stochastic Gradient Succeeds for Bandits0Alekh Agarwal, Bo Dai, Csaba Szepesvári, Dale Schuurmans, Jincheng Mei, Zixin Zhong
1014Normalizing Flows for Interventional Density Estimation0Dennis Frauen, Stefan Feuerriegel, Valentyn Melnychuk
1015Reprogramming Pretrained Language Models for Antibody Sequence Infilling0Amit Dhurandhar, Devleena Das, Igor Melnyk, Inkit Padhi, Payel Das, PinYu Chen, Vijil Chenthamarakshan
1016Superhuman Fairness0Brian D. Ziebart, Linh Vu, Omid Memarrast
1017A Model-Based Method for Minimizing CVaR and Beyond0Robert M. Gower, Si Yi Meng
1018Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning0Jiawei Han, Jiaxin Huang, Martin Michalski, Tarek F. Abdelzaher, Yu Meng, Yu Zhang
1019On Preemption and Learning in Stochastic Scheduling0Corentin Odic, Flore Sentenac, Hugo Richard, Mathieu Molina, Nadav Merlis, Vianney Perchet
1020Quantile Credit Assignment0Alaa Saade, Audrunas Gruslys, Clare Lyle, Eric Moulines, Georg Ostrovski, Mark Rowland, Michal Valko, Rémi Munos, Theophane Weber, Thomas Mesnard, Wenqi Chen, Will Dabney, Yunhao Tang
1021Is Consensus Acceleration Possible in Decentralized Optimization over Slowly Time-Varying Networks?0Alexander Rogozin, Alexander V. Gasnikov, Dmitry Kovalev, Dmitry Metelev
1022Towards Theoretical Understanding of Inverse Reinforcement Learning0Alberto Maria Metelli, Filippo Lazzati, Marcello Restelli
1023Quantum Policy Gradient Algorithm with Optimized Action Decoding0Axel Plinge, Christopher Mutschler, Daniel D. Scherer, Michael J. Hartmann, Nico Meyer
1024Training Deep Surrogate Models with Large Scale Online Learning0Alejandro Ribés, Bruno Raffin, Lucas Thibaut Meyer, Marc Schouler, Robert Alexander Caulk
1025MANSA: Learning Fast and Slow in Multi-Agent Systems0David Henry Mguni, Feifei Tong, Haojun Chen, Jianhong Wang, Jun Wang, Longfei Yue, Stephen Marcus McAleer, Taher Jafferjee, Xidong Feng, Yaodong Yang
1026Representation Learning with Multi-Step Inverse Kinematics: An Efficient and Optimal Approach to Rich-Observation RL0Alexander Rakhlin, Dylan J. Foster, Zakaria Mhammedi
1027Single Point-Based Distributed Zeroth-Order Optimization with a Non-Convex Stochastic Objective Function0Elissa Mhanna, Mohamad Assaad
1028Learning Instance-Specific Augmentations by Capturing Local Invariances0Adam Foster, Emile Mathieu, Hyunjik Kim, Ning Miao, Tom Rainforth, Yann Dubois, Yee Whye Teh
1029Path Neural Networks: Expressive and Accurate Graph Neural Networks0Gaspard Michel, Giannis Nikolentzos, Johannes F. Lutzeyer, Michalis Vazirgiannis
1030Learning to acquire novel cognitive tasks with evolution, plasticity and meta-meta-learning0Thomas Miconi
1031Generative Decoding of Visual Stimuli0Andrei Irimia, Eleni Miliotou, Jason D. Hinman, Panagiotis Kyriakis, Paul Bogdan
1032Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation0Jiafan He, Quanquan Gu, Tianhao Wang, Yifei Min
1033Directed Chain Generative Adversarial Networks0Ming Min, Ruimeng Hu, Tomoyuki Ichiba
1034An Information-Theoretic Analysis of Nonstationary Bandit Learning0Daniel Russo, Seungki Min
1035On the Convergence of Gradient Flow on Multi-layer Linear Models0Enrique Mallada, Hancheng Min, René Vidal
1036Optimal Sets and Solution Paths of ReLU Networks0Aaron Mishkin, Mert Pilanci
1037The Numerical Stability of Hyperbolic Representation Learning0Gal Mishne, Sheng Yang, Yusu Wang, Zhengchao Wan
1038DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature0Alexander Khazatsky, Chelsea Finn, Christopher D. Manning, Eric Mitchell, Yoonho Lee
1039Diffusion Based Representation Learning0Arash Mehrjou, Bernhard Schölkopf, Korbinian Abstreiter, Sarthak Mittal, Stefan Bauer
1040Disentangled Multiplex Graph Representation Learning0Heng Tao Shen, Jialie Shen, Xiaofeng Zhu, Xiaoshuang Shi, Yajie Lei, Yujie Mo
1041A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition0Pedro Morgado, Shentong Mo
1042Pruning via Sparsity-indexed ODE: a Continuous Sparsity Viewpoint0Haosen Shi, Sinno Jialin Pan, Zhanfeng Mo
1043Text-To-Concept (and Back) via Cross-Model Alignment0Keivan Rezaei, Mazda Moayeri, Maziar Sanjabi, Soheil Feizi
1044A Fast, Well-Founded Approximation to the Empirical Neural Tangent Kernel0Danica J. Sutherland, Mohamad Amin Mohamadi, Wonho Bae
1045Special Properties of Gradient Descent with Large Learning Rates0Amirkeivan Mohtashami, Martin Jaggi, Sebastian U. Stich
1046Neural Inverse Operators for Solving PDE Inverse Problems0Björn Engquist, Roberto Molinaro, Siddhartha Mishra, Yunan Yang
1047Input uncertainty propagation through trained neural networks0Erwan Le Pennec, Loic Coquelin, Nicolas Fischer, Paul Monchot, Sébastien Julien Petit, Sébastien Marmin
1048Compressing Tabular Data via Latent Variable Estimation0Andrea Montanari, Eric Weiner
1049An SDE for Modeling SAM: Theory and Insights0Antonio Orvieto, Aurélien Lucchi, Enea Monzio Compagnoni, Frank Norbert Proske, Hans Kersting, Luca Biggio
1050Learning Deductive Reasoning from Synthetic Corpus based on Formal Logic0Atsuki Yamaguchi, Gaku Morio, Terufumi Morishita, Yasuhiro Sogawa
1051WL meet VC0Christopher Morris, Floris Geerts, Jan Tönshoff, Martin Grohe
1052ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs0Brendan O'Donoghue, Satinder Singh, Sebastian Flennerhag, Ted Moskovitz, Tom Zahavy, Vivek Veeriah
1053Optimistic Planning by Regularized Dynamic Programming0Antoine Moulin, Gergely Neu
1054Neural signature kernels as infinite-width-depth-limits of controlled ResNets0Cristopher Salvi, Maud Lemercier, Nicola Muca Cirone
1055Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models0Alaaeldin ElNouby, Hervé Jégou, Jakob Verbeek, Karen Ullrich, Matthew J. Muckley
1056PFNs4BO: In-Context Learning for Bayesian Optimization0Frank Hutter, Matthias Feurer, Noah Hollmann, Samuel Müller
1057Achieving High Accuracy with PINNs via Energy Natural Gradient Descent0Johannes Müller, Marius Zeinhofer
1058Uncertain Evidence in Probabilistic Models and Stochastic Simulators0Alexander Mead, Andreas Munk, Frank Wood
1059GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration0ChiehHsin Lai, Koichi Saito, Naoki Murata, Stefano Ermon, Toshimitsu Uesaka, Yuhta Takida, Yuki Mitsufuji
1060DIFF2: Differential Private Optimization via Gradient Differences for Nonconvex Distributed Learning0Taiji Suzuki, Tomoya Murata
1061Efficiently predicting high resolution mass spectra with graph neural networks0David Healey, Ernest Fraenkel, Michael Murphy, Stefanie Jegelka, Thomas Butler, Tobias Kind
1062Dynamical Linear Bandits0Alberto Maria Metelli, Marcello Restelli, Marco Mussi
1063Representation-Driven Reinforcement Learning0Guy Tennenholtz, Ofir Nabati, Shie Mannor
1064DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization0Adel Nabli, Edouard Oyallon
1065Multi-User Reinforcement Learning with Low Rank Rewards0Dheeraj Mysore Nagaraj, Naman Agarwal, Praneeth Netrapalli, Prateek Jain, Suhas S. Kowshik
1066Statistical Foundations of Prior-Data Fitted Networks0Thomas Nagler
1067Do Machine Learning Models Learn Statistical Rules Inferred from Data?0Aaditya Naik, Eric Wong, Mayur Naik, Yinjun Wu
1068Sample and Predict Your Latent: Modality-free Sequential Disentanglement via Contrastive Estimation0Ilan Naiman, Nimrod Berman, Omri Azencot
1069Effectively Using Public Data in Privacy Preserving Machine Learning0Amir Houmansadr, Milad Nasr, Prateek Mittal, Saeed Mahloujifar, Xinyu Tang
1070Counterfactual Identifiability of Bijective Causal Models0Arash NasrEsfahany, Devavrat Shah, Mohammad Alizadeh
1071Discovering Object-Centric Generalized Value Functions From Pixels0Gopeshh Raaj Subbaraj, Khimya Khetarpal, Samira Ebrahimi Kahou, Somjit Nath
1072On Many-Actions Policy Gradient0Marek Cygan, Michal Nauman
1073Equivariant Architectures for Learning in Deep Weight Spaces0Aviv Navon, Aviv Shamsian, Ethan Fetaya, Gal Chechik, Haggai Maron, Idan Achituve
1074Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation0Hamsa Balakrishnan, Karthik Gopalakrishnan, Kenneth Choi, Siddharth Nayak, Sydney Dolan, Wenqi Ding
1075Geometric Autoencoders - What You See is What You Decode0Fred A. Hamprecht, Philipp Nazari, Sebastian Damrich
1076Action Matching: Learning Stochastic Dynamics from Samples0Alireza Makhzani, Daniel Severo, Kirill Neklyudov, Rob Brekelmans
1077Extending Conformal Prediction to Hidden Markov Models with Exact Validity via de Finetti's Theorem for Markov Chains0Buddhika Nettasinghe, Mahantesh M. Halappanavar, Ramakrishna Tipireddy, Samrat Chatterjee
1078ClimaX: A foundation model for weather and climate0Aditya Grover, Ashish Kapoor, Jayesh K. Gupta, Johannes Brandstetter, Tung Nguyen
1079Provable Reset-free Reinforcement Learning by No-Regret Reduction0ChingAn Cheng, HoaiAn Nguyen
1080Revisiting Over-smoothing and Over-squashing Using Ollivier-Ricci Curvature0Khang Nguyen, Nhat Ho, Nong Minh Hieu, Stanley J. Osher, Tan Minh Nguyen, Vinh Duc Nguyen
1081Deep Clustering with Incomplete Noisy Pairwise Annotations: A Geometric Regularization Approach0Shahana Ibrahim, Tri Nguyen, Xiao Fu
1082Self-Attention Amortized Distributional Projection Optimization for Sliced Wasserstein Point-Cloud Reconstruction0Dang Nguyen, Khai Nguyen, Nhat Ho
1083Building Neural Networks on Matrix Manifolds: A Gyrovector Space Approach0Shuo Yang, Xuan Son Nguyen
1084Simple Disentanglement of Style and Content in Visual Representations0Alex Gittens, Lilian Ngweta, Mikhail Yurochkin, Subha Maity, Yuekai Sun
1085MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL0Bin Wang, Fei Ni, Jianye Hao, Yan Zheng, Yao Mu, Yifu Yuan, Zhixuan Liang
1086LEVER: Learning to Verify Language-to-Code Generation with Execution0Ansong Ni, Dragomir Radev, Sida I. Wang, Srini Iyer, Veselin Stoyanov, WenTau Yih, Xi Victoria Lin
1087Continual Vision-Language Representation Learning with Off-Diagonal Information0Longhui Wei, Qi Tian, Siliang Tang, Yueting Zhuang, Zixuan Ni
1088Attributing Image Generative Models using Latent Fingerprints0Changhoon Kim, Guangyu Nie, Yezhou Yang, Yi Ren
1089A Framework for Adapting Offline Algorithms to Solve Combinatorial Multi-Armed Bandit Problems with Bandit Feedback0Christopher John Quinn, Guanyu Nie, Vaneet Aggarwal, Yanhui Zhu, Yididiya Y. Nadew
1090SinFusion: Training Diffusion Models on a Single Image or Video0Michal Irani, Niv Haim, Yaniv Nikankin
1091SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks at the Edge0Dan Alistarh, Eldar Kurtic, Eugenia Iofinova, Mahdi Nikdan, Tommaso Pegolotti
1092Anti-Exploration by Random Network Distillation0Alexander Nikulin, Denis Tarasov, Sergey Kolesnikov, Vladislav Kurenkov
1093Input Perturbation Reduces Exposure Bias in Diffusion Models0Angelo Porrello, Enver Sangineto, Mang Ning, Rita Cucchiara, Simone Calderara
1094Primal and Dual Analysis of Entropic Fictitious Play for Finite-sum Problems0Atsushi Nitanda, Denny Wu, Kazusato Oko, Nobuhito Takenouchi, Taiji Suzuki
1095The Statistical Scope of Multicalibration0Aaron Roth, Georgy Noarov
1096Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling0Alane Suhr, Hannaneh Hajishirzi, Kolby Nottingham, Prithviraj Ammanabrolu, Roy Fox, Sameer Singh, Yejin Choi
1097Gradient-Free Structured Pruning with Unlabeled Data0Azade Nova, Dale Schuurmans, Hanjun Dai
1098CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets0Julian J. McAuley, Saurabh Garg, Zachary Chase Lipton, Zachary Novack
1099Few-bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction0Alex Shonenkov, Daniel Bershatsky, Denis Valerievich Dimitrov, Georgii Sergeevich Novikov, Ivan V. Oseledets, Julia Gusak
1100Efficient Exploration via Epistemic-Risk-Seeking Policy Optimization0Brendan O'Donoghue
1101Provable Benefit of Mixup for Finding Optimal Decision Boundaries0Chulhee Yun, Junsoo Oh
1102Shedding a PAC-Bayesian Light on Adaptive Sliced-Wasserstein Distances0Alain Rakotomamonjy, Kimia Nadjahi, Liva Ralaivola, Ruben Ohana
1103Reasons for the Superiority of Stochastic Estimators over Deterministic Ones: Robustness, Consistency and Perceptual Quality0Guy Ohayon, Michael Elad, Theo Joseph Adrai, Tomer Michaeli
1104On the Within-Group Fairness of Screening Classifiers0Manuel Gomez Rodriguez, Nastaran Okati, Stratis Tsirtsis
1105Diffusion Models are Minimax Optimal Distribution Estimators0Kazusato Oko, Shunta Akiyama, Taiji Suzuki
1106How Many Perturbations Break This Model? Evaluating Robustness Beyond Adversarial Accuracy0Bhiksha Raj, Raphaël Olivier
1107B-Learner: Quasi-Oracle Bounds on Heterogeneous Causal Effects Under Hidden Confounding0Andrew Jesson, Jacob Dorn, Marah Ghoummaid, Miruna Oprescu, Nathan Kallus, Uri Shalit
1108Measuring the Impact of Programming Language Distribution0Gabriel Orlanski, Jacob Austin, Jeffrey Hui, Jonathan Malmaud, Joshua Howland, Kefan Xiao, Michele Catasta, Rishabh Singh, Xavier Garcia
1109When does Privileged information Explain Away Label Noise?0Alexander Nicholas D'Amour, Anant Nawalgaria, Efi Kokiopoulou, Guillermo OrtizJiménez, Jesse Berent, Mark Collier, Rodolphe Jenatton
1110Resurrecting Recurrent Neural Networks for Long Sequences0Albert Gu, Antonio Orvieto, Anushan Fernando, Razvan Pascanu, Samuel L. Smith, Soham De, Çaglar Gülçehre
1111Improving Adversarial Robustness Through the Contrastive-Guided Diffusion Process0Guang Cheng, Liyan Xie, Yidong Ouyang
1112On the Role of Attention in Prompt-tuning0Ankit Singh Rawat, Christos Thrampoulidis, Mahdi Soltanolkotabi, Samet Oymak
1113Revisiting the Linear-Programming Framework for Offline RL with General Function Approximation0Asuman E. Ozdaglar, Jiawei Zhang, Kaiqing Zhang, Sarath Pattathil
1114Extrapolative Controlled Sequence Generation via Iterative Refinement0Ankur P. Parikh, He He, Richard Yuanzhe Pang, Vishakh Padmakumar
1115Locally Regularized Neural Differential Equations: Some Black Boxes were meant to remain closed!0Alan Edelman, Avik Pal, Christopher Vincent Rackauckas
1116Controlled Differential Equations on Long Sequences via Non-standard Wavelets0Sathya N. Ravi, Sourav Pal, Vikas Singh, Zhanpeng Zeng
1117Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the Machiavelli Benchmark0Alexander Pan, Andy Zou, Dan Hendrycks, Hanlin Zhang, Jun Shern Chan, Nathaniel Li, Scott Emmons, Steven Basart, Thomas Woodside
1118Beyond Homophily: Reconstructing Structure for Graph-agnostic Clustering0Erlin Pan, Zhao Kang
1119Better Training of GFlowNets with Local Credit and Incomplete Trajectories0Dinghuai Zhang, Ling Pan, Nikolay Malkin, Yoshua Bengio
1120A Hybrid Quantum-Classical Approach based on the Hadamard Transform for the Convolutional Layer0Ahmet Enis Çetin, Hongyi Pan, Salih Furkan Atici, Xin Zhu
1121Semi Bandit dynamics in Congestion Games: Convergence to Nash Equilibrium and No-Regret Guarantees0Ioannis Panageas, Luca Viano, Stratis Skoulakis, Volkan Cevher, Xiao Wang
1122Flash: Concept Drift Adaptation in Federated Learning0Hui Guan, Koyel Mukherjee, Kunjal Panchal, Saayan Mitra, Somdeb Sarkhel, Subrata Mitra, Sunav Choudhary
1123Learn to Accumulate Evidence from All Training Samples: Theory and Practice0Deep Shankar Pandey, Qi Yu
1124Secure Federated Correlation Test and Entropy Estimation0Dawn Song, Lun Wang, Qi Pang, Shuai Wang, Wenting Zheng
1125Task-Specific Skill Localization in Fine-tuned Language Models0Abhishek Panigrahi, Haoyu Zhao, Nikunj Saunshi, Sanjeev Arora
1126Kernel Sufficient Dimension Reduction and Variable Selection for Compositional Data via Amalgamation0Cheolwoo Park, Jeongyoun Ahn, Junyoung Park
1127Learning Affinity with Hyperbolic Representation for Spatial Propagation0HaeGon Jeon, Inhwan Bae, Jaesung Choe, JinHwi Park
1128TRAK: Attributing Model Behavior at Scale0Aleksander Madry, Andrew Ilyas, Guillaume Leclerc, Kristian Georgiev, Sung Min Park
1129Test-Time Style Shifting: Handling Arbitrary Styles in Domain Generalization0DongJun Han, Jaekyun Moon, Jungwuk Park, Soyeong Kim
1130Towards Understanding Ensemble Distillation in Federated Learning0Ganguk Hwang, Kihun Hong, Sejun Park
1131Learning Controllable Degradation for Real-World Super-Resolution via Constrained Flows0Dongjin Kim, Seobin Park, Sungyong Baik, Tae Hyun Kim
1132Differentially Private Sharpness-Aware Training0Hoki Kim, Jaewook Lee, Jinseong Park, Yujin Choi
1133Controllability-Aware Unsupervised Skill Discovery0Kimin Lee, Pieter Abbeel, Seohong Park, Youngwoon Lee
1134Predictable MDP Abstraction for Unsupervised Model-Based RL0Seohong Park, Sergey Levine
1135Neural Stochastic Differential Games for Time-series Analysis0Byoungwoo Park, Changhee Lee, Moontae Lee, Sungwoo Park
1136Accelerated Infeasibility Detection of Constrained Optimization and Fixed-Point Iterations0Ernest K. Ryu, Jisun Park
1137Model-based Reinforcement Learning with Scalable Composite Policy Gradient Estimators0Paavo Parmas, Takuma Seno, Yuma Aoki
1138PAC Generalization via Invariant Representations0Advait U. Parulekar, Karthikeyan Shanmugam, Sanjay Shakkottai
1139Stochastic Gradient Descent-Induced Drift of Representation in a Two-Layer Neural Network0Alexei A. Koulakov, Farhad Pashakhanloo
1140Reducing SO(3) Convolutions to SO(2) for Efficient Equivariant GNNs0C. Lawrence Zitnick, Saro Passaro
1141Federated Online and Bandit Convex Optimization0Aadirupa Saha, Kumar Kshitij Patel, Lingxiao Wang, Nathan Srebro
1142Brauer's Group Equivariant Neural Networks0Edward PearceCrump
1143How Jellyfish Characterise Alternating Group Equivariant Neural Networks0Edward PearceCrump
1144Can Large Language Models Reason about Program Invariants?0Charles Sutton, David Bieber, Kensen Shi, Kexin Pei, Pengcheng Yin
1145Dynamics-inspired Neuromorphic Visual Representation Learning0Shuhui Wang, Zhengqi Pei
1146Feature Directions Matter: Long-Tailed Learning via Rotated Balanced Representation0Huiyang Shao, Peifeng Gao, Peisong Wen, Qianqian Xu, Qingming Huang, Zhiyong Yang
1147Fair Neighbor Embedding0Jaakko Peltonen, Jyrki Nummenmaa, Timo Nummenmaa, Wen Xu
1148The Ideal Continual Learner: An Agent That Never Forgets0Liangzu Peng, Paris Giampouras, René Vidal
1149MolDiff: Addressing the Atom-Bond Inconsistency Problem in 3D Molecule Diffusion Generation0Jianzhu Ma, Jiaqi Guan, Qiang Liu, Xingang Peng
1150Diagnosis, Feedback, Adaptation: A Human-in-the-Loop Framework for Test-Time Policy Adaptation0Andi Peng, Andreea Bobu, Aviv Netanyahu, Julie Shah, Mark K. Ho, Pulkit Agrawal, Tianmin Shu
1151Learning Hidden Markov Models When the Locations of Missing Observations are Unknown0Binyamin Perets, Mark Kozdoba, Shie Mannor
1152Estimating the Contamination Factor's Distribution in Unsupervised Anomaly Detection0Arto Klami, Lorenzo Perini, PaulChristian Bürkner
1153Are Gaussian Data All You Need? The Extents and Limits of Universality in High-Dimensional Generalized Linear Estimation0Bruno Loureiro, Florent Krzakala, Luca Pesce, Ludovic Stephan
1154Certifying Ensembles: A General Certification Theory with S-Lipschitzness0Adel Bibi, Aleksandar Petrov, Amartya Sanyal, Francisco Eiras, Philip H. S. Torr
1155The Power of Learned Locally Linear Models for Nonlinear Policy Optimization0Daniel Pfrommer, Max Simchowitz, Nikolai Matni, Stephen Tu, Tyler Westenbroek
1156A Scalable Frank-Wolfe-Based Algorithm for the Max-Cut SDP0Chi Bach Pham, James Saunderson, Wynita M. Griggs
1157Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability0Claudia LinnhoffPopien, Fabian Ritz, Jonas Nüßlein, Maximilian Zorn, Michael Kölle, Philipp Altmann, Thomas Gabor, Thomy Phan
1158HyperTuning: Toward Adapting Large Language Models without Back-propagation0Jason Phang, Pengcheng He, Weizhu Chen, Yi Mao
1159Linear CNNs Discover the Statistical Structure of the Dataset Using Only the Most Dominant Frequencies0Hannah Pinson, Joeri Lenaerts, Vincent Ginis
1160Conformal Prediction for Federated Uncertainty Quantification Under Label Shift0Aleksandr Rubashevskii, Eric Moulines, Maxim Panov, Mehdi Makni, Vincent Plassier
1161Universal Physics-Informed Neural Networks: Symbolic Differential Operator Discovery with Sparse Data0Brydon Eastman, Lena Podina, Mohammad Kohandel
1162Sequential Kernelized Independence Testing0Aaditya Ramdas, Aleksandr Podkopaev, Patrick Blöbaum, Shiva Prasad Kasiviswanathan
1163Truncating Trajectories in Monte Carlo Reinforcement Learning0Alberto Maria Metelli, Marcello Restelli, Riccardo Poiani
1164Hyena Hierarchy: Towards Larger Convolutional Language Models0Christopher Ré, Daniel Y. Fu, Eric Nguyen, Michael Poli, Stefano Ermon, Stefano Massaroli, Stephen Baccus, Tri Dao, Yoshua Bengio
1165Spurious Valleys and Clustering Behavior of Neural Networks0Samuele Pollaci
1166Multisample Flow Matching: Straightening Flows with Minibatch Couplings0AramAlexandre Pooladian, Brandon Amos, Carles DomingoEnrich, Heli BenHamu, Ricky T. Q. Chen, Yaron Lipman
1167Minimax estimation of discontinuous optimal transport maps: The semi-discrete case0AramAlexandre Pooladian, Jonathan NilesWeed, Vincent Divol
1168Test-time Adaptation with Slot-Centric Models0Anirudh Goyal, Deepak Pathak, Gaurav Aggarwal, Katerina Fragkiadaki, Mehdi S. M. Sajjadi, Mihir Prabhudesai, Sjoerd van Steenkiste, Sujoy Paul, Thomas Kipf
1169JAWS-X: Addressing Efficiency Bottlenecks of Conformal Prediction Under Standard and Feedback Covariate Shift0Anqi Liu, Drew Prinster, Suchi Saria
1170Equivariant Polynomials for Graph Neural Networks0Bobak Toussi Kiani, Derek Lim, Haggai Maron, Omri Puny, Yaron Lipman
1171Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining0Guofan Fan, Kaisheng Ma, Li Yi, Runpei Dong, Xiangyu Zhang, Zekun Qi, Zheng Ge
1172An Effective Meaningful Way to Evaluate Survival Models0LiHao Kuan, Mahtab Farrokh, Neeraj Kumar, Rajesh Ranganath, Ricardo Henao, Russell Greiner, Shiang Qi, Weijie Sun
1173Coarse-to-Fine: a Hierarchical Diffusion Model for Molecule Generation in 3D0Bo Qiang, Bowen Gao, Hao Zhou, Jingjing Gong, Minkai Xu, WeiYing Ma, Yanyan Lan, Yuxuan Song
1174Collaborative Causal Inference with Fair Incentives0Bryan Kian Hsiang Low, Rui Qiao, Xinyi Xu
1175FREDIS: A Fusion Framework of Refinement and Disambiguation for Unreliable Partial Label Learning0Congyu Qiao, Jiaqi Lv, Ning Xu, Xin Geng, Yi Ren
1176Nugget: Neural Agglomerative Embeddings of Text0Benjamin Van Durme, Guanghui Qin
1177BiBench: Benchmarking and Analyzing Network Binarization0Aoyu Li, Fisher Yu, Haotong Qin, Mingyuan Zhang, Xianglong Liu, Yifu Ding, Zhongang Cai, Ziwei Liu
1178Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization0Denny Zhou, Lijun Zhang, Quanqi Hu, Tianbao Yang, Zhuoning Yuan, ZiHao Qiu
1179Shortest Edit Path Crossover: A Theory-driven Solution to the Permutation Problem in Evolutionary Neural Architecture Search0Risto Miikkulainen, Xin Qiu
1180Simple and Fast Group Robustness by Automatic Feature Reweighting0Andres Potapczynski, Andrew Gordon Wilson, Pavel Izmailov, Shikai Qiu
1181DRCFS: Doubly Robust Causal Feature Selection0Ashkan Soleymani, Cristian R. Rojas, Francesco Quinzan, Patrick Jaillet, Stefan Bauer
1182Robust Speech Recognition via Large-Scale Weak Supervision0Alec Radford, Christine McLeavey, Greg Brockman, Ilya Sutskever, Jong Wook Kim, Tao Xu
1183Shiftable Context: Addressing Training-Inference Context Mismatch in Simultaneous Speech Translation0Drew Penney, Lizhong Chen, Matthew Raffel
1184Sequential Multi-Dimensional Self-Supervised Learning for Clinical Time Series0Aniruddh Raghu, Collin M. Stultz, John V. Guttag, Payal Chandak, Ridwan Alam
1185Recovery Bounds on Class-Based Optimal Transport: A Sum-of-Norms Regularization Framework0Arman Rahbar, Ashkan Panahi, Devdatt P. Dubhashi, Hamid Krim, Morteza Haghir Chehreghani
1186Algorithmic Stability of Heavy-Tailed SGD with General Loss Functions0Anant Raj, Lingjiong Zhu, Mert Gürbüzbalaban, Umut Simsekli
1187Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels0Aaron C. Courville, Alexandre Lacoste, Alexandre Piché, Bart Dhoedt, Pietro Mazzaglia, Sai Rajeswar, Tim Verbelen
1188SpotEM: Efficient Video Search for Episodic Memory0Kristen Grauman, Santhosh Kumar Ramakrishnan, Ziad AlHalah
1189How much does Initialization Affect Generalization?0Hemanth Saratchandran, Lachlan Ewen MacDonald, Moshiur R. Farazi, Sameera Ramasinghe, Simon Lucey
1190Model Ratatouille: Recycling Diverse Models for Out-of-Distribution Generalization0Alexandre Ramé, David LopezPaz, Jianyu Zhang, Kartik Ahuja, Léon Bottou, Matthieu Cord
1191A Picture of the Space of Typical Learnable Tasks0Han Kheng Teoh, Itay Griniasty, James P. Sethna, Jialin Mao, Mark K. Transtrum, Pratik Chaudhari, Rahul Ramesh, Rubing Yang
1192Policy Regularization with Dataset Constraint for Offline Reinforcement Learning0Fuxiang Zhang, Yang Yu, YiChen Li, Yuhang Ran, Zongzhang Zhang
1193SpENCNN: Orchestrating Encoding and Sparsity for Fast Homomorphically Encrypted Neural Network Inference0Caiwen Ding, Gang Quan, Ran Ran, Tao Liu, Wei Wang, Wujie Wen, Xiaolin Xu, Xinwei Luo
1194Feature learning in deep classifiers through Intermediate Neural Collapse0Akshay Rangamani, Marius Lindegaard, Tomaso A. Poggio, Tomer Galanti
1195The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning0Finale DoshiVelez, Sarah Rathnam, Sonali Parbhoo, Susan A. Murphy, Weiwei Pan
1196Beam Tree Recursive Cells0Cornelia Caragea, Jishnu Ray Chowdhury
1197Monotonic Location Attention for Length Generalization0Cornelia Caragea, Jishnu Ray Chowdhury
1198Automated Search for Conjectures on Mathematical Constants using Analysis of Integer Sequences0Dan Carmon, Ido Kaminer, Ofir David, Ofir Razon, Shahar Gottlieb, Yoav Harris
1199Neural networks trained with SGD learn distributions of increasing complexity0Alessandro Ingrosso, Maria Refinetti, Sebastian Goldt
1200Simplex Random Features0Adrian Weller, Isaac Reid, Krzysztof Marcin Choromanski, Valerii Likhosherstov
1201Bayesian Neural Networks Avoid Encoding Complex and Perturbation-Sensitive Concepts0Huiqi Deng, Qihan Ren, Quanshi Zhang, Siyu Lou, Yunuo Chen
1202Escaping saddle points in zeroth-order optimization: the power of two-point estimators0Na Li, Yujie Tang, Zhaolin Ren
1203Dimension-independent Certified Neural Network Watermarks via Mollifier Smoothing0Da Yan, Jiaxiang Ren, Jiayin Jin, Lingjuan Lyu, Yang Zhou
1204Feature Programming for Multivariate Time Series Prediction0Alex Daniel Reneau, Ammar Gilani, Han Liu, Jerry YaoChieh Hu
1205Run-off Election: Improved Provable Defense against Data Poisoning Attacks0Atoosa Malemir Chegini, Keivan Rezaei, Kiarash Banihashem, Soheil Feizi
1206Learning Control-Oriented Dynamical Structure from Data0JeanJacques E. Slotine, Marco Pavone, Navid Azizan, Spencer M. Richards
1207The Edge of Orthogonality: A Simple View of What Makes BYOL Tick0Allison C. Tam, Bilal Piot, Felix Hill, Florian Strub, Pierre Harvey Richemond, Yunhao Tang
1208Multi-Agent Best Arm Identification with Private Communications0Alexandre Rio, Igor Colin, Marta Soare, Merwan Barlier
1209A Two-Stage Active Learning Algorithm for k-Nearest Neighbors0Kamalika Chaudhuri, Nicholas Rittler
1210Lowering the Pre-training Tax for Gradient-based Subset Training: A Lightweight Distributed Pre-Training Toolkit0Aditya Akella, Vijay Chidambaram, Yeonju Ro, Zhangyang Wang
1211The Role of Entropy and Reconstruction in Multi-View Self-Supervised Learning0Adam Golinski, Arno Blaas, Borja Rodríguez Gálvez, Dan Busbridge, Jason Ramapuram, Luca Zappella, Pau Rodríguez, Xavier Suau
1212RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents0Benjamin Adin Spiegel, George Konidaris, Jennifer Wang, Rafael RodríguezSánchez, Roma Patel, Stefanie Tellex
1213Improving Fair Training under Correlation Shifts0Changho Suh, Kangwook Lee, Steven Euijong Whang, Yuji Roh
1214The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation0Clare Lyle, Marc G. Bellemare, Mark Rowland, Rémi Munos, Will Dabney, Yunhao Tang
1215Robust Satisficing MDPs0Chin Pang Ho, Haolin Ruan, Siyu Zhou, Zhi Chen
1216Infinite Action Contextual Bandits with Reusable Data Exhaust0Mark Rucker, Paul Mineiro, Yinglun Zhu
1217Function-Space Regularization in Neural Networks: A Probabilistic Perspective0Andrew Gordon Wilson, Sanyam Kapoor, Shikai Qiu, Tim G. J. Rudner
1218A New PHO-rmula for Improved Performance of Semi-Structured Networks0David Rügamer
1219Geometric Clifford Algebra Networks0David Ruhe, Jayesh K. Gupta, Johannes Brandstetter, Max Welling, Steven De Keninck
1220Constrained Monotonic Neural Networks0Davor Runje, Sharath M. Shankaranarayana
1221Differential Privacy, Linguistic Fairness, and Training Data Influence: Impossibility and Possibility Theorems for Multilingual Language Models0Anders Søgaard, Phillip Rust
1222Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs0Raif M. Rustamov, Subhabrata Majumdar
1223SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient0Alexander Borzunov, Max Ryabinin, Michael Diskin, Tim Dettmers
1224Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles0Arkabandhu Chowdhury, Chaitanya Ryali, Chen Wei, Christoph Feichtenhofer, Daniel Bolya, Haoqi Fan, Jitendra Malik, Judy Hoffman, Omid Poursaeed, PoYao Huang, Vaibhav Aggarwal, Yanghao Li, YuanTing Hu
1225End-to-End Learning for Stochastic Optimization: A Bayesian Perspective0Daniel Kuhn, Tobias Sutter, Yves Rychener
1226Sequential Monte Carlo Learning for Time Series Structure Discovery0Brian Patton, Feras Saad, Matthew Douglas Hoffman, Rif A. Saurous, Vikash Mansinghka
1227Active Ranking of Experts Based on their Performances in Many Tasks0Alexandra Carpentier, El Mehdi Saad, Nicolas Verzelen
1228Sample Complexity Bounds for Learning High-dimensional Simplices in Noisy Regimes0Abolfazl S. Motahari, Amir Najafi, Babak H. Khalaj, Seyed Amir Hossein Saberi
1229Global Selection of Contrastive Batches via Optimization on Sample Permutations0Chenguang Zhu, Vin Sachidananda, Ziyi Yang
1230High-Probability Bounds for Stochastic Optimization and Variational Inequalities: the Case of Unbounded Variance0Abdurakhmon Sadiev, Alexander V. Gasnikov, Eduard Gorbunov, Gauthier Gidel, Marina Danilova, Pavel E. Dvurechensky, Peter Richtárik, Samuel Horváth
1231End-to-end Differentiable Clustering with Associative Memories0Bishwajit Saha, Dmitry Krotov, Mohammed J. Zaki, Parikshit Ram
1232Learning to Suggest Breaks: Sustainable Optimization of Long-Term User Engagement0Eden Saig, Nir Rosenfeld
1233Multi-class Graph Clustering via Approximated Effective p-Resistance0Mark Herbster, Shota Saito
1234Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling0Qingyang Ren, Thorsten Joachims, Yuta Saito
1235Rethinking Warm-Starts with Predictions: Learning Predictions Close to Sets of Optimal Solutions for Faster L-/L♮-Convex Function Minimization0Shinsaku Sakaue, Taihei Oki
1236PAC-Bayesian Offline Contextual Bandits With Guarantees0Nicolas Chopin, Otmane Sakhi, Pierre Alquier
1237Provably and Practically Efficient Neural Contextual Bandits0Sudeep Salgia
1238Distributed Linear Bandits under Communication Constraints0Qing Zhao, Sudeep Salgia
1239Optimizing Hyperparameters with Conformal Quantile Regression0Aaron Klein, Cédric Archambeau, David Salinas, Jacek Golebiowski, Matthias W. Seeger
1240Raising the Cost of Malicious AI-Powered Image Editing0Alaa Khaddaj, Aleksander Madry, Andrew Ilyas, Guillaume Leclerc, Hadi Salman
1241Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective0Gabriel Peyré, Joan Puigcerver, Josip Djolonga, Mathieu Blondel, Michael Eli Sander
1242TAN Without a Burn: Scaling Laws of DP-SGD0Alexandre Sablayrolles, Pierre Stock, Tom Sander
1243Discrete Continuous Optimization Framework for Simultaneous Clustering and Training in Mixture Models0Abir De, Arjun Shashank Kashettiwar, Bhuvan Reddy Gangula, Durga Sivasubramanian, Ganesh Ramakrishnan, Parth Vipul Sangani, Pritish Chakraborty, Rishabh K. Iyer
1244Whose Opinions Do Language Models Reflect?0Cinoo Lee, Esin Durmus, Faisal Ladhak, Percy Liang, Shibani Santurkar, Tatsunori Hashimoto
1245Streaming Active Learning with Deep Neural Networks0Akanksha Saran, Akshay Krishnamurthy, John Langford, Jordan T. Ash, Safoora Yousefi
1246Random Teachers are Good Teachers0Felix Sarnthein, Gregor Bachmann, Sotiris Anagnostidis, Thomas Hofmann
1247Posterior Sampling for Deep Reinforcement Learning0Michelangelo Conserva, Paulo E. Rauber, Remo Sasso
1248Graph Neural Networks can Recover the Hidden Features Solely from the Graph Structure0Ryoma Sato
1249Existence and Estimation of Critical Batch Size for Training Generative Adversarial Networks with Two Time-Scale Update Rule0Hideaki Iiduka, Naoki Sato
1250StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis0Andreas Geiger, Axel Sauer, Samuli Laine, Tero Karras, Timo Aila
1251Facial Expression Recognition with Adaptive Frame Rate based on Multiple Testing Correction0Andrey V. Savchenko
1252Off-Policy Average Reward Actor-Critic with Deterministic Policy Search0Naman Saxena, Shalabh Bhatnagar, Shishir Kolathaya, Subhojyoti Khastagir
1253Gibbsian Polar Slice Sampling0Daniel Rudolf, Michael Habeck, Philip Schär
1254Identifiability and Generalizability in Constrained Inverse Reinforcement Learning0Andreas Schlaginhaufen, Maryam Kamgarpour
1255Learning Expressive Priors for Generalization and Uncertainty Estimation in Neural Networks0Daniel Cremers, Dominik Schnaus, Jongseok Lee, Rudolph Triebel
1256Deterministic equivalent and error universality of deep random features learning0Bruno Loureiro, Daniil Dmitriev, Dominik Schröder, Hugo Cui
1257The Acquisition of Physical Knowledge in Generative Neural Networks0Eric Schulz, Luca M. Schulze Buschoff, Marcel Binz
1258Modality-Agnostic Variational Compression of Implicit Neural Representations0Jaeho Lee, Jihoon Tack, Jinwoo Shin, Jonathan Richard Schwarz, Yee Whye Teh
1259Bigger, Better, Faster: Human-level Atari with human-level efficiency0Aaron C. Courville, Johan S. ObandoCeron, Marc G. Bellemare, Max Schwarzer, Pablo Samuel Castro, Rishabh Agarwal
1260Dissecting the Effects of SGD Noise in Distinct Regimes of Deep Learning0Antonio Sclocchi, Mario Geiger, Matthieu Wyart
1261A Fast Optimistic Method for Monotone Variational Inequalities0DangKhoa Nguyen, Michael Sedlmayer, Radu Ioan Bot
1262Double-Weighting for Covariate Shift Adaptation0Anqi Liu, José Ignacio SegoviaMartín, Santiago Mazuelas
1263Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language0Andreu Vall, Günter Klambauer, Philipp Seidl, Sepp Hochreiter
1264Variational Autoencoding Neural Operators0George J. Pappas, Georgios Kissas, Jacob H. Seidman, Paris Perdikaris
1265Neural Markov Jump Processes0Patrick Seifner, Ramsés J. Sánchez
1266Bayesian online change point detection with Hilbert space approximate Student-t process0Jeremy Sellier, Petros Dellaportas
1267Incentivizing Exploration with Linear Contexts and Combinatorial Actions0Mark Sellke
1268Explainability as statistical inference0Damien Garreau, Hugo Henri Joseph Senetaire, Jes Frellsen, PierreAlexandre Mattei
1269Multi-View Masked World Models for Visual Robotic Manipulation0Jinwoo Shin, Junsu Kim, Kimin Lee, Pieter Abbeel, Stephen James, Younggyo Seo
1270One-Shot Compression of Large Edge-Exchangeable Graphs using Bits-Back Coding0Alireza Makhzani, Ashish J. Khisti, Daniel Severo, James Townsend
1271ModelDiff: A Framework for Comparing Learning Algorithms0Aleksander Madry, Andrew Ilyas, Harshay Shah, Sung Min Park
1272Auxiliary Learning as an Asymmetric Bargaining Game0Aviv Navon, Aviv Shamsian, Ethan Fetaya, Gal Chechik, Kenji Kawaguchi, Neta Glazer
1273Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models0Minlie Huang, Nan Duan, Weizhu Chen, Yelong Shen, Yeyun Gong, Zhihong Shao
1274Complementary Attention for Multi-Agent Reinforcement Learning0Chang Liu, Hongchang Zhang, Jianzhun Shao, Shuncheng He, Xiangyang Ji, Yuhang Jiang, Yun Qu
1275Regularization-free Diffeomorphic Temporal Alignment Nets0Oren Freifeld, Ron Shapira Weber
1276Toward Efficient Gradient-Based Value Estimation0Arsalan Sharifnassab, Richard S. Sutton
1277Coin Sampling: Gradient-Based Bayesian Inference without Learning Rates0Christopher Nemeth, Louis Sharrock
1278On Kinetic Optimal Probability Paths for Generative Models0Matthew Le, Maximilian Nickel, Neta Shaul, Ricky T. Q. Chen, Yaron Lipman
1279Sequential Changepoint Detection via Backward Confidence Sequences0Aaditya Ramdas, Shubhanshu Shekhar
1280Cold Analysis of Rao-Blackwellized Straight-Through Gumbel-Softmax Gradient Estimator0Alexander Shekhovtsov
1281Towards Understanding and Improving GFlowNet Training0Andreas Loukas, Ehsan Hajiramezanali, Emmanuel Bengio, Kyunghyun Cho, Max W. Shen, Tommaso Biancalani
1282On Balancing Bias and Variance in Unsupervised Multi-Source-Free Domain Adaptation0Gregory W. Wornell, Maohao Shen, Yuheng Bu
1283On Penalty-based Bilevel Gradient Descent Method0Han Shen, Tianyi Chen
1284Non-autoregressive Conditional Diffusion Models for Time Series Prediction0James T. Kwok, Lifeng Shen
1285Cross-Modal Fine-Tuning: Align then Refine0Ameet Talwalkar, Corey Staten, Graham Neubig, Junhong Shen, Liam Li, Lucio M. Dery, Mikhail Khodak
1286Auxiliary Modality Learning with Generalized Curriculum Distillation0Ming C. Lin, Peng Gao, Xijun Wang, Yu Shen
1287TGRL: An Algorithm for Teacher Guided Reinforcement Learning0Aviv Tamar, Idan Shenfeld, Pulkit Agrawal, ZhangWei Hong
1288FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU0Beidi Chen, Binhang Yuan, Ce Zhang, Christopher Ré, Ion Stoica, Lianmin Zheng, Max Ryabinin, Percy Liang, Ying Sheng, Zhuohan Li
1289Improved Regret for Efficient Online Reinforcement Learning with Linear Function Approximation0Tomer Koren, Uri Sherman, Yishay Mansour
1290Fundamental Limits of Two-layer Autoencoders, and Achieving Them with Gradient Methods0Aleksandr Shevchenko, Hamed Hassani, Kevin Kögler, Marco Mondelli
1291Large Language Models Can Be Easily Distracted by Irrelevant Context0David Dohan, Denny Zhou, Ed H. Chi, Freda Shi, Kanishka Misra, Nathan Scales, Nathanael Schärli, Xinyun Chen
1292Everyone's Preference Changes Differently: A Weighted Multi-Interest Model For Retrieval0Bo Zhao, Hui Shi, Jishen Zhao, Sicun Gao, Yitong Zhou, Yupeng Gu
1293A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard Constraints0Ming Shi, Ness B. Shroff, Yingbin Liang
1294Improving the Model Consistency of Decentralized Federated Learning0Bo Yuan, Dacheng Tao, Kang Wei, Li Shen, Xueqian Wang, Yan Sun, Yifan Shi
1295UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers0Chaofan Tao, Chun Yuan, Dachuan Shi, Jiaqi Wang, Ying Jin, Zhendong Yang
1296Sequence Modeling with Multiresolution Convolutional Memory0Emily B. Fox, Jiaxin Shi, Ke Alexander Wang
1297Statistical Inference on Multi-armed Bandits with Delayed Feedback0Jingshen Wang, Lei Shi, Tianhao Wu
1298Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources0Chengshuai Shi, Cong Shen, Jing Yang, Wei Xiong
1299On the Complexity of Bayesian Generalization0John E. Hopcroft, Joshua B. Tenenbaum, Kun He, Manjie Xu, SongChun Zhu, Wenjuan Han, Ying Nian Wu, Yixin Zhu, YuZhe Shi
1300Understanding and Generalizing Contrastive Learning from the Inverse Optimal Transport Perspective0Gu Zhang, Haoyu Zhen, Jintao Fan, Junchi Yan, Liangliang Shi
1301Long Horizon Temperature Scaling0Andy Shih, Dorsa Sadigh, Stefano Ermon
1302Gradient Descent in Neural Networks as Sequential Learning in Reproducing Kernel Banach Space0Alistair Shilton, Santu Rana, Sunil Gupta, Svetha Venkatesh
1303SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning0Dongseok Shim, H. Jin Kim, Seungjae Lee
1304A Closer Look at the Intervention Procedure of Concept Bottleneck Models0Namhoon Lee, Sungbin Shin, Sungsoo Ahn, Yohan Jo
1305MetricGAN-OKD: Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement0Byung Hoon Lee, Hyun Joon Park, Jin Sob Kim, Sung Won Han, Wooseok Shin
1306Improved Learning-Augmented Algorithms for the Multi-Option Ski Rental Problem via Best-Possible Competitive Analysis0Changyeol Lee, Gukryeol Lee, HyungChan An, Yongho Shin
1307One-shot Imitation in a Non-Stationary Environment via Multi-Modal Skill0Daehee Lee, Honguk Woo, Minjong Yoo, Sangwoo Shin, Woo Kyung Kim
1308Context Consistency Regularization for Label Sparsity in Time Series0Byung Suk Lee, Byunghyun Kim, Dongmin Park, Hwanjun Song, JaeGil Lee, Susik Yoon, Yooju Shin
1309Generative Causal Representation Learning for Out-of-Distribution Motion Forecasting0Mark Crowley, Oliver Schulte, Shayan Shirahmad Gale Bagi, Zahra Gharaee
1310Exphormer: Sparse Transformers for Graphs0Ali Kemal Sinop, Ameya Velingker, Balaji Venkatachalam, Danica J. Sutherland, Hamed Shirzad
1311Synthetic data for model selection0Alon Shoshan, Gérard G. Medioni, Igor Kviatkovsky, Matan Fintz, Nadav Bhonker
1312Probabilistic Attention-to-Influence Neural Models for Event Sequences0Debarun Bhattacharjya, Dharmashankar Subramanian, Kristin P. Bennett, Oktie Hassanzadeh, Tian Gao, Xiao Shou
1313Causal Bounds in Quasi-Markovian Graphs0Garud Iyengar, Madhumitha Shridharan
1314Repository-Level Prompt Generation for Large Language Models of Code0Daniel Tarlow, Disha Shrivastava, Hugo Larochelle
1315CLIPood: Generalizing CLIP to Out-of-Distributions0Jialong Wu, Jianmin Wang, Mingsheng Long, Ximei Wang, Xingzhuo Guo, Yang Shu
1316Semi-Autoregressive Energy Flows: Exploring Likelihood-Free Training of Normalizing Flows0Phillip Si, Subham Sekhar Sahoo, Volodymyr Kuleshov, Yair Schiff, Zeyi Chen
1317Unearthing InSights into Mars: Unsupervised Source Separation with Limited Data0Ali Siahkoohi, Erwan Allys, Grégory Sainton, Maarten V. de Hoop, Rudy Morel, Taichi Kawamura
1318Quantitative Universal Approximation Bounds for Deep Belief Networks0Johann Gehringer, Julian Sieber
1319Pricing Experimental Design: Causal Effect, Expected Revenue and Tail Risk0Chonghuan Wang, David SimchiLevi
1320Statistical Learning under Heterogenous Distribution Shift0Akshay Krishnamurthy, Anurag Ajay, Max Simchowitz, Pulkit Agrawal
1321On the Stepwise Nature of Self-Supervised Learning0Abraham J. Fetterman, Daniel Geisz, James B. Simon, Joshua Albrecht, Liu Ziyin, Maksis Knutins
1322Hindsight Learning for MDPs with Exogenous Inputs0Adith Swaminathan, ChingAn Cheng, Felipe Vieira Frujeri, Hugo de Oliveira Barbalho, Ishai Menache, Jennifer Neville, Jingling Li, Luke Marshall, Sean R. Sinclair
1323Text-To-4D Dynamic Scene Generation0Adam Polyak, Andrea Vedaldi, Devi Parikh, Filippos Kokkinos, Iurii Makarov, Justin Johnson, Naman Goyal, Oron Ashual, Shelly Sheynin, Uriel Singer, Yaniv Taigman
1324The Hessian perspective into the Nature of Convolutional Neural Networks0Bernhard Schölkopf, Sidak Pal Singh, Thomas Hofmann
1325When do Minimax-fair Learning and Empirical Risk Minimization Coincide?0Chris Russell, Harvineet Singh, Matthäus Kleindessner, Rumi Chunara, Volkan Cevher
1326Differentiable Simulations for Enhanced Sampling of Rare Events0Johannes C. B. Dietschreit, Lukás Grajciar, Martin Sípka, Rafael GómezBombarelli
1327Preprocessors Matter! Realistic Decision-Based Attacks on Machine Learning Systems0Chawin Sitawarin, Florian Tramèr, Nicholas Carlini
1328Invariance in Policy Optimisation and Partial Identifiability in Reward Learning0Adam Gleave, Alessandro Abate, Joar Max Viktor Skalse, Matthew FarrugiaRoberts, Stuart Russell
1329A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems0David Henry Mguni, Jun Wang, Oliver Slumbers, Stefano B. Blumberg, Stephen Marcus McAleer, Yaodong Yang
1330On the Effectiveness of Offline RL for Dialogue Response Generation0Ethan R. Elenberg, Felix Wu, Kilian Q. Weinberger, Paloma Sodhi, Ryan McDonald
1331Fair Densities via Boosting the Sufficient Statistics of Exponential Families0Alexander Soen, Hisham Husain, Richard Nock
1332The Dormant Neuron Phenomenon in Deep Reinforcement Learning0Ghada Sokar, Pablo Samuel Castro, Rishabh Agarwal, Utku Evci
1333Abstracting Imperfect Information Away from Two-Player Zero-Sum Games0Chun Kai Ling, David J. Wu, J. Zico Kolter, Noam Brown, Ryan D'Orazio, Samuel Sokota
1334Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optimization0Hyeonah Kim, Jinkyoo Park, Jiwoo Son, Minsu Kim
1335Consistency Models0Ilya Sutskever, Mark Chen, Prafulla Dhariwal, Yang Song
1336LipsNet: A Smooth and Robust Neural Network with Adaptive Lipschitz Constant for High Accuracy Optimal Control0Bo Cheng, Bo Zhang, Chen Chen, Jingliang Duan, Junqing Wei, Shengbo Eben Li, Wenxuan Wang, Xiaoming Simon Wang, Xujie Song
1337Deep Perturbation Learning: Enhancing the Network Performance via Image Perturbations0Cairong Zhao, Guosheng Hu, Xiao Gong, Zifan Song
1338Latent Traversals in Generative Models as Potential Flows0Max Welling, Nicu Sebe, T. Anderson Keller, Yue Song
1339FedAvg Converges to Zero Training Loss Linearly for Overparameterized Multi-Layer Neural Networks0Bingqing Song, Jinfeng Yi, Mingyi Hong, Prashant Khanduri, Xinwei Zhang
1340RGE: A Repulsive Graph Rectification for Node Classification via Influence0Eunho Yang, Jaeyun Song, Sungyub Kim
1341Importance Weighted Expectation-Maximization for Protein Sequence Design0Lei Li, Zhenqiao Song
1342Sketching for First Order Method: Efficient Algorithm for Low-Bandwidth Channel and Vulnerability0Lichen Zhang, Yitan Wang, Zhao Song, Zheng Yu
1343Sketching Meets Differential Privacy: Fast Algorithm for Dynamic Kronecker Projection Maintenance0Lichen Zhang, Xin Yang, Yuanyuan Yang, Zhao Song
1344A Nearly-Optimal Bound for Fast Regression with ℓ∞ Guarantee0Junze Yin, Lichen Zhang, Mingquan Ye, Zhao Song
1345Loss-Guided Diffusion Models for Plug-and-Play Controllable Generation0Arash Vahdat, Hongxu Yin, Jan Kautz, Jiaming Song, MingYu Liu, Morteza Mardani, Qinsheng Zhang, Yongxin Chen
1346Differentiable Tree Operations Promote Compositional Generalization0Edward J. Hu, Jianfeng Gao, Kate McCurdy, Paul Smolensky, Paul Soulos, Roland Fernandez, Yunmo Chen
1347Are labels informative in semi-supervised learning? Estimating and leveraging the missing-data mechanism0Aude Sportisse, Charles Bouveyron, Hugo Schmutz, Olivier Humbert, PierreAlexandre Mattei
1348Linear Causal Disentanglement via Interventions0Anna Seigal, Caroline Uhler, Chandler Squires, Salil S. Bhate
1349Generating Language Corrections for Teaching Physical Control Tasks0Dorsa Sadigh, Megha Srivastava, Noah D. Goodman
1350FaDIn: Fast Discretized Inference for Hawkes Processes with General Parametric Kernels0Alexandre Gramfort, Cédric Allain, Guillaume Staerman, Thomas Moreau
1351Partial Optimality in Cubic Correlation Clustering0Bjoern Andres, David Stein, Silvia Di Gregorio
1352MODeL: Memory Optimizations for Deep Learning0Benoit Steiner, Jacob Kahn, James Hegarty, Mostafa Elhoushi
1353Improving Expert Predictions with Conformal Prediction0Eleni Straitouri, Lequn Wang, Manuel Gomez Rodriguez, Nastaran Okati
1354Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers0Ariya Rastrow, Athanasios Mouchtaris, Brian John King, Grant P. Strimel, Martin Radfar, Yi Xie
1355Kernel QuantTree0Diego Stucchi, Giacomo Boracchi, Nicolò Folloni, Paolo Rizzo
1356Topologically Faithful Image Segmentation via Induced Matching of Persistence Barcodes0Bjoern H. Menze, Johannes C. Paetzold, Nico Stucki, Suprosanna Shit, Ulrich Bauer
1357Towards Robust Graph Incremental Learning on Evolving Graphs0Chuan Wu, Difan Zou, Junwei Su, Zijun Zhang
1358DUET: 2D Structured and Approximately Equivariant Representations0Arno Blaas, Chen Huang, Dan Busbridge, Federico Danieli, Jason Ramapuram, Luca Zappella, T. Anderson Keller, Xavier Suau
1359Long-Tailed Recognition by Mutual Information Maximization between Latent Features and Ground-Truth Labels0MinKook Suh, SeungWoo Seo
1360Adversarial Learning of Distributional Reinforcement Learning0Fan Zhou, Hongtu Zhu, Yang Sui, Yukun Huang
1361Distilling Internet-Scale Vision-Language Models into Embodied Agents0Arun Ahuja, Ishita Dasgupta, Kenneth Marino, Rob Fergus, Theodore R. Sumers
1362Vector-Valued Control Variates0Alessandro Barp, FrançoisXavier Briol, Zhuo Sun
1363MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks0Cees G. M. Snoek, Fan Wang, Ling Wang, Wenfang Sun, Xiantong Zhen, Yingjun Du
1364Revisiting Sampling for Combinatorial Optimization0Azade Nova, Dale Schuurmans, Hanjun Dai, Haoran Sun, Katayoon Goshvadi
1365What Makes Entities Similar? A Similarity Flooding Perspective for Multi-sourced Knowledge Graph Embeddings0Jiacheng Huang, Qijin Chen, Wei Hu, Weijun Ren, Xiaozhou Xu, Zequn Sun
1366Maximum Optimality Margin: A Unified Approach for Contextual Linear Programming and Inverse Linear Programming0Chunlin Sun, Shang Liu, Xiaocheng Li
1367Tensor Gaussian Process with Contraction for Multi-Channel Imaging Analysis0Hu Sun, Meng Jin, Ward Manchester, Yang Chen, Yang Liu
1368MABe22: A Multi-Species Multi-Task Benchmark for Learned Representations of Behavior0Alice Robie, Andrew Wesley Ulmer, Ann Kennedy, Brian Geuther, Catherine E. Schretter, Chao Sun, Dipam Chakraborty, Edward Hayes, Erik Werner, Heng Jia, Jennifer J. Sun, Joseph Parker, Julian Morgan Wagner, Keith Sheppard, Kristin Branson, Markus Marks, Milan Peelman, Param Uttarwar, Pietro Perona, Sebastian Oleszko, Vivek Kumar, Yisong Yue, Zachary Partridge
1369Dynamic Regularized Sharpness Aware Minimization in Federated Learning: Approaching Global Consistency and Smooth Landscape0Dacheng Tao, Li Shen, Liang Ding, Shixiang Chen, Yan Sun
1370When and How Does Known Class Help Discover Unknown Ones? Provable Understanding Through Spectral Analysis0Yingyu Liang, Yixuan Li, Yiyou Sun, Zhenmei Shi
1371Learning Prescriptive ReLU Networks0Asterios Tsiourvas, Wei Sun
1372All in a Row: Compressed Convolution Networks for Graphs0Junshu Sun, Qingming Huang, Shuhui Wang, Xinzhe Han, Zhe Xue
1373Momentum Ensures Convergence of SIGNSGD under Weaker Assumptions0Bao Wang, Dongsheng Li, Qingsong Wang, Tao Sun
1374A Critical Revisit of Adversarial Robustness in 3D Point Cloud Recognition with Diffusion-Driven Purification0Chaowei Xiao, Jiachen Sun, Jiongxiao Wang, Weili Nie, Zhiding Yu, Zhuoqing Mao
1375SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation0Jia Jia, Junliang Xing, Longhui Wei, Qi Tian, Shikun Sun
1376A Neural PDE Solver with Temporal Stencil Modeling0Shinjae Yoo, Yiming Yang, Zhiqing Sun
1377Feature Expansion for Graph Neural Networks0Guangyi Chen, Jiaqi Sun, Kun Zhang, Lin Zhang, Peng Xu, Yujiu Yang
1378Model-Bellman Inconsistency for Model-based Offline Reinforcement Learning0Chengxing Jia, Haoxin Lin, Jiaji Zhang, Junyin Ye, Yang Yu, Yihao Sun
1379Inflow, Outflow, and Reciprocity in Machine Learning0Mukund Sundararajan, Walid Krichene
1380When Personalization Harms Performance: Reconsidering the Use of Group Attributes in Prediction0Berk Ustun, Marzyeh Ghassemi, Vinith Menon Suriyakumar
1381Tuning Computer Vision Models With Task Rewards0Alexander Kolesnikov, André Susano Pinto, Lucas Beyer, Xiaohua Zhai, Yuge Shi
1382Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic0Alec Koppel, Amrit S. Bedi, Bhrij Patel, Brian M. Sadler, Dinesh Manocha, Wesley A. Suttle
1383Tight and fast generalization error bound of graph embedding in metric space0Atsushi Nitanda, Atsushi Suzuki, Feng Tian, Jing Wang, Kenji Yamanishi, Taiji Suzuki
1384Proximal Causal Learning of Conditional Average Treatment Effects0Erik Sverdrup, Yifan Cui
1385Inverse Reinforcement Learning without Reinforcement Learning0David Wu, Drew Bagnell, Gokul Swamy, Sanjiban Choudhury, Zhiwei Steven Wu
1386Von Mises Mixture Distributions for Molecular Conformation Generation0Eric M. Jonas, Jake Lawrence Williams, Kirk Swanson
1387Optimal randomized multilevel Monte Carlo for repeatedly nested expectations0Guanyang Wang, Yasa Syed
1388Adaptive Coordination in Social Embodied Rearrangement0Akshara Rai, Andrew Szot, Dhruv Batra, Ruta Desai, Unnat Jain, Zsolt Kira
1389MG-GNN: Multigrid Graph Neural Networks for Learning Multilevel Domain Decomposition Methods0Ali Taghibakhshi, Luke N. Olson, Matthew West, Nicolas Nytko, Scott P. MacLachlan, Tareq Uz Zaman
1390Learning Mixtures of Gaussians with Censored Data0Bryon Aragam, Wai Ming Tai
1391Approximation and Estimation Ability of Transformers for Sequence-to-Sequence Functions with Infinite Dimensional Input0Shokichi Takakura, Taiji Suzuki
1392Learning Neural PDE Solvers with Parameter-Guided Channel Attention0Francesco Alesiani, Makoto Takamoto, Mathias Niepert
1393Contextual Conservative Interleaving Bandits0Kei Takemura
1394Randomized Gaussian Process Upper Confidence Bound with Tighter Bayesian Regret Bounds0Masayuki Karasuyama, Shion Takeno, Yu Inatsu
1395Towards Practical Preferential Bayesian Optimization with Skew Gaussian Processes0Masahiro Nomura, Masayuki Karasuyama, Shion Takeno
1396Robust Explanation for Free or At the Cost of Faithfulness0Yang Tian, Zeren Tan
1397Provably Invariant Learning without Domain Information0Chao Qu, Lin Yong, Peng Cui, Shengyu Zhu, Xiaoyu Tan, Xihe Qiu, Yinghui Xu, Yuan Qi
1398Auto-Differentiation of Relational Computations for Very Large Scale Machine Learning0Binhang Yuan, Chris Jermaine, Daniel Bourgeois, Dimitrije Jankov, Yuxin Tang, Zhimin Ding
1399Regret-Minimizing Double Oracle for Extensive-Form Games0Le Cong Dinh, Stephen Marcus McAleer, Xiaohang Tang, Yaodong Yang
1400From Perception to Programs: Regularize, Overparameterize, and Amortize0Hao Tang, Kevin Ellis
1401Understanding Self-Predictive Learning for Reinforcement Learning0András György, Bernardo Ávila Pires, Bilal Piot, Charline Le Lan, Clare Lyle, Daniele Calandriello, Mark Rowland, Michal Valko, Mohammad Gheshlaghi Azar, Pierre Harvey Richemond, Rémi Munos, Shantanu Thakoor, Will Dabney, Yash Chandak, Yunhao Tang, Zhaohan Daniel Guo
1402DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm0Anna Harutyunyan, Bernardo Ávila Pires, Mark Rowland, Michal Valko, Rémi Munos, Tadashi Kozuno, Yunhao Tang
1403Towards Understanding Generalization of Graph Neural Networks0Huayi Tang, Yong Liu
1404Towards a better understanding of representation dynamics under TD-learning0Rémi Munos, Yunhao Tang
1405VA-learning as a more efficient alternative to Q-learning0Mark Rowland, Michal Valko, Rémi Munos, Yunhao Tang
1406Defects of Convolutional Decoder Networks in Frequency Representation0Ling Tang, Quanshi Zhang, Wen Shen, Yuefeng Chen, Zhanpeng Zhou
1407Difference-in-Differences Meets Tree-based Methods: Heterogeneous Treatment Effects Estimation with Unmeasured Confounding0Caizhi Tang, Huiyuan Wang, Jun Zhou, Longfei Li, Qing Cui, Xinyu Li
1408End-to-end Training of Deep Boltzmann Machines by Unbiased Contrastive Divergence with Local Mode Initialization0Masahiro Suzuki, Shohei Taniguchi, Yusuke Iwasawa, Yutaka Matsuo
1409POUF: Prompt-Oriented Unsupervised Fine-tuning for Large Pre-trained Models0Huangjie Zheng, Korawat Tanwisuth, Mingyuan Zhou, Pengcheng He, Shujian Zhang
1410Dual Focal Loss for Calibration0Chang Xu, Linwei Tao, Minjing Dong
1411Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization0Hao Su, Stone Tao, Tongzhou Mu, Xiaochen Li, Yuzhe Qin, Zhiao Huang
1412Data Feedback Loops: Model-driven Amplification of Dataset Biases0Rohan Taori, Tatsunori Hashimoto
1413Deep Regression Unlearning0Ayush Kumar Tarun, Mohan S. Kankanhalli, Murari Mandal, Vikram Singh Chundawat
1414How to Trust Your Diffusion Model: A Convex Optimization Approach to Conformal Risk Control0J. Webster Stayman, Jacopo Teneggi, Jeremias Sulam, Matthew Tivnan
1415Concurrent Shuffle Differential Privacy Under Continual Observation0Haim Kaplan, Jay Tenenbaum, Uri Stemmer, Yishay Mansour
1416Finding Generalization Measures by Contrasting Signal and Noise0Bohang Zhang, Haowei He, Jiaye Teng, Ruichen Li, Yan Tian, Yang Yuan, Yequan Wang
1417Reinforcement Learning with History Dependent Dynamic Contexts0Craig Boutilier, Guy Tennenholtz, Lior Shani, Martin Mladenov, Nadav Merlis
1418PWSHAP: A Path-Wise Explanation Model for Targeted Variables0Christopher C. Holmes, Karla DiazOrdaz, Lucile TerMinassian, Oscar Clivio, Robin J. Evans
1419On the Estimation of Gaussian Mixture Copula Models0Ashutosh Tewari
1420Target-Aware Generative Augmentations for Single-Shot Adaptation0Jayaraman J. Thiagarajan, Kowshik Thopalli, Pavan K. Turaga, Rakshith Subramanyam
1421ELSA: Efficient Label Shift Adaptation through the Lens of Semiparametric Models0Jiwei Zhao, Qinglong Tian, Xin Zhang
1422Spherical Inducing Features for Orthogonally-Decoupled Gaussian Processes0Louis C. Tiao, Victor Picheny, Vincent Dutordoir
1423Fast Rates for Maximum Entropy Exploration0Alexey Naumov, Daniele Calandriello, Daniil Tiapkin, Denis Belomestny, Eric Moulines, Michal Valko, Pierre Ménard, Pierre Perrault, Rémi Munos, Yunhao Tang
1424Margin-based sampling in high dimensions: When being active is less efficient than staying passive0Alexandru Tifrea, Fanny Yang, Jacob Clarysse
1425Differentiable Multi-Target Causal Bayesian Experimental Design0Adam Foster, Andrew Jesson, Desi R. Ivanova, Panagiotis Tigas, Stefan Bauer, Yarin Gal, Yashas Annadani
1426PCA-based Multi-Task Learning: a Random Matrix Approach0Frédéric Pascal, Malik Tiomoko, Romain Couillet
1427Perturbation Analysis of Neural Collapse0Haoxiang Huang, Jonathan NilesWeed, Tom Tirer
1428Overcoming Simplicity Bias in Deep Networks using a Feature Sieve0Pradeep Shenoy, Rishabh Tiwari
1429Beyond In-Domain Scenarios: Robust Density-Aware Calibration0Christian Tomani, Daniel Cremers, Futa Kai Waseda, Yuesong Shen
1430Distribution Free Domain Generalization0He Li, Jialin Ding, Peifeng Tong, Song Xi Chen, Wu Su, Zhan Haoxiang
1431Extending Kernel PCA through Dualization: Sparsity, Robustness and Fast Algorithms0Alex Lambert, Francesco Tonin, Johan A. K. Suykens, Panagiotis Patrinos
1432Robust Weak Supervision with Variational Auto-Encoders0Francesco Tonolini, Gabriella Kazai, Nikolaos Aletras, Yunlong Jiao
1433Fully Bayesian Autoencoders with Latent Sparse Gaussian Processes0BaHien Tran, Babak Shahbaba, Maurizio Filippone, Stephan Mandt
1434Discrete Key-Value Bottleneck0Anirudh Goyal, Bernhard Schölkopf, Frederik Träuble, Kenji Kawaguchi, Michael Curtis Mozer, Nasim Rahaman, Yoshua Bengio
1435Mimetic Initialization of Self-Attention Layers0Asher Trockman, J. Zico Kolter
1436Representer Point Selection for Explaining Regularized High-dimensional Models0ChePing Tsai, ChoJui Hsieh, Eli Chien, HsiangFu Yu, Jiong Zhang, Pradeep Kumar Ravikumar
1437Expected Gradients of Maxout Networks and Consequences to Parameter Initialization0Guido Montúfar, Hanna Tseran
1438Provable Data Subset Selection For Efficient Neural Networks Training0Alaa Maalouf, Dan Feldman, Daniela Rus, Murad Tukan, Samson Zhou, Vladimir Braverman
1439Jump-Start Reinforcement Learning0Banghua Zhu, Chuyuan Fu, Cong Ma, Ikechukwu Uchendu, Jiantao Jiao, Joséphine Simon, Karol Hausman, Matthew Bennice, Mengyuan Yan, Sergey Levine, Ted Xiao, Yao Lu
1440Submodular Order Functions and Assortment Optimization0Rajan Udwani
1441Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings0Ayush Sekhari, Jason D. Lee, Masatoshi Uehara, Nathan Kallus, Wen Sun
1442From Adaptive Query Release to Machine Unlearning0Enayat Ullah, Raman Arora
1443Private Federated Learning with Autotuned Compression0Christopher A. ChoquetteChoo, Enayat Ullah, Peter Kairouz, Sewoong Oh
1444The Monge Gap: A Regularizer to Learn All Transport Maps0Marco Cuturi, Théo Uscidda
1445Semi-Dual Unbalanced Quadratic Optimal Transport: fast statistical rates and convergent algorithm0Adrien Vacher, FrançoisXavier Vialard
1446Random Grid Neural Processes for Parametric Partial Differential Equations0Arnaud Vadeboncoeur, Fehmi Cirak, Ieva Kazlauskaite, Mark Girolami, Yanni Papandreou, Ömer Deniz Akyildiz
1447Delayed Feedback in Kernel Bandits0Alberto Bernacchia, Ciara PikeBurke, Danyal Ahmed, Sattar Vakili
1448Synthetic Data, Real Errors: How (Not) to Publish and Use Synthetic Data0Boris van Breugel, Mihaela van der Schaar, Zhaozhi Qian
1449Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts0Ciara PikeBurke, Dirk van der Hoeven, Hao Qiu, Nicolò CesaBianchi
1450Causal Isotonic Calibration for Heterogeneous Treatment Effects0Alex Luedtke, Ernesto UlloaPérez, Lars van der Laan, Marco Carone
1451Accounting For Informative Sampling When Learning to Forecast Treatment Outcomes Over Time0Alicia Curth, Mihaela van der Schaar, Toon Vanderschueren, Wouter Verbeke
1452Best Arm Identification in Multi-Agent Multi-Armed Bandits0Alexandre Proutière, Filippo Vannella, Jaeseong Jeong
1453Conditional Tree Matching for Inference-Time Adaptation of Tree Prediction Models0Abhijeet Awasthi, Harshit Varma, Sunita Sarawagi
1454Optimal LP Rounding and Linear-Time Approximation Algorithms for Clustering Edge-Colored Hypergraphs0Nate Veldt
1455Fast (1+ε)-Approximation Algorithms for Binary Matrix Factorization0Ameya Velingker, David P. Woodruff, Maximilian Vötsch, Samson Zhou
1456The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms0Aarti Singh, Anirudh Vemula, Drew Bagnell, Sanjiban Choudhury, Yuda Song
1457Learning the Right Layers a Data-Driven Layer-Aggregation Strategy for Semi-Supervised Learning on Multilayer Graphs0Andrea Cristofari, Francesco Rinaldi, Francesco Tudisco, Sara Venturini
1458Multi-Environment Pretraining Enables Transfer to Action Limited Datasets0David Venuto, Doina Precup, Igor Mordatch, Ofir Nachum, Pieter Abbeel, Sherry Yang
1459AbODE: Ab initio antibody design using conjoined ODEs0Markus Heinonen, Vikas Garg, Yogesh Verma
1460TabLeak: Tabular Data Leakage in Federated Learning0Dimitar Iliev Dimitrov, Mark Vero, Martin T. Vechev, Mislav Balunovic
1461Low-Variance Gradient Estimation in Unrolled Computation Graphs with ES-Single0Paul Vicol
1462Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models0Alexandre Tachard Passos, Luke Vilnis, Patrick Murray, Sumit Sanghai, Yury Zemlyanskiy
1463Eventual Discounting Temporal Logic Counterfactual Experience Replay0Abhinav Verma, Cameron Voloshin, Yisong Yue
1464Transformers Learn In-Context by Gradient Descent0Alexander Mordvintsev, Andrey Zhmoginov, Ettore Randazzo, Eyvind Niklasson, Johannes von Oswald, João Sacramento, Max Vladymyrov
1465Topological Singularity Detection at Multiple Scales0Bastian Rieck, Julius von Rohrscheidt
1466Improving l1-Certified Robustness via Randomized Smoothing by Leveraging Box Constraints0Matthias Hein, Václav Vorácek
1467Vector Quantized Wasserstein Auto-Encoder0Chuanxia Zheng, Dinh Q. Phung, He Zhao, Jianfei Cai, Long Tung Vuong, Mehrtash Harandi, Trung Le
1468Competitive Gradient Optimization0Abhijeet Vyas, Brian Bullins, Kamyar Azizzadenesheli
1469On Provable Copyright Protection for Generative Models0Boaz Barak, Nikhil Vyas, Sham M. Kakade
1470Leveraging Offline Data in Online Reinforcement Learning0Aldo Pacchiano, Andrew Wagenmaker
1471Fast Private Kernel Density Estimation via Locality Sensitive Quantization0Nina Mishra, Tal Wagner, Yonatan Naamad
1472Investigating the Role of Model-Based Learning in Exploration and Transfer0Ankesh Anand, Eszter Vértes, Gabriel DulacArnold, Jacob C. Walker, Jessica B. Hamrick, Theophane Weber, Yazhe Li
1473UPSCALE: Unconstrained Channel Pruning0Alvin Wan, David Güera, Hanxiang Hao, Kaushik Patnaik, Omer Hadad, Qi Shan, Yueyang Xu, Zhile Ren
1474Poisoning Language Models During Instruction Tuning0Alexander Wan, Dan Klein, Eric Wallace, Sheng Shen
1475SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models0DeChuan Zhan, Minghao Shao, Ruying Chen, Shenghua Wan, Yucen Wang
1476Multiplier Bootstrap-based Exploration0Branislav Kveton, Haoyu Wei, Rui Song, Runzhe Wan
1477Bandit Multi-linear DR-Submodular Maximization and Its Applications on Adversarial Submodular Bandits0Jialin Zhang, Wei Chen, Xiaoming Sun, Zhijie Zhang, Zongqi Wan
1478Tight Regret Bounds for Single-pass Streaming Multi-armed Bandits0Chen Wang
1479Improved Active Multi-Task Representation Learning via Lasso0Kevin Jamieson, Simon Shaolei Du, Yifang Chen, Yiping Wang
1480Tilted Sparse Additive Models0Dacheng Tao, Fengxiang He, Hong Chen, Tieliang Gong, Weifeng Liu, Yingjie Wang, Youcheng Fu
1481From Hypergraph Energy Functions to Hypergraph Neural Networks0David Wipf, Quan Gan, Xipeng Qiu, Xuanjing Huang, Yuxin Wang
1482A Closer Look at Self-Supervised Lightweight Vision Transformers0Jin Gao, Shaoru Wang, Weiming Hu, Xiaoqin Zhang, Zeming Li
1483PreNAS: Preferred One-Shot Learning Towards Efficient Neural Architecture Search0Ce Ge, Haibin Wang, Hesen Chen, Xiuyu Sun
1484Adversarial Policies Beat Superhuman Go AIs0Adam Gleave, Joseph Miller, Kellin Pelrine, Michael D. Dennis, Nora Belrose, Sergey Levine, Stuart Russell, Tom Tseng, Tony Tong Wang, Viktor Pogrebniak, Yawen Duan
1485On Regularization and Inference with Label Constraints0Dan Roth, Hangfeng He, Kaifu Wang, Piyush Kumar, Tin D. Nguyen
1486Policy Gradient in Robust MDPs with Global Convergence Guarantee0Chin Pang Ho, Marek Petrik, Qiuhao Wang
1487Adaptive Smoothing Gradient Learning for Spiking Neural Networks0Huajin Tang, Rui Yan, Runhao Jiang, Shuang Lian, Ziming Wang
1488CircuitNet: A Generic Neural Network to Realize Universal Circuit Motif Modeling0Caihua Shan, Dongqi Han, Dongsheng Li, Kaitao Song, Kan Ren, Xinyang Jiang, Xufang Luo, Yansen Wang, Yifei Shen
1489Generalized Polyak Step Size for First Order Optimization with Momentum0Mikael Johansson, Tong Zhang, Xiaoyu Wang
1490Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR0Kaiwen Wang, Nathan Kallus, Wen Sun
1491FedHPO-Bench: A Benchmark Suite for Federated Hyperparameter Optimization0Bolin Ding, Ce Zhang, Weirui Kuang, Yaliang Li, Zhen Wang
1492A/B Testing in Network Data with Covariate-Adaptive Randomization0Feifang Hu, Jialu Wang, Ping Li
1493Learning Belief Representations for Partially Observable Deep RL0Andrew C. Li, Andrew Wang, Rodrigo Toro Icarte, Sheila A. McIlraith, Toryn Q. Klassen
1494Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap0Hang Wang, Junshan Zhang, Sen Lin
1495Slot-VAE: Object-Centric Scene Generation with Slot Attention0Justin Dauwels, Letao Liu, Yanbo Wang
1496DIVISION: Memory Efficient Training via Dual Activation Precision0Guanchu Wang, Na Zou, Ninghao Liu, Xia Ben Hu, Zhimeng Jiang, Zirui Liu
1497CocktailSGD: Fine-tuning Foundation Models over 500Mbps Networks0Beidi Chen, Binhang Yuan, Ce Zhang, Christopher De Sa, Christopher Ré, Jue Wang, Percy Liang, Yucheng Lu
1498Magneto: A Foundation Transformer0Alon Benhaim, Barun Patra, Furu Wei, Hongyu Wang, Li Dong, Payal Bajaj, Saksham Singhal, Shaohan Huang, Shuming Ma, Vishrav Chaudhary, Wenhui Wang, Xia Song, Yu Wu, Zhiliang Peng, Zhun Liu
1499Direct Parameterization of Lipschitz-Bounded Deep Networks0Ian R. Manchester, Ruigang Wang
1500Tighter Information-Theoretic Generalization Bounds from Supersamples0Yongyi Mao, Ziqiao Wang
1501NP-SemiSeg: When Neural Processes meet Semi-Supervised Semantic Segmentation0Daniela Massiceti, Jianfeng Wang, Thomas Lukasiewicz, Vladimir Pavlovic, Xiaolin Hu
1502GC-Flow: A Graph-Based Flow Network for Effective Clustering0Farzaneh Mirzazadeh, Jie Chen, Tianchun Wang, Xiang Zhang
1503Curriculum Co-disentangled Representation Learning across Multiple Environments for Social Recommendation0Chendi Ge, Hong Chen, Wenwu Zhu, Xin Wang, Yuwei Zhou, Zirui Pan
1504Data Efficient Neural Scaling Law via Model Reusing0Peihao Wang, Rameswar Panda, Zhangyang Wang
1505Deep Temporal Sets with Evidential Reinforced Attentions for Unique Behavioral Pattern Discovery0Deep Shankar Pandey, Dingrong Wang, Ervine Zheng, Krishna Prasad Neupane, Qi Yu, Zhi Zheng, Zhiwei Yu
1506Active Learning based Structural Inference0Aoran Wang, Jun Pang
1507Better Diffusion Models Further Improve Adversarial Training0Chao Du, Min Lin, Shuicheng Yan, Tianyu Pang, Weiwei Liu, Zekai Wang
1508Polarity Is All You Need to Learn and Transfer Faster0Ali Geisa, Eric W. Bridgeford, Joshua T. Vogelstein, Michael Alan Powell, Qingyang Wang
1509Projected Tensor Power Method for Hypergraph Community Recovery0Anthony ManCho So, Jinxin Wang, Peng Wang, Xiaolu Wang, YuenMan Pun
1510Estimating Possible Causal Effects with Latent Variables via Adjustment0Tian Qin, TianZuo Wang, ZhiHua Zhou
1511InfoDiffusion: Representation Learning Using Information Maximizing Diffusion Models0Aaron Gokaslan, Christopher De Sa, Fei Wang, Volodymyr Kuleshov, Weishen Pan, Yair Schiff, Yingheng Wang
1512A Robust Test for the Stationarity Assumption in Sequential Decision Making0Chengchun Shi, Jitao Wang, Zhenke Wu
1513GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models0Congjie He, Hanjing Wang, Jun Wang, Luo Mai, ManKit Sit, Weinan Zhang, Yaodong Yang, Ying Wen
1514Effective and Efficient Structural Inference with Reservoir Computing0Aoran Wang, Jun Pang, Tsz Pan Tong
1515Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning0Amy Zhang, Antonio Torralba, Phillip Isola, Tongzhou Wang
1516Model-Free Robust Average-Reward Reinforcement Learning0Alvaro Velasquez, Ashley PraterBennette, George K. Atia, Shaofeng Zou, Yue Wang
1517Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy0Furong Huang, Ruonan Jia, Wichayaporn Wongkamjan, Xiyao Wang
1518Learning to Bid in Repeated First-Price Auctions with Budgets0Qian Wang, Xiaotie Deng, Yuqing Kong, Zongjun Yang
1519Network Effects in Performative Prediction Games0ChungYiu Yau, HoiTo Wai, Xiaolu Wang
1520Robustly Learning a Single Neuron via Sharpness0Ilias Diakonikolas, Jelena Diakonikolas, Nikos Zarifis, Puqian Wang
1521DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning0Jennifer G. Dy, Stratis Ioannidis, Yanzhi Wang, Yifan Gong, Yucai Shao, Zheng Zhan, Zifeng Wang
1522Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments0Chao Huang, Qi Zhu, Ruochen Jiao, Simon Sinong Zhan, Wanxin Jin, Yixuan Wang, Zhaoran Wang, Zhilu Wang, Zhuoran Yang
1523LinSATNet: The Positive Linear Satisfiability Neural Networks0Junchi Yan, Runzhong Wang, Tianyi Chen, Xiaokang Yang, Yunhao Zhang, Ziao Guo
1524Offline Meta Reinforcement Learning with In-Distribution Online Adaptation0Chongjie Zhang, Haozhe Jiang, Jianhao Wang, Jin Zhang, Junyu Zhang, Liwei Wang
1525Reachability-Aware Laplacian Representation in Reinforcement Learning0Bryan Hooi, Jiashi Feng, Kaixin Wang, Kuangqi Zhou, Xinchao Wang
1526PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient0Daquan Zhou, Jiashi Feng, Kaixin Wang, Shie Mannor
1527On Heterogeneous Treatment Effects in Heterogeneous Causal Graphs0Hengrui Cai, Richard A. Watson, Rui Song, Samuel A. McLean, Xinming An
1528Nonparametric Extensions of Randomized Response for Private Confidence Sets0Aaditya Ramdas, Ian WaudbySmith, Zhiwei Steven Wu
1529Global optimality for Euclidean CCCP under Riemannian convexity0Melanie Weber, Suvrit Sra
1530A Universal Unbiased Method for Classification from Aggregate Observations0Bo Han, Gang Niu, Heng Tao Shen, Lei Feng, Tongliang Liu, Xiaofeng Zhu, Zixi Wei
1531NTK-approximating MLP Fusion for Efficient Language Model Fine-tuning0Jingrui He, Tianxin Wei, Yifan Chen, Zeming Guo
1532Boosting Graph Contrastive Learning via Graph Contrastive Saliency0Bing Bai, Chunyu Wei, David Brady, Kai Ni, Lu Fang, Yu Wang
1533Set-membership Belief State-based Reinforcement Learning for POMDPs0Huizhong Song, Jiye Liang, Lijun Zhang, Lin Li, Wei Wei
1534Mitigating Memorization of Noisy Labels by Clipping the Model Prediction0Bo An, Gang Niu, Hongxin Wei, Huiping Zhuang, Lei Feng, Renchunzi Xie, Yixuan Li
1535Graphically Structured Diffusion Models0Christian Dietrich Weilbach, Frank Wood, William Harvey
1536Expectation-Complete Graph Representations with Homomorphisms0Fabian Jogl, Maximilian Thiessen, Pascal Welke, Thomas Gärtner
1537A Conditional Normalizing Flow for Accelerated Multi-Coil MR Imaging0Jeffrey Wen, Philip Schniter, Rizwan Ahmad
1538Optimizing Mode Connectivity for Class Incremental Learning0Haitao Wen, Haoyang Cheng, Heqian Qiu, Hongliang Li, Lanxiao Wang, Lili Pan
1539Towards Learning Geometric Eigen-Lengths Crucial for Fitting Tasks0Kaichun Mo, Leonidas J. Guibas, Ruoxi Shi, Yanchao Yang, Yijia Weng
1540Open-VCLIP: Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization0Ang Li, Xitong Yang, YuGang Jiang, Zejia Weng, Zuxuan Wu
1541Fully-Adaptive Composition in Differential Privacy0Aaditya Ramdas, Justin Whitehouse, Ryan Rogers, Steven Wu
1542Scalable Set Encoding with Universal Mini-Batch Consistency and Unbiased Full Set Gradient Approximation0Bruno Andreis, Jeffrey Willette, Juho Lee, Kenji Kawaguchi, Seanie Lee, Sung Ju Hwang
1543Flexible Phase Dynamics for Bio-Plausible Contrastive Learning0Colin Bredenberg, Ezekiel Williams, Guillaume Lajoie
1544Approximate Stein Classes for Truncated Density Estimation0Daniel J. Williams, Song Liu
1545Theoretical Behavior of XAI Methods in the Presence of Suppressor Variables0Benedict Clark, Leo Kieslich, Rick Wilming, Stefan Haufe
1546Marginalization is not Marginal: No Bad VAE Local Minima when Learning Optimal Sparse Representations0David Wipf
1547Uncertainty Estimation for Molecules: Desiderata and Methods0Bertrand Charpentier, Mohamed Amine Ketata, Nicholas Gao, Stephan Günnemann, Tom Wollschläger
1548The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond0Gauri Joshi, Jiin Woo, Yuejie Chi
1549Learning Deep Time-index Models for Time Series Forecasting0Akshat Kumar, Chenghao Liu, Doyen Sahoo, Gerald Woo, Steven C. H. Hoi
1550Sharper Bounds for ℓp Sensitivity Sampling0David P. Woodruff, Taisuke Yasuda
1551Two Losses Are Better Than One: Faster Optimization Using a Cheaper Proxy0Blake E. Woodworth, Francis R. Bach, Konstantin Mishchenko
1552SEGA: Structural Entropy Guided Anchor View for Graph Contrastive Learning0Bowen Shi, Junran Wu, Ke Xu, Shangzhe Li, Xueyuan Chen
1553Causal Proxy Models for Concept-based Model Explanations0Amir Zur, Atticus Geiger, Christopher Potts, Karel D'Oosterlinck, Zhengxuan Wu
1554Effective Neural Topic Modeling with Embedding Clustering Regularization0Anh Tuan Luu, Thong Thanh Nguyen, Xiaobao Wu, Xinshuai Dong
1555Adaptive Compositional Continual Meta-Learning0Bin Wu, Jinyuan Fang, Qiang Zhang, Shangsong Liang, Xiangxiang Zeng
1556Anchor Sampling for Federated Learning with Partial Client Participation0Feijie Wu, Jing Gao, Shiqi He, Song Guo, Zhihao Qu, Ziming Liu
1557Solving High-Dimensional PDEs with Latent Spectral Models0Haixu Wu, Huakun Luo, Jianmin Wang, Mingsheng Long, Tengge Hu
1558A Law of Robustness beyond Isoperimetry0Heng Huang, Hongyang Zhang, Yihan Wu
1559Uncovering Adversarial Risks of Test-Time Adaptation0Feiran Jia, Jiachen T. Wang, Prateek Mittal, Saeed Mahloujifar, Tong Wu, Vikash Sehwag, Xiangyu Qi
1560Stable Estimation of Heterogeneous Treatment Effects0Anpeng Wu, Bo Li, Fei Wu, Kun Kuang, Ruoxuan Xiong
1561Rethinking Explaining Graph Neural Networks via Non-parametric Subgraph Matching0Dragomir Radev, Fang Wu, Siyuan Li, Stan Z. Li, Xurui Jin, Yinghui Jiang, Zhangming Niu
1562Understanding Int4 Quantization for Language Models: Latency Speedup, Composability, and Failure Cases0Cheng Li, Reza Yazdani Aminabadi, Xiaoxia Wu, Yuxiong He, Zhewei Yao
1563Towards Understanding Generalization of Macro-AUC in Multi-label Learning0Chongxuan Li, Guoqiang Wu, Yilong Yin
1564Quantifying the Knowledge in GNNs for Reliable Distillation into MLPs0Haitao Lin, Lirong Wu, Stan Z. Li, Yufei Huang
1565Delay-agnostic Asynchronous Coordinate Update Algorithm0Changxin Liu, Mikael Johansson, Sindri Magnússon, Xuyang Wu
1566Masked Trajectory Models for Prediction, Representation, and Control0Aravind Rajeswaran, Arjun Majumdar, Igor Mordatch, Kevin Stone, Philipp Wu, Pieter Abbeel, Yixin Lin
1567Disentangled Multi-Fidelity Deep Bayesian Active Learning0Dongxia Wu, Matteo Chinazzi, Rose Yu, Ruijia Niu, YiAn Ma
1568Tight Data Access Bounds for Private Top-k Selection0Anthony Wirth, Hao Wu, Olga Ohrimenko
1569The Implicit Regularization of Dynamical Stability in Stochastic Gradient Descent0Lei Wu, Weijie J. Su
1570Distributional Offline Policy Evaluation with Predictive Error Guarantees0Masatoshi Uehara, Runzhe Wu, Wen Sun
1571π-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation0Chengyue Wu, Ping Luo, Ruisong Zhou, Teng Wang, Ying Shan, Yixiao Ge, Zeyu Lu
1572Learning Functional Distributions with Private Labels0Ananth Grama, Changlong Wu, Wojciech Szpankowski, Yifan Wang
1573QuantumDARTS: Differentiable Quantum Architecture Search for Variational Quantum Algorithms0Ge Yan, Junchi Yan, Kaisen Pan, Wenjie Wu, Xudong Lu
1574Discover and Cure: Concept-aware Mitigation of Spurious Correlation0James Zou, Linjun Zhang, Mert Yüksekgönül, Shirley Wu
1575On the Training Instability of Shuffling SGD with Batch Normalization0Chulhee Yun, David Xing Wu, Suvrit Sra
1576dugMatting: Decomposed-Uncertainty-Guided Matting0Changqing Zhang, Huazhu Fu, Jiawei Wu, Joey Tianyi Zhou, Xi Peng, Zuoyong Li
1577Personalized Federated Learning under Mixture of Distributions0Dawei Zhou, Haifeng Chen, Quanquan Gu, Shuaicheng Zhang, Wei Cheng, Wenchao Yu, Yanchi Liu, Yue Wu
1578Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards0Di Wang, Sayak Ray Chowdhury, Xingyu Zhou, Yulian Wu
1579Finite-Sample Analysis of Learning High-Dimensional Single ReLU Neuron0Difan Zou, Jingfeng Wu, Quanquan Gu, Sham M. Kakade, Vladimir Braverman, Zixiang Chen
1580Understanding Backdoor Attacks through the Adaptability Hypothesis0Ashish Kundu, Ganghua Wang, Jayanth Srinivasa, Jie Ding, Mingyi Hong, Xuan Bi, Xun Xian
1581Fair and Optimal Classification via Post-Processing0Han Zhao, Lang Yin, Ruicheng Xian
1582UMD: Unsupervised Model Detection for X2X Backdoor Attacks0Bo Li, Zhen Xiang, Zidi Xiong
1583Random Shuffle Transformer for Image Restoration0Hongjian Liu, Jie Xiao, Man Zhou, Xueyang Fu, ZhengJun Zha
1584Communication-Efficient Federated Hypergradient Computation via Aggregated Iterative Differentiation0Kaiyi Ji, Peiyao Xiao
1585SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models0Guangxuan Xiao, Hao Wu, Ji Lin, Julien Demouth, Mickaël Seznec, Song Han
1586On the Forward Invariance of Neural ODEs0Chuang Gan, Daniela Rus, Mathias Lechner, Ramin M. Hasani, TsunHsuan Wang, Wei Xiao, Yutong Ban
1587COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models0Bo Yuan, Jian Ren, Jinqi Xiao, Miao Yin, Xiao Zang, Yu Gong
1588Improving Bi-level Optimization Based Methods with Inspiration from Humans' Classroom Study Techniques0Pengtao Xie
1589Future-conditioned Unsupervised Pretraining for Decision Transformer0Deheng Ye, Qiang Fu, Shuai Li, Yang Wei, Zhihui Xie, Zichuan Lin
1590DeSRA: Detect and Delete the Artifacts of GAN-based Real-World Super-Resolution Models0Chao Dong, Gen Li, Jiantao Zhou, Liangbin Xie, Xiangyu Chen, Xintao Wang, Ying Shan
1591Semiparametrically Efficient Off-Policy Evaluation in Linear Markov Decision Processes0Chuhan Xie, Wenhao Yang, Zhihua Zhang
1592A Critical View of Vision-Based Long-Term Dynamics Prediction Under Environment Misalignment0Hanchen Xie, Jiageng Zhu, Jiazhi Li, Mahyar Khayatkhoei, Mohamed E. Hussein, Wael AbdAlmageed
1593Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification0Bo An, Dong Xing, Gang Pan, Longtao Zheng, Pengjie Gu, Qian Zheng, Shanqi Liu, Xinrun Wang
1594Universal Morphology Control via Contextual Modulation0Jacob Beck, Shimon Whiteson, Zheng Xiong
1595Relevant Walk Search for Explaining Graph Neural Networks0Grégoire Montavon, KlausRobert Müller, Michael Gastegger, Ping Xiong, Shinichi Nakajima, Thomas Schnake
1596Why do Nearest Neighbor Language Models Work?0Frank F. Xu, Graham Neubig, Uri Alon
1597MixFlows: principled variational inference via mixed flows0Naitong Chen, Trevor Campbell, Zuheng Xu
1598Bit Allocation using Optimization0Chenjian Gao, Dailan He, Han Gao, Hongwei Qin, Jingjing Liu, Jinyong Pi, Jixiang Luo, Mao Ye, Tongda Xu, YaQin Zhang, Yan Wang, Yuanyuan Wang, Ziyu Zhu
1599Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents0Wenhao Xu, Xuedong He, Xuefeng Gao
1600Probabilistic Categorical Adversarial Attack and Adversarial Training0Han Xu, Hui Liu, Jie Ren, Jiliang Tang, Pengfei He, Yuxuan Wan, Zitao Liu
1601Hierarchical Neural Coding for Controllable CAD Model Generation0Joseph George Lambourne, Karl D. D. Willis, Pradeep Kumar Jayaraman, Xiang Xu, Yasutaka Furukawa
1602Efficient Sequence Transduction by Jointly Predicting Tokens and Durations0Boris Ginsburg, Fei Jia, Hainan Xu, He Huang, Shinji Watanabe, Somshubra Majumdar
1603Constrained Efficient Global Optimization of Expensive Black-box Functions0Bratislav Svetozarevic, Colin N. Jones, Wenjie Xu, Yuning Jiang
1604Pareto Regret Analyses in Multi-objective Multi-armed Bandit0Diego Klabjan, Mengfan Xu
1605Diverse and Faithful Knowledge-Grounded Dialogue Generation via Sequential Posterior Inference0Bo Pang, Dehong Xu, Deqian Kong, Pascale Fung, Yan Xu, Ying Nian Wu, Ziwei Ji
1606Quantifying the Variability Collapse of Neural Networks0Haoxiong Liu, Jing Xu
1607Progressive Purification for Instance-Dependent Partial Label Learning0Biao Liu, Congyu Qiao, Jiaqi Lv, Ning Xu, Xin Geng
1608PFGM++: Unlocking the Potential of Physics-Inspired Generative Models0Max Tegmark, Shangyuan Tong, Tommi S. Jaakkola, Yilun Xu, Yonglong Tian, Ziming Liu
1609Geometric Latent Diffusion Models for 3D Molecule Generation0Alexander S. Powers, Jure Leskovec, Minkai Xu, Ron O. Dror, Stefano Ermon
1610The Power of Preconditioning in Overparameterized Low-Rank Matrix Sensing0Cong Ma, Xingyu Xu, Yandi Shen, Yuejie Chi
1611Fascinating Supervisory Signals and Where to Find Them: Deep Anomaly Detection with Scale Learning0Hongzuo Xu, Juhui Wei, Ning Liu, Songlei Jian, Yijie Wang, Yizhou Li
1612Competing for Shareable Arms in Multi-Player Multi-Armed Bandits0Bo Li, Haotian Wang, Peng Cui, Renzhe Xu, Xingxuan Zhang
1613Sequential Predictive Conformal Inference for Time Series0Chen Xu, Yao Xie
1614mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video0Bin Bi, Chenliang Li, Fei Huang, Guohai Xu, Haiyang Xu, Ji Zhang, Jiabo Ye, Jingren Zhou, Ming Yan, Qi Qian, Qinghao Ye, Songfang Huang, Wei Wang, Yaya Shi, Yuanhong Xu
1615ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts0Jian Tang, Minghao Xu, Santiago Miret, Xinyu Yuan
1616Bayesian Design Principles for Frequentist Sequential Learning0Assaf Zeevi, Yunbei Xu
1617SLAMB: Accelerated Large Batch Training with Sparse Communication0Hang Xu, Jiawei Fei, Jun Huang, Mohamed Elhoseiny, Panos Kalnis, Tingwen Xie, Wenxuan Zhang, Yuchen Xie, Yuzhe Wu
1618Do Not Train It: A Linear Neural Architecture Search of Graph Neural Networks0Bei Yu, Haiqin Yang, Jiaqi Sun, Lin Zhang, Peng Xu, Xuanzhou Liu, Yue Zhao
1619An Instrumental Variable Approach to Confounded Off-Policy Evaluation0Chengchun Shi, Jin Zhu, Rui Song, Shikai Luo, Yang Xu
1620Near-Optimal Quantum Coreset Construction Algorithms for Clustering0Shaofeng H.C. Jiang, Tongyang Li, Xiaoyu Chen, Yecheng Xue
1621A Study on Transformer Configuration and Training Objective0Aixin Sun, Fuzhao Xue, Jianghai Chen, Xiaoxin He, Xiaozhe Ren, Xin Jiang, Yang You, Yongming Chen, Zangwei Zheng
1622LazyGNN: Large-Scale Graph Neural Networks via Lazy Propagation0Haoyu Han, Jian Pei, MohamadAli Torkamani, Rui Xue, Xiaorui Liu
1623Which Features are Learnt by Contrastive Learning? On the Role of Simplicity Bias in Class Collapse and Feature Suppression0Baharan Mirzasoleiman, Eric Gan, PinYu Chen, Siddharth Joshi, Yihao Xue
1624Adaptive Computation with Elastic Input Sequence0Anurag Arnab, Fuzhao Xue, Mostafa Dehghani, Neil Houlsby, Valerii Likhosherstov, Yang You
1625Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL0Ahmed Khalil, Raúl SantosRodríguez, Taku Yamagata
1626Quantum Ridgelet Transform: Winning Lottery Ticket of Neural Networks with Quantum Computation0Hayata Yamasaki, Sathyawageeswar Subramanian, Satoshi Hayakawa, Sho Sonoda
1627Compressed Decentralized Proximal Stochastic Gradient Method for Nonconvex Composite Problems with Heterogeneous Data0Jie Chen, PinYu Chen, Songtao Lu, Xiaodong Cui, Yangyang Xu, Yonggui Yan
1628Temporally Consistent Transformers for Video Generation0Danijar Hafner, Pieter Abbeel, Stephen James, Wilson Yan
1629Distortion and Uncertainty Aware Loss for Panoramic Depth Completion0Jian Yang, Jun Li, Kun Wang, Shuo Chen, Xiang Li, Zhiqiang Yan
1630Self-Interpretable Time Series Prediction with Counterfactual Explanations0Hao Wang, Jingquan Yan
1631Quantum 3D Graph Learning with Applications to Molecule Embedding0Ge Yan, Huaijin Wu, Junchi Yan
1632Fast Rates in Time-Varying Strongly Monotone Games0Peng Zhao, YuHu Yan, ZhiHua Zhou
1633Proper Scoring Rules for Survival Analysis0Hiroki Yanagisawa
1634Behavior Contrastive Learning for Unsupervised Skill Discovery0Bin Zhao, Chenjia Bai, Hongyi Guo, Peng Liu, Rushuai Yang, Siyuan Li, Xuelong Li, Zhen Wang
1635Nested Elimination: A Simple Algorithm for Best-Item Identification From Choice-Based Feedback0Junwen Yang, Yifan Feng
1636Towards Better Graph Representation Learning with Parameterized Decomposition & Filtering0Bryan Hooi, Mingqi Yang, Wenjie Feng, Yanming Shen
1637Weighted Flow Diffusion for Local Graph Clustering with Node Attributes: an Algorithm and Statistical Guarantees0Kimon Fountoulakis, Shenghao Yang
1638Chemically Transferable Generative Backmapping of Coarse-Grained Proteins0Rafael GómezBombarelli, Soojung Yang
1639Data Poisoning Attacks Against Multimodal Encoders0Mathias Humbert, Michael Backes, Pascal Berrang, Xinlei He, Yang Zhang, Zheng Li, Ziqing Yang
1640Towards Sustainable Learning: Coresets for Data-efficient Deep Learning0Baharan Mirzasoleiman, Hao Kang, Yu Yang
1641Improving Adversarial Robustness by Putting More Regularizations on Less Robust Samples0Dongyoon Yang, Insung Kong, Yongdai Kim
1642Improving Adversarial Robustness of Deep Equilibrium Models with Explicit Regulations Along the Neural Dynamics0Peng Li, Tianyu Pang, Yang Liu, Zonghan Yang
1643Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning0Baharan Mirzasoleiman, Besmira Nushi, Hamid Palangi, Yu Yang
1644A theory of representation learning gives a deep generalisation of kernel methods0Adam X. Yang, Ben Anson, Edward Milsom, Laurence Aitchison, Maxime Robeyns, Nandi Schoots
1645Efficient Algorithms for Exact Graph Matching on Correlated Stochastic Block Models with Constant Correlation0Dongpil Shin, Hye Won Chung, Joonhyuk Yang
1646Are Neurons Actually Collapsed? On the Fine-Grained Structure in Neural Representations0Jacob Steinhardt, Wei Hu, Yongyi Yang
1647Generative Adversarial Symmetry Discovery0Jianke Yang, Nima Dehmamy, Robin Walters, Rose Yu
1648Boosting Offline Reinforcement Learning with Action Preference Query0Gao Huang, Matthieu Gaetan Lin, Qisen Yang, Shenzhi Wang, Shiji Song
1649Towards Controlled Data Augmentations for Active Learning0Gang Chen, Haobo Wang, Jianan Yang, Junbo Zhao, Sai Wu
1650What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?0Chongjie Zhang, Hao Hu, Lin Yong, Rui Yang, Tong Zhang, Xiaoteng Ma
1651Neural Prediction Errors enable Analogical Visual Reasoning in Human Standard Intelligence Tests0Dahui Wang, Hongzhi You, Lingxiao Yang, RuYuan Zhang, Xiaohong Wan, Xiaohua Xie, Zonglei Zhen
1652Change is Hard: A Closer Look at Subpopulation Shift0Dina Katabi, Haoran Zhang, Marzyeh Ghassemi, Yuzhe Yang
1653Continual Task Allocation in Meta-Policy Network via Sparse Prompting0Guodong Long, Jing Jiang, Tianyi Zhou, Yijun Yang, Yuhui Shi
1654Hyperbolic Representation Learning: Revisiting and Advancing0Irwin King, Menglin Yang, Min Zhou, Rex Ying, Yankai Chen
1655Which is Better for Learning with Noisy Labels: The Semi-supervised Method or Modeling Label Noise?0Bo Han, Jun Yu, Kun Zhang, Mingming Gong, Tongliang Liu, Yu Yao, Yuxuan Du
1656How Bad is Top-K Recommendation under Competing Content Creators?0Chuanhao Li, Denis Nekipelov, Fan Yao, Haifeng Xu, Hongning Wang
1657MultiAdam: Parameter-wise Scale-invariant Optimizer for Multiscale Training of Physics-informed Neural Networks0Chang Su, Hang Su, Jiachen Yao, Jun Zhu, Songming Liu, Zhongkai Hao
1658Policy Mirror Ascent for Efficient and Independent Learning in Mean Field Games0Batuhan Yardim, Matthieu Geist, Niao He, Semih Cayci
1659Retrieval-Augmented Multimodal Language Modeling0Armen Aghajanyan, Jure Leskovec, Luke Zettlemoyer, Michihiro Yasunaga, Mike Lewis, Percy Liang, Richard James, Weijia Shi, WenTau Yih
1660On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness0Haotian Ye, Liwei Wang, Simon Shaolei Du, Xiaoyu Chen
1661Personalized Federated Learning with Inferred Collaboration Graphs0Fangzhao Wu, Rui Ye, Siheng Chen, Yanfeng Wang, Zhenyang Ni
1662Compositional Exemplars for In-context Learning0Jiacheng Ye, Jiangtao Feng, Lingpeng Kong, Tao Yu, Zhiyong Wu
1663Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes0Chenlu Ye, Quanquan Gu, Tong Zhang, Wei Xiong
1664GNN&GBDT-Guided Fast Optimizing Framework for Large-scale Integer Programming0Chengming Wang, Hongyan Wang, Hua Xu, Huigen Ye, Yu Jiang
1665FedDisco: Federated Learning with Discrepancy-Aware Collaboration0Chenxin Xu, Jianyu Wang, Mingkai Xu, Rui Ye, Siheng Chen, Yanfeng Wang
1666Towards Quantum Machine Learning for Constrained Combinatorial Optimization: a Quantum QAP Solver0Ge Yan, Junchi Yan, Xinyu Ye
1667Temporal Label Smoothing for Early Event Prediction0Alizée Pace, Gunnar Rätsch, Hugo Yèche, Rita Kuznetsova
1668From Temporal to Contemporaneous Iterative Causal Discovery in the Presence of Latent Confounders0Gal Novik, Raanan Y. Rohekar, Shami Nisimov, Yaniv Gurwicz
1669Doubly Adversarial Federated Bandits0Jialin Yi, Milan Vojnovic
1670Online Prototype Alignment for Few-shot Policy Transfer0Jiaming Guo, Kaizhao Yuan, Qi Guo, Qi Yi, Rui Zhang, Ruizhi Chen, Shaohui Peng, Siming Lan, Xing Hu, Xishan Zhang, Yunji Chen, Yunkai Gao, Zidong Du
1671MonoFlow: Rethinking Divergence GANs via the Perspective of Wasserstein Gradient Flows0Mingxuan Yi, Song Liu, Zhanxing Zhu
1672SE(3) diffusion model with application to protein backbone generation0Arnaud Doucet, Brian L. Trippe, Emile Mathieu, Jason Yim, Regina Barzilay, Tommi S. Jaakkola, Valentin De Bortoli
1673CoCo: A Coupled Contrastive Framework for Unsupervised Domain Adaptive Graph Classification0Chong Chen, Li Shen, Long Lan, Mengzhu Wang, Nan Yin, XianSheng Hua, Xiao Luo, Zeyu Ma
1674Adaptive Estimation of Graphical Models under Total Positivity0Daniel P. Palomar, Jiaxi Ying, José Vinícius de Miranda Cardoso
1675Improving Visual Prompt Tuning for Self-supervised Vision Transformers0Dahuin Jung, Eunji Kim, Jungbeom Lee, Seungryong Yoo, Sungroh Yoon
1676End-to-End Multi-Object Detection with a Regularized Mixture Model0Hojun Lee, Inseop Chung, Jaeyoung Yoo, Nojun Kwak, Seunghyeon Seo
1677EM-Network: Oracle Guided Self-distillation for Sequence Learning0Hyeonseung Lee, Ji Won Yoon, Minchan Kim, Nam Soo Kim, Seok Min Kim, Sunghwan Ahn
1678Continual Learners are Incremental Model Generalizers0Jaehong Yoon, Sung Ju Hwang, Yue Cao
1679An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning0Heechul Bae, Jaesik Yoon, Sungjin Ahn, YiFu Wu
1680Graph Generative Model for Benchmarking Graph Neural Networks0Bryan Perozzi, John Palowitch, Minji Yoon, Russ Salakhutdinov, Yue Wu
1681Analyzing Convergence in Quantum Neural Networks: Deviations from Neural Tangent Kernels0Boyang Chen, Shouvanik Chakrabarti, Xiaodi Wu, Xuchen You
1682Entropy-driven Unsupervised Keypoint Representation Learning in Videos0Ali Younes, Georgia Chalvatzaki, Simone SchaubMeyer
1683The Benefits of Model-Based Generalization in Reinforcement Learning0Aditya A. Ramesh, Jürgen Schmidhuber, Kenny John Young, Louis Kirsch
1684COLA: Orchestrating Error Coding and Learning for Robust Neural Network Inference Against Hardware Defects0Anlan Yu, Jieming Yin, Ning Lyu, Wujie Wen, Zhiyuan Yan
1685Delving into Noisy Label Detection with Clean Data0Chenglin Yu, Weiwei Liu, Xinsong Ma
1686Bag of Tricks for Training Data Extraction from Language Models0Bingyi Kang, Chao Du, Min Lin, Qian Liu, Shuicheng Yan, Tianyu Pang, Weichen Yu, Yan Huang
1687Discover-Then-Rank Unlabeled Support Vectors in the Dual Space for Multi-Class Active Learning0Dayou Yu, Qi Yu, Weishi Shi
1688Long-Term Rhythmic Video Soundtracker0Jiashuo Yu, Xiao Sun, Xinyuan Chen, Yaohui Wang, Yu Qiao
1689Adversarial Parameter Attack on Deep Neural Networks0Lijia Yu, XiaoShan Gao, Yihan Wang
1690CodeIPPrompt: Intellectual Property Infringement Assessment of Code Language Models0Chaowei Xiao, Chenguang Wang, Ning Zhang, Yevgeniy Vorobeychik, Yuhao Wu, Zhiyuan Yu
1691SeedGNN: Graph Neural Network for Supervised Seeded Graph Matching0Jiaming Xu, Liren Yu, Xiaojun Lin
1692Efficient and Equivariant Graph Networks for Predicting Quantum Hamiltonian0Haiyang Yu, Shuiwang Ji, Xiaofeng Qian, Xiaoning Qian, Zhao Xu
1693On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures0Lei Ying, Xian Yu
1694Actor-Critic Alignment for Offline-to-Online Reinforcement Learning0Xinhua Zhang, Zishun Yu
1695Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning0Cheng Wan, Kaizhi Qian, Yang Zhang, Yingyan Celine Lin, Yongan Zhang, Yonggan Fu, Zhongzhi Yu
1696Coordinate Descent Methods for Fractional Minimization0Ganzhao Yuan
1697On the Power of Foundation Models0Yang Yuan
1698Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning0Bo Li, Mingqi Yuan, Wenjun Zeng, Xin Jin
1699Traversing Between Modes in Function Space for Fast Ensembling0Eunggu Yun, Giung Nam, Hyungi Lee, Juho Lee
1700Conformal Prediction with Missing Values0Aymeric Dieuleveut, Julie Josse, Margaux Zaffran, Yaniv Romano
1701KDEformer: Accelerating Transformers via Kernel Density Estimation0Amin Karbasi, Amir Zandieh, Insu Han, Majid Daliri
1702Bayesian Estimation of Differential Privacy0Ahmed Salem, Andrew Paverd, Boris Köpf, Daniel Jones, Lukas Wutschitz, Mohammad Naseri, Santiago ZanellaBéguelin, Shruti Tople, Victor Rühle
1703When is Realizability Sufficient for Off-Policy Reinforcement Learning?0Andrea Zanette
1704On Distribution Dependent Sub-Logarithmic Query Time of Learned Indexing0Cyrus Shahabi, Sepanta Zeighami
1705Sequential Counterfactual Risk Minimization0Eustache Diemert, Houssam Zenati, Julien Mairal, Matthieu Martin, Pierre Gaillard
1706LookupFFN: Making Transformers Compute-lite for CPU inference0Karthikeyan Sankaralingam, Michael Davies, Pranav Pulijala, Vikas Singh, Zhanpeng Zeng
1707Attribute-Efficient PAC Learning of Low-Degree Polynomial Threshold Functions with Nasty Noise0Jie Shen, Shiwei Zeng
1708Generative Graph Dictionary Learning0Hanghang Tong, Hanqing Zeng, Ruike Zhu, Yinglong Xia, Zhichen Zeng
1709Stabilizing Transformer Training by Preventing Attention Entropy Collapse0Dan Busbridge, Etai Littwin, Jason Ramapuram, Jiatao Gu, Joshua M. Susskind, Shuangfei Zhai, Tatiana Likhomanenko, Yizhe Zhang
1710Offline Learning in Markov Games with General Function Approximation0Nan Jiang, Yu Bai, Yuheng Zhang
1711Learning useful representations for shifting tasks and distributions0Jianyu Zhang, Léon Bottou
1712Nonparametric Iterative Machine Teaching0Chen Zhang, Ivor W. Tsang, James T. Kwok, Weiyang Liu, Xiaofeng Cao
1713Matrix Estimation for Individual Fairness0Cindy Y. Zhang, Devavrat Shah, Sarah Huiyi Cen
1714Graph Contrastive Backdoor Attacks0Dinghao Wu, Hangfan Zhang, Jinghui Chen, Jinyuan Jia, Lu Lin
1715Effective Minkowski Dimension of Deep Nonparametric Regression: Function Approximation and Statistical Theories0Mengdi Wang, Minshuo Chen, Tuo Zhao, Wenjing Liao, Zixuan Zhang
1716Tractable Control for Autoregressive Language Generation0Guy Van den Broeck, Honghua Zhang, Meihua Dang, Nanyun Peng
1717CataBEEM: Integrating Latent Interaction Categories in Node-wise Community Detection Models for Network Data0Walter H. Dempsey, Yuhua Zhang
1718Rethink DARTS Search Space and Renovate a New Benchmark0Jiuling Zhang, Zhiming Ding
1719Team Belief DAG: Generalizing the Sequence Form to Team Games for Fast Computation of Correlated Team Max-Min Equilibria via Regret Minimization0Brian Hu Zhang, Gabriele Farina, Tuomas Sandholm
1720A Complete Expressiveness Hierarchy for Subgraph GNNs via Subgraph Weisfeiler-Lehman Tests0Bohang Zhang, Di He, Guhao Feng, Liwei Wang, Yiheng Du
1721Crafting Training Degradation Distribution for the Accuracy-Generalization Trade-off in Real-World Super-Resolution0Chao Dong, Haoyu Chen, Jinjin Gu, Ruofan Zhang, Wenming Yang, Yulun Zhang
1722Prompting Large Language Model for Machine Translation: A Case Study0Alexandra Birch, Barry Haddow, Biao Zhang
1723On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits0Jiafan He, Quanquan Gu, Weitong Zhang, Zhiyuan Fan
1724When Sparsity Meets Contrastive Models: Less Graph Data Can Bring Better Class-Balanced Representations0Chao Huang, Chunhui Zhang, Chuxu Zhang, Qianlong Wen, Yanfang Ye, Yijun Tian, Youhuan Li, Zhongyu Ouyang
1725Spatial-Temporal Graph Learning with Adversarial Contrastive Adaptation0Chao Huang, Lianghao Xia, Qianru Zhang, Ruihua Han, Siu Ming Yiu, Zheng Wang
1726Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models0Guanhua Zhang, Jiabao Ji, Mo Yu, Shiyu Chang, Tommi S. Jaakkola, Yang Zhang
1727CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling0Jiangtao Feng, Jun Zhang, Lin Zheng, Lingpeng Kong, Shuyang Jiang
1728Adaptive Barrier Smoothing for First-Order Policy Gradient with Contact Dynamics0Shenao Zhang, Wanxin Jin, Zhaoran Wang
1729One-Step Estimator for Permuted Sparse Recovery0Hang Zhang, Ping Li
1730Quantum Lower Bounds for Finding Stationary Points of Nonconvex Functions0Chenyi Zhang, Tongyang Li
1731Improving Medical Predictions by Irregular Multimodal Electronic Health Records Modeling0Linda Ruth Petzold, Shiyang Li, Xifeng Yan, Xinlu Zhang, Zhiyu Chen
1732FedCR: Personalized Federated Learning Based on Across-Client Common Representation with Conditional Mutual Information Regularization0Chenglin Li, Hao Zhang, Hongkai Xiong, Junni Zou, Wenrui Dai
1733On the Optimality of Misspecified Kernel Ridge Regression0Haobo Zhang, Qian Lin, Weihao Lu, Yicheng Li
1734Fed-CBS: A Heterogeneity-Aware Client Sampling Mechanism for Federated Learning via Class-Imbalance Reduction0Ang Li, Changyou Chen, Fan Zhang, Hai Li, Jianyi Zhang, Jingwei Sun, Minxue Tang, Xiang Chen, Yiran Chen
1735Learning Subpocket Prototypes for Generalizable Structure-based Drug Design0Qi Liu, Zaixi Zhang
1736No One Idles: Efficient Heterogeneous Federated Learning with Parallel Edge and Server Computation0Feilong Zhang, Gang Wu, Junjun Jiang, Shiyi Lin, Xiangyang Ji, Xianming Liu, Xiong Zhou
1737The Wisdom of Hindsight Makes Language Models Better Instruction Followers0Fangchen Liu, Joseph E. Gonzalez, Justin Wong, Pieter Abbeel, Tianjun Zhang
1738Detecting Adversarial Data by Probing Multiple Perturbations Using Expected Perturbation Score0Bo Han, Changsheng Li, Feng Liu, Jiahao Yang, Mingkui Tan, Shuhai Zhang, Yifan Yang
1739On Enhancing Expressive Power via Compositions of Single Fixed-Size ReLU Network0Hongkai Zhao, Jianfeng Lu, Shijun Zhang
1740Bi-directional Masks for Efficient N: M Sparse Training0Fei Chao, Jingjing Xie, Mingbao Lin, Rongrong Ji, Yiting Luo, Yunshan Zhong, Yuxin Zhang
1741Towards Unbiased Training in Federated Open-world Semi-supervised Learning0Jie Zhang, Song Guo, Wenchao Xu, Xiaosong Ma
1742Interactive Object Placement with Reinforcement Learning0Bineng Zhong, Liqiang Nie, Qinglin Liu, Quanling Meng, Rongrong Ji, Shengping Zhang, Xiaopeng Fan
1743Optimal Shrinkage for Distributed Second-Order Optimization0Fangzhao Zhang, Mert Pilanci
1744"Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts0Haoran Zhang, Harvineet Singh, Marzyeh Ghassemi, Shalmali Joshi
1745Learning Regions of Interest for Bayesian Optimization with Adaptive Level-Set Estimation0Alexander Ladd, Fengxue Zhang, James C. Bowden, Jialin Song, Thomas Desautels, Yisong Yue, Yuxin Chen
1746A Category-theoretical Meta-analysis of Definitions of Disentanglement0Masashi Sugiyama, Yivan Zhang
1747On the Convergence of SARSA with Linear Function Approximation0Remi Tachet des Combes, Romain Laroche, Shangtong Zhang
1748AdaNPC: Exploring Non-Parametric Classifier for Test-Time Adaptation0Kexin Jin, Kun Yuan, Liang Wang, Rong Jin, Tieniu Tan, Xue Wang, Yifan Zhang, Zhang Zhang
1749On the Generalization of Multi-modal Contrastive Learning0Qi Zhang, Yifei Wang, Yisen Wang
1750ConCerNet: A Contrastive Learning Based Framework for Automated Conservation Law Discovery and Trustworthy Dynamical System Prediction0Alexandre Megretski, Lam M. Nguyen, Luca Daniel, Subhro Das, TsuiWei Weng, Wang Zhang
1751Towards Trustworthy Explanation: On Causal Rationalization0Hengrui Cai, Tong Wu, Wenbo Zhang, Yong Cai, Yunlong Wang
1752Demystifying Uneven Vulnerability of Link Stealing Attacks against Graph Neural Networks0Bang Wu, He Zhang, Minhui Xue, Shirui Pan, Shuo Wang, Xiangwen Yang, Xingliang Yuan
1753Provable Dynamic Fusion for Low-Quality Multimodal Data0Changqing Zhang, Haitao Wu, Huazhu Fu, Joey Tianyi Zhou, Qinghua Hu, Qingyang Zhang, Xi Peng
1754ReDi: Efficient Learning-Free Diffusion Inference via Trajectory Retrieval0Kexun Zhang, Lei Li, William Yang Wang, Xianjun Yang
1755Nearly Optimal Competitive Ratio for Online Allocation Problems with Two-sided Resource Constraints and Finite Requests0Enhong Chen, Haoyuan Hu, Qixin Zhang, Wenbing Ye, Yu Yang, Zaiyi Chen
1756Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection0Chenglong Wang, Chu Yuan Zhang, Jiangyan Yi, Jianhua Tao, Xiaohui Zhang
1757Coder Reviewer Reranking for Code Generation0Daniel Fried, Mike Lewis, Sida Wang, Tao Yu, Tatsunori Hashimoto, Tianyi Zhang, WenTau Yih
1758DP-Fast MH: Private, Fast, and Accurate Metropolis-Hastings for Large-Scale Bayesian Inference0Ruqi Zhang, Wanrong Zhang
1759Nearly-tight Bounds for Deep Kernel Learning0MinLing Zhang, Yifan Zhang
1760OpenFE: Automated Feature Generation with Expert-level Performance0Fengyuan Liu, Haoyan Luo, Li Jian, Qian Liu, Tianping Zhang, Wei Cao, Zheyu Aqa Zhang, Zhiyuan Fan
1761Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs0Junkai Zhang, Quanquan Gu, Weitong Zhang
1762Unlocking Slot Attention by Changing Optimal Transport Costs0Cees G. M. Snoek, David W. Zhang, Gertjan J. Burghouts, Simon LacosteJulien, Yan Zhang
1763Towards a Persistence Diagram that is Robust to Noise and Varied Densities0Hang Zhang, Kai Ming Ting, Kaifeng Zhang, Ye Zhu
1764Robust Situational Reinforcement Learning in Face of Context Disturbances0Chuheng Zhang, Jiang Bian, Jinpeng Zhang, Lei Song, Li Zhao, Yuan Zhou, Yufeng Zheng
1765Patch-level Contrastive Learning via Positional Query for Visual Pre-training0Fan Wang, Junchi Yan, Qiang Zhou, Shaofeng Zhang, Zhibin Wang
1766Men Also Do Laundry: Multi-Attribute Bias Amplification0Alice Xiang, Dora Zhao, Jerone Theodore Alexander Andrews
1767Rockmate: an Efficient, Fast, Automatic and Generic Tool for Re-materialization in PyTorch0Julia Gusak, Lionel EyraudDubois, Olivier Beaumont, Théotime Le Hellard, Xunyi Zhao
1768Revisiting Structured Variational Autoencoders0Scott W. Linderman, Yixiu Zhao
1769On Pitfalls of Test-Time Adaptation0Alexandre Alahi, Hao Zhao, Tao Lin, Yuejiang Liu
1770Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm0Boxiang Lyu, Boxin Zhao, Mladen Kolar, Raul Castro Fernandez
1771X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion0Ce Liu, Dianmo Sheng, Dong Chen, Dongdong Chen, Fang Wen, Hanqing Zhao, Jianmin Bao, Lu Yuan, Nenghai Yu, Qi Chu, Weiming Zhang, Wenbo Zhou
1772Revisiting Simple Regret: Fast Rates for Returning a Good Arm0Connor Stephens, Csaba Szepesvári, KwangSung Jun, Yao Zhao
1773Transformed Distribution Matching for Missing Value Imputation0Amir Dezfouli, Edwin V. Bonilla, He Zhao, Ke Sun
1774Protecting Language Generation Models via Invisible Watermarking0Lei Li, Xuandong Zhao, YuXiang Wang
1775Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning0Jason D. Lee, Yulai Zhao, Zhaoran Wang, Zhuoran Yang
1776Simplified Temporal Consistency Reinforcement Learning0Joni Pajarinen, Juho Kannala, Rinu Boney, Wenshuai Zhao, Yi Zhao
1777RLEG: Vision-Language Representation Learning with Diffusion-based Embedding Generation0Deli Zhao, Jingren Zhou, Kecheng Zheng, Liming Zhao, Yun Zheng
1778Optimal Online Generalized Linear Regression with Stochastic Noise and Its Application to Heteroscedastic Bandits0Dongruo Zhou, Heyang Zhao, Jiafan He, Quanquan Gu
1779Does Continual Learning Equally Forget All Parameters?0Chengqi Zhang, Guodong Long, Haiyan Zhao, Jing Jiang, Tianyi Zhou
1780Online Learning in Stackelberg Games with an Omniscient Follower0Banghua Zhu, Geng Zhao, Jiantao Jiao, Michael I. Jordan
1781Structure-informed Language Models Are Protein Designers0Dongyu Xue, Fei Ye, Quanquan Gu, Yi Zhou, Yifan Deng, Zaixiang Zheng
1782Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories0Aditya Grover, Brandon Amos, Mikael Henaff, Qinqing Zheng
1783Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs0Cheng Lu, Jianfei Chen, Jun Zhu, Kaiwen Zheng
1784Fast Sampling of Diffusion Models via Operator Learning0Anima Anandkumar, Arash Vahdat, Hongkai Zheng, Kamyar Azizzadenesheli, Weili Nie
1785Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation0Ajay Kumar Jaiswal, Dejia Xu, Kevin Wang, S. P. Sharan, Wenqing Zheng, Yihan Xi, Zhangyang Wang
1786Revisiting Discriminative vs. Generative Classifiers: Theory and Implications0Chenyu Zheng, Chongxuan Li, Fan Bao, Guoqiang Wu, Jun Zhu, Yue Cao
1787Evidential Interactive Learning for Medical Image Captioning0Ervine Zheng, Qi Yu
1788Finding the Missing-half: Graph Complementary Learning for Homophily-prone and Heterophily-prone Graphs0He Zhang, Shirui Pan, Vincent ChengSiong Lee, Xiao Wang, Yizhen Zheng, Yu Zheng
1789Multi-agent Online Scheduling: MMS Allocations for Indivisible Items0Rufan Bai, Shengwei Zhou, Xiaowei Wu
1790Eliminating Adversarial Noise via Information Discard and Robust Representation Restoration0Dawei Zhou, Decheng Liu, Nannan Wang, Tongliang Liu, Xinbo Gao, Yukun Chen
1791Brainformers: Trading Simplicity for Efficiency0Andrew M. Dai, Chang Lan, Claire Cui, Da Huang, Daiyi Peng, David R. So, James Laudon, Jeff Dean, Nan Du, Quoc V. Le, Siamak Shakeri, Yanping Huang, Yanqi Zhou, Yifeng Lu, Zhifeng Chen
1792Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression0Mo Zhou, Rong Ge
1793ODS: Test-Time Adaptation in the Presence of Open-World Data Shift0Dingchu Zhang, LanZhe Guo, LinHan Jia, Yufeng Li, Zhi Zhou
1794Fourmer: An Efficient Global Modeling Paradigm for Image Restoration0Chongyi Li, ChunLe Guo, Jie Huang, Man Zhou
1795Controlled Text Generation with Natural Language Instructions0Ethan Wilcox, Mrinmaya Sachan, Ryan Cotterell, Wangchunshu Zhou, Yuchen Eleanor Jiang
1796NNSplitter: An Active Defense Solution for DNN Model via Automated Weight Obfuscation0Shaolei Ren, Tong Zhou, Xiaolin Xu, Yukui Luo
1797Deep Latent State Space Models for Time-Series Generation0Linqi Zhou, Michael Poli, Stefano Ermon, Stefano Massaroli, Winnie Xu
1798SlotGAT: Slot-based Message Passing for Heterogeneous Graphs0Jieming Shi, Qing Li, Renchi Yang, Yuanhang Zou, Ziang Zhou
1799Fast Online Node Labeling for Very Large Graphs0Baojian Zhou, Reza Babanezhad Harikandeh, Yifan Sun
1800Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes0Runlong Zhou, Ruosong Wang, Simon Shaolei Du
1801Phase-aware Adversarial Defense for Improving Adversarial Robustness0Dawei Zhou, Heng Yang, Nannan Wang, Tongliang Liu, Xinbo Gao
1802From Relational Pooling to Subgraph GNNs: A Universal Framework for More Expressive Graph Neural Networks0Cai Zhou, Muhan Zhang, Xiyuan Wang
1803Towards Omni-generalizable Neural Methods for Vehicle Routing Problems0Jianan Zhou, Jie Zhang, Wen Song, Yaoxin Wu, Zhiguang Cao
1804A Three-regime Model of Network Pruning0Arin Chang, Michael W. Mahoney, Yaoqing Yang, Yefan Zhou
1805Learning to Decouple Complex Systems0Tianshu Yu, Zihan Zhou
1806ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation0Connor Pryor, Hongxia Jin, Kaiwen Zhou, Kaizhi Zheng, Lise Getoor, Xin Eric Wang, Yilin Shen
1807On Strengthening and Defending Graph Reconstruction Attack with Markov Chain Approximation0Bo Han, Chenyu Zhou, Jiangchao Yao, Quanming Yao, Xuan Li, Zhanke Zhou
1808Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments0Runlong Zhou, Simon Shaolei Du, Zihan Zhang
1809Learning Unforeseen Robustness from Out-of-distribution Data Using Equivariant Domain Translator0Bang An, Furong Huang, Sanghyun Hong, Sicheng Zhu
1810Markovian Gaussian Process Variational Autoencoders0Carles Balsells Rodas, Harrison Zhu, Yingzhen Li
1811Mixture Proportion Estimation Beyond Irreducibility0Aaron Fjeldsted, Azaree Lintereur, Clayton Scott, Darren Holland, George Landon, Yilun Zhu
1812Exploring Model Dynamics for Accumulative Poisoning Discovery0Bo Han, Chao Du, Jiangchao Yao, Jianing Zhu, Li He, Liang Wang, Shuo Yuan, Tongliang Liu, Xiawei Guo
1813Decentralized SGD and Average-direction SAM are Asymptotically Equivalent0Dacheng Tao, Fengxiang He, Kaixuan Chen, Mingli Song, Tongtian Zhu
1814Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons0Banghua Zhu, Jiantao Jiao, Michael I. Jordan
1815Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability0Bo Han, Hengzhuang Li, Jiangchao Yao, Jianing Zhu, Jianliang Xu, Tongliang Liu
1816Benign Overfitting in Deep Neural Networks under Lazy Training0Fanghui Liu, Francesco Locatello, Grigorios Chrysos, Volkan Cevher, Zhenyu Zhu
1817Interpolation for Robust Learning: Data Augmentation on Wasserstein Geodesics0Aritra Guha, Bo Li, Ding Zhao, Jiacheng Zhu, Jielin Qiu, XuanLong Nguyen, Zhuolin Yang
1818LeadFL: Client Self-Defense against Model Poisoning in Federated Learning0Chaoyi Zhu, Lydia Y. Chen, Stefanie Roos
1819XTab: Cross-table Pretraining for Tabular Transformers0Bingzhao Zhu, George Karypis, Mahsa Shoaran, Mu Li, Nick Erickson, Xingjian Shi
1820Provable Multi-instance Deep AUC Maximization with Stochastic Pooling0Bokun Wang, Dixian Zhu, Milan Sonka, Tianbao Yang, Xiaodong Wu, Yaxing Wang, Zhi Chen
1821Surrogate Model Extension (SME): A Fast and Accurate Weight Update Attack on Federated Learning0Junyi Zhu, Matthew B. Blaschko, Ruicong Yao
1822Weak Proxies are Sufficient and Preferable for Fairness with Missing Sensitive Attributes0Hang Li, Jiankai Sun, Yang Liu, Yuanshun Yao, Zhaowei Zhu
1823Label Distributionally Robust Losses for Multi-class Classification: Consistency, Robustness and Adaptivity0Dixian Zhu, Tianbao Yang, Yiming Ying
1824Likelihood Adjusted Semidefinite Programs for Clustering Heterogeneous Data0Xiaohui Chen, Yubo Zhuang, Yun Yang
1825Are Random Decompositions all we need in High Dimensional Bayesian Optimisation?0Haitham BouAmmar, Juliusz Krysztof Ziomek
1826Revisiting Bellman Errors for Offline Model Selection0Daniel de Marchi, Joshua P. Zitovsky, Michael Rene Kosorok, Rishabh Agarwal
1827spred: Solving L1 Penalty with SGD0Liu Ziyin, Zihao Wang
1828The Benefits of Mixup for Feature Learning0Difan Zou, Quanquan Gu, Yuan Cao, Yuanzhi Li