AAAI-2024-Papers

May 23, 2024 · View on GitHub

Application App

Safe, Robust and Responsible AI

Section Papers Preprint Papers Papers with Open Code Papers with Video

:id:TitleRepoPaperVideo
ImageCaptioner2: Image Captioner for Image Captioning Bias Amplification Assessment:heavy_minus_sign:ojs.aaai
A Framework for Data-Driven Explainability in Mathematical Optimization:heavy_minus_sign:ojs.aaaiYouVideo
On the Importance of Application-Grounded Experimental Design for Evaluating Explainable ML Methods:heavy_minus_sign:ojs.aaaiYouVideo
Risk-Aware Continuous Control with Neural Contextual Bandits:heavy_minus_sign:ojs.aaaiYouVideo
Robust Uncertainty Quantification Using Conformalised Monte Carlo Prediction:heavy_minus_sign:ojs.aaaiYouVideo
CCTR: Calibrating Trajectory Prediction for Uncertainty-Aware Motion Planning in Autonomous Driving:heavy_minus_sign:ojs.aaaiYouVideo
Rethinking the Development of Large Language Models from the Causal Perspective: A Legal Text Prediction Case Study:heavy_minus_sign:ojs.aaai
Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning:heavy_minus_sign:ojs.aaai
Constrained Meta-Reinforcement Learning for Adaptable Safety Guarantee with Differentiable Convex Programming:heavy_minus_sign:ojs.aaaiYouVideo
Conformal Prediction Regions for Time Series Using Linear Complementarity Programming:heavy_minus_sign:ojs.aaaiYouVideo
TTTS: Tree Test Time Simulation for Enhancing Decision Tree Robustness against Adversarial Examples:heavy_minus_sign:ojs.aaaiYouVideo
Find the Lady: Permutation and Re-synchronization of Deep Neural Networks:heavy_minus_sign:ojs.aaaiYouVideo
Stability Analysis of Switched Linear Systems with Neural Lyapunov Functions:heavy_minus_sign:ojs.aaaiYouVideo
Robustness Verification of Multi-Class Tree Ensembles:heavy_minus_sign:ojs.aaaiYouVideo
P2BPO: Permeable Penalty Barrier-Based Policy Optimization for Safe RL:heavy_minus_sign:ojs.aaaiYouVideo
Trade-Offs in Fine-Tuned Diffusion Models between Accuracy and Interpretability:heavy_minus_sign:ojs.aaaiYouVideo
From Hope to Safety: Unlearning Biases of Deep Models via Gradient Penalization in Latent Space:heavy_minus_sign:ojs.aaaiYouVideo
Automatically Testing Functional Properties of Code Translation Models:heavy_minus_sign:ojs.aaaiYouVideo
A Simple and Yet Fairly Effective Defense for Graph Neural Networks:heavy_minus_sign:ojs.aaaiYouVideo
Invisible Backdoor Attack against 3D Point Cloud Classifier in Graph Spectral Domain:heavy_minus_sign:ojs.aaaiYouVideo
CASE: Exploiting Intra-class Compactness and Inter-class Separability of Feature Embeddings for Out-of-Distribution Detection:heavy_minus_sign:ojs.aaaiYouVideo
Solving Non-rectangular Reward-Robust MDPs via Frequency Regularization:heavy_minus_sign:ojs.aaaiYouVideo
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation:heavy_minus_sign:ojs.aaaiYouVideo
π-Light: Programmatic Interpretable Reinforcement Learning for Resource-Limited Traffic Signal Control:heavy_minus_sign:ojs.aaaiYouVideo
Generative Model for Decision Trees:heavy_minus_sign:ojs.aaaiYouVideo
Omega-Regular Decision Processes:heavy_minus_sign:ojs.aaaiYouVideo
Provable Robustness against a Union of L_0 Adversarial Attacks:heavy_minus_sign:ojs.aaaiYouVideo
All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models:heavy_minus_sign:ojs.aaaiYouVideo
Towards Efficient Verification of Quantized Neural Networks:heavy_minus_sign:ojs.aaaiYouVideo
On the Concept Trustworthiness in Concept Bottleneck Models:heavy_minus_sign:ojs.aaaiYouVideo
Personalization as a Shortcut for Few-Shot Backdoor Attack against Text-to-Image Diffusion Models:heavy_minus_sign:ojs.aaaiYouVideo
Stronger and Transferable Node Injection Attacks:heavy_minus_sign:ojs.aaaiYouVideo
Learning Fair Policies for Multi-Stage Selection Problems from Observational Data:heavy_minus_sign:ojs.aaaiYouVideo
NeRFail: Neural Radiance Fields-Based Multiview Adversarial Attack:heavy_minus_sign:ojs.aaaiYouVideo
Analysis of Differentially Private Synthetic Data: A Measurement Error Approach:heavy_minus_sign:ojs.aaaiYouVideo
Chasing Fairness in Graphs: A GNN Architecture Perspective:heavy_minus_sign:ojs.aaaiYouVideo
Assume-Guarantee Reinforcement Learning:heavy_minus_sign:ojs.aaaiYouVideo
DeepBern-Nets: Taming the Complexity of Certifying Neural Networks Using Bernstein Polynomial Activations and Precise Bound Propagation:heavy_minus_sign:ojs.aaaiYouVideo
Layer Attack Unlearning: Fast and Accurate Machine Unlearning via Layer Level Attack and Knowledge Distillation:heavy_minus_sign:ojs.aaaiYouVideo
Quilt: Robust Data Segment Selection against Concept Drifts:heavy_minus_sign:ojs.aaaiYouVideo
OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially Generated Examples:heavy_minus_sign:ojs.aaaiYouVideo
Accelerating Adversarially Robust Model Selection for Deep Neural Networks via Racing:heavy_minus_sign:ojs.aaaiYouVideo
Robust Active Measuring under Model Uncertainty:heavy_minus_sign:ojs.aaaiYouVideo
Towards Large Certified Radius in Randomized Smoothing Using Quasiconcave Optimization:heavy_minus_sign:ojs.aaaiYouVideo
Contrastive Credibility Propagation for Reliable Semi-supervised Learning:heavy_minus_sign:ojs.aaaiYouVideo
Exponent Relaxation of Polynomial Zonotopes and Its Applications in Formal Neural Network Verification:heavy_minus_sign:ojs.aaaiYouVideo
I Prefer Not to Say: Protecting User Consent in Models with Optional Personal Data:heavy_minus_sign:ojs.aaaiYouVideo
Promoting Counterfactual Robustness through Diversity:heavy_minus_sign:ojs.aaaiYouVideo
Revisiting the Information Capacity of Neural Network Watermarks: Upper Bound Estimation and Beyond:heavy_minus_sign:ojs.aaaiYouVideo
PointCVaR: Risk-Optimized Outlier Removal for Robust 3D Point Cloud Classification:heavy_minus_sign:ojs.aaaiYouVideo
Game-Theoretic Unlearnable Example Generator:heavy_minus_sign:ojs.aaaiYouVideo
Beyond Traditional Threats: A Persistent Backdoor Attack on Federated Learning:heavy_minus_sign:ojs.aaaiYouVideo
Handling Long and Richly Constrained Tasks through Constrained Hierarchical Reinforcement Learning:heavy_minus_sign:ojs.aaai
Combining Graph Transformers Based Multi-Label Active Learning and Informative Data Augmentation for Chest Xray Classification:heavy_minus_sign:ojs.aaaiYouVideo
Enumerating Safe Regions in Deep Neural Networks with Provable Probabilistic Guarantees:heavy_minus_sign:ojs.aaaiYouVideo
Divide-and-Aggregate Learning for Evaluating Performance on Unlabeled Data:heavy_minus_sign:ojs.aaaiYouVideo
SentinelLMs: Encrypted Input Adaptation and Fine-Tuning of Language Models for Private and Secure Inference:heavy_minus_sign:ojs.aaaiYouVideo
Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis:heavy_minus_sign:ojs.aaaiYouVideo
Feature Unlearning for Pre-trained GANs and VAEs:heavy_minus_sign:ojs.aaaiYouVideo
Reward Certification for Policy Smoothed Reinforcement Learning:heavy_minus_sign:ojs.aaaiYouVideo
EncryIP: A Practical Encryption-Based Framework for Model Intellectual Property Protection:heavy_minus_sign:ojs.aaaiYouVideo
Neural Closure Certificates:heavy_minus_sign:ojs.aaai
SocialStigmaQA: A Benchmark to Uncover Stigma Amplification in Generative Language Models:heavy_minus_sign:ojs.aaaiYouVideo
MaxEnt Loss: Constrained Maximum Entropy for Calibration under Out-of-Distribution Shift:heavy_minus_sign:ojs.aaaiYouVideo
ORES: Open-Vocabulary Responsible Visual Synthesis:heavy_minus_sign:ojs.aaaiYouVideo
Q-SENN: Quantized Self-Explaining Neural Networks:heavy_minus_sign:ojs.aaaiYouVideo
Understanding Likelihood of Normalizing Flow and Image Complexity through the Lens of Out-of-Distribution Detection:heavy_minus_sign:ojs.aaaiYouVideo
Adversarial Initialization with Universal Adversarial Perturbation: A New Approach to Fast Adversarial Training:heavy_minus_sign:ojs.aaaiYouVideo
A PAC Learning Algorithm for LTL and Omega-Regular Objectives in MDPs:heavy_minus_sign:ojs.aaaiYouVideo
Robust Stochastic Graph Generator for Counterfactual Explanations:heavy_minus_sign:ojs.aaaiYouVideo
Visual Adversarial Examples Jailbreak Aligned Large Language Models:heavy_minus_sign:ojs.aaaiYouVideo
Dissenting Explanations: Leveraging Disagreement to Reduce Model Overreliance:heavy_minus_sign:ojs.aaaiYouVideo
I-CEE: Tailoring Explanations of Image Classification Models to User Expertise:heavy_minus_sign:ojs.aaaiYouVideo
A Simple and Practical Method for Reducing the Disparate Impact of Differential Privacy:heavy_minus_sign:ojs.aaaiYouVideo
Interpretability Benchmark for Evaluating Spatial Misalignment of Prototypical Parts Explanations:heavy_minus_sign:ojs.aaaiYouVideo
Human-Guided Moral Decision Making in Text-Based Games:heavy_minus_sign:ojs.aaaiYouVideo
Towards Fairer Centroids in K-means Clustering:heavy_minus_sign:ojs.aaaiYouVideo
Toward Robustness in Multi-Label Classification: A Data Augmentation Strategy against Imbalance and Noise:heavy_minus_sign:ojs.aaaiYouVideo
Bidirectional Contrastive Split Learning for Visual Question Answering:heavy_minus_sign:ojs.aaaiYouVideo
Quantile-Based Maximum Likelihood Training for Outlier Detection:heavy_minus_sign:ojs.aaaiYouVideo
Sparsity-Guided Holistic Explanation for LLMs with Interpretable Inference-Time Intervention:heavy_minus_sign:ojs.aaai
Toward More Generalized Malicious URL Detection Models:heavy_minus_sign:ojs.aaaiYouVideo
Self-Supervised Likelihood Estimation with Energy Guidance for Anomaly Segmentation in Urban Scenes:heavy_minus_sign:ojs.aaaiYouVideo
Pure-Past Action Masking:heavy_minus_sign:ojs.aaaiYouVideo
Long-Term Safe Reinforcement Learning with Binary Feedback:heavy_minus_sign:ojs.aaaiYouVideo
Identifying Reasons for Bias: An Argumentation-Based Approach:heavy_minus_sign:ojs.aaaiYouVideo
Would You Like Your Data to Be Trained? A User Controllable Recommendation Framework:heavy_minus_sign:ojs.aaaiYouVideo
Moderate Message Passing Improves Calibration: A Universal Way to Mitigate Confidence Bias in Graph Neural Networks:heavy_minus_sign:ojs.aaaiYouVideo
Generating Diagnostic and Actionable Explanations for Fair Graph Neural Networks:heavy_minus_sign:ojs.aaaiYouVideo
Physics-Informed Representation and Learning: Control and Risk Quantification:heavy_minus_sign:ojs.aaaiYouVideo
Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration:heavy_minus_sign:ojs.aaai
Concealing Sensitive Samples against Gradient Leakage in Federated Learning:heavy_minus_sign:ojs.aaaiYouVideo
The Evidence Contraction Issue in Deep Evidential Regression: Discussion and Solution:heavy_minus_sign:ojs.aaaiYouVideo
Byzantine-Robust Decentralized Learning via Remove-then-Clip Aggregation:heavy_minus_sign:ojs.aaai
Hypothesis Testing for Class-Conditional Noise Using Local Maximum Likelihood:heavy_minus_sign:ojs.aaaiYouVideo
Providing Fair Recourse over Plausible Groups:heavy_minus_sign:ojs.aaaiYouVideo
Representation-Based Robustness in Goal-Conditioned Reinforcement Learning:heavy_minus_sign:ojs.aaaiYouVideo
Enhancing Off-Policy Constrained Reinforcement Learning through Adaptive Ensemble C Estimation:heavy_minus_sign:ojs.aaaiYouVideo
Efficient Toxic Content Detection by Bootstrapping and Distilling Large Language Models:heavy_minus_sign:ojs.aaaiYouVideo
LR-XFL: Logical Reasoning-Based Explainable Federated Learning:heavy_minus_sign:ojs.aaai
GaLileo: General Linear Relaxation Framework for Tightening Robustness Certification of Transformers:heavy_minus_sign:ojs.aaai
A Huber Loss Minimization Approach to Byzantine Robust Federated Learning:heavy_minus_sign:ojs.aaaiYouVideo
Responsible Bandit Learning via Privacy-Protected Mean-Volatility Utility:heavy_minus_sign:ojs.aaaiYouVideo
UMA: Facilitating Backdoor Scanning via Unlearning-Based Model Ablation:heavy_minus_sign:ojs.aaaiYouVideo
AdvST: Revisiting Data Augmentations for Single Domain Generalization:heavy_minus_sign:ojs.aaaiYouVideo
Can LLM Replace Stack Overflow? A Study on Robustness and Reliability of Large Language Model Code Generation:heavy_minus_sign:ojs.aaaiYouVideo
DataElixir: Purifying Poisoned Dataset to Mitigate Backdoor Attacks via Diffusion Models:heavy_minus_sign:ojs.aaaiYouVideo
Closing the Gap: Achieving Better Accuracy-Robustness Tradeoffs against Query-Based Attacks:heavy_minus_sign:ojs.aaaiYouVideo
Coevolutionary Algorithm for Building Robust Decision Trees under Minimax Regret:heavy_minus_sign:ojs.aaaiYouVideo