diff --git a/iclr2023_submissions.html b/iclr2023_submissions.html index 2ea2535..aa24142 100644 --- a/iclr2023_submissions.html +++ b/iclr2023_submissions.html @@ -1 +1 @@ -ICLR2023 Statistics
ICLR 2023 Statistics
Github
# (4753)TitleR1stdRatings
1Git Re-Basin: Merging Models modulo Permutation Symmetries8.670.9410, 8, 8
2Rethinking the Expressive Power of GNNs via Graph Biconnectivity8.670.9410, 8, 8
3Emergence of Maps in the Memories of Blind Navigation Agents8.500.878, 8, 8, 10
4DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems8.500.8710, 8, 8, 8
5Graph Neural Networks for Link Prediction with Subgraph Sketching8.500.878, 8, 8, 10
6Revisiting the Entropy Semiring for Neural Speech Recognition8.501.6610, 8, 6, 10
7Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning8.252.058, 10, 10, 5
8Learning a Data-Driven Policy Network for Pre-Training Automated Feature Engineering8.000.008, 8, 8
9Fast Nonlinear Vector Quantile Regression8.000.008, 8, 8
10Scaling Up Probabilistic Circuits by Latent Variable Distillation8.000.008, 8, 8
11​​What learning algorithm is in-context learning? Investigations with linear models8.000.008, 8, 8
12FedExP: Speeding up Federated Averaging via Extrapolation8.000.008, 8, 8
13DreamFusion: Text-to-3D using 2D Diffusion8.000.008, 8, 8, 8
14Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching8.001.6310, 8, 6
15ReAct: Synergizing Reasoning and Acting in Language Models8.000.008, 8, 8
16The Lie Derivative for Measuring Learned Equivariance8.000.008, 8, 8
17Agree to Disagree: Diversity through Disagreement for Better Transferability8.000.008, 8, 8, 8
18Can We Find Nash Equilibria at a Linear Rate in Markov Games?8.000.008, 8, 8, 8
19Aligning Model and Macaque Inferior Temporal Cortex Representations Improves Model-to-Human Behavioral Alignment and Adversarial Robustness8.000.008, 8, 8
20Robust Scheduling with GFlowNets8.000.008, 8, 8, 8
21Transformers Learn Shortcuts to Automata8.001.638, 10, 6
22Strong inductive biases provably prevent harmless interpolation8.000.008, 8, 8
23Confidential-PROFITT: Confidential PROof of FaIr Training of Trees8.000.008, 8, 8
24Minimum Variance Unbiased N:M Sparsity for the Neural Gradients8.000.008, 8, 8
25Asymptotic Instance-Optimal Algorithms for Interactive Decision Making8.001.268, 8, 10, 8, 6
26Targeted Hyperparameter Optimization with Lexicographic Preferences Over Multiple Objectives8.000.008, 8, 8
27Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning8.000.008, 8, 8
28Self-Stabilization: The Implicit Bias of Gradient Descent at the Edge of Stability8.000.008, 8, 8
29Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness8.000.008, 8, 8, 8
30AudioGen: Textually Guided Audio Generation8.000.008, 8, 8, 8
31Geometric Networks Induced by Energy Constrained Diffusion8.001.418, 6, 8, 10
32A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification8.001.638, 10, 6
33Martingale Posterior Neural Processes8.000.008, 8, 8
34Relative representations enable zero-shot latent space communication8.001.6310, 6, 8
35Sign and Basis Invariant Networks for Spectral Graph Representation Learning8.000.008, 8, 8, 8
36Conditional Antibody Design as 3D Equivariant Graph Translation8.000.008, 8, 8, 8
37Evaluating Long-Term Memory in 3D Mazes8.000.008, 8, 8
38Generate rather than Retrieve: Large Language Models are Strong Context Generators8.001.418, 10, 8, 6
39Betty: An Automatic Differentiation Library for Multilevel Optimization8.001.418, 6, 10, 8
40Benchmarking Deformable Object Manipulation with Differentiable Physics8.000.008, 8, 8
41Generating Diverse Cooperative Agents by Learning Incompatible Policies8.000.008, 8, 8, 8
42On the duality between contrastive and non-contrastive self-supervised learning7.751.798, 5, 8, 10
43Flow Matching for Generative Modeling7.751.7910, 8, 8, 5
44DiffEdit: Diffusion-based semantic image editing with mask guidance7.751.798, 5, 8, 10
45GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation7.672.058, 5, 10
46Selection-Inference: Exploiting Large Language Models for Interpretable Logical Reasoning7.600.808, 8, 8, 6, 8
47BigVGAN: A Universal Neural Vocoder with Large-Scale Training7.600.808, 8, 8, 8, 6
48Exponential Generalization Bounds with Near-Optimal Rates for $L_q$-Stable Algorithms7.600.808, 6, 8, 8, 8
49CROM: Continuous Reduced-Order Modeling of PDEs Using Implicit Neural Representations7.600.808, 6, 8, 8, 8
50Concept-level Debugging of Part-Prototype Networks7.500.876, 8, 8, 8
51WikiWhy: Answering and Explaining Cause-and-Effect Questions7.500.878, 6, 8, 8
52GEASS: Neural causal feature selection for high-dimensional biological data7.500.878, 8, 6, 8
53Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions7.500.876, 8, 8, 8
54SMART: Self-supervised Multi-task pretrAining with contRol Transformers7.500.878, 8, 8, 6
55The Surprising Effectiveness of Equivariant Models in Domains with Latent Symmetry7.500.878, 8, 8, 6
56Provably Efficient Neural Offline Reinforcement Learning via Perturbed Rewards7.500.878, 8, 8, 6
57Near-optimal Coresets for Robust Clustering7.500.878, 8, 8, 6
58PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification7.500.876, 8, 8, 8
59GLM-130B: An Open Bilingual Pre-trained Model7.500.878, 8, 8, 6
60Provably Auditing Ordinary Least Squares in Low Dimensions7.500.878, 8, 6, 8
61Effects of Graph Convolutions in Multi-layer Networks7.500.878, 8, 8, 6
62Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask?7.501.668, 6, 10, 6
63Few-shot Cross-domain Image Generation via Inference-time Latent-code Learning7.500.878, 8, 6, 8
64Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs7.500.878, 8, 8, 6
65Symbolic Physics Learner: Discovering governing equations via Monte Carlo tree search7.500.878, 8, 8, 6
66Prompt-to-Prompt Image Editing with Cross-Attention Control7.500.878, 8, 6, 8
67PV3D: A 3D Generative Model for Portrait Video Generation7.501.666, 8, 10, 6
68UNIFIED-IO: A Unified Model for Vision, Language, and Multi-modal Tasks7.500.878, 6, 8, 8
69Omnigrok: Grokking Beyond Algorithmic Data7.500.876, 8, 8, 8
70A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics7.500.878, 8, 8, 6
71Accurate Image Restoration with Attention Retractable Transformer7.500.878, 8, 8, 6
72Generalized structure-aware missing view completion network for incomplete multi-view clustering7.500.878, 8, 6, 8
73PEER: A Collaborative Language Model7.500.876, 8, 8, 8
74Empowering Networks With Scale and Rotation Equivariance Using A Similarity Convolution7.500.878, 8, 6, 8
75Token Merging: Your ViT But Faster7.500.876, 8, 8, 8
76Image as Set of Points7.500.878, 8, 6, 8
77H2RBox: Horizonal Box Annotation is All You Need for Oriented Object Detection7.501.668, 6, 6, 10
78Pushing the Limits of Fewshot Anomaly Detection in Industry Vision: Graphcore7.500.878, 8, 8, 6
79Minimax Optimal Kernel Operator Learning via Multilevel Training7.401.7410, 5, 8, 8, 6
80Few-Shot Domain Adaptation For End-to-End Communication7.330.948, 6, 8
81Improved Training of Physics-Informed Neural Networks Using Energy-Based Priors: a Study on Electrical Impedance Tomography7.331.8910, 6, 6
82Combinatorial Pure Exploration of Causal Bandits7.330.948, 8, 6
83The In-Sample Softmax for Offline Reinforcement Learning7.330.948, 6, 8
84Discrete Predictor-Corrector Diffusion Models for Image Synthesis7.330.948, 6, 8
85Binding Language Models in Symbolic Languages7.330.948, 8, 6
86Evolve Smoothly, Fit Consistently: Learning Smooth Latent Dynamics For Advection-Dominated Systems7.330.948, 8, 6
87Learning Language Representations with Logical Inductive Bias7.330.946, 8, 8
88Implicit Bias of Large Depth Networks: a Notion of Rank for Nonlinear Functions7.331.8010, 8, 5, 8, 5, 8
89Contrastive Corpus Attribution for Explaining Representations7.330.948, 8, 6
90SoftZoo: A Soft Robot Co-design Benchmark For Locomotion In Diverse Environments7.330.948, 6, 8
91Disentanglement of Correlated Factors via Hausdorff Factorized Support7.330.948, 6, 8
92Exploring the Limits of Differentially Private Deep Learning with Group-wise Clipping7.330.946, 8, 8
93DiffusER: Diffusion via Edit-based Reconstruction7.330.946, 8, 8
94Efficient recurrent architectures through activity sparsity and sparse back-propagation through time7.330.946, 8, 8
95Symmetric Pruning in Quantum Neural Networks7.330.948, 8, 6
96Incremental Learning of Structured Memory via Closed-Loop Transcription7.330.948, 6, 8
97Scaling Forward Gradient With Local Losses7.330.948, 6, 8
98Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning7.330.948, 6, 8
99Progress measures for grokking via mechanistic interpretability7.330.946, 8, 8
100Simplified State Space Layers for Sequence Modeling7.330.948, 6, 8
101Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms7.330.946, 8, 8
102Post-hoc Concept Bottleneck Models7.330.948, 6, 8
103Open-Vocabulary Object Detection upon Frozen Vision and Language Models7.330.948, 6, 8
104Temporal Dependencies in Feature Importance for Time Series Prediction7.330.946, 8, 8
105Pre-training via Denoising for Molecular Property Prediction7.330.946, 8, 8
106A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning7.330.946, 8, 8
107SCALE-UP: An Efficient Black-box Input-level Backdoor Detection via Analyzing Scaled Prediction Consistency7.330.948, 6, 8
108Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve7.330.946, 8, 8
109A framework for benchmarking Class-out-of-distribution detection and its application to ImageNet7.330.948, 8, 6
110SketchKnitter: Vectorized Sketch Generation with Diffusion Models7.330.946, 8, 8
111Tailoring Language Generation Models under Total Variation Distance7.330.948, 6, 8
112Bag of Tricks for Unsupervised Text-to-Speech7.330.948, 8, 6
113Statistical Efficiency of Score Matching: The View from Isoperimetry7.330.946, 8, 8
114Multifactor Sequential Disentanglement via Structured Koopman Autoencoders7.330.948, 6, 8
115View Synthesis with Sculpted Neural Points7.330.948, 6, 8
116AutoGT: Automated Graph Transformer Architecture Search7.330.948, 8, 6
117Neural Optimal Transport7.330.946, 8, 8
118Deep Ranking Ensembles for Hyperparameter Optimization7.330.948, 8, 6
119Win: Weight-Decay-Integrated Nesterov Acceleration for Adaptive Gradient Algorithms7.330.948, 6, 8
120Measuring axiomatic identifiability of counterfactual image models7.330.948, 8, 6
121GFlowNets and variational inference7.331.8910, 6, 6
122Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes7.251.928, 6, 10, 5
123gDDIM: Generalized denoising diffusion implicit models7.251.308, 8, 8, 5
124A Theoretical Framework for Inference and Learning in Predictive Coding Networks7.252.598, 3, 10, 8
125The Onset of Variance-Limited Behavior for Networks in the Lazy and Rich Regimes7.251.308, 8, 5, 8
126The Asymmetric Maximum Margin Bias of Quasi-Homogeneous Neural Networks7.251.928, 10, 5, 6
127Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation7.251.305, 8, 8, 8
128A probabilistic framework for task-aligned intra- and inter-area neural manifold estimation7.251.308, 5, 8, 8
129Neuromechanical Autoencoders: Learning to Couple Elastic and Neural Network Nonlinearity7.251.308, 8, 5, 8
130Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning7.251.305, 8, 8, 8
131Efficient Learning of Rationalizable Equilibria in General-Sum Games7.251.308, 8, 8, 5
132ExpressivE: A Spatio-Functional Embedding For Knowledge Graph Completion7.251.928, 5, 10, 6
133Fundamental Limits in Formal Verification of Message-Passing Neural Networks7.252.593, 8, 10, 8
134Learning on Large-scale Text-attributed Graphs via Variational Inference7.251.305, 8, 8, 8
135Extreme Q-Learning: MaxEnt RL without Entropy7.251.928, 5, 10, 6
136STaSy: Score-based Tabular data Synthesis7.251.305, 8, 8, 8
137BAYES RISK CTC: CONTROLLABLE CTC ALIGNMENT IN SEQUENCE-TO-SEQUENCE TASKS7.251.308, 5, 8, 8
138A Convergent Single-Loop Algorithm for Gromov-Wasserstein in Graph Data7.251.308, 8, 8, 5
139Provable Memorization Capacity of Transformers7.251.308, 5, 8, 8
140Mega: Moving Average Equipped Gated Attention7.251.308, 5, 8, 8
141Domain-Indexing Variational Bayes for Domain Adaptation7.251.308, 8, 5, 8
142Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?7.251.928, 6, 10, 5
143ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor7.251.308, 8, 8, 5
144Multi-skill Mobile Manipulation for Object Rearrangement7.251.928, 10, 6, 5
145MocoSFL: enabling cross-client collaborative self-supervised learning7.251.308, 8, 8, 5
146MECTA: Memory-Economic Continual Test-Time Model Adaptation7.251.308, 8, 8, 5
147Diversify and Disambiguate: Out-of-Distribution Robustness via Disagreement7.251.308, 8, 8, 5
148Depth Separation with Multilayer Mean-Field Networks7.200.986, 8, 6, 8, 8
149A Holistic View of Noise Transition Matrix in Deep Learning and Beyond7.200.988, 6, 8, 6, 8
150Masked Unsupervised Self-training for Label-free Image Classification7.171.218, 6, 8, 8, 5, 8
151Softened Symbol Grounding for Neuro-symbolic Systems7.002.125, 5, 8, 10
152Learning Group Importance using the Differentiable Hypergeometric Distribution7.001.008, 6, 8, 6
153A Message Passing Perspective on Learning Dynamics of Contrastive Learning7.001.418, 5, 8
154LiftedCL: Lifting Contrastive Learning for Human-Centric Perception7.001.418, 5, 8
155Learning with Logical Constraints but without Shortcut Satisfaction7.001.008, 8, 6, 6
156Automatically Answering and Generating Machine Learning Final Exams7.002.948, 10, 3
157A Closer Look at Model Adaptation using Feature Distortion and Simplicity Bias7.002.128, 10, 5, 5
158What Makes Convolutional Models Great on Long Sequence Modeling?7.001.008, 6, 8, 6
159The Role of Coverage in Online Reinforcement Learning7.001.418, 5, 8
160Diffusion-GAN: Training GANs with Diffusion7.001.006, 6, 8, 8
161Real-time variational method for learning neural trajectory and its dynamics7.001.008, 6, 6, 8
162When and why Vision-Language Models behave like Bags-of-Words, and what to do about it?7.001.006, 6, 8, 8
163Learning Iterative Neural Optimizers for Image Steganography7.001.006, 6, 8, 8
164Interpretable Geometric Deep Learning via Learnable Randomness Injection7.001.008, 8, 6, 6
165Is Reinforcement Learning (Not) for Natural Language Processing?: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization7.001.006, 6, 8, 8
166Learning rigid dynamics with face interaction graph networks7.001.736, 10, 6, 6
167Why (and When) does Local SGD Generalize Better than SGD?7.001.415, 8, 8
168Do We Really Need Complicated Model Architectures For Temporal Networks?7.001.418, 8, 5
169Modeling the Data-Generating Process is Necessary for Out-of-Distribution Generalization7.001.008, 8, 6, 6
170(Certified!!) Adversarial Robustness for Free!7.001.008, 6, 8, 6
171Efficient Conditionally Invariant Representation Learning7.001.418, 5, 8
172Canary in a Coalmine: Better Membership Inference with Ensembled Adversarial Queries7.001.418, 8, 5
173Learning Fair Graph Representations via Automated Data Augmentations7.001.008, 8, 6, 6
174Latent Neural ODEs with Sparse Bayesian Multiple Shooting7.001.008, 8, 6, 6
175Decentralized Optimistic Hyperpolicy Mirror Descent: Provably No-Regret Learning in Markov Games7.001.008, 8, 6, 6
176Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training7.001.008, 6, 8, 6
177A Higher Precision Algorithm for Computing the $1$-Wasserstein Distance7.001.415, 8, 8
178Imitating Human Behaviour with Diffusion Models7.001.008, 6, 6, 8
179LexMAE: Lexicon-Bottlenecked Pretraining for Large-Scale Retrieval7.001.008, 8, 6, 6
180Sampling-based inference for large linear models, with application to linearised Laplace7.001.008, 8, 6, 6
181Dual Algorithmic Reasoning7.001.415, 8, 8
182Almost Linear Constant-Factor Sketching for $ell_1$ and Logistic Regression7.001.418, 8, 5
183Spectral Subgraph Localization7.001.418, 8, 5
184FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation7.002.1210, 8, 5, 5
185On Compositional Uncertainty Quantification for Seq2seq Graph Parsing7.002.948, 3, 10
186Efficient Attention via Control Variates7.001.006, 8, 6, 8
187Augmented Lagrangian is Enough for Optimal Offline RL with General Function Approximation and Partial Coverage7.001.006, 6, 8, 8
188DocPrompting: Generating Code by Retrieving the Docs7.001.008, 6, 8, 6
189Words are all you need? Language as an approximation for representational similarity7.002.125, 8, 5, 10
190FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning7.001.418, 5, 8
191Spectral Decomposition Representation for Reinforcement Learning7.001.418, 8, 5
192Certifiably Robust Policy Learning against Adversarial Multi-Agent Communication7.001.418, 8, 5
193Learning Sparse Group Models Through Boolean Relaxation7.001.006, 8, 6, 8
194Deconstructing Distributions: A Pointwise Framework of Learning7.001.008, 6, 6, 8
195Parametrizing Product Shape Manifolds by Composite Networks7.001.418, 8, 5
196Learning Hyper Label Model for Programmatic Weak Supervision7.001.008, 6, 6, 8
197STOCHASTIC NO-REGRET LEARNING FOR GENERAL GAMES WITH VARIANCE REDUCTION7.001.008, 6, 8, 6
198TAN without a burn: Scaling laws of DP-SGD7.001.008, 8, 6, 6
199Pink Noise Is All You Need: Colored Noise Exploration in Deep Reinforcement Learning7.001.415, 8, 8
200A Unified Algebraic Perspective on Lipschitz Neural Networks7.001.006, 6, 8, 8
201Sparsity-Constrained Optimal Transport7.001.7910, 8, 5, 6, 6
202Embedding Fourier for Ultra-High-Definition Low-Light Image Enhancement7.001.006, 8, 8, 6
203HT-Net: Hierarchical Transformer based Operator Learning Model for Multiscale PDEs7.002.125, 10, 8, 5
204On the Usefulness of Embeddings, Clusters and Strings for Text Generation Evaluation7.001.006, 8, 8, 6
205Accurate Bayesian Meta-Learning by Accurate Task Posterior Inference7.001.008, 8, 6, 6
206Context-enriched molecule representations improve few-shot drug discovery7.001.008, 8, 6, 6
207A Universal 3D Molecular Representation Learning Framework7.002.943, 8, 10
208The Generalized Eigenvalue Problem as a Nash Equilibrium7.001.008, 6, 6, 8
209Language Modelling with Pixels7.001.008, 6, 6, 8
210Faster Gradient-Free Methods for Escaping Saddle Points7.001.008, 6, 8, 6
211Classically Approximating Variational Quantum Machine Learning with Random Fourier Features7.001.415, 8, 8
212Self-supervision through Random Segments with Autoregressive Coding (RandSAC)7.001.415, 8, 8
213Exploring Temporally Dynamic Data Augmentation for Video Recognition7.001.006, 6, 8, 8
214Meta-Learning in Games7.001.006, 8, 8, 6
215Continuized Acceleration for Quasar Convex Functions in Non-Convex Optimization7.001.008, 6, 6, 8
216InCoder: A Generative Model for Code Infilling and Synthesis7.001.006, 6, 8, 8
217Benchmarking Offline Reinforcement Learning on Real-Robot Hardware7.001.008, 8, 6, 6
218Transformers are Sample-Efficient World Models7.001.008, 6, 6, 8
219Scalable Subset Sampling with Neural Conditional Poisson Networks7.001.008, 6, 6, 8
220Diffusion Posterior Sampling for General Noisy Inverse Problems7.001.006, 8, 6, 8
221Learning the Positions in CountSketch7.001.008, 6, 8, 6
222DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection7.001.268, 8, 5, 8, 6
223Provable Sim-to-real Transfer in Continuous Domain with Partial Observations7.001.418, 5, 8
224Outcome-directed Reinforcement Learning by Uncertainty & Temporal Distance-Aware Curriculum Goal Generation7.001.418, 8, 5
225Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning7.001.006, 8, 8, 6
226NeRN: Learning Neural Representations for Neural Networks7.001.008, 6, 6, 8
227Rank Preserving Framework for Asymmetric Image Retrieval7.001.006, 8, 8, 6
228Closing the gap: Exact maximum likelihood training of generative autoencoders using invertible layers7.001.006, 8, 8, 6
229Switch-NeRF: Learning Scene Decomposition with Mixture of Experts for Large-scale Neural Radiance Fields7.001.008, 6, 6, 8
230Plateau in Monotonic Linear Interpolation --- A 'Biased' View of Loss Landscape for Deep Networks7.001.006, 8, 8, 6
231Automated Data Augmentations for Graph Classification7.001.415, 8, 8
232Self-Supervised Category-Level Articulated Object Pose Estimation with Part-Level SE(3) Equivariance7.001.7310, 6, 6, 6
233Human Motion Diffusion Model7.001.006, 8, 8, 6
234More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity6.801.945, 8, 10, 6, 5
235Understanding Edge-of-Stability Training Dynamics with a Minimalist Example6.801.478, 5, 5, 8, 8
236Self-Distillation for Further Pre-training of Transformers6.800.986, 8, 6, 6, 8
237Neural Networks and the Chomsky Hierarchy6.800.986, 8, 8, 6, 6
238Implicit Bias in Leaky ReLU Networks Trained on High-Dimensional Data6.752.5910, 6, 3, 8
239Certified Training: Small Boxes are All You Need6.751.306, 5, 8, 8
240A Kernel Perspective of Skip Connections in Convolutional Networks6.751.305, 8, 8, 6
241Chasing All-Round Graph Representation Robustness: Model, Training, and Optimization6.752.178, 3, 8, 8
242Robust Algorithms on Adaptive Inputs from Bounded Adversaries6.751.308, 6, 5, 8
243Simple initialization and parametrization of sinusoidal networks via their kernel bandwidth6.751.308, 6, 8, 5
244Reparameterization through Spatial Gradient Scaling6.751.305, 8, 6, 8
245Guiding Energy-based Models via Contrastive Latent Variables6.751.306, 8, 5, 8
246Gradient Descent Converges Linearly for Logistic Regression on Separable Data6.751.308, 5, 8, 6
247Momentum Stiefel Optimizer, with Applications to Suitably-Orthogonal Attention, and Optimal Transport6.751.926, 5, 6, 10
248On the Sensitivity of Reward Inference to Misspecified Human Models6.752.178, 8, 3, 8
249Promptagator: Few-shot Dense Retrieval From 8 Examples6.751.305, 6, 8, 8
250Label Propagation with Weak Supervision6.751.308, 8, 6, 5
251Learning MLPs on Graphs: A Unified View of Effectiveness, Robustness, and Efficiency6.751.306, 8, 8, 5
252Disentangling with Biological Constraints: A Theory of Functional Cell Types6.751.308, 6, 5, 8
253DINO as a von Mises-Fisher mixture model6.751.308, 5, 6, 8
254Scalable Batch-Mode Deep Bayesian Active Learning via Equivalence Class Annealing6.751.308, 8, 6, 5
255Provable Defense Against Geometric Transformations6.751.306, 5, 8, 8
256Taking a Step Back with KCal: Multi-Class Kernel-Based Calibration for Deep Neural Networks6.751.306, 5, 8, 8
257Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints6.751.305, 8, 8, 6
258Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics6.751.308, 6, 5, 8
259In-Situ Text-Only Adaptation of Speech Models with Low-Overhead Speech Imputations6.751.305, 6, 8, 8
260Choreographer: Learning and Adapting Skills in Imagination6.751.305, 8, 8, 6
261In-context Reinforcement Learning with Algorithm Distillation6.751.308, 8, 6, 5
262User-Interactive Offline Reinforcement Learning6.752.598, 3, 6, 10
263Lower Bounds on the Depth of Integral ReLU Neural Networks via Lattice Polytopes6.751.308, 6, 5, 8
264Learning Vortex Dynamics for Fluid Inference and Prediction6.751.305, 8, 8, 6
265Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data6.751.308, 5, 6, 8
266Unsupervised Semantic Segmentation with Self-supervised Object-centric Representations6.751.305, 8, 6, 8
267Decompositional Generation Process for Instance-Dependent Partial Label Learning6.752.173, 8, 8, 8
268Building a Subspace of Policies for Scalable Continual Learning6.751.306, 8, 8, 5
269Visually-Augmented Language Modeling6.751.926, 5, 10, 6
270Understanding the Role of Nonlinearity in Training Dynamics of Contrastive Learning6.751.305, 6, 8, 8
271CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis6.751.308, 5, 8, 6
272SAM as an Optimal Relaxation of Bayes6.751.308, 8, 5, 6
273Partial Label Unsupervised Domain Adaptation with Class-Prototype Alignment6.751.305, 8, 8, 6
274Moving Forward by Moving Backward: Embedding Action Impact over Action Semantics6.751.306, 5, 8, 8
275Undersampling is a Minimax Optimal Robustness Intervention in Nonparametric Classification6.751.308, 8, 6, 5
276Sampling with Mollified Interaction Energy Descent6.751.308, 6, 8, 5
277Does Zero-Shot Reinforcement Learning Exist?6.752.596, 3, 8, 10
278PaLI: A Jointly-Scaled Multilingual Language-Image Model6.751.305, 8, 8, 6
279Learning with Stochastic Orders6.751.308, 6, 5, 8
280Offline Congestion Games: How Feedback Type Affects Data Coverage Requirement6.751.308, 6, 8, 5
281Powderworld: A Platform for Understanding Generalization via Rich Task Distributions6.752.173, 8, 8, 8
282Is Attention All That NeRF Needs?6.751.308, 6, 5, 8
283The Influence of Learning Rule on Representation Dynamics in Wide Neural Networks6.751.306, 5, 8, 8
284RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch6.751.305, 6, 8, 8
285Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!6.751.306, 8, 8, 5
286Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search6.751.308, 5, 6, 8
287Does Deep Learning Learn to Abstract? A Systematic Probing Framework6.751.308, 5, 6, 8
288Variance-Aware Sparse Linear Bandits6.751.305, 8, 6, 8
289Image to Sphere: Learning Equivariant Features for Efficient Pose Prediction6.751.306, 8, 5, 8
290Self-Consistency Improves Chain of Thought Reasoning in Language Models6.751.925, 6, 6, 10
291Combinatorial-Probabilistic Trade-Off: P-Values of Community Properties Test in the Stochastic Block Models6.751.308, 5, 6, 8
292Improving Deep Regression with Ordinal Entropy6.752.178, 8, 3, 8
293Clifford Neural Layers for PDE Modeling6.751.305, 8, 8, 6
294Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning6.751.306, 8, 8, 5
295A Model or 603 Exemplars: Towards Memory-Efficient Class-Incremental Learning6.751.305, 8, 8, 6
296Contextual bandits with concave rewards, and an application to fair ranking6.751.308, 6, 5, 8
297When to Make and Break Commitments?6.751.305, 6, 8, 8
298Advancing Radiograph Representation Learning with Masked Record Modeling6.751.308, 6, 5, 8
299Quadratic models for understanding neural network dynamics6.751.308, 8, 6, 5
300Hidden Markov Transformer for Simultaneous Machine Translation6.751.308, 6, 5, 8
301Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model6.751.305, 8, 6, 8
302Masked Visual-Textual Prediction for Document Image Representation Pretraining6.751.308, 8, 6, 5
303Crossformer: Transformer Utilizing Cross-Dimension Dependency for Multivariate Time Series Forecasting6.751.306, 8, 5, 8
304Linear Connectivity Reveals Generalization Strategies6.751.308, 5, 8, 6
305ViT-Adapter: Exploring Plain Vision Transformer for Accurate Dense Predictions6.751.306, 5, 8, 8
306Collaborative Pure Exploration in Kernel Bandit6.751.308, 8, 6, 5
307LAVA: Data Valuation without Pre-Specified Learning Algorithms6.751.305, 6, 8, 8
308Generative Augmented Flow Networks6.751.306, 5, 8, 8
309Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language6.751.308, 6, 5, 8
310Automating Nearest Neighbor Search Configuration with Constrained Optimization6.751.308, 8, 6, 5
311Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders6.751.308, 8, 5, 6
312Can discrete information extraction prompts generalize across language models?6.751.308, 8, 6, 5
313Contextual Convolutional Networks6.751.308, 5, 8, 6
314Easy Differentially Private Linear Regression6.751.306, 8, 8, 5
315Towards Stable Test-time Adaptation in Dynamic Wild World6.752.178, 8, 8, 3
316Neural ePDOs: Spatially Adaptive Equivariant Partial Differential Operator Based Networks6.751.305, 8, 6, 8
317An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion6.751.306, 8, 5, 8
318PatchDCT: Patch Refinement for High Quality Instance Segmentation6.751.306, 5, 8, 8
319Representation Learning for Low-rank General-sum Markov Games6.751.306, 5, 8, 8
320DFPC: Data flow driven pruning of coupled channels without data.6.670.946, 6, 8
321Transformer-based model for symbolic regression via joint supervised learning6.670.946, 6, 8
322Curriculum-based Co-design of Morphology and Control of Voxel-based Soft Robots6.670.946, 8, 6
323Modeling content creator incentives on algorithm-curated platforms6.670.948, 6, 6
324Efficient Deep Reinforcement Learning Requires Regulating Statistical Overfitting6.670.946, 6, 8
325The Tilted Variational Autoencoder: Improving Out-of-Distribution Detection6.670.946, 8, 6
326Mind the Pool: Convolutional Neural Networks Can Overfit Input Size6.670.948, 6, 6
327Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection6.670.946, 6, 8
328On Achieving Optimal Adversarial Test Error6.670.946, 8, 6
329KwikBucks: Correlation Clustering with Cheap-Weak and Expensive-Strong Signals6.670.946, 6, 8
330Integrating Symmetry into Differentiable Planning with Steerable Convolutions6.670.948, 6, 6
331Revisiting Populations in multi-agent Communication6.670.946, 6, 8
332Sublinear Algorithms for Kernel Matrices via Kernel Density Estimation6.670.946, 6, 8
333Representational Dissimilarity Metric Spaces for Stochastic Neural Networks6.670.946, 6, 8
334Guess the Instruction! Making Language Models Stronger Zero-Shot Learners6.670.946, 6, 8
335TDR-CL: Targeted Doubly Robust Collaborative Learning for Debiased Recommendations6.670.946, 8, 6
336Scaffolding a Student to Instill Knowledge6.670.946, 8, 6
337The Implicit Bias of Minima Stability in Multivariate Shallow ReLU Networks6.670.946, 8, 6
338MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning6.670.946, 8, 6
339Capturing the Motion of Every Joint: 3D Human Pose and Shape Estimation with Independent Tokens6.670.946, 6, 8
340Quality-Similar Diversity via Population Based Reinforcement Learning6.670.946, 8, 6
341Mind's Eye: Grounded Language Model Reasoning through Simulation6.670.946, 8, 6
342Understanding Embodied Reference with Touch-Line Transformer6.670.946, 8, 6
343Domain Generalization via Heckman-type Selection Models6.670.946, 6, 8
344Hyperbolic Deep Reinforcement Learning6.670.946, 8, 6
345Where to Begin? Exploring the Impact of Pre-Training and Initialization in Federated6.670.946, 8, 6
346Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier6.670.946, 6, 8
347AutoTransfer: AutoML with Knowledge Transfer - An Application to Graph Neural Networks6.670.948, 6, 6
348Text Summarization with Oracle Expectation6.670.946, 6, 8
349Out-of-Distribution Detection and Selective Generation for Conditional Language Models6.670.946, 6, 8
350Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descriptions6.670.946, 8, 6
351Active Image Indexing6.670.946, 6, 8
352Efficient Model Updates for Approximate Unlearning of Graph-Structured Data6.670.946, 6, 8
353DiGress: Discrete Denoising diffusion for graph generation6.670.948, 6, 6
354Differentially private Bias-Term Only Fine-tuning of Foundation Models6.670.946, 6, 8
355Accurate Neural Training with 4-bit Matrix Multiplications at Standard Formats6.670.946, 6, 8
356KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in Low-Resource NLP6.670.948, 6, 6
357MARS: Meta-learning as Score Matching in the Function Space6.670.948, 6, 6
358Simplicial Hopfield networks6.670.946, 8, 6
359MICN: Multi-scale Local and Global Context Modeling for Long-term Series Forecasting6.670.946, 8, 6
360Progressive Voronoi Diagram Subdivision Enables Accurate Data-free Class-Incremental Learning6.670.946, 8, 6
361Hungry Hungry Hippos: Towards Language Modeling with State Space Models6.670.946, 8, 6
362Near-optimal Policy Identification in Active Reinforcement Learning6.670.946, 8, 6
363Generative Modeling Helps Weak Supervision (and Vice Versa)6.670.946, 6, 8
364AIM: Adapting Image Models for Efficient Video Understanding6.670.946, 6, 8
365GAIN: On the Generalization of Instructional Action Understanding6.670.948, 6, 6
366Efficient Federated Domain Translation6.670.948, 6, 6
367Improved Convergence of Differential Private SGD with Gradient Clipping6.670.946, 8, 6
368Learning QUBO Forms in Quantum Annealing6.670.948, 6, 6
369Backstepping Temporal Difference Learning6.670.946, 6, 8
370Learning to Jointly Share and Prune Weights for Grounding Based Vision and Language Models6.670.946, 6, 8
371TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis6.670.948, 6, 6
372Active Learning in Bayesian Neural Networks with Balanced Entropy Learning Principle6.670.946, 8, 6
373Robust Active Distillation6.670.946, 8, 6
374Neural Episodic Control with State Abstraction6.670.948, 6, 6
375Learning to Generate Columns with Application to Vertex Coloring6.670.946, 6, 8
376EVA3D: Compositional 3D Human Generation from 2D Image Collections6.670.948, 6, 6
377Alternating Differentiation for Optimization Layers6.670.946, 6, 8
378MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction6.670.946, 6, 8
379Learning Domain-Agnostic Representation for Disease Diagnosis6.670.948, 6, 6
380Object Tracking by Hierarchical Part-Whole Attention6.670.946, 6, 8
381Boosting the Cycle Counting Power of Graph Neural Networks with I$^2$-GNNs6.601.208, 5, 6, 6, 8
382Pitfalls of Gaussians as a noise distribution in NCE6.601.208, 6, 6, 5, 8
383Theoretical Characterization of Neural Network Generalization with Group Imbalance6.602.0610, 5, 8, 5, 5
384Flow Annealed Importance Sampling Bootstrap6.601.206, 5, 6, 8, 8
385FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification6.601.206, 6, 8, 5, 8
386Sub-Task Decomposition Enables Learning in Sequence to Sequence Tasks6.601.205, 8, 8, 6, 6
387Adversarial Training descends without descent: Finding actual descent directions based on Danskin's theorem6.500.876, 8, 6, 6
388Generating Intuitive Fairness Specifications for Natural Language Processing6.500.876, 6, 8, 6
389LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning6.501.505, 8, 5, 8
390Selective Frequency Network for Image Restoration6.501.508, 8, 5, 5
391Multi-Objective Online Learning6.501.505, 8, 5, 8
392Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient6.500.876, 6, 8, 6
393Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks6.501.505, 8, 5, 8
394On the Importance and Applicability of Pre-Training for Federated Learning6.501.505, 8, 5, 8
395Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward6.501.508, 8, 5, 5
396Weighted Clock Logic Point Process6.501.508, 8, 5, 5
397Diffusion-based Image Translation using disentangled style and content representation6.500.878, 6, 6, 6
398How Much Data Are Augmentations Worth? An Investigation into Scaling Laws, Invariance, and Implicit Regularization6.501.505, 8, 5, 8
399Artificial Neuronal Ensembles with Learned Context Dependent Gating6.501.505, 8, 5, 8
400Backpropagation at the Infinitesimal Inference Limit of Energy-Based Models: Unifying Predictive Coding, Equilibrium Propagation, and Contrastive Hebbian Learning6.501.505, 8, 5, 8
401Dichotomy of Control: Separating What You Can Control from What You Cannot6.501.508, 5, 8, 5
402Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization6.500.876, 8, 6, 6
403Associative Memory Augmented Asynchronous Spatiotemporal Representation Learning for Event-based Perception6.500.876, 8, 6, 6
404Semi Parametric Inducing Point Networks6.500.878, 6, 6, 6
405Augmentation with Projection: Towards an Effective and Efficient Data Augmentation Paradigm for Distillation6.500.876, 8, 6, 6
406Transfer Learning with Deep Tabular Models6.501.505, 8, 8, 5
407Interneurons accelerate learning dynamics in recurrent neural networks for statistical adaptation6.501.505, 5, 8, 8
408HypeR: Multitask Hyper-Prompted Training Enables Large-Scale Retrieval Generalization6.500.878, 6, 6, 6
409On the Trade-Off between Actionable Explanations and the Right to be Forgotten6.500.876, 6, 6, 8
410Learning What and Where - Unsupervised Disentangling Location and Identity Tracking6.501.505, 5, 8, 8
411CANIFE: Crafting Canaries for Empirical Privacy Measurement in Federated Learning6.501.508, 8, 5, 5
412Training language models for deeper understanding improves brain alignment6.501.505, 8, 5, 8
413Sampling-free Inference for Ab-Initio Potential Energy Surface Networks6.501.508, 8, 5, 5
414Wasserstein Auto-encoded MDPs: Formal Verification of Efficiently Distilled RL Policies with Many-sided Guarantees6.501.505, 5, 8, 8
415Solving Constrained Variational Inequalities via a First-order Interior Point-based Method6.500.876, 6, 8, 6
416Calibration Matters: Tackling Maximization Bias in Large-scale Advertising Recommendation Systems6.500.878, 6, 6, 6
417Data Continuity Matters: Improving Sequence Modeling with Lipschitz Regularizer6.500.876, 6, 6, 8
418Control Graph as Unified IO for Morphology-Task Generalization6.501.505, 8, 8, 5
419Restricted Strong Convexity of Deep Learning Models with Smooth Activations6.500.878, 6, 6, 6
420Koopman Neural Operator Forecaster for Time-series with Temporal Distributional Shifts6.501.505, 8, 5, 8
421The Surprising Computational Power of Nondeterministic Stack RNNs6.500.878, 6, 6, 6
422A Non-monotonic Self-terminating Language Model6.500.876, 6, 6, 8
423Differentially Private $L_2$-Heavy Hitters in the Sliding Window Model6.501.508, 8, 5, 5
424Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning6.501.505, 8, 8, 5
425EA-HAS-Bench: Energy-aware Hyperparameter and Architecture Search Benchmark6.500.876, 6, 8, 6
426Versatile Neural Processes for Learning Implicit Neural Representations6.501.508, 5, 5, 8
427Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning6.500.876, 6, 8, 6
428Characterizing the Influence of Graph Elements6.500.876, 6, 8, 6
429Personalized Federated Learning with Feature Alignment and Classifier Collaboration6.501.508, 5, 5, 8
430Simple Yet Effective Graph Contrastive Learning for Recommendation6.501.505, 8, 5, 8
431Dual Diffusion Implicit Bridges for Image-to-Image Translation6.502.065, 5, 10, 6
432Learning to Grow Pretrained Models for Efficient Transformer Training6.500.878, 6, 6, 6
433Learning to Estimate Shapley Values with Vision Transformers6.501.505, 8, 8, 5
434Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning6.500.878, 6, 6, 6
435Code Translation with Compiler Representations6.502.0610, 6, 5, 5
436AnyDA: Anytime Domain Adaptation6.500.876, 6, 8, 6
437Differentiable Mathematical Programming for Object-Centric Representation Learning6.501.508, 5, 8, 5
438Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding6.500.878, 6, 6, 6
439Mass-Editing Memory in a Transformer6.500.876, 6, 6, 8
440On the Saturation Effect of Kernel Ridge Regression6.500.876, 6, 8, 6
441AANG : Automating Auxiliary Learning6.501.508, 8, 5, 5
442Private Federated Learning Without a Trusted Server: Optimal Algorithms for Convex Losses6.500.876, 6, 6, 8
443Robust Fair Clustering: A Novel Fairness Attack and Defense Framework6.500.876, 8, 6, 6
444Dynamic Historical Adaptation for Continual Image-Text Modeling6.501.508, 5, 8, 5
445Learning without Prejudices: Continual Unbiased Learning via Benign and Malignant Forgetting6.501.508, 8, 5, 5
446Spherical Sliced-Wasserstein6.500.876, 8, 6, 6
447Causal Representation Learning for Instantaneous and Temporal Effects6.501.508, 8, 5, 5
448The Role of ImageNet Classes in Fréchet Inception Distance6.501.508, 5, 5, 8
449Towards Understanding Why Mask Reconstruction Pretraining Helps in Downstream Tasks6.500.876, 8, 6, 6
450Prompt Learning with Optimal Transport for Vision-Language Models6.500.876, 6, 6, 8
451DASHA: Distributed Nonconvex Optimization with Communication Compression and Optimal Oracle Complexity6.500.876, 6, 8, 6
452LDMIC: Learning-based Distributed Multi-view Image Coding6.500.876, 6, 6, 8
453Causal Balancing for Domain Generalization6.500.876, 6, 6, 8
454Multi-lingual Evaluation of Code Generation Models6.500.876, 6, 6, 8
455ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure6.500.878, 6, 6, 6
456Digging into Backbone Design on Face Detection6.500.878, 6, 6, 6
457Sparse Mixture-of-Experts are Domain Generalizable Learners6.501.508, 5, 8, 5
458STREET: A MULTI-TASK STRUCTURED REASONING AND EXPLANATION BENCHMARK6.501.508, 5, 8, 5
459Fairness-aware Contrastive Learning with Partially Annotated Sensitive Attributes6.501.505, 8, 8, 5
460Patch-Level Contrasting without Patch Correspondence for Accurate and Dense Contrastive Representation Learning6.500.876, 6, 8, 6
461Excess Risk of Two-Layer ReLU Neural Networks in Teacher-Student Settings and its Superiority to Kernel Methods6.402.068, 3, 5, 8, 8
462Fundamental limits on the robustness of image classifiers6.401.368, 6, 5, 8, 5
463ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning6.401.365, 6, 8, 5, 8
464RoPAWS: Robust Semi-supervised Representation Learning from Uncurated Data6.402.068, 3, 8, 8, 5
465On Emergence of Activation Sparsity in Trained Transformers6.401.368, 5, 8, 5, 6
466ManyDG: Many-domain Generalization for Healthcare Applications6.402.068, 5, 8, 8, 3
467Neuro-Symbolic Procedural Planning with Commonsense Prompting6.401.366, 5, 8, 5, 8
468Direct Embedding of Temporal Network Edges via Time-Decayed Line Graphs6.382.0610, 8, 5, 3, 8, 6, 6, 5
469Quantifying and Mitigating the Impact of Label Errors on Model Disparity Metrics6.331.258, 6, 5
470Learning Continuous Normalizing Flows For Faster Convergence To Target Distribution via Ascent Regularizations6.331.256, 8, 5
471Learning Uncertainty for Unknown Domains with Zero-Target-Assumption6.331.258, 5, 6
472Making Substitute Models More Bayesian Can Enhance Transferability of Adversarial Examples6.331.255, 8, 6
473Zeroth-Order Optimization with Trajectory-Informed Derivative Estimation6.331.255, 8, 6
474Ordered GNN: Ordering Message Passing to Deal with Heterophily and Over-smoothing6.331.256, 5, 8
475Masked Distillation with Receptive Tokens6.331.255, 6, 8
476On Representing Linear Programs by Graph Neural Networks6.331.258, 6, 5
477Implicit Regularization for Group Sparsity6.331.258, 6, 5
478Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-Oriented Dialogue Systems6.331.256, 8, 5
479Supervision Complexity and its Role in Knowledge Distillation6.331.258, 5, 6
480Neural Causal Models for Counterfactual Identification and Estimation6.331.256, 5, 8
481How I Learned to Stop Worrying and Love Retraining6.331.256, 8, 5
482Systematic Rectification of Language Models via Dead-end Analysis6.331.258, 5, 6
483f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation6.331.256, 8, 5
484Scaling up and Stabilizing Differentiable Planning with Implicit Differentiation6.331.258, 6, 5
485Bispectral Neural Networks6.331.255, 6, 8
486Where to Diffuse, How to Diffuse and How to get back: Learning in Multivariate Diffusions6.332.363, 8, 8
487Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences6.331.255, 6, 8
488Explicitly Minimizing the Blur Error of Variational Autoencoders6.331.258, 5, 6
489Error Sensitivity Modulation based Experience Replay: Mitigating Abrupt Representation Drift in Continual Learning6.331.256, 8, 5
490Bayes-MIL: A New Probabilistic Perspective on Attention-based Multiple Instance Learning for Whole Slide Images6.331.258, 5, 6
491Using Language to Extend to Unseen Domains6.331.258, 5, 6
492Explainability as statistical inference6.331.255, 8, 6
493Bringing robotics taxonomies to continuous domains via GPLVM on hyperbolic manifolds6.331.256, 8, 5
494A Theory of Dynamic Benchmarks6.331.258, 5, 6
495Computing all Optimal Partial Transports6.331.258, 6, 5
496A View From Somewhere: Human-Centric Face Representations6.331.258, 6, 5
497Efficient Planning in a Compact Latent Action Space6.331.255, 6, 8
498Localized Randomized Smoothing for Collective Robustness Certification6.331.258, 6, 5
499Unbiased Supervised Contrastive Learning6.331.255, 8, 6
500Compressing multidimensional weather and climate data into neural networks6.331.255, 8, 6
501That Label's got Style: Handling Label Style Bias for Uncertain Image Segmentation6.331.255, 8, 6
502StableDR: Stabilized Doubly Robust Learning for Recommendation on Data Missing Not at Random6.331.256, 5, 8
503Learnable Graph Convolutional Attention Networks6.331.255, 6, 8
504How Sharpness-Aware Minimization Minimizes Sharpness?6.331.255, 8, 6
505Quantized Compressed Sensing with Score-Based Generative Models6.331.255, 8, 6
506On The Relative Error of Random Fourier Features for Preserving Kernel Distance6.332.368, 8, 3
507Weakly Supervised Neuro-Symbolic Image Manipulation via Multi-Hop Complex Instructions6.331.256, 5, 8
508Pushing the Accuracy-Fairness Tradeoff Frontier with Introspective Self-play6.331.258, 6, 5
509Imbalanced Semi-supervised Learning with Bias Adaptive Classifier6.331.258, 6, 5
510Excess risk analysis for epistemic uncertainty with application to variational inference6.332.363, 8, 8
511Meta-Learning General-Purpose Learning Algorithms with Transformers6.331.255, 8, 6
5123D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation6.332.368, 8, 3
513Re-calibrating Feature Attributions for Model Interpretation6.332.368, 8, 3
514Offline RL for Natural Language Generation with Implicit Language Q Learning6.332.368, 8, 3
515Fairness and Accuracy under Domain Generalization6.331.256, 5, 8
516Iteratively Learning Novel Strategies with Diversity Measured in State Distances6.331.255, 8, 6
517Contrastive Learning Can Find An Optimal Basis For Approximately View-Invariant Functions6.331.258, 6, 5
518Efficiently Computing Nash Equilibria in Adversarial Team Markov Games6.331.256, 8, 5
519SimPer: Simple Self-Supervised Learning of Periodic Targets6.332.368, 3, 8
520Causal Imitation Learning via Inverse Reinforcement Learning6.331.256, 8, 5
521Efficient Discrete Multi Marginal Optimal Transport Regularization6.331.255, 8, 6
522Human-level Atari 200x faster6.332.363, 8, 8
523Temporal Domain Generalization with Drift-Aware Dynamic Neural Networks6.331.256, 8, 5
524Matching receptor to odorant with protein language and graph neural networks6.331.256, 8, 5
525PGrad: Learning Principal Gradients For Domain Generalization6.332.368, 3, 8
526Statistical Guarantees for Consensus Clustering6.331.258, 5, 6
527Expressive Monotonic Neural Networks6.332.368, 8, 3
528Learning to CROSS exchange to solve min-max vehicle routing problems6.332.363, 8, 8
529Mitigating Dataset Bias by Using Per-Sample Gradient6.331.258, 5, 6
530Multiple Modes for Continual Learning6.332.873, 6, 10
531REVISITING PRUNING AT INITIALIZATION THROUGH THE LENS OF RAMANUJAN GRAPH6.331.256, 8, 5
532Learning Cut Selection for Mixed-Integer Linear Programming via Hierarchical Sequence Model6.331.255, 8, 6
533ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency6.332.368, 8, 3
534Neural Architecture Design and Robustness: A Dataset6.331.256, 8, 5
535Learning to Decompose Visual Features with Latent Textual Prompts6.331.258, 6, 5
536MATS: Memory Attention for Time-Series forecasting6.331.256, 5, 8
537MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer6.331.255, 6, 8
538Text-Driven Generative Domain Adaptation with Spectral Consistency Regularization6.331.258, 6, 5
539Transfer Learning with Pre-trained Conditional Generative Models6.331.255, 6, 8
540Treeformer: Dense Gradient Trees for Efficient Attention Computation6.331.256, 5, 8
541Cycle to Clique (Cy2C) Graph Neural Network: A Sight to See beyond Neighborhood Aggregation6.331.258, 6, 5
5423D Molecular Generation by Virtual Dynamics6.331.255, 6, 8
543Adversarial Attacks on Adversarial Bandits6.331.258, 5, 6
544On the Perils of Cascading Robust Classifiers6.331.255, 8, 6
545Diving into Unified Data-Model Sparsity for Class-Imbalanced Graph Representation Learning6.332.363, 8, 8
546Sparse tree-based Initialization for Neural Networks6.331.258, 6, 5
547On the Performance of Temporal Difference Learning With Neural Networks6.331.258, 6, 5
548Calibrating Sequence likelihood Improves Conditional Language Generation6.331.258, 6, 5
549SlotFormer: Unsupervised Visual Dynamics Simulation with Object-Centric Models6.331.255, 6, 8
550Fuzzy Alignments in Directed Acyclic Graph for Non-Autoregressive Machine Translation6.331.256, 5, 8
551On the complexity of nonsmooth automatic differentiation6.331.256, 5, 8
552Masked Image Modeling with Denoising Contrast6.331.258, 5, 6
553HiViT: A Simpler and More Efficient Design of Hierarchical Vision Transformer6.331.258, 6, 5
554Risk-Aware Reinforcement Learning with Coherent Risk Measures and Non-linear Function Approximation6.331.256, 8, 5
555Learning Proximal Operators to Discover Multiple Optima6.331.258, 6, 5
556Formal Mathematics Statement Curriculum Learning6.332.368, 3, 8
557POPGym: Benchmarking Partially Observable Reinforcement Learning6.332.368, 8, 3
558Learning Sparse and Low-Rank Priors for Image Recovery via Iterative Reweighted Least Squares Minimization6.331.256, 5, 8
559Truthful Self-Play6.331.258, 5, 6
560Continual Transformers: Redundancy-Free Attention for Online Inference6.331.256, 5, 8
561Dirichlet-based Uncertainty Calibration for Active Domain Adaptation6.331.258, 6, 5
562Robustness to corruption in pre-trained Bayesian neural networks6.331.256, 5, 8
563Meta-learning Adaptive Deep Kernel Gaussian Processes for Molecular Property Prediction6.331.255, 8, 6
564Bort: Towards Explainable Neural Networks with Bounded Orthogonal Constraint6.331.256, 5, 8
565A view of mini-batch SGD via generating functions: conditions of convergence, phase transitions, benefit from negative momenta.6.331.258, 5, 6
566ManiSkill2: A Unified Benchmark for Generalizable Manipulation Skills6.331.255, 8, 6
567Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph Matching6.331.258, 6, 5
568GANet: Graph-Aware Network for Point Cloud Completion with Displacement-Aware Point Augmentor6.332.8710, 6, 3
569Out-of-distribution Detection with Implicit Outlier Transformation6.331.256, 5, 8
570MCAL: Minimum Cost Human-Machine Active Labeling6.331.255, 6, 8
571Learnable Topological Features For Phylogenetic Inference via Graph Neural Networks6.332.363, 8, 8
572Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection6.332.363, 8, 8
573Surgical Fine-Tuning Improves Adaptation to Distribution Shifts6.331.256, 8, 5
574DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Manipulation6.331.255, 8, 6
575Understanding and Adopting Rational Behavior by Bellman Score Estimation6.291.166, 5, 8, 5, 8, 6, 6
576Solving stochastic weak Minty variational inequalities without increasing batch size6.251.096, 5, 6, 8
577WiNeRT: Towards Neural Ray Tracing for Wireless Channel Modelling and Differentiable Simulations6.251.096, 6, 5, 8
578On the Certification of Classifiers for Outperforming Human Annotators6.251.095, 6, 6, 8
579Don’t fear the unlabelled: safe semi-supervised learning via debiasing6.252.056, 3, 8, 8
580Boosting Causal Discovery via Adaptive Sample Reweighting6.251.098, 6, 5, 6
581Mole-BERT: Rethinking Pre-training Graph Neural Networks for Molecules6.251.096, 8, 6, 5
582Learning in temporally structured environments6.251.098, 6, 5, 6
583Efficient Certified Training and Robustness Verification of Neural ODEs6.251.096, 8, 5, 6
584UL2: Unifying Language Learning Paradigms6.252.058, 3, 8, 6
585Bitrate-Constrained DRO: Beyond Worst Case Robustness To Unknown Group Shifts6.251.096, 6, 8, 5
586FLIP: A Provable Defense Framework for Backdoor Mitigation in Federated Learning6.252.053, 8, 6, 8
587Structured World Representations via Block-Slot Attention6.251.095, 6, 8, 6
588CktGNN: Circuit Graph Neural Network for Electronic Design Automation6.251.095, 8, 6, 6
589Linearly Mapping from Image to Text Space6.252.058, 8, 3, 6
590Is the Performance of My Deep Network Too Good to Be True? A Direct Approach to Estimating the Bayes Error in Binary Classification6.251.096, 5, 8, 6
591Memorization Capacity of Neural Networks with Conditional Computation6.252.053, 6, 8, 8
592Neural Image-based Avatars: Generalizable Radiance Fields for Human Avatar Modeling6.252.058, 3, 6, 8
593Compositional Task Representations for Large Language Models6.251.096, 8, 5, 6
594Unsupervised Learning for Combinatorial Optimization Needs Meta Learning6.251.096, 8, 5, 6
595Unsupervised Meta-learning via Few-shot Pseudo-supervised Contrastive Learning6.252.056, 8, 3, 8
596Decepticons: Corrupted Transformers Breach Privacy in Federated Learning for Language Models6.253.038, 1, 8, 8
597Implicit regularization in Heavy-ball momentum accelerated stochastic gradient descent6.252.053, 8, 6, 8
598Pruning Deep Neural Networks from a Sparsity Perspective6.251.096, 6, 8, 5
599Composite Slice Transformer: An Efficient Transformer with Composition of Multi-Scale Multi-Range Attentions6.251.096, 6, 8, 5
600Information-Theoretic Diffusion6.251.095, 6, 6, 8
601Robust Graph Dictionary Learning6.251.098, 6, 5, 6
602Understanding Influence Functions and Datamodels via Harmonic Analysis6.251.098, 6, 6, 5
603TextGrad: Advancing Robustness Evaluation in NLP by Gradient-Driven Optimization6.251.096, 6, 8, 5
604Dynamical systems embedding with a physics-informed convolutional network6.251.095, 8, 6, 6
605Learning Interpretable Dynamics from Images of a Freely Rotating 3D Rigid Body6.251.096, 5, 6, 8
606Characteristic Neural Ordinary Differential Equation6.251.096, 5, 6, 8
607Forget Unlearning: Towards True Data-Deletion in Machine Learning6.251.098, 6, 5, 6
608Serving Graph Compression for Graph Neural Networks6.252.056, 3, 8, 8
609Learning where and when to reason in neuro-symbolic inference6.251.096, 5, 6, 8
610FIGARO: Controllable Music Generation using Learned and Expert Features6.251.095, 6, 6, 8
611Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function6.252.058, 3, 8, 6
612Hyper-Decision Transformer for Efficient Online Policy Adaptation6.252.056, 3, 8, 8
613Solving Continuous Control via Q-learning6.251.098, 5, 6, 6
614Rhino: Deep Causal Temporal Relationship Learning with History-dependent Noise6.251.098, 5, 6, 6
615Pseudoinverse-Guided Diffusion Models for Inverse Problems6.251.095, 6, 6, 8
616Sequential Gradient Coding For Straggler Mitigation6.251.098, 6, 6, 5
617Understanding DDPM Latent Codes Through Optimal Transport6.251.095, 6, 6, 8
618Self-supervised learning with rotation-invariant kernels6.251.096, 8, 5, 6
619Bidirectional Language Models Are Also Few-shot Learners6.251.096, 5, 8, 6
620EPISODE: Episodic Gradient Clipping with Periodic Resampled Corrections for Federated Learning with Heterogeneous Data6.251.098, 6, 5, 6
621Probabilistically Robust Recourse: Navigating the Trade-offs between Costs and Robustness in Algorithmic Recourse6.251.096, 8, 6, 5
622Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning6.251.096, 8, 6, 5
623Contrastive Learning for Unsupervised Domain Adaptation of Time Series6.252.058, 8, 3, 6
624Fisher-Legendre (FishLeg) optimization of deep neural networks6.251.096, 5, 8, 6
625A law of adversarial risk, interpolation, and label noise6.251.098, 8, 5, 6, 6, 5, 6, 6
626Revisiting Dense Retrieval with Unaswerable Counterfactuals6.251.098, 6, 6, 5
627Pareto-Efficient Decision Agents for Offline Multi-Objective Reinforcement Learning6.251.098, 5, 6, 6
628Language Models are Realistic Tabular Data Generators6.251.096, 8, 6, 5
629CRISP: Curriculum based Sequential neural decoders for Polar code family6.251.095, 6, 6, 8
630Learning Diffusion Bridges on Constrained Domains6.251.098, 5, 6, 6
631Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models6.251.096, 8, 6, 5
632PartAfford: Part-level Affordance Discovery6.252.053, 6, 8, 8
633NewModel: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing6.251.096, 8, 6, 5
634Max-Margin Works while Large Margin Fails: Generalization without Uniform Convergence6.251.096, 8, 6, 5
635Preference Transformer: Modeling Human Preferences using Transformers for RL6.251.095, 6, 6, 8
636MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations6.251.096, 5, 6, 8
637PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm6.252.058, 8, 6, 3
638Language Models Can Teach Themselves to Program Better6.251.098, 6, 6, 5
639Learning Kernelized Contextual Bandits in a Distributed and Asynchronous Environment6.251.098, 6, 5, 6
640Moderate Coreset: A Universal Method of Data Selection for Real-world Data-efficient Deep Learning6.251.096, 5, 6, 8
641Diffusion Models for Causal Discovery via Topological Ordering6.252.056, 8, 3, 8
642MetaMD: Principled Optimiser Meta-Learning for Deep Learning6.252.056, 8, 8, 3
643When Source-Free Domain Adaptation Meets Learning with Noisy Labels6.251.096, 5, 6, 8
644Concept Gradient: Concept-based Interpretation Without Linear Assumption6.251.096, 5, 8, 6
645MetaGL: Evaluation-Free Selection of Graph Learning Models via Meta-Learning6.251.096, 6, 5, 8
646Critical Initialization of Wide and Deep Neural Networks through Partial Jacobians: General Theory and Applications6.252.056, 8, 3, 8
647MaskViT: Masked Visual Pre-Training for Video Prediction6.251.096, 6, 8, 5
648How to Train your HIPPO: State Space Models with Generalized Orthogonal Basis Projections6.251.098, 6, 6, 5
649Generalization and Estimation Error Bounds for Model-based Neural Networks6.251.098, 5, 6, 6
650SGDA with shuffling: faster convergence for nonconvex-PŁ minimax optimization6.251.096, 5, 8, 6
651LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification6.251.096, 5, 6, 8
652Liquid Structural State-Space Models6.252.053, 8, 6, 8
653Ollivier-Ricci Curvature for Hypergraphs: A Unified Framework6.251.096, 8, 5, 6
654TiAda: A Time-scale Adaptive Algorithm For Nonconvex Minimax Optimization6.251.096, 5, 8, 6
655Teacher Guided Training: An Efficient Framework for Knowledge Transfer6.251.096, 6, 5, 8
656Adversarial Training of Self-supervised Monocular Depth Estimation against Physical-World Attacks6.251.098, 5, 6, 6
657Self-supervised Geometric Correspondence for Category-level 6D Object Pose Estimation in the Wild6.251.096, 6, 5, 8
658A Simple Yet Powerful Deep Active Learning With Snapshots Ensembles6.252.058, 6, 8, 3
659Towards Open Temporal Graph Neural Networks6.251.096, 5, 6, 8
660Batch Multivalid Conformal Prediction6.251.098, 6, 6, 5
661Equivariant 3D-Conditional Diffusion Models for Molecular Linker Design6.252.058, 3, 8, 6
662UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer6.252.058, 6, 3, 8
663Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation6.251.098, 5, 6, 6
664Unsupervised visualization of image datasets using contrastive learning6.252.496, 10, 3, 6
665A Differential Geometric View and Explainability of GNN on Evolving Graphs6.251.098, 6, 6, 5
666Generative Modelling with Inverse Heat Dissipation6.251.095, 6, 8, 6
667Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images6.251.095, 6, 8, 6
668Recon: Reducing Conflicting Gradients From the Root For Multi-Task Learning6.252.058, 6, 8, 3
669Breaking the Curse of Dimensionality in Multiagent State Space: A Unified Agent Permutation Framework6.251.096, 5, 6, 8
670Hierarchical Sliced Wasserstein Distance6.251.096, 8, 5, 6
671Prototypical Calibration for Few-shot Learning of Language Models6.251.095, 8, 6, 6
672Your Contrastive Learning Is Secretly Doing Stochastic Neighbor Embedding6.252.053, 8, 6, 8
673Distributionally Robust Recourse Action6.251.098, 6, 5, 6
674Visual Classification via Description from Large Language Models6.251.095, 6, 6, 8
675The World is Changing: Improving Fair Training under Correlation Shifts6.252.058, 3, 6, 8
676Relational Attention: Generalizing Transformers for Graph-Structured Tasks6.251.096, 8, 6, 5
677Distilling Model Failures as Directions in Latent Space6.252.053, 6, 8, 8
678Countinuous pseudo-labeling from the start6.251.096, 6, 5, 8
679FedDA: Faster Framework of Local Adaptive Gradient Methods via Restarted Dual Averaging6.251.096, 8, 5, 6
680FoSR: First-order spectral rewiring for addressing oversquashing in GNNs6.251.095, 8, 6, 6
681Deep Generative Symbolic Regression6.251.095, 6, 8, 6
682Diffusion Probabilistic Fields6.251.096, 5, 8, 6
683Novel View Synthesis with Diffusion Models6.251.098, 6, 6, 5
684LMC: Fast Training of GNNs via Subgraph Sampling with Provable Convergence6.252.058, 8, 6, 3
685How to Exploit Hyperspherical Embeddings for Out-of-Distribution Detection?6.251.095, 6, 8, 6
686Emergent world representations: Exploring a sequence model trained on a synthetic task6.252.056, 3, 8, 8
687Programmatically Grounded, Compositionally Generalizable Robotic Manipulation6.252.056, 8, 8, 3
688Anisotropic Message Passing: Graph Neural Networks with Directional and Long-Range Interactions6.251.096, 6, 8, 5
689Planckian Jitter: countering the color-crippling effects of color jitter on self-supervised training6.252.053, 8, 6, 8
690GAMR: A Guided Attention Model for (visual) Reasoning6.251.096, 6, 8, 5
691Monocular Scene Reconstruction with 3D SDF Transformers6.251.095, 8, 6, 6
692Re-parameterizing Your Optimizers rather than Architectures6.252.053, 8, 8, 6
693Deep Generative Modeling on Limited Data with Regularization by Nontransferable Pre-trained Models6.251.098, 6, 5, 6
694Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation6.251.095, 6, 8, 6
695NeRF-SOS: Any-View Self-supervised Object Segmentation on Complex Scenes6.251.095, 6, 8, 6
696Analyzing Tree Architectures in Ensembles via Neural Tangent Kernel6.251.098, 6, 5, 6
697Proactive Multi-Camera Collaboration for 3D Human Pose Estimation6.251.095, 8, 6, 6
698Become a Proficient Player with Limited Data through Watching Pure Videos6.251.098, 5, 6, 6
699Multi-domain image generation and translation with identifiability guarantees6.251.095, 6, 8, 6
700Information-Theoretic Analysis of Unsupervised Domain Adaptation6.252.056, 8, 8, 3
701Understanding Zero-shot Adversarial Robustness for Large-Scale Models6.252.058, 3, 8, 6
702Continual evaluation for lifelong learning: Identifying the stability gap6.251.095, 8, 6, 6
703A General Framework For Proving The Equivariant Strong Lottery Ticket Hypothesis6.251.096, 5, 6, 8
704CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning6.252.056, 8, 8, 3
705Everybody Needs Good Neighbours: An Unsupervised Locality-based Method for Bias Mitigation6.251.096, 8, 6, 5
706Towards Robust Object Detection Invariant to Real-World Domain Shifts6.251.098, 6, 6, 5
707Light Sampling Field and BRDF Representation for Physically-based Neural Rendering6.252.056, 8, 8, 3
708Bidirectional Propagation for Cross-Modal 3D Object Detection6.251.095, 6, 8, 6
709Policy Pre-training for Autonomous Driving via Self-supervised Geometric Modeling6.251.096, 5, 8, 6
710EurNet: Efficient Multi-Range Relational Modeling of Spatial Multi-Relational Data6.251.096, 5, 6, 8
711FINDE: Neural Differential Equations for Finding and Preserving Invariant Quantities6.252.058, 6, 3, 8
712Near-Optimal Adversarial Reinforcement Learning with Switching Costs6.252.058, 8, 6, 3
713Sparse Token Transformer with Attention Back Tracking6.251.095, 6, 6, 8
714Kernel Neural Optimal Transport6.251.098, 5, 6, 6
715Iterative $alpha$-(de)Blending: Learning a Deterministic Mapping Between Arbitrary Densities6.251.098, 6, 5, 6
716Diffusion Models Already Have A Semantic Latent Space6.251.096, 8, 6, 5
717Towards Real-Time Neural Image Compression With Mask Decay6.252.056, 3, 8, 8
718Predicting Cellular Responses with Variational Causal Inference and Refined Relational Information6.251.095, 6, 8, 6
719BrainBERT: Self-supervised representation learning for Intracranial Electrodes6.251.095, 6, 8, 6
720Nonlinear Reconstruction for Operator Learning of PDEs with Discontinuities6.252.058, 3, 6, 8
721Sound Randomized Smoothing in Floating-Point Arithmetic6.251.096, 6, 8, 5
722Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path6.252.056, 3, 8, 8
723Test-Time Robust Personalization for Federated Learning6.251.098, 6, 5, 6
724The Trade-off between Universality and Label Efficiency of Representations from Contrastive Learning6.252.056, 8, 8, 3
725MPCFORMER: FAST, PERFORMANT AND PRIVATE TRANSFORMER INFERENCE WITH MPC6.252.058, 8, 6, 3
726Disparate Impact in Differential Privacy from Gradient Misalignment6.251.096, 6, 5, 8
727Interactive Portrait Harmonization6.251.098, 5, 6, 6
728Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction6.251.095, 6, 8, 6
729Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class-Incremental Learning6.251.095, 8, 6, 6
730WaGI: Wavelet-based GAN Inversion for Preserving High-Frequency Image Details6.251.098, 6, 5, 6
731Continuous-Discrete Convolution for (3+1)D Geometry-Sequence Modeling in Proteins6.251.095, 8, 6, 6
732Uniform-in-time propagation of chaos for the mean field gradient Langevin dynamics6.200.988, 5, 6, 6, 6
733SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing6.201.478, 5, 5, 5, 8
734A Mixture-of-Expert Approach to RL-based Dialogue Management6.201.838, 6, 3, 6, 8
735Can Neural Networks Learn Implicit Logic from Physical Reasoning?6.200.986, 6, 6, 5, 8
736Quantitative Universal Approximation Bounds for Deep Belief Networks6.201.838, 6, 3, 8, 6
737Compositional Law Parsing with Latent Random Functions6.200.988, 6, 5, 6, 6
738StyleMorph: Disentangling Shape, Pose and Appearance through 3D Morphable Image and Geometry Generation6.201.833, 8, 8, 6, 6
739Multi-Prompt Alignment for Multi-source Unsupervised Domain Adaptation6.201.475, 8, 5, 5, 8
740Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning6.200.985, 6, 8, 6, 6
741GRACE-C: Generalized Rate Agnostic Causal Estimation via Constraints6.200.985, 6, 8, 6, 6
742TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding6.201.836, 3, 8, 6, 8
743Learning ReLU networks to high uniform accuracy is intractable6.171.678, 6, 3, 6, 8, 6
744Sharper Bounds for Uniformly Stable Algorithms with Stationary $varphi$-mixing Process6.170.906, 6, 5, 8, 6, 6
745FARE: Provably Fair Representation Learning6.002.453, 8, 8, 3, 8
746Encoding Recurrence into Transformers6.001.415, 8, 5
747Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS6.002.128, 5, 3, 8
748CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code6.002.128, 8, 3, 5
749Cross-Layer Retrospective Retrieving via Layer Attention6.001.225, 5, 8, 6
750xTrimoDock: Cross-Modal Transformer for Multi-Chain Protein Docking6.001.415, 8, 5
751RandProx: Primal-Dual Optimization Algorithms with Randomized Proximal Updates6.002.943, 10, 5
752Guarded Policy Optimization with Imperfect Online Demonstrations6.002.128, 3, 5, 8
753Hierarchical Abstraction for Combinatorial Generalization in Object Rearrangement6.001.415, 5, 8
754Arbitrary Virtual Try-On Network: Characteristics Representation and Trade-off between Body and Clothing6.002.128, 3, 8, 5
755Feature selection and low test error in shallow low-rotation ReLU networks6.001.225, 5, 8, 6
756Coupled Multiwavelet Operator Learning for Coupled Differential Equations6.000.006, 6, 6
757Mechanistic Mode Connectivity6.000.006, 6, 6, 6
758ADELT: Unsupervised Transpilation Between Deep Learning Frameworks6.001.225, 6, 5, 8
759Recursive Time Series Data Augmentation6.002.556, 3, 5, 10
760Robust Multivariate Time-Series Forecasting: Adversarial Attacks and Defense Mechanisms6.001.226, 5, 5, 8
761Ask Me Anything: A simple strategy for prompting language models6.000.006, 6, 6, 6
762The Best of Both Worlds: Accurate Global and Personalized Models through Federated Learning with Data-Free Hyper-Knowledge Distillation6.001.225, 6, 8, 5
763Over-Training with Mixup May Hurt Generalization6.001.225, 5, 8, 6
764Principal Trade-off Analysis6.002.128, 3, 5, 8
765Federated Neural Bandits6.001.225, 8, 5, 6
766Contextual Subspace Approximation with Neural Householder Transforms6.001.418, 5, 5
767A second order regression model shows edge of stability behavior6.001.105, 8, 6, 6, 5
768Broken Neural Scaling Laws6.001.415, 8, 5
769LEARNING CONTEXT-AWARE ADAPTIVE SOLVERS TO ACCELERATE QUADRATIC PROGRAMMING6.001.415, 5, 8
770$mathrm{SE}(3)$-Equivariant Attention Networks for Shape Reconstruction in Function Space6.001.225, 5, 8, 6
771How Can GANs Learn Hierarchical Generative Models for Real-World Distributions6.000.006, 6, 6
772BiAdam: Fast Adaptive Bilevel Optimization Methods6.002.128, 8, 5, 3
773Lovasz Theta Contrastive Learning6.002.555, 10, 6, 3
774Information Plane Analysis for Dropout Neural Networks6.002.125, 8, 8, 3
775Learning Harmonic Molecular Representations on Riemannian Manifold6.001.228, 6, 5, 5
776Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement6.001.415, 8, 5
777STay-On-the-Ridge (STON'R): Guaranteed Convergence to Local Minimax Equilibrium in Nonconvex-Nonconcave Games6.001.415, 8, 5
778Understanding Multi-Task Scaling in Machine Translation6.001.228, 6, 5, 5
779A Simple Approach for Visual Room Rearrangement: 3D Mapping and Semantic Search6.000.006, 6, 6
780Neural Compositional Rule Learning for Knowledge Graph Reasoning6.002.123, 8, 5, 8
781Efficient approximation of neural population structure and correlations with probabilistic circuits6.001.228, 6, 5, 5
782AGRO: Adversarial discovery of error-prone Groups for Robust Optimization6.001.226, 5, 5, 8
783On The Specialization of Neural Modules6.001.415, 5, 8
784Language models are multilingual chain-of-thought reasoners6.001.006, 8, 5, 6, 6, 5
785Subsampling in Large Graphs Using Ricci Curvature6.001.225, 5, 6, 8
786Score-based Continuous-time Discrete Diffusion Models6.002.555, 6, 10, 3
787SurCo: Learning Linear Surrogates for Combinatorial Nonlinear Optimization Problems6.001.415, 8, 5
788Analogical Networks for Memory-Modulated 3D Parsing6.001.225, 8, 5, 6
789DySR: Adaptive Super-Resolution via Algorithm and System Co-design6.001.225, 6, 5, 8
790Synergies Between Disentanglement and Sparsity: a Multi-Task Learning Perspective6.000.006, 6, 6, 6
791Causal Reasoning in the Presence of Latent Confounders via Neural ADMG Learning6.001.226, 5, 8, 5
792Beyond Lipschitz: Sharp Generalization and Excess Risk Bounds for Full-Batch GD6.001.228, 6, 5, 5
793Pushing the limits of self-supervised learning: Can we outperform supervised learning without labels?6.001.225, 6, 8, 5
794DensePure: Understanding Diffusion Models towards Adversarial Robustness6.001.228, 6, 5, 5
795Automatically Auditing Large Language Models via Discrete Optimization6.001.225, 5, 6, 8
796How gradient estimator variance and bias impact learning in neural networks6.001.225, 5, 8, 6
797Distributed Extra-gradient with Optimal Complexity and Communication Guarantees6.001.415, 8, 5
798FIT: A Metric for Model Sensitivity6.001.908, 8, 3, 5, 6
799Revisiting Robustness in Graph Machine Learning6.000.006, 6, 6
800Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation6.001.226, 5, 8, 5
801Logical Message Passing Networks with One-hop Inference on Atomic Formulas6.000.006, 6, 6
802Symmetries, Flat Minima and the Conserved Quantities of Gradient Flow6.001.225, 8, 6, 5
803Synaptic Dynamics Realize First-order Adaptive Learning and Weight Symmetry6.001.415, 8, 5
804Order Matters: Agent-by-agent Policy Optimization6.001.105, 6, 5, 6, 8
805On the Convergence of AdaGrad on $mathbb{R}^d$: Beyond Convexity, Non-Asymptotic Rate and Acceleration6.001.415, 5, 8
806Large language models are not zero-shot communicators6.001.225, 8, 5, 6
807ImageNet-X: Understanding Model Mistakes with Factor of Variation Annotations6.001.415, 8, 5
808Improved Learning-augmented Algorithms for k-means and k-medians Clustering6.000.006, 6, 6
809DIFFUSION GENERATIVE MODELS ON SO(3)6.001.418, 5, 5
810Learning About Progress From Experts6.000.006, 6, 6
811Distributed Graph Neural Network Training with Periodic Stale Representation Synchronization6.001.226, 5, 8, 5
812Obtaining More Generalizable Fair Classifiers on Imbalanced Datasets6.000.006, 6, 6
813Understanding The Robustness of Self-supervised Learning Through Topic Modeling6.000.006, 6, 6
814Adversarial Cheap Talk6.001.228, 5, 5, 6
815Achieve Near-Optimal Individual Regret & Low Communications in Multi-Agent Bandits6.000.006, 6, 6
816Online Boundary-Free Continual Learning by Scheduled Data Prior6.001.105, 6, 8, 5, 6
817Revisiting adapters with adversarial training6.001.228, 6, 5, 5
818A Self-Attention Ansatz for Ab-initio Quantum Chemistry6.001.228, 6, 5, 5
819Multi-Behavior Dynamic Contrastive Learning for Recommendation6.001.228, 5, 5, 6
820HyperDeepONet: learning operator with complex target function space using the limited resources via hypernetwork6.000.006, 6, 6
821Towards the Detection of Diffusion Model Deepfakes6.001.106, 5, 8, 5, 6
822Identifiability Results for Multimodal Contrastive Learning6.001.228, 6, 5, 5
823Causal Attention to Exploit Transient Emergence of Causal Effect6.001.418, 5, 5
824Correlative Information Maximization Based Biologically Plausible Neural Networks for Correlated Source Separation6.001.415, 8, 5
825Copy is All You Need6.001.226, 5, 5, 8
826Why adversarial training can hurt robust accuracy6.002.128, 3, 5, 8
827Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection6.000.006, 6, 6, 6
828TANGOS: Regularizing Tabular Neural Networks through Gradient Orthogonalization and Specialization6.001.415, 8, 5
829Improving the imputation of missing data with Markov Blanket discovery6.001.225, 8, 6, 5
830Understanding Neural Coding on Latent Manifolds by Sharing Features and Dividing Ensembles6.000.006, 6, 6
831Defending against Adversarial Audio via Diffusion Model6.001.226, 5, 8, 5
832Theoretical Characterization of the Generalization Performance of Overfitted Meta-Learning6.001.225, 8, 5, 6
833Towards graph-level anomaly detection via deep evolutionary mapping6.001.415, 8, 5
834Global Explainability of GNNs via Logic Combination of Learned Concepts6.001.415, 8, 5
835Instance-Specific Augmentation: Capturing Local Invariances6.000.006, 6, 6
836$Lambda$-DARTS: Mitigating Performance Collapse by Harmonizing Operation Selection among Cells6.000.006, 6, 6, 6
837Better Teacher Better Student: Dynamic Prior Knowledge for Knowledge Distillation6.001.418, 5, 5
838Inequality phenomenon in $l_{infty}$-adversarial training, and its unrealized threats6.002.123, 8, 5, 8
839Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow6.000.006, 6, 6
840Complexity-Based Prompting for Multi-step Reasoning6.002.128, 5, 3, 8
841Not All Tasks Are Born Equal: Understanding Zero-Shot Generalization6.001.226, 5, 5, 8
842What Do Self-Supervised Vision Transformers Learn?6.002.125, 3, 8, 8
843Sampled Transformer for Point Sets6.001.225, 5, 8, 6
844Squeeze Training for Adversarial Robustness6.000.006, 6, 6, 6
845Provably efficient multi-task Reinforcement Learning in large state spaces6.001.415, 5, 8
846Learning Multi-Object Positional Relationships via Emergent Communication6.002.128, 5, 3, 8
847The Dark Side of Invariance: Revisiting the Role of Augmentations in Contrastive Learning6.001.225, 5, 6, 8
848Long-Tailed Partial Label Learning via Dynamic Rebalancing6.001.226, 8, 5, 5
849How hard are computer vision datasets? Calibrating dataset difficulty to viewing time6.001.225, 8, 5, 6
850Do We Always Need to Penalize Variance of Losses for Learning with Label Noise?6.001.418, 5, 5
851Causal Estimation for Text Data with (Apparent) Overlap Violations6.000.006, 6, 6, 6
852Adversarial Diversity in Hanabi6.000.006, 6, 6
853CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos6.000.006, 6, 6, 6, 6
854CAREER: Transfer Learning for Economic Prediction of Labor Data6.001.415, 5, 8
855Federated Nearest Neighbor Machine Translation6.000.006, 6, 6, 6
856ROCO: A General Framework for Evaluating Robustness of Combinatorial Optimization Solvers on Graphs6.001.225, 5, 6, 8
857PiFold: Toward effective and efficient protein inverse folding6.001.418, 5, 5
858Distributional Signals for Node Classification in Graph Neural Networks6.001.415, 8, 5
859Planning Goals for Exploration6.001.903, 5, 6, 8, 8
860Scalable and Equivariant Spherical CNNs by Discrete-Continuous (DISCO) Convolutions6.001.226, 8, 5, 5
861Learning Efficient Hybrid Particle-continuum Representations of Non-equilibrium N-body Systems6.001.415, 8, 5
862Neural Network Approximation of Lipschitz Functions in High Dimensions with Applications to Inverse Problems6.001.226, 5, 5, 8
863Minimum Description Length Control6.001.225, 8, 5, 6
864Tuning Frequency Bias in Neural Network Training with Nonuniform Data6.001.226, 5, 8, 5
865Is Adversarial Training Really a Silver Bullet for Mitigating Data Poisoning?6.002.553, 6, 10, 5
866Does Decentralized Learning with Non-IID Unlabeled Data Benefit from Self Supervision?6.001.228, 5, 5, 6
867MEDFAIR: BENCHMARKING FAIRNESS FOR MEDICAL IMAGEING6.002.123, 5, 8, 8
868Exploring and Exploiting Decision Boundary Dynamics for Adversarial Robustness6.001.105, 5, 8, 6, 6
869SMART: Sentences as Basic Units for Text Evaluation6.001.225, 8, 5, 6
870Neural Design for Genetic Perturbation Experiments6.001.226, 8, 5, 5
871Quantifying Memorization Across Neural Language Models6.001.225, 5, 8, 6
872Diffusion Adversarial Representation Learning for Self-supervised Vessel Segmentation6.000.006, 6, 6, 6
873A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games6.002.125, 8, 8, 3
874The Dark Side of AutoML: Towards Architectural Backdoor Search6.001.228, 5, 5, 6
875On the Data-Efficiency with Contrastive Image Transformation in Reinforcement Learning6.001.226, 5, 5, 8
876Energy-based Out-of-Distribution Detection for Graph Neural Networks6.001.225, 5, 8, 6
877Compositional Semantic Parsing with Large Language Models6.001.225, 5, 6, 8
878MEDICAL IMAGE UNDERSTANDING WITH PRETRAINED VISION LANGUAGE MODELS: A COMPREHENSIVE STUDY6.001.225, 6, 8, 5
879Adversarial Attack Detection Through Network Transport Dynamics6.001.418, 5, 5
880Knowledge-Driven Active Learning6.001.105, 5, 6, 6, 8
881CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment6.001.105, 5, 6, 8, 6
882Transferring Pretrained Diffusion Probabilistic Models6.001.225, 5, 6, 8
883Test-Time Adaptation via Self-Training with Nearest Neighbor Information6.001.225, 8, 5, 6
884Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting6.001.415, 8, 5
885Massively Scaling Heteroscedastic Classifiers6.001.735, 8, 3, 6, 8, 6
886Blurring Diffusion Models6.001.225, 5, 6, 8
887Hyperbolic Self-paced Learning for Self-supervised Skeleton-based Action Representations6.001.226, 5, 5, 8
888On Uni-modal Feature Learning in Multi-modal Learning6.001.225, 6, 8, 5
889VA-DepthNet: A Variational Approach to Single Image Depth Prediction6.001.225, 5, 8, 6
890E-Forcing: Improving Autoregressive Models by Treating it as an Energy-Based One6.001.415, 8, 5
891TRANSFORMER-PATCHER: ONE MISTAKE WORTH ONE NEURON6.001.225, 6, 5, 8
892On the Edge of Benign Overfitting: Label Noise and Overparameterization Level6.000.006, 6, 6
893Measure the Predictive Heterogeneity6.001.225, 6, 8, 5
894In-sample Actor Critic for Offline Reinforcement Learning6.001.228, 5, 6, 5
895Joint Gaussian Mixture Model for Versatile Deep Visual Model Explanation6.002.128, 8, 3, 5
896Localized Graph Contrastive Learning6.001.225, 8, 6, 5
897CircuitNet: A Generic Neural Network to Realize Universal Circuit Motif Modeling6.000.006, 6, 6
898Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting6.001.226, 5, 5, 8
899Ensuring DNN Solution Feasibility for Optimization Problems with Linear Constraints6.001.225, 8, 6, 5
900AutoFHE: Automated Adaption of CNNs for Efficient Evaluation over FHE6.001.415, 8, 5
901From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data6.002.125, 3, 8, 8
902FINE: Future-Aware Inference for Streaming Speech Translation6.001.106, 8, 5, 5, 6
903Stable Target Field for Reduced Variance Score Estimation6.001.415, 8, 5
904Dynamic Embeddings of Temporal High-Order Interactions via Neural Diffusion-Reaction Processes6.001.225, 5, 8, 6
905DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking6.003.083, 8, 10, 3
906Certified Defences Against Adversarial Patch Attacks on Semantic Segmentation6.001.228, 5, 5, 6
907How Much Space Has Been Explored? Measuring the Chemical Space Covered by Databases and Machine-Generated Molecules6.001.226, 8, 5, 5
908Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective6.001.105, 6, 8, 6, 5
909DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases6.001.228, 5, 6, 5
910NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis6.001.225, 5, 8, 6
911Iterative Patch Selection for High-Resolution Image Recognition6.002.128, 8, 5, 3
9123D Segmenter: 3D Transformer based Semantic Segmentation via 2D Panoramic Distillation6.001.225, 6, 5, 8
913GOOD: Exploring geometric cues for detecting objects in an open world6.001.226, 8, 5, 5
914TabCaps: A Capsule Neural Network for Tabular Data Classification with BoW Routing6.001.415, 5, 8
915Koopman neural operator for learning non-linear partial differential equations6.001.415, 5, 8
916CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling6.001.225, 5, 6, 8
917Toeplitz Neural Network for Sequence Modeling6.002.123, 8, 5, 8
918Deep Learning on Implicit Neural Representations of Shapes6.001.228, 5, 6, 5
919Learning Counterfactually Invariant Predictors6.001.228, 5, 6, 5
920ImaginaryNet: Learning Object Detectors without Real Images and Annotations6.001.225, 8, 6, 5
921Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased6.000.006, 6, 6, 6
922From $t$-SNE to UMAP with contrastive learning6.001.908, 5, 8, 3, 6
923Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning6.001.008, 5, 6, 6, 5, 6
924Generalize Learned Heuristics to Solve Large-scale Vehicle Routing Problems in Real-time6.001.226, 5, 8, 5
925Towards the Generalization of Contrastive Self-Supervised Learning6.002.285, 3, 6, 10, 6
926Do We Need Neural Collapse? Learning Diverse Features for Fine-grained and Long-tail Classification6.001.415, 8, 5
927DepthFL : Depthwise Federated Learning for Heterogeneous Clients6.001.225, 6, 5, 8
928BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers6.001.226, 5, 8, 5
929CooPredict : Cooperative Differential Games For Time Series Prediction6.001.415, 8, 5
930Molecule Generation For Target Protein Binding with Structural Motifs6.001.226, 5, 5, 8
931Towards Robustness Certification Against Universal Perturbations6.002.128, 8, 5, 3
932Multimodal Federated Learning via Contrastive Representation Ensemble6.001.225, 8, 5, 6
933Adversarial perturbation based latent reconstruction for domain-agnostic self-supervised learning6.001.225, 6, 8, 5
934Protein Representation Learning by Geometric Structure Pretraining6.001.225, 8, 5, 6
935Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation6.001.226, 8, 5, 5
936Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning6.001.228, 6, 5, 5
937Reversible Column Networks6.000.006, 6, 6
938What Is Missing in IRM Training and Evaluation? Challenges and Solutions6.000.006, 6, 6
939Multi-task Self-supervised Graph Neural Networks Enable Stronger Task Generalization6.000.006, 6, 6
940Hierarchies of Reward Machines6.001.418, 5, 5
941LatentAugment: Dynamically Optimized Latent Probabilities of Data Augmentation6.001.225, 8, 5, 6
942Policy Contrastive Imitation Learning6.001.415, 5, 8
943Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes6.000.006, 6, 6, 6
944Dataless Knowledge Fusion by Merging Weights of Language Models6.001.225, 6, 8, 5
945GReTo: Remedying dynamic graph topology-task discordance via target homophily6.001.106, 6, 8, 5, 5
946Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning6.000.006, 6, 6
947Particle-based Variational Inference with Preconditioned Functional Gradient Flow6.000.006, 6, 6
948Selective Annotation Makes Language Models Better Few-Shot Learners6.001.225, 5, 6, 8
949Adaptive Client Sampling in Federated Learning via Online Learning with Bandit Feedback6.001.225, 5, 6, 8
950SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation6.002.128, 3, 8, 5
951Learning Symbolic Models for Graph-structured Physical Mechanism6.001.415, 5, 8
952AdaDQH Optimizer: Evolving from Stochastic to Adaptive by Auto Switch of Precondition Matrix6.001.418, 5, 5
953Dataset Pruning: Reducing Training Data by Examining Generalization Influence6.001.225, 8, 6, 5
954Expected Gradients of Maxout Networks and Consequences to Parameter Initialization6.001.108, 6, 5, 5, 6
955Online Continual Learning for Progressive Distribution Shift (OCL-PDS): A Practitioner's Perspective6.002.555, 3, 10, 6
956Understanding Why Generalized Reweighting Does Not Improve Over ERM6.001.226, 5, 5, 8
957Composing Ensembles of Pre-trained Models via Iterative Consensus6.001.226, 8, 5, 5
958Learning Label Encodings for Deep Regression6.000.006, 6, 6, 6
959Riemannian Metric Learning via Optimal Transport6.001.225, 6, 5, 8
960Deep Variational Implicit Processes6.001.225, 6, 5, 8
961Estimating individual treatment effects under unobserved confounding using binary instruments6.000.006, 6, 6, 6
962Denoising Diffusion Error Correction Codes6.000.006, 6, 6
963Exploring Active 3D Object Detection from a Generalization Perspective6.000.006, 6, 6, 6
964Learning Object-Language Alignments for Open-Vocabulary Object Detection6.001.225, 8, 6, 5
965Inferring Fluid Dynamics via Inverse Rendering6.001.418, 5, 5
966Exploring Low-Rank Property in Multiple Instance Learning for Whole Slide Image Classification6.001.228, 6, 5, 5
967Equiformer: Equivariant Graph Attention Transformer for 3D Atomistic Graphs6.001.225, 5, 6, 8
968IDP: Iterative Differentiable Pruning based on Attention for Deep Neural Networks6.001.228, 5, 6, 5
969OTOv2: Automatic, Generic, User-Friendly6.001.415, 5, 8
970Sparse Q-Learning: Offline Reinforcement Learning with Implicit Value Regularization6.001.415, 5, 8
971Admeta: A Novel Double Exponential Moving Average to Adaptive and Non-adaptive Momentum Optimizers with Bidirectional Looking6.000.006, 6, 6, 6
972Statistical Inference for Fisher Market Equilibrium6.000.006, 6, 6
973Scenario-based Question Answering with Interacting Contextual Properties6.000.006, 6, 6
974Visual Recognition with Deep Nearest Centroids6.001.225, 6, 8, 5
975Continuous PDE Dynamics Forecasting with Implicit Neural Representations6.000.006, 6, 6, 6
976Towards Inferential Reproducibility of Machine Learning Research6.001.418, 5, 5
977Graph Contrastive Learning for Skeleton-based Action Recognition6.002.125, 8, 3, 8
978Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation6.001.108, 6, 5, 6, 5
979Spikformer: When Spiking Neural Network Meets Transformer6.002.555, 10, 3, 6
980Multimodal Analogical Reasoning over Knowledge Graphs6.001.415, 5, 8
981What shapes the loss landscape of self supervised learning?6.000.006, 6, 6
982Conditional Positional Encodings for Vision Transformers6.001.226, 8, 5, 5
983Label Distribution Learning via Implicit Distribution Representation6.001.228, 5, 6, 5
984Learning to Compose Soft Prompts for Compositional Zero-Shot Learning6.001.228, 6, 5, 5
985SQA3D: Situated Question Answering in 3D Scenes6.000.006, 6, 6, 6
986The Benefits of Model-Based Generalization in Reinforcement Learning6.001.225, 5, 6, 8
987Extracting Robust Models with Uncertain Examples6.001.225, 5, 6, 8
988Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks6.001.226, 5, 8, 5
989DifFace: Blind Face Restoration with Diffused Error Contraction6.001.226, 5, 8, 5
990ChiroDiff: Modelling chirographic data with Diffusion Models6.000.006, 6, 6
991Real-Time Image Demoir$acute{e}$ing on Mobile Devices6.002.123, 8, 5, 8
992Steering Prototypes with Prompt Tuning for Rehearsal-free Continual Learning6.000.006, 6, 6, 6
993Decompose to Generalize: Species-Generalized Animal Pose Estimation6.001.225, 5, 8, 6
994Learning Implicit Scale Conditioned Memory Compensation for Talking Head Generation6.000.006, 6, 6
995Logical Entity Representation in Knowledge-Graphs for Differentiable Rule Learning6.001.228, 5, 6, 5
996Suppressing the Heterogeneity: A Strong Feature Extractor for Few-shot Segmentation6.001.226, 5, 5, 8
997Sparse and Hierarchical Masked Modeling for Convolutional Representation Learning6.001.415, 8, 5
998On amortizing convex conjugates for optimal transport6.000.006, 6, 6, 6
999ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent Training6.001.228, 6, 5, 5
1000Generalization Bounds for Federated Learning: Fast Rates, Unparticipating Clients and Unbounded Losses5.831.075, 6, 5, 6, 8, 5
1001Corrupted Image Modeling for Self-Supervised Visual Pre-Training5.831.076, 5, 8, 6, 5, 5
1002Neural Probabilistic Logic Programming in Discrete-Continuous Domains5.801.175, 5, 5, 8, 6
1003Substructure-Atom Cross Attention for Molecular Representation Learning5.801.175, 5, 8, 5, 6
1004Language Models Can (kind of) Reason: A Systematic Formal Analysis of Chain-of-Thought5.801.178, 5, 5, 5, 6
1005Evaluation of Active Feature Acquisition Methods under Missing Data5.801.606, 8, 6, 6, 3
1006Learning to Induce Causal Structure5.801.176, 5, 5, 5, 8
1007Energy Transformer5.801.175, 5, 8, 6, 5
1008Sample Relationships through the Lens of Learning Dynamics with Label Information5.801.178, 5, 5, 6, 5
1009CUDA: Curriculum of Data Augmentation for Long-tailed Recognition5.801.176, 5, 8, 5, 5
1010Transport with Support: Data-Conditional Diffusion Bridges5.750.436, 6, 5, 6
1011FairGBM: Gradient Boosting with Fairness Constraints5.751.793, 6, 8, 6
1012Robust Training through Adversarially Selected Data Subsets5.750.436, 5, 6, 6
1013Face reconstruction from facial templates by learning latent space of a generator network5.750.435, 6, 6, 6
1014Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery5.751.793, 6, 8, 6
1015Gray-Box Gaussian Processes for Automated Reinforcement Learning5.751.305, 5, 5, 8
1016One-Step Estimator for Permuted Sparse Recovery5.750.436, 6, 6, 5
1017Leveraging Large Language Models for Multiple Choice Question Answering5.751.308, 5, 5, 5
1018Transfer NAS with Meta-learned Bayesian Surrogates5.750.436, 6, 5, 6
1019Mitigating the Limitations of Multimodal VAEs with Coordination-Based Approach5.751.305, 5, 5, 8
1020Sharp Convergence Analysis of Gradient Descent for Deep Linear Neural Networks5.751.305, 5, 8, 5
1021Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation5.750.436, 6, 6, 5
1022Sparse Distributed Memory is a Continual Learner5.751.305, 8, 5, 5
1023Hyper-parameter Tuning for Fair Classification without Sensitive Attribute Access5.751.308, 5, 5, 5
1024Rethinking Symbolic Regression: Morphology and Adaptability in the Context of Evolutionary Algorithms5.751.798, 6, 6, 3
1025Imitating Graph-Based Planning with Goal-Conditioned Policies5.751.796, 3, 8, 6
1026Computational Language Acquisition with Theory of Mind5.751.798, 6, 3, 6
1027Pareto Invariant Risk Minimization5.751.308, 5, 5, 5
1028Can Agents Run Relay Race with Strangers? Generalization of RL to Out-of-Distribution Trajectories5.750.436, 6, 6, 5
1029STUNT: Few-shot Tabular Learning with Self-generated Tasks from Unlabeled Tables5.750.436, 5, 6, 6
1030Compressed Predictive Information Coding5.751.796, 6, 3, 8
1031WebBrain: Learning to Generate Factually Correct Articles for Queries by Grounding on Large Web Corpus5.751.793, 6, 8, 6
1032Reinforcement Learning-Based Estimation for Partial Differential Equations5.750.436, 5, 6, 6
1033Heterogeneous-Agent Mirror Learning5.751.798, 3, 6, 6
1034TextShield: Beyond Successfully Detecting Adversarial Sentences in NLP5.751.305, 5, 8, 5
1035Minimalistic Unsupervised Learning with the Sparse Manifold Transform5.750.436, 6, 5, 6
1036Quantile Risk Control: A Flexible Framework for Bounding the Probability of High-Loss Predictions5.750.436, 5, 6, 6
1037HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware Attention5.750.436, 5, 6, 6
1038Return Augmentation gives Supervised RL Temporal Compositionality5.750.436, 6, 5, 6
1039Characterizing intrinsic compositionality in transformers with Tree Projections5.751.796, 3, 6, 8
1040Open-Set 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning5.750.436, 6, 6, 5
1041Interaction-Based Disentanglement of Entities for Object-Centric World Models5.750.436, 6, 5, 6
1042PromptBoosting: Black-Box Text Classification with Ten Forward Passes5.750.436, 6, 6, 5
1043Adaptive Optimization in the $infty$-Width Limit5.751.305, 5, 5, 8
1044A Control-Centric Benchmark for Video Prediction5.751.796, 3, 8, 6
1045Data-Efficient Finetuning Using Cross-Task Nearest Neighbors5.751.796, 3, 8, 6
1046Unveiling Transformers with LEGO: A Synthetic Reasoning Task5.751.798, 3, 6, 6
1047Efficiently Controlling Multiple Risks with Pareto Testing5.751.796, 8, 6, 3
1048Learning Structured Representations by Embedding Class Hierarchy5.751.308, 5, 5, 5
1049FunkNN: Neural Interpolation for Functional Generation5.750.435, 6, 6, 6
1050Approximating any Function via Coreset for Radial Basis Functions: Towards Provable Data Subset Selection For Efficient Neural Networks training5.750.435, 6, 6, 6
1051Towards Understanding GD with Hard and Conjugate Pseudo-labels for Test-Time Adaptation5.751.796, 6, 8, 3
1052A Statistical Framework for Personalized Federated Learning and Estimation: Theory, Algorithms, and Privacy5.750.435, 6, 6, 6
1053Learning Low Dimensional State Spaces with Overparameterized Recurrent Neural Networks5.750.436, 6, 5, 6
1054DT+GNN: A Fully Explainable Graph Neural Network using Decision Trees5.750.436, 6, 6, 5
1055Spatio-temporal point processes with deep non-stationary kernels5.750.435, 6, 6, 6
1056DAG Learning via Sparse Relaxations5.750.436, 5, 6, 6
1057Autoregressive Diffusion Model for Graph Generation5.750.436, 5, 6, 6
1058Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations5.750.436, 6, 6, 5
1059Don’t forget the nullspace! Nullspace occupancy as a mechanism for out of distribution failure5.751.305, 5, 8, 5
1060Towards Interpretable Deep Reinforcement Learning with Human-Friendly Prototypes5.750.435, 6, 6, 6
1061Compositional Task Generalization with Discovered Successor Feature Modules5.751.796, 6, 8, 3
1062Phenaki: Variable Length Video Generation from Open Domain Textual Descriptions5.751.793, 6, 8, 6
1063On the (Non-)Robustness of Two-Layer Neural Networks in Different Learning Regimes5.751.796, 3, 8, 6
1064CrAM: A Compression-Aware Minimizer5.751.798, 6, 3, 6
1065Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees5.751.796, 3, 8, 6
1066Hebbian Deep Learning Without Feedback5.750.435, 6, 6, 6
1067Learning to Abstain from Uninformative Data5.751.308, 5, 5, 5
1068Know Your Boundaries: The Advantage of Explicit Behavior Cloning in Offline RL5.751.793, 6, 8, 6
1069Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning5.751.793, 6, 8, 6
1070Maximum Entropy Information Bottleneck for Confidence-aware Stochastic Embedding5.751.305, 8, 5, 5
1071Certifiably Robust Transformers with 1-Lipschitz Self-Attention5.750.435, 6, 6, 6
1072$k$NN Prompting: Learning Beyond the Context with Nearest Neighbor Inference5.751.796, 6, 8, 3
1073Uncovering Directions of Instability via Quadratic Approximation of Deep Neural Loss in Reinforcement Learning5.751.308, 5, 5, 5
1074This Looks Like It Rather Than That: ProtoKNN For Similarity-Based Classifiers5.750.436, 5, 6, 6
1075Leveraging Importance Weights in Subset Selection5.751.798, 6, 6, 3
1076Gradient flow in the gaussian covariate model: exact solution of learning curves and multiple descent structures5.750.436, 6, 6, 5
1077MILAN: Masked Image Pretraining on Language Assisted Representation5.751.305, 8, 5, 5
1078Learning topology-preserving data representations5.751.796, 8, 6, 3
1079The Curious Case of Benign Memorization5.751.796, 3, 6, 8
1080Can Wikipedia Help Offline Reinforcement Learning?5.751.798, 6, 3, 6
1081Modeling Temporal Data as Continuous Functions with Process Diffusion5.750.435, 6, 6, 6
1082Model-based Causal Bayesian Optimization5.751.305, 8, 5, 5
1083Probabilistic Imputation for Time-series Classification with Missing Data5.751.305, 5, 5, 8
1084Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints5.751.796, 6, 8, 3
1085Statistical Theory of Differentially Private Marginal-based Data Synthesis Algorithms5.750.436, 5, 6, 6
1086A Primal-Dual Framework for Transformers and Neural Networks5.751.796, 3, 6, 8
1087Understanding the Generalization of Adam in Learning Neural Networks with Proper Regularization5.750.436, 5, 6, 6
1088MAST: Masked Augmentation Subspace Training for Generalizable Self-Supervised Priors5.751.798, 6, 3, 6
1089Quantum Vision Transformers5.752.595, 10, 3, 5
1090Pre-training Protein Structure Encoder via Siamese Diffusion Trajectory Prediction5.751.305, 8, 5, 5
1091Scaling Laws in Mean-Field Games5.751.796, 6, 3, 8
1092Clustering for directed graphs using parametrized random walk diffusion kernels5.750.435, 6, 6, 6
1093ProsodyBERT: Self-Supervised Prosody Representation for Style-Controllable TTS5.752.595, 10, 3, 5
1094Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation5.750.436, 6, 5, 6
1095The hidden uniform cluster prior in self-supervised learning5.750.435, 6, 6, 6
1096Spacetime Representation Learning5.751.798, 6, 3, 6
1097CLIP-Dissect: Automatic Description of Neuron Representations in Deep Vision Networks5.750.435, 6, 6, 6
1098LipsFormer: Introducing Lipschitz Continuity to Vision Transformers5.751.793, 8, 6, 6
1099Automatic Chain of Thought Prompting in Large Language Models5.751.793, 6, 6, 8
1100Latent Variable Representation for Reinforcement Learning5.751.793, 6, 8, 6
1101SoftMatch: Addressing the Quantity-Quality Tradeoff in Semi-supervised Learning5.751.798, 6, 3, 6
1102Attention-Guided Backdoor Attacks against Transformers5.751.305, 5, 8, 5
1103Overthinking the Truth: Understanding how Language Models process False Demonstrations5.751.305, 8, 5, 5
1104Re-Imagen: Retrieval-Augmented Text-to-Image Generator5.750.435, 6, 6, 6
1105Implicit regularization via Spectral Neural Networks and non-linear matrix sensing5.751.796, 6, 3, 8
1106A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning5.751.305, 8, 5, 5
1107Graph Neural Network-Inspired Kernels for Gaussian Processes in Semi-Supervised Learning5.750.436, 6, 5, 6
1108Graph Convolutional Normalizing Flows for Semi-Supervised Classification and Clustering5.751.308, 5, 5, 5
1109Weakly Supervised Explainable Phrasal Reasoning with Neural Fuzzy Logic5.750.436, 6, 6, 5
1110Weighted Ensemble Self-Supervised Learning5.751.793, 6, 8, 6
1111TILP: Differentiable Learning of Temporal Logical Rules on Knowledge Graphs5.751.305, 5, 5, 8
1112Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP5.751.305, 5, 8, 5
1113CURE: A Pre-training Framework on Large-scale Patient Data for Treatment Effect Estimation5.751.305, 5, 8, 5
1114Bridging the Gap between Semi-supervised and Supervised Continual Learning via Data Programming5.751.305, 8, 5, 5
1115Measuring Forgetting of Memorized Training Examples5.750.436, 6, 5, 6
1116Efficient Edge Inference by Selective Query5.751.796, 8, 6, 3
1117Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments5.751.305, 8, 5, 5
1118Model Transferability with Responsive Decision Subjects5.751.305, 5, 5, 8
1119NTFields: Neural Time Fields for Physics-Informed Robot Motion Planning5.750.436, 6, 5, 6
1120ZiCo: Zero-shot NAS via inverse Coefficient of Variation on Gradients5.750.436, 6, 5, 6
1121Learning Simultaneous Navigation and Construction in Grid Worlds5.750.435, 6, 6, 6
1122PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs5.750.435, 6, 6, 6
1123Towards Minimax Optimal Reward-free Reinforcement Learning in Linear MDPs5.750.436, 5, 6, 6
1124Which Layer is Learning Faster? A Systematic Exploration of Layer-wise Convergence Rate for Deep Neural Networks5.750.436, 6, 6, 5
1125Scaleformer: Iterative Multi-scale Refining Transformers for Time Series Forecasting5.750.436, 6, 6, 5
1126Sparse MoE with Random Routing as the New Dropout: Training Bigger and Self-Scalable Models5.750.436, 5, 6, 6
1127Jump-Start Reinforcement Learning5.751.796, 8, 6, 3
1128Sequence to sequence text generation with diffusion models5.751.793, 6, 6, 8
1129BSTT: A Bayesian Spatial-Temporal Transformer for Sleep Staging5.751.308, 5, 5, 5
1130Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation5.750.436, 6, 5, 6
1131Learning Adversarial Linear Mixture Markov Decision Processes with Bandit Feedback and Unknown Transition5.750.436, 6, 6, 5
1132Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning5.751.305, 8, 5, 5
1133Equivariant Energy-Guided SDE for Inverse Molecular Design5.751.308, 5, 5, 5
1134Demystifying Approximate RL with $epsilon$-greedy Exploration: A Differential Inclusion View5.751.308, 5, 5, 5
1135Delving into the Openness of CLIP5.751.305, 5, 5, 8
1136Unsupervised Manifold Alignment with Joint Multidimensional Scaling5.751.798, 3, 6, 6
1137Learning with Auxiliary Activation for Memory-Efficient Training5.751.793, 6, 6, 8
1138Finding the global semantic representation in GAN through Fréchet Mean5.751.798, 3, 6, 6
1139E3Bind: An End-to-End Equivariant Network for Protein-Ligand Docking5.750.435, 6, 6, 6
1140Joint Generator-Ranker Learning for Natural Language Generation5.750.436, 5, 6, 6
1141Gromov-Wasserstein Autoencoders5.750.436, 6, 5, 6
1142Learning to Learn with Generative Models of Neural Network Checkpoints5.751.305, 8, 5, 5
1143Optimal Activation Functions for the Random Features Regression Model5.751.308, 5, 5, 5
1144Generalizing and Decoupling Neural Collapse via Hyperspherical Uniformity Gap5.751.798, 3, 6, 6
1145Hierarchical Protein Representations via Complete 3D Graph Networks5.751.798, 6, 6, 3
1146Write and Paint: Generative Vision-Language Models are Unified Modal Learners5.750.436, 5, 6, 6
1147Recovering Top-Two Answers and Confusion Probability in Multi-Choice Crowdsourcing5.751.796, 8, 3, 6
1148Contrastive Novelty Learning: Anticipating Outliers with Large Language Models5.750.436, 6, 5, 6
1149Adaptive Robust Evidential Optimization For Open Set Detection from Imbalanced Data5.750.435, 6, 6, 6
1150Learning Soft Constraints From Constrained Expert Demonstrations5.751.305, 5, 5, 8
1151Bridge the Inference Gaps of Neural Processes via Expectation Maximization5.751.793, 6, 6, 8
1152Masked Vision and Language Modeling for Multi-modal Representation Learning5.751.305, 5, 5, 8
1153Markup-to-Image Diffusion Models with Scheduled Sampling5.751.796, 6, 8, 3
1154Posterior Sampling Model-based Policy Optimization under Approximate Inference5.751.793, 8, 6, 6
1155What Can we Learn From The Selective Prediction And Uncertainty Estimation Performance Of 523 Imagenet Classifiers?5.750.436, 6, 6, 5
1156Transformer Meets Boundary Value Inverse Problems5.751.308, 5, 5, 5
1157Landscape Learning for Neural Network Inversion5.750.436, 5, 6, 6
1158Stochastic Multi-Person 3D Motion Forecasting5.751.798, 6, 6, 3
1159Multi-Objective Reinforcement Learning: Convexity, Stationarity and Pareto Optimality5.751.798, 6, 3, 6
1160Continual Unsupervised Disentangling of Self-Organizing Representations5.751.793, 8, 6, 6
1161Learning Human-Compatible Representations for Case-Based Decision Support5.750.436, 5, 6, 6
1162Unified Discrete Diffusion for Simultaneous Vision-Language Generation5.751.305, 8, 5, 5
1163Evidential Uncertainty and Diversity Guided Active Learning for Scene Graph Generation5.750.436, 6, 6, 5
1164Approximate Nearest Neighbor Search through Modern Error-Correcting Codes5.751.796, 8, 6, 3
1165DENSE RGB SLAM WITH NEURAL IMPLICIT MAPS5.750.436, 6, 6, 5
1166Modeling Sequential Sentence Relation to Improve Cross-lingual Dense Retrieval5.751.796, 6, 8, 3
1167Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths5.751.793, 6, 8, 6
1168Understanding Rare Spurious Correlations in Neural Networks5.751.305, 8, 5, 5
1169Neural Diffusion Processes5.751.796, 8, 3, 6
1170Learning Locality and Isotropy in Dialogue Modeling5.751.796, 6, 3, 8
1171Adaptive Update Direction Rectification for Unsupervised Continual Learning5.750.436, 6, 6, 5
1172NORM: Knowledge Distillation via N-to-One Representation Matching5.751.305, 5, 5, 8
1173CroMA: Cross-Modality Adaptation for Monocular BEV Perception5.751.305, 5, 5, 8
1174Robust Multi-Agent Reinforcement Learning with State Uncertainties5.750.436, 6, 5, 6
1175Neural Optimal Transport with General Cost Functionals5.751.796, 3, 6, 8
1176Strategic Classification on Graphs5.751.793, 6, 8, 6
1177Priors, Hierarchy, and Information Asymmetry for Skill Transfer in Reinforcement Learning5.751.308, 5, 5, 5
1178Visual Imitation Learning with Patch Rewards5.751.793, 6, 8, 6
1179Discovering Informative and Robust Positives for Video Domain Adaptation5.750.435, 6, 6, 6
1180Gradient-Guided Importance Sampling for Learning Binary Energy-Based Models5.750.435, 6, 6, 6
1181Single-shot General Hyper-parameter Optimization for Federated Learning5.751.796, 3, 6, 8
1182ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation5.751.798, 6, 6, 3
1183SCoMoE: Efficient Mixtures of Experts with Structured Communication5.750.436, 5, 6, 6
1184Uncertainty-Aware Self-Supervised Learning with Independent Sub-networks5.751.308, 5, 5, 5
1185Towards Semi-Supervised Learning with Non-Random Missing Labels5.750.435, 6, 6, 6
1186Masked Frequency Modeling for Self-Supervised Visual Pre-Training5.751.305, 5, 5, 8
1187S-NeRF: Neural Radiance Fields for Street Views5.751.796, 6, 8, 3
1188Differentiable Gaussianization Layers for Inverse Problems Regularized by Deep Generative Models5.751.793, 8, 6, 6
1189Evaluating and Inducing Personality in Pre-trained Language Models5.750.436, 5, 6, 6
1190Block and Subword-Scaling Floating-Point (BSFP) : An Efficient Non-Uniform Quantization For Low Precision Inference5.750.436, 6, 5, 6
1191CAST: Concurrent Recognition and Segmentation with Adaptive Segment Tokens5.750.436, 6, 5, 6
1192Effective Self-supervised Pre-training on Low-compute networks without Distillation5.751.308, 5, 5, 5
1193CoRTX: Contrastive Framework for Real-time Explanation5.751.308, 5, 5, 5
1194Networks are Slacking Off: Understanding Generalization Problem in Image Deraining5.750.436, 6, 6, 5
1195Towards Smooth Video Composition5.750.436, 5, 6, 6
1196GAIN: Enhancing Byzantine Robustness in Federated Learning with Gradient Decomposition5.751.796, 6, 3, 8
1197No Reason for No Supervision: Improved Generalization in Supervised Models5.751.798, 3, 6, 6
1198Clustering Structure Identification With Ordering Graph5.751.798, 3, 6, 6
1199Robust and Controllable Object-Centric Learning through Energy-based Models5.751.793, 6, 8, 6
1200Limitless Stability for Graph Convolutional Networks5.751.798, 3, 6, 6
1201Rethinking skip connection model as a learnable Markov chain5.750.436, 5, 6, 6
1202Neural Groundplans: Persistent Neural Scene Representations from a Single Image5.750.436, 5, 6, 6
1203Global Prototype Encoding for Incremental Video Highlights Detection5.751.798, 3, 6, 6
1204Neural-Symbolic Recursive Machine for Systematic Generalization5.750.436, 6, 6, 5
1205DrML: Diagnosing and Rectifying Vision Models using Language5.750.436, 6, 5, 6
1206MaSS: Multi-attribute Selective Suppression5.750.436, 6, 6, 5
1207Trust-consistent Visual Semantic Embedding for Image-Text Matching5.751.798, 3, 6, 6
1208Delving into Semantic Scale Imbalance5.751.305, 5, 5, 8
1209DAG Matters! GFlowNets Enhanced Explainer for Graph Neural Networks5.751.308, 5, 5, 5
1210Set-Level Self-Supervised Learning from Noisily-Labeled Data5.711.678, 3, 5, 5, 8, 5, 6
1211Distributed Least Square Ranking with Random Features5.672.058, 3, 6
1212EquiMod: An Equivariance Module to Improve Self-Supervised Learning5.672.056, 3, 8
1213Task-Aware Information Routing from Common Representation Space in Lifelong Learning5.670.475, 6, 6
1214Decision S4: Efficient Sequence-Based RL via State Spaces Layers5.670.476, 6, 5
1215Actionable Neural Representations: Grid Cells from Minimal Constraints5.672.053, 6, 8
1216A sparse, fast, and stable representation for multiparameter topological data analysis5.670.476, 6, 5
1217Causal Explanations of Structural Causal Models5.672.056, 8, 3
1218CASR: Generating Complex Sequences with Autoregressive Self-Boost Refinement5.670.475, 6, 6
1219SciRepEval: A Multi-Format Benchmark for Scientific Document Representations5.672.056, 8, 3
1220Seeing Differently, Acting Similarly: Heterogeneously Observable Imitation Learning5.672.056, 3, 8
1221Learning Globally Smooth Functions on Manifolds5.670.476, 6, 5
1222UniKGQA: Unified Retrieval and Reasoning for Solving Multi-hop Question Answering Over Knowledge Graph5.670.476, 6, 5
1223Large Language Models are Human-Level Prompt Engineers5.670.475, 6, 6
1224Enhancing Meta Learning via Multi-Objective Soft Improvement Functions5.672.053, 8, 6
1225Transferable Unlearnable Examples5.670.476, 5, 6
1226Random Laplacian Features for Learning with Hyperbolic Space5.672.056, 8, 3
1227Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding5.670.475, 6, 6
1228GOGGLE: Generative Modelling for Tabular Data by Learning Relational Structure5.672.058, 3, 6
1229Optimal Data Sampling for Training Neural Surrogates of Programs5.673.308, 8, 1
1230HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers5.670.476, 5, 6
1231Learning multi-scale local conditional probability models of images5.670.476, 5, 6
1232Adversarial Imitation Learning with Preferences5.670.476, 5, 6
1233Synthetic Data Generation of Many-to-Many Datasets via Random Graph Generation5.670.476, 6, 5
1234Function-space regularized Rényi divergences5.672.058, 3, 6
1235Constant-Factor Approximation Algorithms for Socially Fair $k$-Clustering5.670.475, 6, 6
1236Personalized Reward Learning with Interaction-Grounded Learning (IGL)5.670.476, 5, 6
1237Grounding Graph Network Simulators using Physical Sensor Observations5.672.053, 8, 6
1238Performance Bounds for Model and Policy Transfer in Hidden-parameter MDPs5.672.053, 8, 6
1239DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics5.670.475, 6, 6
1240Effective passive membership inference attacks in federated learning against overparameterized models5.672.056, 3, 8
1241Gaussian-Bernoulli RBMs Without Tears5.672.056, 8, 3
1242Proposal-Contrastive Pretraining for Object Detection from Fewer Data5.672.056, 8, 3
1243Neural Network Differential Equation Solvers allow unsupervised error estimation and correction5.672.056, 8, 3
1244Spectral Augmentation for Self-Supervised Learning on Graphs5.672.058, 6, 3
1245PAC Reinforcement Learning for Predictive State Representations5.670.476, 5, 6
1246Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning5.670.476, 6, 5
1247Active Learning based Structural Inference5.672.056, 8, 3
1248No-Regret Learning in Strongly Monotone Games Converges to a Nash Equilibrium5.670.476, 6, 5
1249Latent Graph Inference using Product Manifolds5.672.053, 8, 6
1250Representation Balancing with Decomposed Patterns for Treatment Effect Estimation5.670.476, 5, 6
1251Learning Probabilistic Topological Representations Using Discrete Morse Theory5.672.058, 6, 3
1252Random Matrix Analysis to Balance between Supervised and Unsupervised Learning under the Low Density Separation Assumption5.672.058, 6, 3
1253Heterogeneous Loss Function with Aggressive Rejection for Contaminated data in anomaly detection5.670.476, 6, 5
1254Local KL Convergence Rate for Stein Variational Gradient Descent with Reweighted Kernel5.672.058, 6, 3
1255Learning Discrete Representation with Optimal Transport Quantized Autoencoders5.670.475, 6, 6
1256MonoFlow: A Unified Generative Modeling Framework for GAN Variants5.672.053, 8, 6
1257Graph-based Deterministic Policy Gradient for Repetitive Combinatorial Optimization Problems5.672.056, 8, 3
1258Coordination Scheme Probing for Generalizable Multi-Agent Reinforcement Learning5.672.053, 8, 6
1259Neural-based classification rule learning for sequential data5.672.056, 3, 8
1260Shifts 2.0: Extending The Dataset of Real Distributional Shifts5.670.476, 6, 5
1261Indiscriminate Poisoning Attacks on Unsupervised Contrastive Learning5.670.476, 5, 6
1262Budgeted Training for Vision Transformer5.670.476, 5, 6
1263Mosaic Representation Learning for Self-supervised Visual Pre-training5.670.476, 5, 6
1264Language model with Plug-in Knowldge Memory5.670.476, 6, 5
1265Hierarchical Gaussian Mixture based Task Generative Model for Robust Meta-Learning5.670.475, 6, 6
1266Winning Both the Accuracy of Floating Point Activation and the Simplicity of Integer Arithmetic5.670.476, 6, 5
1267More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization5.670.476, 5, 6
1268Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks5.670.476, 6, 5
1269Any-scale Balanced Samplers for Discrete Space5.672.053, 8, 6
1270Pre-trained Language Models can be Fully Zero-Shot Learners5.670.476, 6, 5
1271Certified Robustness on Structural Graph Matching5.670.476, 6, 5
1272Explaining Temporal Graph Models through an Explorer-Navigator Framework5.670.476, 5, 6
1273On the Soft-Subnetwork for Few-Shot Class Incremental Learning5.672.053, 6, 8
1274Distributed Differential Privacy in Multi-Armed Bandits5.670.476, 6, 5
1275Prometheus: Endowing Low Sample and Communication Complexities to Constrained Decentralized Stochastic Bilevel Learning5.670.476, 6, 5
1276Mutual Partial Label Learning with Competitive Label Noise5.672.053, 8, 6
1277simpleKT: A Simple But Tough-to-Beat Baseline for Knowledge Tracing5.672.053, 8, 6
1278An Extensible Multi-modal Multi-task Object Dataset with Materials5.670.476, 6, 5
1279Revisiting the Assumption of Latent Separability for Backdoor Defenses5.670.475, 6, 6
1280Characterizing the spectrum of the NTK via a power series expansion5.672.053, 6, 8
1281ChordMixer: A Scalable Neural Attention Model for Sequences with Different Length5.672.056, 3, 8
1282A non-asymptotic analysis of oversmoothing in Graph Neural Networks5.672.058, 6, 3
1283Class-Incremental Learning with Repetition5.672.056, 3, 8
1284Imitation Learning for Mean Field Games with Correlated Equilibria5.670.476, 5, 6
1285Graph Neural Networks are Inherently Good Generalizers: Insights by Bridging GNNs and Multi-Layer Perceptrons5.670.476, 5, 6
1286Approximation and non-parametric estimation of functions over high-dimensional spheres via deep ReLU networks5.672.053, 6, 8
1287TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation5.670.476, 5, 6
1288Learning to Reason and Act in Cascading Processes5.672.053, 8, 6
1289PMixUp: Simultaneous Utilization of Part-of-Speech Replacement and Feature Space Interpolation for Text Data Augmentation5.672.056, 8, 3
1290Efficient Offline Policy Optimization with a Learned Model5.670.476, 6, 5
1291PowerQuant: Automorphism Search for Non-Uniform Quantization5.670.475, 6, 6
1292Leveraging Future Relationship Reasoning for Vehicle Trajectory Prediction5.672.056, 3, 8
1293Toward Adversarial Training on Contextualized Language Representation5.672.056, 3, 8
1294Learned Index with Dynamic $epsilon$5.670.475, 6, 6
1295Test-Time Adaptation for Visual Document Understanding5.670.476, 6, 5
1296Exploring Transformer Backbones for Heterogeneous Treatment Effect Estimation5.670.476, 5, 6
1297MemoNav: Working Memory Model for Visual Navigation5.670.476, 5, 6
1298The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation5.670.476, 5, 6
1299Algorithmic Determination of the Combinatorial Structure of the Linear Regions of ReLU Neural Networks5.670.476, 6, 5
1300Understanding new tasks through the lens of training data via exponential tilting5.670.476, 6, 5
1301Data Poisoning Attacks Against Multimodal Encoders5.670.475, 6, 6
1302InfoOT: Information Maximizing Optimal Transport5.670.476, 5, 6
1303Impossibly Good Experts and How to Follow Them5.670.476, 6, 5
1304Beyond calibration: estimating the grouping loss of modern neural networks5.672.058, 6, 3
1305Asynchronous Gradient Play in Zero-Sum Multi-agent Games5.670.476, 5, 6
1306An Exact Poly-Time Membership-Queries Algorithm for Extracting a Three-Layer ReLU Network5.670.476, 6, 5
1307SAAL: Sharpness-Aware Active Learning5.670.475, 6, 6
1308An Adaptive Entropy-Regularization Framework for Multi-Agent Reinforcement Learning5.672.053, 8, 6
1309Gradient Boosting Performs Gaussian Process Inference5.670.475, 6, 6
1310Distribution Shift Detection for Deep Neural Networks5.670.476, 5, 6
1311Towards Effective and Interpretable Human-Agent Collaboration in MOBA Games: A Communication Perspective5.670.476, 5, 6
1312FedSpeed: Larger Local Interval, Less Communication Round, and Higher Generalization Accuracy5.670.476, 6, 5
1313Globally Optimal Training of Neural Networks with Threshold Activation Functions5.670.475, 6, 6
1314A Laplace-inspired Distribution on SO(3) for Probabilistic Rotation Estimation5.672.056, 3, 8
1315Measuring and Narrowing the Compositionality Gap in Language Models5.670.476, 5, 6
1316Guiding continuous operator learning through Physics-based boundary constraints5.672.056, 8, 3
1317Human MotionFormer: Transferring Human Motions with Vision Transformers5.672.058, 3, 6
1318Can We Faithfully Represent Absence States to Compute Shapley Values on a DNN?5.670.476, 6, 5
1319One-Pixel Shortcut: On the Learning Preference of Deep Neural Networks5.670.475, 6, 6
1320Combating Exacerbated Heterogeneity for Robust Decentralized Models5.670.476, 6, 5
1321Offline Reinforcement Learning with Closed-Form Policy Improvement Operators5.670.475, 6, 6
1322Maximizing Communication Efficiency for Large-scale Training via 0/1 Adam5.670.476, 5, 6
1323An Additive Instance-Wise Approach to Multi-class Model Interpretation5.672.058, 6, 3
1324Knowledge-Consistent Dialogue Generation with Language Models and Knowledge Graphs5.672.056, 6, 3, 8, 8, 3
1325Meta Knowledge Condensation for Federated Learning5.672.053, 6, 8
1326Cycle-consistent Masked AutoEncoder for Unsupervised Domain Generalization5.670.475, 6, 6
1327Towards Addressing Label Skews in One-shot Federated Learning5.670.476, 6, 5
1328Relaxed Combinatorial Optimization Networks with Self-Supervision: Theoretical and Empirical Notes on the Cardinality-Constrained Case5.670.476, 5, 6
1329Rethinking the Effect of Data Augmentation in Adversarial Contrastive Learning5.670.476, 6, 5
1330Unified Detoxifying and Debiasing in Language Generation via Inference-time Adaptive Optimization5.670.476, 6, 5
1331DeepPipe: Deep, Modular and Extendable Representations of Machine Learning Pipelines5.670.475, 6, 6
1332TIB: Detecting Unknown Objects via Two-Stream Information Bottleneck5.670.475, 6, 6
1333Hidden Poison: Machine unlearning enables camouflaged poisoning attacks5.670.475, 6, 6
1334Adversarial Collaborative Learning on Non-IID Features5.670.476, 5, 6
1335D2Match: Leveraging Deep Learning and Degeneracy for Subgraph Matching5.670.475, 6, 6
1336Topologically faithful image segmentation via induced matching of persistence barcodes5.670.476, 5, 6
1337On the Lower Bound of Minimizing Polyak-Łojasiewicz functions5.670.475, 6, 6
1338Rotamer Density Estimators are Unsupervised Learners of the Effect of Mutations on Protein-Protein Interaction5.670.475, 6, 6
1339Cross-Level Distillation and Feature Denoising for Cross-Domain Few-Shot Classification5.672.058, 6, 3
1340Individual Privacy Accounting for Differentially Private Stochastic Gradient Descent5.672.058, 3, 6
1341Attention De-sparsification Matters: Inducing Diversity in Digital Pathology Representation Learning5.670.476, 6, 5
1342Transcendental Idealism of Planner: Evaluating Perception from Planning Perspective for Autonomous Driving5.670.476, 6, 5
1343The Augmented Image Prior: Distilling 1000 Classes by Extrapolating from a Single Image5.670.476, 5, 6
1344Contextual Image Masking Modeling via Synergized Contrasting without View Augmentation for Faster and Better Visual Pretraining5.670.476, 5, 6
1345Factorized Fourier Neural Operators5.602.243, 8, 3, 6, 8
1346INSPIRE: A Framework for Integrating Individual User Preferences in Recourse5.601.623, 5, 6, 6, 8
1347TypeT5: Seq2seq Type Inference using Static Analysis5.600.495, 6, 6, 5, 6
1348Contrastive Audio-Visual Masked Autoencoder5.601.625, 6, 3, 6, 8
1349SemPPL: Predicting Pseudo-Labels for Better Contrastive Representations5.600.496, 6, 5, 5, 6
1350CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers5.601.626, 3, 8, 5, 6
1351Malign Overfitting: Interpolation and Invariance are Fundamentally at Odds5.601.628, 5, 6, 3, 6
1352How to prepare your task head for finetuning5.600.496, 6, 5, 6, 5
1353Searching Lottery Tickets in Graph Neural Networks: A Dual Perspective5.601.626, 3, 8, 5, 6
1354Out-of-distribution Representation Learning for Time Series Classification5.601.205, 8, 5, 5, 5
1355Early Stopping for Deep Image Prior5.600.495, 6, 5, 6, 6
1356Agent-based Graph Neural Networks5.601.628, 6, 3, 6, 5
1357GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis5.601.625, 6, 8, 3, 6
1358The KFIoU Loss for Rotated Object Detection5.601.628, 6, 6, 5, 3
1359Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning5.601.626, 5, 6, 3, 8
1360On the Word Boundaries of Emergent Languages Based on Harris's Articulation Scheme5.601.626, 3, 6, 5, 8
1361SoundCount: Sound Counting from Raw Audio with Dyadic Decomposition Neural Network5.601.626, 6, 3, 5, 8
1362SGD Through the Lens of Kolmogorov Complexity5.571.405, 6, 6, 6, 3, 5, 8
1363TVSPrune - Pruning Non-discriminative filters via Total Variation separability of intermediate representations without fine tuning5.501.803, 5, 6, 8
1364Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flow5.500.505, 5, 6, 6
1365Adaptive Block-wise Learning for Knowledge Distillation5.501.803, 8, 5, 6
1366Share Your Representation Only: Guaranteed Improvement of the Privacy-Utility Tradeoff in Federated Learning5.501.808, 5, 3, 6
1367Cross-utterance Conditioned Coherent Speech Editing via Biased Training and Entire Inference5.501.805, 8, 3, 6
1368Learning Geometric Representations of Interactive Objects5.501.803, 5, 6, 8
1369Online Bias Correction for Task-Free Continual Learning5.501.805, 3, 8, 6
1370Meta-Learning the Inductive Biases of Simple Neural Circuits5.501.808, 3, 6, 5
1371Iterative Circuit Repair Against Formal Specifications5.500.506, 6, 5, 5
1372Improving Adversarial Robustness by Putting More Regularizations on Less Robust Samples5.501.803, 5, 8, 6
1373Toward Learning Geometric Eigen-Lengths Crucial for Robotic Fitting Tasks5.501.803, 8, 6, 5
1374Individual Privacy Accounting with Gaussian Differential Privacy5.500.506, 5, 5, 6
1375Improving Differentiable Neural Architecture Search by Encouraging Transferability5.500.506, 5, 6, 5
1376Cross-Window Self-Training via Context Variations from Sparsely-Labeled Time Series5.500.505, 6, 5, 6
1377A theoretical study of inductive biases in contrastive learning5.500.506, 6, 5, 5
1378M$^3$SAT: A Sparsely Activated Transformer for Efficient Multi-Task Learning from Multiple Modalities5.501.805, 6, 8, 3
1379Importance of Class Selectivity in Early Epochs of Training5.500.505, 6, 5, 6
1380Conservative Exploration in Linear MDPs under Episode-wise Constraints5.500.505, 5, 6, 6
1381Fighting Fire with Fire: Contrastive Debiasing without Bias-free Data via Generative Bias-transformation5.500.506, 6, 5, 5
1382Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel5.500.506, 5, 6, 5
1383Game Theoretic Mixed Experts for Combinational Adversarial Machine Learning5.501.805, 3, 6, 8
1384Reproducible Bandits5.501.805, 8, 3, 6
1385Solving Continual Learning via Problem Decomposition5.501.805, 8, 3, 6
1386How Useful are Gradients for OOD Detection Really?5.501.805, 3, 8, 6
1387Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games5.501.803, 5, 6, 8
1388Simple Emergent Action Representations from Multi-Task Policy Training5.500.506, 5, 5, 6
1389Avoiding spurious correlations via logit correction5.500.506, 6, 5, 5
1390HesScale: Scalable Computation of Hessian Diagonals5.502.508, 3, 3, 8
1391Building Normalizing Flows with Stochastic Interpolants5.501.808, 5, 6, 3
1392Does progress on ImageNet transfer to real world datasets?5.501.803, 8, 6, 5
1393Competitive Physics Informed Networks5.501.805, 6, 8, 3
1394Decomposed Prompting: A Modular Approach for Solving Complex Tasks5.500.506, 5, 5, 6
1395Energy-Inspired Self-Supervised Pretraining for Vision Models5.500.505, 5, 6, 5, 6, 6
1396A Time Series is Worth 64 Words: Long-term Forecasting with Transformers5.500.505, 6, 5, 6
1397Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay5.500.506, 5, 5, 6
1398Confidence-Conditioned Value Functions for Offline Reinforcement Learning5.501.806, 8, 5, 3
1399Stochastic Constrained DRO with a Complexity Independent of Sample Size5.501.803, 5, 8, 6
1400Kernel Regression with Infinite-Width Neural Networks on Millions of Examples5.501.808, 3, 5, 6
1401Evaluating Unsupervised Denoising Requires Unsupervised Metrics5.500.505, 5, 6, 6
1402The Value of Out-of-distribution Data5.502.8710, 3, 6, 3
1403First Steps Toward Understanding the Extrapolation of Nonlinear Models to Unseen Domains5.500.506, 5, 5, 6
1404LogicDP: Creating Labels for Graph Data via Inductive Logic Programming5.501.806, 5, 3, 8
1405A VAE for Transformers with Nonparametric Variational Information Bottleneck5.500.505, 6, 6, 5
1406Information-Theoretic Underpinnings of Generalization and Translation in Emergent Communication5.501.806, 3, 8, 5
1407The Brainy Student: Scalable Unlearning by Selectively Disobeying the Teacher5.500.506, 5, 5, 6
1408A Neural PDE Solver with Temporal Stencil Modeling5.501.805, 8, 6, 3
1409Recitation-Augmented Language Models5.500.505, 5, 6, 6
1410Credible, Sealed-bid, Optimal Repeated Auctions With Differentiable Economics5.502.503, 8, 8, 3
1411Towards Efficient Gradient-Based Meta-Learning in Heterogenous Environments5.501.805, 6, 8, 3
1412Optimal Transport for Offline Imitation Learning5.500.506, 5, 6, 5
1413FedorAS: Federated Architecture Search under system heterogeneity5.500.505, 6, 6, 5
1414Towards A Unified View of Sparse Feed-Forward Network in Transformer5.501.803, 5, 6, 8
1415SuperFed: Weight Shared Federated Learning5.500.505, 5, 6, 6
1416Images as Weight Matrices: Sequential Image Generation Through Synaptic Learning Rules5.500.506, 6, 5, 5
1417SGD with large step sizes learns sparse features5.501.803, 5, 8, 6
1418ProSampler: Improving Contrastive Learning by Better Mini-batch Sampling5.501.808, 6, 5, 3
1419Make-A-Video: Text-to-Video Generation without Text-Video Data5.500.506, 5, 6, 5
1420In-distribution and Out-of-distribution Generalization for Graph Neural Networks5.500.506, 6, 5, 5
1421Effectively using public data in privacy preserving Machine learning5.500.505, 5, 6, 6
1422CADet: Fully Self-Supervised Anomaly Detection With Contrastive Learning5.500.505, 6, 5, 6
1423On the System-Level Effectiveness of Physical Object-Hiding Adversarial Attack in Autonomous Driving5.500.505, 6, 6, 5
1424Is Conditional Generative Modeling all you need for Decision Making?5.501.806, 8, 5, 3
1425META-STORM: Generalized Fully-Adaptive Variance Reduced SGD for Unbounded Functions5.500.505, 6, 5, 6
1426TEMPERA: Test-Time Prompt Editing via Reinforcement Learning5.500.505, 5, 6, 6
1427What Matters In The Structured Pruning of Generative Language Models?5.500.505, 6, 5, 6
1428Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning5.501.805, 8, 3, 6
1429Optimizing Bi-Encoder for Named Entity Recognition via Contrastive Learning5.501.805, 6, 8, 3
1430Differentially Private Adaptive Optimization with Delayed Preconditioners5.501.803, 8, 6, 5
1431Long Range Language Modeling via Gated State Spaces5.500.505, 5, 6, 6
1432Task-customized Masked Autoencoder via Mixture of Cluster-conditional Experts5.500.506, 5, 5, 6
1433Investigating Multi-task Pretraining and Generalization in Reinforcement Learning5.501.805, 6, 8, 3
1434Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models5.500.506, 6, 5, 5
1435Noise-Robust De-Duplication at Scale5.500.506, 6, 5, 5
1436Hyperparameter Optimization through Neural Network Partitioning5.501.808, 5, 6, 3
1437Concept-based Explanations for Out-of-Distribution Detectors5.500.505, 6, 5, 6
1438Architectural optimization over subgroups of equivariant neural networks5.500.505, 6, 5, 6
1439Accelerating Hamiltonian Monte Carlo via Chebyshev Integration Time5.501.808, 6, 5, 3
1440Revisiting Structured Dropout5.500.505, 6, 5, 6
1441HiT-MDP: Learning the SMDP option framework on MDPs with Hidden Temporal Variables5.501.806, 8, 3, 5
1442Fusion over the Grassmann Manifold for Incomplete-Data Clustering5.502.875, 8, 8, 1
1443Unsupervised Model-based Pre-training for Data-efficient Control from Pixels5.501.808, 3, 5, 6
1444Fine-grain Inference on Out-of-Distribution Data with Hierarchical Classification5.501.803, 8, 6, 5
1445TTN: A Domain-Shift Aware Batch Normalization in Test-Time Adaptation5.500.505, 6, 6, 5
1446Repository-Level Prompt Generation for Large Language Models of Code5.501.808, 6, 3, 5
1447Variational Prompt Tuning Improves Generalization of Vision-Language Models5.500.506, 6, 5, 5
1448Bridging the Gap to Real-World Object-Centric Learning5.501.803, 8, 6, 5
1449Energy-Based Test Sample Adaptation for Domain Generalization5.500.505, 6, 5, 6
1450A GENERAL SCENARIO-AGNOSTIC REINFORCEMENT LEARNING FOR TRAFFIC SIGNAL CONTROL5.500.505, 6, 6, 5
1451BALTO: efficient tensor program optimization with diversity-based active learning5.501.806, 3, 8, 5
1452Autoregressive Generative Modeling with Noise Conditional Maximum Likelihood Estimation5.502.508, 8, 3, 3
1453How robust is unsupervised representation learning to distribution shift?5.501.803, 5, 8, 6
1454Affinity-Aware Graph Networks5.500.505, 6, 6, 5
1455Edge Guided GANs with Contrastive Learning for Semantic Image Synthesis5.501.803, 5, 6, 8
1456Federated Learning as Variational Inference: A Scalable Expectation Propagation Approach5.500.506, 5, 5, 6
1457Enhancing the Inductive Biases of Graph Neural ODE for Modeling Dynamical Systems5.500.506, 5, 5, 6
1458Mastering Spatial Graph Prediction of Road Networks5.501.805, 8, 6, 3
1459A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning5.501.803, 5, 8, 6
1460Multi-objective optimization via equivariant deep hypervolume approximation5.500.506, 5, 6, 5
1461Inversely Eliciting Numerical Reasoning in Language Models via Solving Linear Systems5.501.808, 3, 6, 5
1462On Explaining Neural Network Robustness with Activation Path5.500.505, 6, 5, 6
1463Structure by Architecture: Structured Representations without Regularization5.501.806, 8, 5, 3
1464DECAP: Decoding CLIP Latents for Zero-shot Captioning5.500.505, 6, 6, 5, 5, 6
1465Robust Explanation Constraints for Neural Networks5.501.803, 6, 5, 8
1466Hidden Schema Networks5.502.503, 3, 8, 8
1467Learning Input-agnostic Manipulation Directions in StyleGAN with Text Guidance5.500.506, 5, 6, 5
1468Domain Generalisation via Domain Adaptation: An Adversarial Fourier Amplitude Approach5.500.505, 5, 6, 6
1469Anti-Symmetric DGN: a stable architecture for Deep Graph Networks5.501.805, 3, 6, 8
1470FastFill: Efficient Compatible Model Update5.501.803, 6, 5, 8
1471SLTUNET: A Simple Unified Model for Sign Language Translation5.500.505, 6, 5, 6
1472DetectBench: An Object Detection Benchmark for OOD Generalization Algorithms5.501.805, 3, 8, 6
1473Leveraging Unlabeled Data to Track Memorization5.500.505, 5, 6, 6
1474Efficient Out-of-Distribution Detection based on In-Distribution Data Patterns Memorization with Modern Hopfield Energy5.500.506, 5, 6, 5
1475NAGphormer: A Tokenized Graph Transformer for Node Classification in Large Graphs5.500.506, 5, 6, 5
1476Near Optimal Private and Robust Linear Regression5.500.506, 6, 5, 5
1477Tensor-Based Sketching Method for the Low-Rank Approximation of Data Streams.5.500.505, 5, 6, 6
1478Data augmentation alone can improve adversarial training5.500.505, 6, 6, 5
1479Valid P-Value for Deep Learning-driven Salient Region5.500.505, 6, 5, 6
1480Learning from conflicting data with hidden contexts5.502.503, 8, 8, 3
1481MeGraph: Graph Representation Learning on Connected Multi-scale Graphs5.502.503, 8, 8, 3
1482Self-supervised debiasing using low rank regularization5.501.803, 6, 5, 8
1483Multi-Vector Retrieval as Sparse Alignment5.500.505, 6, 5, 6
1484Knowledge Unlearning for Mitigating Privacy Risks in Language Models5.500.506, 5, 6, 5
1485Open-domain Visual Entity Linking5.501.805, 3, 6, 8
1486The Final Ascent: When Bigger Models Generalize Worse on Noisy-Labeled Data5.501.805, 3, 8, 6
1487Proportional Amplitude Spectrum Training Augmentation for Synthetic-to-Real Domain Generalization5.501.803, 5, 8, 6
1488Equivariant Shape-Conditioned Generation of 3D Molecules for Ligand-Based Drug Design5.500.506, 5, 6, 5
1489Linearly Constrained Bilevel Optimization: A Smoothed Implicit Gradient Approach5.500.506, 5, 5, 6
1490Memorization-Dilation: Modeling Neural Collapse Under Noise5.500.505, 6, 5, 6
1491Multi-level Protein Structure Pre-training via Prompt Learning5.500.506, 6, 5, 5
1492Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 Small5.502.503, 3, 8, 8
1493FedMT: Federated Learning with Mixed-type Labels5.501.806, 8, 5, 3
1494Denoising MCMC for Accelerating Diffusion-Based Generative Models5.500.506, 6, 5, 5
1495Confidence Estimation Using Unlabeled Data5.501.808, 5, 6, 3
1496Sequential Attention for Feature Selection5.501.803, 6, 5, 8
1497Multi-Epoch Matrix Factorization Mechanisms for Private Machine Learning5.500.506, 5, 6, 5
1498Learning Listwise Domain-Invariant Representations for Ranking5.500.505, 6, 5, 6
1499Exp-$alpha$: Beyond Proportional Aggregation in Federated Learning5.500.505, 6, 5, 6
1500Guiding Safe Exploration with Weakest Preconditions5.501.803, 8, 6, 5
1501Gated Neural ODEs: Trainability, Expressivity and Interpretability5.501.803, 8, 6, 5
1502Learning Multimodal Data Augmentation in Feature Space5.501.805, 3, 8, 6
1503Achieving Sub-linear Regret in Infinite Horizon Average Reward Constrained MDP with Linear Function Approximation5.501.806, 8, 3, 5
1504FedFA: Federated Feature Augmentation5.500.506, 5, 6, 5
1505A critical look at evaluation of GNNs under heterophily: Are we really making progress?5.500.505, 6, 5, 6
1506Interpretable Debiasing of Vectorized Language Representations with Iterative Orthogonalization5.500.506, 6, 5, 5
1507Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations5.500.506, 5, 6, 5
1508VIMA: General Robot Manipulation with Multimodal Prompts5.501.803, 6, 5, 8
1509AUTOJOIN: EFFICIENT ADVERSARIAL TRAINING FOR ROBUST MANEUVERING VIA DENOISING AUTOEN- CODER AND JOINT LEARNING5.500.505, 6, 6, 5
1510The power of choices in decision tree learning5.501.806, 3, 8, 5
1511Boosting Adversarial Transferability using Dynamic Cues5.500.506, 5, 5, 6
1512MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models5.500.506, 5, 6, 5
1513Part-Based Models Improve Adversarial Robustness5.500.506, 5, 6, 5
1514Extremely Simple Activation Shaping for Out-of-Distribution Detection5.501.805, 8, 6, 3
1515Hebbian and Gradient-based Plasticity Enables Robust Memory and Rapid Learning in RNNs5.500.505, 6, 5, 6
1516Equivariant Hypergraph Diffusion Neural Operators5.500.506, 5, 6, 5
1517Biases in Evaluation of Molecular Optimization Methods and Bias Reduction Strategies5.501.803, 5, 6, 8
1518Neural Agents Struggle to Take Turns in Bidirectional Emergent Communication5.501.808, 6, 3, 5
1519Observational Robustness and Invariances in Reinforcement Learning via Lexicographic Objectives5.501.505, 3, 8, 5, 6, 6
1520Prompting GPT-3 To Be Reliable5.500.505, 6, 5, 6
1521Turning the Curse of Heterogeneity in Federated Learning into a Blessing for Out-of-Distribution Detection5.501.806, 3, 5, 8
1522Neural Lagrangian Schr'{o}dinger Bridge: Diffusion Modeling for Population Dynamics5.500.505, 6, 5, 6
1523Warping the Space: Weight Space Rotation for Class-Incremental Few-Shot Learning5.501.805, 3, 6, 8
1524Jointly Learning Visual and Auditory Speech Representations from Raw Data5.501.808, 5, 3, 6
1525On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning5.500.505, 6, 6, 5
1526Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC5.500.505, 6, 5, 6
1527Discovering Policies with DOMiNO5.500.505, 6, 6, 5
1528Improving Out-of-distribution Generalization with Indirection Representations5.501.806, 5, 3, 8
1529SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient5.502.068, 3, 5, 6, 8, 3
1530Sinkhorn Discrepancy for Counterfactual Generalization5.500.506, 5, 6, 5
1531Distributional Meta-Gradient Reinforcement Learning5.501.805, 8, 6, 3
1532Interval-based Offline Policy Evaluation without Sufficient Exploration or Realizability5.501.808, 3, 5, 6
1533Dense Correlation Fields for Motion Modeling in Action Recognition5.501.808, 3, 6, 5
1534CBLab: Scalable Traffic Simulation with Enriched Data Supporting5.501.808, 5, 6, 3
1535Time to augment visual self-supervised learning5.501.805, 3, 6, 8
1536Towards Lightweight, Model-Agnostic and Diversity-Aware Active Anomaly Detection5.501.805, 8, 3, 6
1537Switching One-Versus-the-Rest Loss to Increase Logit Margins for Adversarial Robustness5.500.506, 5, 5, 6
1538Q-Pensieve: Boosting Sample Efficiency of Multi-Objective RL Through Memory Sharing of Q-Snapshots5.500.506, 5, 6, 5
1539Learning Invariant Features for Online Continual Learning5.501.808, 5, 3, 6
1540ODAM: Gradient-based Instance-Specific Visual Explanations for Object Detection5.500.506, 5, 5, 6
1541Unsupervised Object-Centric Learning with Bi-level Optimized Query Slot Attention5.501.808, 6, 3, 5
1542EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model5.500.505, 6, 6, 5
1543Smoothed-SGDmax: A Stability-Inspired Algorithm to Improve Adversarial Generalization5.500.506, 5, 5, 6
1544Learning to Generate All Feasible Actions5.501.808, 5, 6, 3
1545Empirical Study of Pre-training a Backbone for 3D Human Pose and Shape Estimation5.500.506, 5, 6, 5
1546Class Prototype-based Cleaner for Label Noise Learning5.502.503, 3, 8, 8
1547AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection5.501.803, 8, 6, 5
1548ILA-DA: Improving Transferability of Intermediate Level Attack with Data Augmentation5.501.806, 3, 8, 5
1549A Closer Look at the Calibration of Differentially Private Learners5.500.506, 5, 6, 5
1550Schema Inference for Interpretable Image Classification5.500.506, 5, 6, 5
1551Covariance-Robust Minimax Probability Machines for Algorithmic Recourse5.502.503, 8, 3, 8
1552Spiking Convolutional Neural Networks for Text Classification5.501.806, 8, 3, 5
1553Improving Language Model Pretraining with Text Structure Information5.501.803, 5, 8, 6
1554Cluster and Landmark Attributes Infused Graph Neural Networks for Link prediction5.500.506, 6, 5, 5
1555Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions5.500.506, 5, 5, 6
1556Average Sensitivity of Decision Tree Learning5.500.506, 6, 5, 5
1557Bridging the Gap Between Cascade and End-to-End Cross-modal Translation Models: A Zero-Shot Approach5.501.803, 6, 8, 5
1558Learning by Distilling Context5.501.803, 5, 6, 8
1559Structured Pruning of CNNs at Initialization5.500.506, 5, 5, 6
1560Generating Adversarial Examples with Task Oriented Multi-Objective Optimization5.501.803, 8, 5, 6
1561Neural Network Approximations of PDEs Beyond Linearity: Representational Perspective5.501.803, 5, 6, 8
1562Analytical Composition of Differential Privacy via the Edgeworth Accountant5.500.505, 5, 6, 6
1563Predictor-corrector algorithms for stochastic optimization under gradual distribution shift5.500.506, 5, 5, 6
1564Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation5.500.506, 5, 5, 6
1565Unicom: Universal and Compact Representation Learning for Image Retrieval5.500.506, 5, 5, 6
1566A unified optimization framework of ANN-SNN Conversion: towards optimal mapping from activation values to firing rates5.502.878, 5, 8, 1
1567Trading Information between Latents in Hierarchical Variational Autoencoders5.501.808, 5, 6, 3
1568Towards Skilled Population Curriculum for MARL5.500.505, 6, 5, 6
1569Bringing Saccades and Fixations into Self-supervised Video Representation Learning5.500.506, 6, 5, 5
1570Improve learning combining crowdsourced labels by weighting Areas Under the Margin5.500.505, 6, 5, 6
1571Neural Field Discovery Disentangles Equivariance in Interacting Dynamical Systems5.500.506, 6, 5, 5
1572An Optimal Transport Perspective on Unpaired Image Super-Resolution5.501.808, 6, 5, 3
1573Learning Heterogeneous Interaction Strengths by Trajectory Prediction with Graph Neural Network5.500.506, 5, 6, 5
1574Neural Volumetric Mesh Generator5.501.806, 3, 8, 5
1575Protein Representation Learning via Knowledge Enhanced Primary Structure Reasoning5.500.506, 5, 5, 6
1576LPMARL: Linear Programming based Implicit Task Assignment for Hierarchical Multi-agent Reinforcement Learning5.500.505, 5, 6, 6
1577Neural Frailty Machine: Beyond proportional hazard assumption in neural survival regressions5.500.505, 6, 5, 6
1578Robust Learning with Decoupled Meta Label Purifier5.501.806, 3, 5, 8
1579Basic Binary Convolution Unit for Binarized Image Restoration Network5.501.805, 8, 3, 6
1580Sweet Gradient Matters: Designing Consistent and Efficient Estimator for Zero-Shot Neural Architecture Search5.500.505, 6, 6, 5
1581Constrained Hierarchical Deep Reinforcement Learning with Differentiable Formal Specifications5.501.803, 5, 6, 8
1582Limitations of the NTK for Understanding Generalization in Deep Learning5.501.806, 8, 3, 5
1583Scalable Estimation of Nonparametric Markov Networks with Mixed-Type Data5.500.506, 5, 5, 6
1584Diffusion Probabilistic Modeling of Protein Backbones in 3D for the motif-scaffolding problem5.500.505, 6, 6, 5
1585Joint rotational invariance and adversarial training of a dual-stream Transformer yields state of the art Brain-Score for Area V45.501.805, 8, 6, 3
1586A Unified Causal View of Domain Invariant Representation Learning5.500.506, 6, 5, 5
1587On the Robustness of Safe Reinforcement Learning under Observational Perturbations5.500.505, 6, 5, 6
1588Behind the Scenes of Gradient Descent: A Trajectory Analysis via Basis Function Decomposition5.500.505, 5, 6, 6
1589T2D: Spatiotemporal Feature Learning Based on Triple 2D Decomposition5.501.803, 5, 8, 6
1590Data-Free One-Shot Federated Learning Under Very High Statistical Heterogeneity5.500.506, 5, 5, 6
1591An Efficient Mean-field Approach to High-Order Markov Logic5.501.803, 6, 5, 8
1592Downstream Datasets Make Surprisingly Good Pretraining Corpora5.501.805, 6, 3, 8
1593Unleashing Mask: Explore the Intrinsic Out-of-distribution Detection Capability5.501.806, 8, 5, 3
1594Universal Speech Enhancement with Score-based Diffusion5.500.505, 6, 6, 5
1595CodeT: Code Generation with Generated Tests5.502.508, 3, 3, 8
1596AdaStride: Using Adaptive Strides in Sequential Data for Effective Downsampling5.500.506, 5, 5, 6
1597On the Universality of Langevin Diffusion for Private Euclidean (Convex) Optimization5.500.505, 5, 6, 6
1598What Knowledge gets Distilled in Knowledge Distillation?5.501.806, 8, 5, 3
1599Simplicial Embeddings in Self-Supervised Learning and Downstream Classification5.500.506, 5, 5, 6
1600Thalamus: a brain-inspired algorithm for biologically-plausible continual learning and disentangled representations5.500.505, 5, 6, 6
1601Context Autoencoder for Self-Supervised Representation Learning5.500.505, 5, 6, 6
1602Progressive Purification for Instance-Dependent Partial Label Learning5.501.803, 8, 5, 6
1603CFlowNets: Continuous control with Generative Flow Networks5.500.506, 5, 5, 6
1604Neural Radiance Fields with Geometric Consistency for Few-Shot Novel View Synthesis5.501.806, 3, 5, 8
1605Semi-supervised Community Detection via Structural Similarity Metrics5.501.808, 3, 5, 6
1606Multivariate Time-series Imputation with Disentangled Temporal Representations5.500.506, 6, 5, 5
1607LPT: Long-tailed Prompt Tuning for Image Classification5.500.506, 5, 6, 5
1608TopoZero: Digging into Topology Alignment on Zero-Shot Learning5.501.803, 6, 8, 5
1609Knowledge Distillation based Degradation Estimation for Blind Super-Resolution5.500.505, 5, 6, 6
1610Temporary feature collapse phenomenon in early learning of MLPs5.501.806, 8, 5, 3
1611Meta-Evolve: Continuous Robot Evolution for One-to-many Policy Transfer5.501.808, 5, 6, 3
1612Learning Lightweight Object Detectors via Progressive Knowledge Distillation5.500.506, 5, 5, 6
1613Twofer: Tackling Continual Domain Shift with Simultaneous Domain Generalization and Adaptation5.501.806, 5, 3, 8
1614VectorMapNet: End-to-end Vectorized HD Map Learning5.501.803, 8, 5, 6
1615Domain Generalization with Small Data5.501.808, 3, 5, 6
1616Hierarchical Prompting Improves Visual Recognition On Accuracy, Data Efficiency and Explainability5.500.506, 6, 5, 5
1617Decomposing Texture and Semantics for Out-of-distribution Detection5.500.506, 5, 5, 6
1618One Transformer Can Understand Both 2D & 3D Molecular Data5.501.805, 8, 3, 6
1619An Analysis of Information Bottlenecks5.501.808, 6, 3, 5
1620Everyone's Preference Changes Differently: Weighted Multi-Interest Retrieval Model5.501.806, 5, 8, 3
1621Hierarchical Relational Learning for Few-Shot Knowledge Graph Completion5.501.803, 5, 8, 6
1622Function-Consistent Feature Distillation5.501.806, 3, 8, 5
1623The Devil is in the Wrongly-classified Samples: Towards Unified Open-set Recognition5.501.808, 6, 5, 3
1624Domain Generalization via Independent Regularization from Early-branching Networks5.501.808, 6, 3, 5
1625DELTA: DEBIASED FULLY TEST-TIME ADAPTATION5.500.505, 6, 5, 6
1626Bit-Pruning: A Sparse Multiplication-Less Dot-Product5.501.803, 5, 8, 6
1627KNN-Diffusion: Image Generation via Large-Scale Retrieval5.500.505, 5, 6, 6
1628IS SYNTHETIC DATA FROM GENERATIVE MODELS READY FOR IMAGE RECOGNITION?5.500.505, 5, 6, 6
1629IDEAL: Query-Efficient Data-Free Learning from Black-Box Models5.501.808, 5, 6, 3
1630Succinct Compression: Lossless Compression for Fast and Memory-Efficient Deep Neural Network Inference5.502.503, 8, 3, 8
1631BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection5.500.505, 5, 6, 6
1632MaPLe: Multi-modal Prompt Learning5.501.805, 6, 8, 3
1633Achieve the Minimum Width of Neural Networks for Universal Approximation5.501.806, 3, 5, 8
1634Example-based Planning via Dual Gradient Fields5.501.803, 8, 5, 6
1635Protein structure generation via folding diffusion5.501.808, 3, 5, 6
1636MBrain: A Multi-channel Self-Supervised Learning Framework for Brain Signals5.401.623, 8, 6, 5, 5
1637KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document Understanding5.400.496, 5, 6, 5, 5
1638Infusing Lattice Symmetry Priors in Neural Networks Using Soft Attention Masks5.400.495, 6, 5, 5, 6
1639Panning for Gold in Federated Learning: Targeted Text Extraction under Arbitrarily Large-Scale Aggregation5.401.203, 6, 6, 6, 6
1640Empowering Graph Representation Learning with Test-Time Graph Transformation5.401.625, 6, 3, 8, 5
1641Maximum Likelihood Learning of Energy-Based Models for Simulation-Based Inference5.401.623, 8, 5, 5, 6
1642Prompt Tuning with Prompt-aligned Gradient for Vision-Language Models5.401.206, 6, 3, 6, 6
1643Evaluating Representations with Readout Model Switching5.401.628, 5, 6, 5, 3
1644Scaling Laws For Deep Learning Based Image Reconstruction5.401.626, 3, 5, 5, 8
1645PASHA: Efficient HPO and NAS with Progressive Resource Allocation5.401.628, 5, 6, 3, 5
1646Tackling Diverse Tasks via Cross-Modal Transfer Learning5.401.625, 5, 3, 6, 8
1647On the Interplay Between Misspecification and Sub-optimality Gap: From Linear Contextual Bandits to Linear MDPs5.400.495, 5, 6, 5, 6
1648LT-SNN: Self-Adaptive Spiking Neural Network for Event-based Classification and Object Detection5.402.248, 5, 3, 8, 3
1649Scaling Convex Neural Networks with Burer-Monteiro Factorization5.401.626, 5, 8, 3, 5
1650$rm A^2Q$: Aggregation-Aware Quantization for Graph Neural Networks5.401.626, 8, 5, 5, 3
1651Learning Dynamical Characteristics with Neural Operators for Data Assimilation5.401.628, 5, 3, 5, 6
1652Wav2Tok: Deep Sequence Tokenizer for Audio Retrieval5.401.625, 5, 3, 8, 6
1653Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information5.401.628, 5, 3, 5, 6
1654GNNDelete: A General Unlearning Strategy for Graph Neural Networks5.401.626, 3, 5, 8, 5
1655General Neural Gauge Fields5.400.495, 6, 5, 6, 5
1656Deep Dynamic AutoEncoder for Vision BERT Pretraining5.400.495, 6, 5, 5, 6
1657DiffMimic: Efficient Motion Mimicking with Differentiable Physics5.401.203, 6, 6, 6, 6
1658Do Not Train It: A Linear Neural Architecture Search of Graph Neural Networks5.400.495, 5, 6, 6, 5
1659ModelAngelo: Automated Model Building for Cryo-EM Maps5.401.626, 5, 3, 8, 5
1660UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers5.330.476, 5, 5
1661Convergence is Not Enough: Average-Case Performance of No-Regret Learning Dynamics5.332.058, 5, 3
1662Simple Spectral Graph Convolution from an Optimization Perspective5.330.476, 5, 5
1663Doing Fast Adaptation Fast: Conditionally Independent Deep Ensembles for Distribution Shifts5.330.475, 5, 6
1664RuDar: Weather Radar Dataset for Precipitation Nowcasting with Geographical and Seasonal Variability5.330.475, 6, 5
1665HyPHEN: A Hybrid Packing Method and Optimizations for Homomorphic Encryption-Based Neural Network5.330.476, 5, 5
1666Unveiling the sampling density in non-uniform geometric graphs5.330.475, 6, 5
1667Geometrically regularized autoencoders for non-Euclidean data5.330.476, 5, 5
1668Evolving Populations of Diverse RL Agents with MAP-Elites5.330.476, 5, 5
1669Mid-Vision Feedback for Convolutional Neural Networks5.332.058, 3, 5
1670Prefer to Classify: Improving Text Classifier via Pair-wise Preference Learning5.332.055, 8, 3
1671Editing models with task arithmetic5.330.475, 6, 5
1672Context-Aware Image Completion5.330.476, 5, 5
1673Architecture Matters in Continual Learning5.332.053, 8, 5
1674Efficient Data Subset Selection to Generalize Training Across Models: Transductive and Inductive Networks5.330.475, 6, 5
1675Raisin: Residual Algorithms for Versatile Offline Reinforcement Learning5.330.475, 5, 6
1676Learning Shareable Bases for Personalized Federated Image Classification5.330.476, 5, 5
1677Learning Mixture Models with Simultaneous Data Partitioning and Parameter Estimation5.330.475, 5, 6
1678Neural Bregman Divergences for Distance Learning5.332.055, 8, 3
1679Offline Reinforcement Learning from Heteroskedastic Data Via Support Constraints5.330.476, 5, 5
1680Bias Propagation in Federated Learning5.330.476, 5, 5
1681LUNA: Language as Continuing Anchors for Referring Expression Comprehension5.330.475, 6, 5
1682Many-Body Approximation for Tensors5.332.058, 3, 5
1683What do large networks memorize?5.330.475, 5, 6
1684Linear Mode Connectivity of Deep Neural Networks via Permutation Invariance and Renormalization5.332.055, 3, 8
1685Differentially Private Diffusion Models5.332.058, 5, 3
1686Teaching Algorithmic Reasoning via In-context Learning5.332.055, 3, 8
1687Instruction-Following Agents with Jointly Pre-Trained Vision-Language Models5.330.475, 6, 5
1688GPTQ: Accurate Quantization for Generative Pre-trained Transformers5.330.475, 5, 6
1689A new characterization of the edge of stability based on a sharpness measure aware of batch gradient distribution5.330.476, 5, 5
1690Continual Post-Training of Language Models5.332.058, 3, 5
1691Min-Max Multi-objective Bilevel Optimization with Applications in Robust Machine Learning5.330.475, 6, 5
1692Spotlight: Mobile UI Understanding using Vision-Language Models with a Focus5.330.475, 6, 5
1693Data Subset Selection via Machine Teaching5.330.475, 6, 5
1694Elicitation Inference Optimization for Multi-Principal-Agent Alignment5.330.475, 6, 5
1695Self-Ensemble Protection: Training Checkpoints Are Good Data Protectors5.330.476, 5, 5
1696Probability flow solution of the Fokker-Planck equation5.330.475, 6, 5
1697Recycling Scraps: Improving Private Learning by Leveraging Intermediate Checkpoints5.330.475, 6, 5
1698BC-IRL: Learning Generalizable Reward Functions from Demonstrations5.332.053, 5, 8
1699Provable Robustness against Wasserstein Distribution Shifts via Input Randomization5.330.475, 6, 5
1700Deep Learning From Crowdsourced Labels: Coupled Cross-Entropy Minimization, Identifiability, and Regularization5.330.476, 5, 5
1701A Kernel-Based View of Language Model Fine-Tuning5.330.476, 5, 5
1702Learning Multiobjective Program Through Online Learning5.332.053, 5, 8
1703ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret5.330.475, 5, 6
1704The Challenges of Exploration for Offline Reinforcement Learning5.330.475, 6, 5
1705Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Approach5.332.058, 5, 3
1706Accelerated Single-Call Methods for Constrained Min-Max Optimization5.332.053, 8, 5
1707Understanding the Complexity Gains of Contextual Multi-task RL with Curricula5.330.475, 6, 5
1708Expected Probabilistic Hierarchies5.330.475, 6, 5
1709SP2 : A Second Order Stochastic Polyak Method5.330.475, 6, 5
1710Improved Group Robustness via Classifier Retraining on Independent Splits5.330.475, 6, 5
1711Density Sketches for Sampling and Estimation5.330.475, 5, 6
1712Beyond Link Prediction: On Pre-Training Knowledge Graph Embeddings5.330.475, 6, 5
1713Univariate vs Multivariate Time Series Forecasting with Transformers5.330.476, 5, 5
1714On the optimization and generalization of overparameterized implicit neural networks5.330.475, 5, 6
1715Learning to Unlearn: Instance-wise Unlearning for Pre-trained Classifiers5.332.058, 5, 3
17163D Neural Embedding Likelihood for Robust Sim-to-Real Transfer in Inverse Graphics5.330.476, 5, 5
1717MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection5.330.476, 5, 5
1718Trimsformer: Trimming Transformer via Searching for Low-Rank Structure5.330.475, 6, 5
1719Towards a Unified Theoretical Understanding of Non-contrastive Learning via Rank Differential Mechanism5.330.475, 6, 5
1720AE-FLOW: Autoencoders with Normalizing Flows for Medical Images Anomaly Detection5.332.053, 5, 8
1721Causal Mean Field Multi-Agent Reinforcement Learning5.330.475, 5, 6
1722Towards Robust Model Watermark via Reducing Parametric Vulnerability5.332.053, 5, 8
1723On the Robustness of Dataset Inference5.332.053, 8, 5
1724Towards Conditionally Dependent Masked Language Models5.330.475, 6, 5
1725DAVA: Disentangling Adversarial Variational Autoencoder5.330.475, 6, 5
1726Online Low Rank Matrix Completion5.332.053, 8, 5
1727Keypoint Matching via Random Network Consensus5.332.053, 5, 8
1728Private and Efficient Meta-Learning with Low Rank and Sparse decomposition5.330.475, 5, 6
1729On discrete symmetries of robotics systems: A group-theoretic and data-driven analysis5.330.475, 5, 6
1730BO-Muse: A Human expert and AI teaming framework for accelerated experimental design5.330.476, 5, 5
1731Policy-Based Self-Competition for Planning Problems5.332.053, 5, 8
1732Bayesian Oracle for bounding information gain in neural encoding models5.330.475, 5, 6
1733Unsupervised Performance Predictor for Architecture Search5.330.475, 5, 6
1734Learning Reduced Fluid Dynamics5.332.053, 5, 8
1735Confident Sinkhorn Allocation for Pseudo-Labeling5.330.476, 5, 5
1736UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction5.332.053, 5, 8
1737UNDERSTANDING PURE CLIP GUIDANCE FOR VOXEL GRID NERF MODELS5.330.476, 5, 5
1738Learning to Predict Parameter for Unseen Data5.330.475, 5, 6
1739BinSGDM: Extreme One-Bit Quantization for Communication Efficient Large-Scale Distributed Training5.330.476, 5, 5
1740Free Lunch for Domain Adversarial Training: Environment Label Smoothing5.330.475, 6, 5
1741One-Vs-All AUC Maximization: an effective solution to the low-resource named entity recognition problem5.332.053, 5, 8
1742Learning to Extrapolate: A Transductive Approach5.332.055, 8, 3
1743Detecting and Mitigating Indirect Stereotypes in Word Embeddings5.330.475, 5, 6
1744ASGNN: Graph Neural Networks with Adaptive Structure5.330.475, 5, 6
1745Spatial reasoning as Object Graph Energy Minimization5.330.475, 5, 6
1746BAT-Chain: Bayesian-Aware Transport Chain for Topic Hierarchies Discovery5.330.476, 5, 5
1747Exact Representation of Sparse Networks with Symmetric Nonnegative Embeddings5.330.476, 5, 5
1748Neural DAG Scheduling via One-Shot Priority Sampling5.330.475, 6, 5
1749Bias Amplification Improves Worst-Group Accuracy without Group Information5.330.475, 5, 6
1750A CMDP-within-online framework for Meta-Safe Reinforcement Learning5.332.053, 5, 8
1751Conditional Permutation Invariant Flows5.330.475, 5, 6
1752Learned Neural Network Representations are Spread Diffusely with Redundancy5.330.475, 5, 6
1753Multi-Segmental Informational Coding for Self-Supervised Representation Learning5.330.476, 5, 5
1754Learning to Segment from Noisy Annotations: A Spatial Correction Approach5.330.476, 5, 5
1755DiP-GNN: Discriminative Pre-Training of Graph Neural Networks5.330.476, 5, 5
1756Faster Reinforcement Learning with Value Target Lower Bounding5.330.475, 6, 5
1757Quasi-optimal Learning with Continuous Treatments5.330.475, 6, 5
1758On Structural Expressive Power of Graph Transformers5.332.058, 5, 3
1759Learning Critically in Federated Learning with Noisy and Heterogeneous Clients5.330.475, 6, 5
1760Deep Evidential Reinforcement Learning for Dynamic Recommendations5.332.053, 8, 5
1761SuperWeight Ensembles: Automated Compositional Parameter Sharing Across Diverse Architechtures5.330.476, 5, 5
1762Robust Self-Supervised Learning with Lie Groups5.332.055, 3, 8
1763D4FT: A Deep Learning Approach to Kohn-Sham Density Functional Theory5.330.476, 5, 5
1764Differentially Private Optimization on Large Model at Small Cost5.330.475, 6, 5
1765Contrastive Value Learning: Implicit Models for Simple Offline RL5.332.053, 8, 5
1766Normalizing Flows for Interventional Density Estimation5.330.476, 5, 5
1767GuoFeng: A Discourse-aware Evaluation Benchmark for Language Understanding, Translation and Generation5.332.058, 3, 5
1768SpectraNet: multivariate forecasting and imputation under distribution shifts and missing data5.332.058, 5, 3
1769Benchmarking Constraint Inference in Inverse Reinforcement Learning5.330.475, 5, 6
1770Forward and Backward Lifelong Learning with Time-dependent Tasks5.330.475, 6, 5
1771Homeomorphism Alignment in Two Spaces for Unsupervised Domain Adaptation5.330.475, 5, 6
1772Warped Convolutional Networks: Bridge Homography to $mathfrak{sl}(3)$ algebra by Group Convolution5.332.053, 5, 8
1773FEAT: A general framework for Feature-aware Multivariate Time-series Representation Learning5.330.475, 5, 6
1774RankCSE: Unsupervised Sentence Representations Learning via Learning to Rank5.330.475, 6, 5
1775Label-distribution-agnostic Ensemble Learning on Federated Long-tailed Data5.330.476, 5, 5
1776Masked Vector Quantization5.333.303, 3, 10
1777Measuring Image Complexity as a Discrete Hierarchy using MDL Clustering5.330.475, 5, 6
1778Agent Prioritization with Interpretable Relation for Trajectory Prediction5.330.475, 5, 6
1779Maximizing Spatio-Temporal Entropy of Deep 3D CNNs for Efficient Video Recognition5.332.053, 5, 8
1780Latent State Marginalization as a Low-cost Approach to Improving Exploration5.330.475, 5, 6
1781Supernet Training for Federated Image Classification Under System Heterogeneity5.330.475, 6, 5
1782Generalizable Person Re-identification Without Demographics5.330.476, 5, 5
1783Behavior Prior Representation learning for Offline Reinforcement Learning5.332.053, 5, 8
1784How Does Adaptive Optimization Impact Local Neural Network Geometry?5.330.475, 6, 5
1785Concentric Ring Loss for Face Forgery Detection5.332.058, 3, 5
1786Representational Task Bias in Zero-shot Recognition at Scale5.330.476, 5, 5
1787Relational Curriculum Learning for Graph Neural Networks5.330.475, 6, 5
1788ACMP: Allen-Cahn Message Passing with Attractive and Repulsive Forces for Graph Neural Networks5.330.475, 6, 5
1789An Upper Bound for the Distribution Overlap Index and Its Applications5.330.476, 5, 5
1790Retrieval-based Controllable Molecule Generation5.330.476, 5, 5
1791Data Drift Correction via Time-varying Importance Weight Estimator5.330.475, 6, 5
1792Solving and Learning non-Markovian Stochastic Control problems in continuous-time with Neural RDEs5.330.476, 5, 5
1793Sequential Latent Variable Models for Few-Shot High-Dimensional Time-Series Forecasting5.330.475, 5, 6
1794On the Fast Convergence of Unstable Reinforcement Learning Problems5.330.475, 6, 5
1795Universal approximation and model compression for radial neural networks5.330.476, 5, 5
1796Learn Low-dimensional Shortest-path Representation of Large-scale and Complex Graphs5.330.475, 5, 6
1797Generalized Sum Pooling for Metric Learning5.330.476, 5, 5
1798Learning to Estimate Single-View Volumetric Flow Motions without 3D Supervision5.330.475, 5, 6
1799$Delta$-PINNs: physics-informed neural networks on complex geometries5.332.058, 5, 3
1800Temperature Schedules for self-supervised contrastive methods on long-tail data5.330.476, 5, 5
1801SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification5.332.053, 8, 5
1802Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup5.332.058, 3, 5
1803Identifying Weight-Variant Latent Causal Models5.331.495, 5, 8, 3, 6, 5
1804Can CNNs Be More Robust Than Transformers?5.332.058, 5, 3
1805Rethinking Graph Lottery Tickets: Graph Sparsity Matters5.330.476, 5, 5
1806On the Universal Approximation Property of Deep Fully Convolutional Neural Networks5.330.475, 5, 6
1807Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval5.330.476, 5, 5
1808Continual Learning In Low-coherence Subspace: A Strategy To Mitigate Learning Capacity Degradation5.330.475, 6, 5
1809GSCA: Global Spatial Correlation Attention5.330.476, 5, 5
1810Understanding Incremental Learning of Gradient Descent: A Fine-grained analysis of Matrix Sensing5.332.053, 5, 8
1811Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models5.330.476, 5, 5
1812Critical Sampling for Robust Evolution Behavior Learning of Unknown Dynamical Systems5.332.053, 8, 5
1813Effective Cross-instance Positive Relations for Generalized Category Discovery5.330.475, 5, 6
1814Assessing Model Out-of-distribution Generalization with Softmax Prediction Probability Baselines and A Correlation Method5.330.476, 5, 5
1815Progressive Compressed Auto-Encoder for Self-supervised Representation Learning5.331.116, 6, 6, 6, 3, 5
1816Knowledge-driven Scene Priors for Semantic Audio-Visual Embodied Navigation5.330.475, 6, 5
1817Distribution Aware Metrics for Conditional Natural Language Generation5.330.475, 5, 6
1818Recommender Transformers with Behavior Pathways5.330.475, 6, 5
1819HNeRV: A Hybrid Neural Representation for Videos5.330.476, 5, 5
1820Filter-Recovery Network for Multi-Speaker Audio-Visual Speech Separation5.330.476, 5, 5
1821Deep Physics-based Deformable Models for Efficient Shape Abstractions5.330.476, 5, 5
1822Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies5.330.476, 5, 5
1823Active Learning with Controllable Augmentation Induced Acquisition5.332.055, 8, 3
1824Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game5.330.475, 5, 6
1825Mind the Gap: Offline Policy Optimizaiton for Imperfect Rewards5.332.058, 5, 3
1826Time Series are Images: Vision Transformer for Irregularly Sampled Time Series5.332.058, 5, 3
1827Understanding Self-Supervised Pretraining with Part-Aware Representation Learning5.330.476, 5, 5
1828Volumetric Optimal Transportation by Fast Fourier Transform5.332.053, 8, 5
1829Robustness Exploration of Semantic Information in Adversarial Training5.330.475, 6, 5
1830Learning GFlowNets from partial episodes for improved convergence and stability5.330.475, 6, 5
1831Boosting Out-of-Distribution Detection with Multiple Pre-trained Models5.330.475, 6, 5
1832Confidence and Dispersity Speak: Characterising Prediction Matrix for Unsupervised Accuracy Estimation5.332.053, 5, 8
1833Molecular Geometry Pretraining with SE(3)-Invariant Denoising Distance Matching5.330.475, 5, 6
1834Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking5.330.475, 6, 5
1835Understanding the Training Dynamics in Federated Deep Learning via Aggregation Weight Optimization5.330.475, 5, 6
1836ONLINE RESTLESS BANDITS WITH UNOBSERVED STATES5.250.435, 5, 6, 5
1837Learning Representations for Reinforcement Learning with Hierarchical Forward Models5.251.303, 6, 6, 6
1838Randomized Sharpness-Aware Training for Boosting Computational Efficiency in Deep Learning5.251.795, 3, 5, 8
1839Light and Accurate: Neural Architecture Search via Two Constant Shared Weights Initialisations5.250.435, 6, 5, 5
1840Protein Sequence and Structure Co-Design with Equivariant Translation5.251.306, 6, 3, 6
1841Efficiently Meta-Learning for Robust Deep Networks without Prior Unbiased Set5.251.795, 8, 5, 3
1842Regression with Label Differential Privacy5.252.591, 6, 8, 6
1843Theoretical Study of Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward5.250.436, 5, 5, 5
1844Backpropagation through Combinatorial Algorithms: Identity with Projection Works5.251.793, 5, 5, 8
1845GradientMix: A Simple yet Effective Regularization for Large Batch Training5.250.435, 6, 5, 5
1846Towards Learning Implicit Symbolic Representation for Visual Reasoning5.250.435, 5, 6, 5
1847SKTformer: A Skeleton Transformer for Long Sequence Data5.251.306, 3, 6, 6
1848Specformer: Spectral Graph Neural Networks Meet Transformers5.250.435, 6, 5, 5
1849MetaP: How to Transfer Your Knowledge on Learning Hidden Physics5.250.435, 5, 6, 5
1850CommsVAE: Learning the brain's macroscale communication dynamics using coupled sequential VAEs5.250.435, 5, 5, 6
1851Long Term Fairness via Performative Distributionally Robust Optimization5.251.795, 3, 8, 5
1852Multi-View Masked Autoencoders for Visual Control5.250.435, 5, 6, 5
1853Safe Exploration Incurs Nearly No Additional Sample Complexity for Reward-Free RL5.251.798, 3, 5, 5
18543D-IntPhys: Learning 3D Visual Intuitive Physics for Fluids, Rigid Bodies, and Granular Materials5.252.8610, 3, 5, 3
1855Benchmarking Algorithms for Domain Generalization in Federated Learning5.250.436, 5, 5, 5
1856Continual Learning Based on Sub-Networks and Task Similarity5.250.435, 6, 5, 5
1857Heavy-tailed Noise Does Not Explain the Gap Between SGD and Adam, but Sign Descent Might5.251.306, 6, 3, 6
1858Efficient parametric approximations of neural net function space distance5.251.798, 5, 3, 5
1859What Spurious Features Can Pretrained Language Models Combat?5.250.435, 5, 6, 5
1860Cramming: Training a language model on a single GPU in one day5.250.435, 5, 5, 6
1861Probabilistic Categorical Adversarial Attack and Adversarial Training5.251.798, 5, 5, 3
1862Dissecting adaptive methods in GANs5.251.798, 5, 5, 3
1863Robustness for Free: Adversarially Robust Anomaly Detection Through Diffusion Model5.250.435, 6, 5, 5
1864ErrorAug: Making Errors to Find Errors in Semantic Segmentation5.250.436, 5, 5, 5
1865When is Offline Hyperparameter Selection Feasible for Reinforcement Learning?5.250.435, 5, 5, 6
1866Denoising Diffusion Samplers5.250.435, 6, 5, 5
1867Model-free Reinforcement Learning that Transfers Using Random Reward Features5.251.795, 3, 5, 8
1868Progressive Mix-Up for Few-Shot Supervised Multi-Source Domain Transfer5.250.435, 5, 6, 5
1869Brain-like representational straightening of natural movies in robust feedforward neural networks5.251.306, 3, 6, 6
1870Using Both Demonstrations and Language Instructions to Efficiently Learn Robotic Tasks5.251.795, 5, 3, 8
1871Calibrating the Rigged Lottery: Making All Tickets Reliable5.251.798, 3, 5, 5
1872Open-Vocabulary Panoptic Segmentation MaskCLIP5.250.435, 6, 5, 5
1873Laser: Latent Set Representations for 3D Generative Modeling5.250.435, 5, 6, 5
1874Finding and only finding local Nash equilibria by both pretending to be a follower5.250.435, 6, 5, 5
1875Fake It Until You Make It : Towards Accurate Near-Distribution Novelty Detection5.251.306, 3, 6, 6
1876Generative Pretraining for Black-Box Optimization5.250.435, 6, 5, 5
1877The ethical ambiguity of AI data enrichment: Measuring gaps in research ethics norms and practices5.252.863, 5, 3, 10
1878Neural multi-event forecasting on spatio-temporal point processes using probabilistically enriched transformers5.251.795, 5, 3, 8
1879Detecting Small Query Graphs in A Large Graph via Neural Subgraph Search5.250.436, 5, 5, 5
1880Planning with Language Models through Iterative Energy Minimization5.251.306, 6, 3, 6
1881Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction5.250.435, 6, 5, 5
1882Joint-Predictive Representations for Multi-Agent Reinforcement Learning5.251.306, 6, 6, 3
1883PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Category Discovery5.250.435, 5, 5, 6
1884Learning implicit hidden Markov models using neural likelihood-free inference5.251.793, 5, 8, 5
1885Making Better Decision by Directly Planning in Continuous Control5.251.306, 6, 3, 6
1886Heterogeneous Neuronal and Synaptic Dynamics for Spike-Efficient Unsupervised Learning: Theory and Design Principles5.251.795, 8, 3, 5
1887Shuffled Transformers for Blind Training5.251.793, 5, 8, 5
1888Hardware-aware compression with Random Operation Access Specific Tile (ROAST) hashing5.250.435, 5, 6, 5
1889Neural Implicit Shape Editing using Boundary Sensitivity5.250.435, 5, 5, 6
1890Amortised Invariance Learning for Contrastive Self-Supervision5.251.795, 5, 3, 8
1891Generating Sequences by Learning to Self-Correct5.250.435, 5, 6, 5
1892An ensemble view on mixup5.251.793, 5, 8, 5
1893ULF: UNSUPERVISED LABELING FUNCTION CORRECTION USING CROSS-VALIDATION FOR WEAK SUPERVISION5.250.436, 5, 5, 5
1894Continual Zero-shot Learning through Semantically Guided Generative Random Walks5.251.795, 8, 3, 5
1895Self-Guided Diffusion Models5.250.436, 5, 5, 5
1896Stay Moral and Explore: Learn to Behave Morally in Text-based Games5.250.436, 5, 5, 5
1897Memory-Efficient Reinforcement Learning with Priority based on Surprise and On-policyness5.251.795, 5, 8, 3
1898Uncertainty-aware off policy learning5.251.793, 5, 8, 5
1899Analyzing diffusion as serial reproduction5.251.793, 5, 8, 5
1900Pseudo-label Training and Model Inertia in Neural Machine Translation5.251.795, 5, 8, 3
1901Understanding weight-magnitude hyperparameters in training binary networks5.250.435, 5, 6, 5
1902Graph Backup: Data Efficient Backup Exploiting Markovian Transitions5.250.435, 5, 6, 5
1903Adversarial Driving Policy Learning by Misunderstanding the Traffic Flow5.250.435, 5, 6, 5
1904Sequential Learning of Neural Networks for Prequential MDL5.250.436, 5, 5, 5
1905ReaKE: Contrastive Molecular Representation Learning with Chemical Synthetic Knowledge Graph5.250.436, 5, 5, 5
1906Parameter Averaging for SGD Stabilizes the Implicit Bias towards Flat Regions5.251.798, 5, 3, 5
1907A New Hierarchy of Expressivity for Graph Neural Networks5.250.435, 6, 5, 5
1908Lmser-pix2seq: Learning Stable Sketch Representations For Sketch Healing5.251.798, 5, 5, 3
1909Consolidator: Mergable Adapter with Group Connections for Vision Transformer5.250.435, 5, 6, 5
1910Explaining RL Decisions with Trajectories5.250.435, 5, 6, 5
1911ProtoGNN: Prototype-Assisted Message Passing Framework for Non-Homophilous Graphs5.250.435, 5, 6, 5
1912Two Birds, One Stone: An Equivalent Transformation for Hyper-relational Knowledge Graph Modeling5.251.798, 3, 5, 5
1913Generalization Bounds with Arbitrary Complexity Measures5.250.435, 5, 6, 5
1914On student-teacher deviations in distillation: does it pay to disobey?5.251.795, 8, 5, 3
1915Merging Models Pre-Trained on Different Features with Consensus Graph5.251.795, 5, 8, 3
1916CUTS: Neural Causal Discovery from Unstructured Time-Series Data5.250.435, 5, 5, 6
1917On the Importance of In-distribution Class Prior for Out-of-distribution Detection5.251.306, 3, 6, 6
1918Curved Data Representations in Deep Learning5.251.798, 5, 5, 3
1919Learning Binary Networks on Long-Tailed Distributions5.251.798, 5, 5, 3
1920Concealing Sensitive Samples for Enhanced Privacy in Federated Learning5.251.793, 5, 8, 5
1921Understanding Graph Contrastive Learning From A Statistical Perspective5.250.435, 5, 5, 6
1922Stochastic Optimization under Strongly Convexity and Lipschitz Hessian: Minimax Sample Complexity5.251.306, 6, 3, 6
1923Label-free Concept Bottleneck Models5.250.435, 5, 5, 6
1924Push and Pull: Competing Feature-Prototype Interactions Improve Semi-supervised Semantic Segmentation5.250.435, 5, 5, 6
1925A computational framework to unify representation similarity and function in biological and artificial neural networks5.251.793, 8, 5, 5
1926Temporally Consistent Video Transformer for Long-Term Video Prediction5.250.435, 5, 5, 6
1927DITTO: Offline Imitation Learning with World Models5.250.436, 5, 5, 5
1928Disentangling the Mechanisms Behind Implicit Regularization in SGD5.251.303, 6, 6, 6
1929Provably Efficient Lifelong Reinforcement Learning with Linear Representation5.250.436, 5, 5, 5
1930Copula Conformal Prediction for Multi-step Time Series Forecasting5.251.303, 6, 6, 6
1931Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy5.250.435, 5, 5, 6
1932TrajGRU-Attention-ODE: Novel Spatiotemporal Predictive Models5.250.436, 5, 5, 5
1933Is a Caption Worth a Thousand Images? A Study on Representation Learning5.251.798, 5, 5, 3
1934Parameter-Efficient Fine-Tuning Design Spaces5.251.793, 8, 5, 5
1935Variational Latent Branching Model for Off-Policy Evaluation5.250.435, 5, 5, 6
1936Polarity is all you need to learn and transfer faster5.251.793, 5, 5, 8
1937On the Geometry of Reinforcement Learning in Continuous State and Action Spaces5.250.436, 5, 5, 5
1938AUGMENTING ZERO-SHOT DENSE RETRIEVERS WITH PLUG-IN MIXTURE-OF-MEMORIES5.250.436, 5, 5, 5
1939Perfectly Secure Steganography Using Minimum Entropy Coupling5.252.596, 8, 1, 6
1940Identifiability of Label Noise Transition Matrix5.250.435, 5, 6, 5
1941Towards Explaining Distribution Shifts5.250.436, 5, 5, 5
1942CAMA: A New Framework for Safe Multi-Agent Reinforcement Learning Using Constraint Augmentation5.250.435, 5, 5, 6
1943Visual Prompt Tuning For Test-time Domain Adaptation5.250.435, 5, 5, 6
1944ReD-GCN: Revisit the Depth of Graph Convolutional Network5.250.436, 5, 5, 5
1945Rethinking Positive Sampling for Contrastive Learning with Kernel5.250.435, 5, 5, 6
1946FaiREE: fair classification with finite-sample and distribution-free guarantee5.251.798, 5, 3, 5
1947Unbiased Stochastic Proximal Solver for Graph Neural Networks with Equilibrium States5.250.436, 5, 5, 5
1948On The Implicit Bias of Weight Decay in Shallow Univariate ReLU Networks5.251.798, 3, 5, 5
1949Improving Deep Policy Gradients with Value Function Search5.250.435, 5, 6, 5
1950Weakly Supervised Knowledge Transfer with Probabilistic Logical Reasoning for Object Detection5.252.596, 8, 6, 1
1951Over-parameterized Model Optimization with Polyak-{L}ojasiewicz Condition5.251.795, 5, 3, 8
1952DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning5.250.435, 5, 6, 5
1953A Curriculum Perspective to Robust Loss Functions5.251.303, 6, 6, 6
1954Decoupled Training for Long-Tailed Classification With Stochastic Representations5.250.436, 5, 5, 5
1955IT-NAS: Integrating Lite-Transformer into NAS for Architecture Seletion5.251.306, 3, 6, 6
1956Simplicity bias in $1$-hidden layer neural networks5.250.435, 5, 5, 6
1957Memory Gym: Partially Observable Challenges to Memory-Based Agents5.251.795, 8, 5, 3
1958On the effectiveness of out-of-distribution data in self-supervised long-tail learning.5.250.435, 5, 6, 5
1959Vera Verto: Multimodal Hijacking Attack5.250.436, 5, 5, 5
1960Joint Attention-Driven Domain Fusion and Noise-Tolerant Learning for Multi-Source Domain Adaptation5.251.798, 3, 5, 5
1961Model Obfuscation for Securing Deployed Neural Networks5.251.795, 8, 3, 5
1962MultiViz: Towards Visualizing and Understanding Multimodal Models5.252.591, 6, 6, 8
1963Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN5.251.795, 8, 3, 5
1964New Insights for the Stability-Plasticity Dilemma in Online Continual Learning5.251.795, 8, 3, 5
1965Ti-MAE: Self-Supervised Masked Time Series Autoencoders5.250.435, 5, 5, 6
1966Are More Layers Beneficial to Graph Transformers?5.251.306, 6, 3, 6
1967Clean-image Backdoor: Attacking Multi-label Models with Poisoned Labels Only5.251.306, 6, 3, 6
1968Bandit Learning in Many-to-one Matching Markets with Uniqueness Conditions5.250.435, 6, 5, 5
1969Predictive Inference with Feature Conformal Prediction5.250.435, 5, 5, 6
1970OrthoReg: Improving Graph-regularized MLPs via Orthogonality Regularization5.250.435, 5, 6, 5
1971Personalized Semantics Excitation for Federated Image Classification5.251.798, 5, 5, 3
1972Intrinsic Motivation via Surprise Memory5.251.798, 3, 5, 5
1973TensorVAE: A Direct Generative Model for Molecular Conformation Generation driven by Novel Feature Engineering5.251.793, 5, 8, 5
1974MaskFusion: Feature Augmentation for Click-Through Rate Prediction via Input-adaptive Mask Fusion5.251.795, 8, 3, 5
1975NERDS: A General Framework to Train Camera Denoisers from Single Noisy Images5.251.303, 6, 6, 6
1976Coverage-centric Coreset Selection for High Pruning Rates5.250.435, 6, 5, 5
1977Chasing Better Deep Image Priors Between Over- and Under-parameterization5.250.436, 5, 5, 5
1978Data Valuation Without Training of a Model5.251.303, 6, 6, 6
1979RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning5.250.435, 5, 6, 5
1980Speculative Decoding: Lossless Speedup of Autoregressive Translation5.250.435, 6, 5, 5
1981Transformer Module Networks for Systematic Generalization in Visual Question Answering5.250.435, 5, 5, 6
1982Constructive TT-representation of the tensors given as index interaction functions with applications5.251.306, 6, 6, 3
1983VoGE: A Differentiable Volume Renderer using Gaussian Ellipsoids for Analysis-by-Synthesis5.251.795, 8, 3, 5
1984Unravel Structured Heterogeneity of Tasks in Meta-Reinforcement Learning via Exploratory Clustering5.250.436, 5, 5, 5
1985Find Your Friends: Personalized Federated Learning with the Right Collaborators5.251.306, 6, 6, 3
1986Equilibrium-finding via exploitability descent with learned best-response functions5.251.795, 8, 5, 3
1987Masked inverse folding with sequence transfer for protein representation learning5.250.436, 5, 5, 5
1988FedDAR: Federated Domain-Aware Representation Learning5.251.306, 6, 6, 3
1989Interval Bound Interpolation for Few-shot Learning with Few Tasks5.250.435, 5, 5, 6
1990ELRT: Towards Efficient Low-Rank Training for Compact Neural Networks5.250.435, 5, 5, 6
1991Tangential Wasserstein Projections5.251.303, 6, 6, 6
1992SYNG4ME: Model Evaluation using Synthetic Test Data5.250.436, 5, 5, 5
1993Long-Tailed Learning Requires Feature Learning5.250.435, 6, 5, 5
1994Revisiting Pretraining Objectives for Tabular Deep Learning5.251.795, 3, 5, 8
1995Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization5.251.798, 5, 5, 3
1996Relative Positional Encoding Family via Unitary Transformation5.251.303, 6, 6, 6
1997Continual Vision-Language Representaion Learning with Off-Diagonal Information5.251.795, 5, 3, 8
1998COFS: COntrollable Furniture layout Synthesis5.250.435, 6, 5, 5
1999A Functional Perspective on Multi-Layer Out-of-Distribution Detection5.250.435, 6, 5, 5
2000Active Learning with Partial Labels5.251.795, 8, 3, 5
2001Fed-CBS: Heterogeneity-Aware Client Sampling Mechanism for Federated Learning via Class-Imbalance Reduction5.251.795, 8, 5, 3
2002Delving into Discrete Normalizing Flows on SO(3) Manifold for Probabilistic Rotation Modeling5.251.795, 3, 8, 5
2003Enabling Probabilistic Inference on Large-Scale Spiking Neural Networks5.251.798, 5, 3, 5
2004A Closer Look at Dual Batch Normalization and Two-domain Hypothesis In Adversarial Training With Hybrid Samples5.250.435, 5, 5, 6
2005Communication-Efficient Federated Learning with Accelerated Client Gradient5.250.435, 6, 5, 5
2006Ranking-Enhanced Unsupervised Sentence Representation Learning5.251.793, 5, 8, 5
2007Revisiting Graph Adversarial Attack and Defense From a Data Distribution Perspective5.250.435, 5, 6, 5
2008On Fairness Measurement for Generative Models5.250.436, 5, 5, 5
2009Analyzing the Latent Space of GAN through Local Dimension Estimation5.251.303, 6, 6, 6
2010Neural Collaborative Filtering Bandits via Meta Learning5.251.798, 5, 5, 3
2011Decoupled Mixup for Data-efficient Learning5.250.435, 5, 5, 6
2012FAIRER: Fairness as Decision Rationale Alignment5.250.435, 5, 5, 6
2013Bi-level Physics-Informed Neural Networks for PDE Constrained Optimization using Broyden's Hypergradients5.250.435, 6, 5, 5
2014Learning Continuous Grasping Function with a Dexterous Hand from Human Demonstrations5.251.795, 8, 5, 3
2015When Do Models Generalize? A Perspective From Data-Algorithm Compatibility5.251.303, 6, 6, 6
2016Learning PDE Solution Operator for Continuous Modeling of Time-Series5.250.435, 5, 5, 6
2017Trainable Weight Averaging: Efficient Training by Optimizing Historical Solutions5.251.793, 5, 5, 8
2018Neural Radiance Field Codebooks5.250.435, 5, 5, 6
2019Data-Efficient and Interpretable Tabular Anomaly Detection5.250.435, 6, 5, 5
2020The Impact of Approximation Errors on Warm-Start Reinforcement Learning: A Finite-time Analysis5.251.306, 6, 3, 6
20213D-Aware Video Generation5.251.795, 3, 8, 5
2022Correcting Data Distribution Mismatch in Offline Meta-Reinforcement Learning with Few-Shot Online Adaptation5.250.435, 5, 6, 5
2023Online Placebos for Class-incremental Learning5.251.798, 3, 5, 5
2024Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning5.251.306, 6, 6, 3
2025IEDR: A Context-aware Intrinsic and Extrinsic Disentangled Recommender System5.251.306, 6, 3, 6
2026Exploring Chemical Space with Score-based Out-of-distribution Generation5.251.798, 3, 5, 5
2027DFlow: Learning to Synthesize Better Optical Flow Datasets via a Differentiable Pipeline5.250.435, 5, 6, 5
2028NormSoftmax: Normalize the Input of Softmax to Accelerate and Stabilize Training5.250.436, 5, 5, 5
2029TimelyFL: Heterogeneity-aware Asynchronous Federated Learning with Adaptive Partial Training5.251.306, 6, 3, 6
2030Graph Domain Adaptation via Theory-Grounded Spectral Regularization5.251.306, 6, 3, 6
2031Cross Modal Domain Generalization for Query-based Video Segmentation5.251.793, 8, 5, 5
2032Language Model Pre-training with Linguistically Motivated Curriculum Learning5.250.435, 5, 5, 6
2033Your Denoising Implicit Model is a Sub-optimal Ensemble of Denoising Predictions5.250.435, 6, 5, 5
2034NOAH: A New Head Structure To Improve Deep Neural Networks For Image Classification5.250.436, 5, 5, 5
2035InPL: Pseudo-labeling the Inliers First for Imbalanced Semi-supervised Learning5.251.306, 3, 6, 6
2036Imitate Your Own Refinement: Knowledge Distillation Sheds Light on Efficient Image-to-Image Translation5.250.435, 6, 5, 5
2037Self-Supervised Set Representation Learning for Unsupervised Meta-Learning5.250.435, 6, 5, 5
2038Learning Specialized Activation Functions for Physics-informed Neural Networks5.251.793, 8, 5, 5
2039Dateformer: Transformer Extends Look-back Horizon to Predict Longer-term Time Series5.251.306, 6, 3, 6
2040Focusing on what to decode and what to train: Efficient Training with HOI Split Decoders and Split Target Guided DeNoising5.250.436, 5, 5, 5
2041Reliability of CKA as a Similarity Measure in Deep Learning5.251.795, 5, 8, 3
2042Comfort Zone: A Vicinal Distribution for Regression Problems5.251.303, 6, 6, 6
2043Locally Invariant Explanations: Towards Stable and Unidirectional Explanations through Local Invariant Learning5.250.435, 5, 6, 5
2044Self-Organizing Pathway Expansion for Non-Exemplar Incremental Learning5.250.436, 5, 5, 5
2045DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection5.252.598, 6, 1, 6
2046DDM$^2$: Self-Supervised Diffusion MRI Denoising with Generative Diffusion Models5.252.591, 6, 6, 8
2047Pareto Automatic Multi-Task Graph Representation Learning5.251.795, 8, 5, 3
2048Outlier Robust Adversarial Training5.251.798, 5, 3, 5
2049Sparse Tokens for Dense Prediction - The Medical Image Segmentation Case5.250.435, 5, 6, 5
2050NTK-SAP: Improving neural network pruning by aligning training dynamics5.251.306, 3, 6, 6
2051Discovering Distinctive ``Semantics'' in Super-Resolution Networks5.251.795, 8, 3, 5
2052BQ-NCO: Bisimulation Quotienting for Generalizable Neural Combinatorial Optimization5.251.793, 5, 5, 8
2053Distilling Cognitive Backdoor within an Image5.251.798, 5, 3, 5
20543D generation on ImageNet5.251.306, 3, 6, 6
2055Revisiting Higher-Order Gradient Methods for Multi-Agent Reinforcement Learning5.250.435, 5, 6, 5
2056DIVISION: Memory Efficient Training via Dual Activation Precision5.251.793, 5, 8, 5
2057CLIP-PAE: Projection-Augmentation Embedding to Extract Relevant Features for a Disentangled, Interpretable and Controllable Text-Guided Image Manipulation5.250.435, 5, 6, 5
2058Provable Adaptivity in Adam5.251.795, 3, 5, 8
2059De Novo Molecular Generation via Connection-aware Motif Mining5.251.795, 3, 5, 8
2060Gradient Estimation for Unseen Domain Risk Minimization with Pre-Trained Models5.250.436, 5, 5, 5
2061Semi-supervised Counting via Pixel-by-pixel Density Distribution Modelling5.251.306, 6, 3, 6
2062E-CRF: Embedded Conditional Random Field for Boundary-caused Class Weights Confusion in Semantic Segmentation5.250.435, 5, 6, 5
2063CAN: A simple, efficient and scalable contrastive masked autoencoder framework for learning visual representations5.251.795, 5, 8, 3
2064Self-conditioned Embedding Diffusion for Text Generation5.250.435, 5, 5, 6
2065Towards a Unified View on Visual Parameter-Efficient Transfer Learning5.250.435, 5, 5, 6
2066BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation5.250.435, 5, 5, 6
2067Towards Sustainable Self-supervised Learning5.250.436, 5, 5, 5
2068Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features5.251.795, 3, 8, 5
2069Efficient Automatic Machine Learning via Design Graphs5.251.795, 5, 8, 3
2070Motion-inductive Self-supervised Object Discovery in Videos5.251.793, 5, 5, 8
2071SIMPLE: Specialized Model-Sample Matching for Domain Generalization5.251.798, 5, 3, 5
2072A Study of Causal Confusion in Preference-Based Reward Learning5.201.608, 5, 5, 5, 3
2073CodeT5Mix: A Pretrained Mixture of Encoder-decoder Transformers for Code Understanding and Generation5.201.176, 6, 6, 3, 5
2074TILDE-Q: a Transformation Invariant Loss Function for Time-Series Forecasting5.202.793, 6, 8, 8, 1
2075Efficient neural representation in the cognitive neuroscience domain: Manifold Capacity in One-vs-rest Recognition Limit5.201.946, 8, 3, 6, 3
2076Revisit Finetuning strategy for Few-Shot Learning to Strengthen the Equivariance of Emdeddings5.201.176, 6, 6, 3, 5
2077Lossy Image Compression with Conditional Diffusion Models5.200.405, 5, 6, 5, 5
2078Active Learning for Object Detection with Evidential Deep Learning and Hierarchical Uncertainty Aggregation5.201.176, 3, 6, 6, 5
2079Understanding and Mitigating Robust Overfitting through the Lens of Feature Dynamics5.201.176, 6, 3, 6, 5
2080Synchronized Contrastive Pruning for Efficient Self-Supervised Learning5.201.605, 8, 5, 3, 5
2081Faster federated optimization under second-order similarity5.200.405, 5, 6, 5, 5
2082Where to Go Next for Recommender Systems? ID- vs. Modality-based recommender models revisited5.201.603, 8, 5, 5, 5
2083Optimising 2D Pose Representation: Improving Accuracy, Stability and Generalisability inUnsupervised 2D-3D Human Pose Estimation5.201.603, 8, 5, 5, 5
2084Test-time Adaptation for Better Adversarial Robustness5.200.405, 5, 5, 5, 6
2085RGI: robust GAN-inversion for mask-free image inpainting and unsupervised pixel-wise anomaly detection5.201.173, 6, 6, 5, 6
2086MIMT: Masked Image Modeling Transformer for Video Compression5.200.405, 5, 5, 6, 5
2087On the Necessity of Disentangled Representations for Downstream Tasks5.201.176, 5, 6, 6, 3
2088Domain-Adjusted Regression or: ERM May Already Learn Features Sufficient for Out-of-Distribution Generalization5.201.943, 6, 6, 3, 8
2089Edge-Varying Fourier Graph Network for Multivariate Time Series Forecasting5.200.405, 5, 6, 5, 5
2090How do Variational Autoencoders Learn? Insights from Representational Similarity5.201.608, 3, 5, 5, 5
2091Dilated convolution with learnable spacings5.201.176, 6, 3, 5, 6
2092Grassmannian Class Representation in Deep Learning5.201.173, 6, 5, 6, 6
2093SPI-GAN: Denoising Diffusion GANs with Straight-Path Interpolations5.171.775, 3, 8, 6, 3, 6
2094The Reward Hypothesis is False5.171.463, 5, 5, 8, 5, 5
2095A Study of Biologically Plausible Neural Network: the Role and Interactions of Brain-Inspired Mechanisms in Continual Learning5.002.128, 3, 6, 3
2096Proper Scoring Rules for Survival Analysis5.000.005, 5, 5
2097PPAT: Progressive Graph Pairwise Attention Network for Event Causality Identification5.000.005, 5, 5
2098Disentangled Feature Swapping Augmentation for Weakly Supervised Semantic Segmentation5.001.226, 3, 5, 6
2099Kinship Representation Learning with Face Componential Relation5.002.123, 6, 8, 3
2100Improved Training of Physics-Informed Neural Networks with Model Ensembles5.002.128, 6, 3, 3
2101RephraseTTS: Dynamic Length Text based Speech Insertion with Speaker Style Transfer5.001.225, 6, 6, 3
2102Beyond Reward: Offline Preference-guided Policy Optimization5.002.128, 3, 3, 6
2103Can BERT Refrain from Forgetting on Sequential Tasks? A Probing Study5.001.416, 6, 3
2104UiTTa: Online Test-Time Adaptation by User Interaction5.000.005, 5, 5, 5
2105Compression-aware Training of Neural Networks using Frank-Wolfe5.002.126, 3, 3, 8
2106MEDOE: A Multi-Expert Decoder and Output Ensemble Framework for Long-tailed Semantic Segmentation5.000.005, 5, 5, 5
2107TransFool: An Adversarial Attack against Neural Machine Translation Models5.001.223, 6, 6, 5
2108Denoising Differential Privacy in Split Learning5.001.223, 5, 6, 6
2109Extracting Meaningful Attention on Source Code: An Empirical Study of Developer and Neural Model Code Exploration5.001.106, 3, 5, 6, 5
2110Asynchronous Distributed Bilevel Optimization5.000.005, 5, 5
2111Confidence-Based Feature Imputation for Graphs with Partially Known Features5.001.416, 3, 6
2112Offline imitation learning by controlling the effective planning horizon5.001.226, 3, 5, 6
2113A Hierarchical Bayesian Approach to Federated Learning5.001.226, 6, 5, 3
2114On the Existence of a Trojaned Twin Model5.001.226, 3, 6, 5
2115Counterfactual Generation Under Confounding5.000.005, 5, 5, 5
2116FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation5.001.416, 3, 6
2117MA-BERT: Towards Matrix Arithmetic-only BERT Inference by Eliminating Complex Non-linear Functions5.000.005, 5, 5
2118Offline Reinforcement Learning via Weighted $f$-divergence5.000.005, 5, 5, 5
2119Revisiting and Improving FGSM Adversarial Training5.000.005, 5, 5, 5
2120TrojText: Test-time Invisible Textual Trojan Insertion5.001.226, 5, 6, 3
2121Robustness Guarantees for Adversarially Trained Neural Networks5.001.226, 5, 6, 3
2122Fast-PINN for Complex Geometry: Solving PDEs with Boundary Connectivity Loss5.001.223, 6, 6, 5
2123UniMax: Fairer and More Effective Language Sampling for Large-Scale Multilingual Pretraining5.001.226, 3, 5, 6
2124GNNInterpreter: A Probabilistic Generative Model-Level Explanation for Graph Neural Networks5.001.226, 3, 6, 5
2125On Pre-training Language Model for Antibody5.001.223, 6, 6, 5
2126L2B: Learning to Bootstrap for Combating Label Noise5.000.005, 5, 5
2127Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis5.001.225, 6, 6, 3
2128Differentially Private Algorithms for Smooth Nonconvex ERM5.001.226, 3, 6, 5
2129Answer Me if You Can: Debiasing Video Question Answering via Answering Unanswerable Questions5.001.226, 6, 3, 5
2130Learning Rewards and Skills to Follow Commands with a Data Efficient Visual-Audio Representation5.000.005, 5, 5
2131Auto-Encoding Goodness of Fit5.001.226, 6, 5, 3
2132Understanding the Covariance Structure of Convolutional Filters5.001.225, 6, 6, 3
2133Nuisances via Negativa: Adjusting for Spurious Correlations via Data Augmentation5.000.005, 5, 5, 5
2134Do We Really Need Graph Models for Skeleton-Based Action Recognition? A Topology-Agnostic Approach with Fully-Connected Networks5.000.005, 5, 5
2135On Representing Mixed-Integer Linear Programs by Graph Neural Networks5.002.556, 8, 1, 5
2136Dynamic Neural Network is All You Need: Understanding the Robustness of Dynamic Mechanisms in Neural Networks5.002.948, 1, 6
2137Stateful Active Facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning5.001.226, 5, 3, 6
2138PINTO: Faithful Language Reasoning Using Prompted-Generated Rationales5.001.416, 3, 6
2139Unsupervised 3D Scene Representation Learning via Movable Object Inference5.001.225, 3, 6, 6
2140Similarity-Based Cooperation5.000.005, 5, 5, 5
2141Augmentation Component Analysis: Modeling Similarity via the Augmentation Overlaps5.001.225, 6, 3, 6
2142On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness5.000.005, 5, 5
2143A Picture of the Space of Typical Learning Tasks5.001.416, 3, 6
2144UNICO: Efficient Unified Hardware-Software Co-Optimization For Deep Neural Networks5.000.005, 5, 5, 5
2145DyG2Vec: Representation Learning for Dynamic Graphs With Self-supervision5.001.223, 6, 6, 5
2146Deep Watermarks for Attributing Generative Models5.001.416, 6, 3
2147Learning Latent Structural Causal Models5.002.458, 3, 3, 8, 3
2148S$^6$-DAMON: Bridging Self-Supervised Speech Models and Real-time Speech Recognition5.000.005, 5, 5
2149ORCA: Interpreting Prompted Language Models via Locating Supporting Evidence in the Ocean of Pretraining Data5.001.223, 6, 6, 5
2150FedTiny: Pruned Federated Learning Towards Specialized Tiny Models5.000.005, 5, 5, 5
2151Learning to represent and predict evolving visual signals via polar straightening5.000.005, 5, 5
2152Interpretable (meta)factorization of clinical questionnaires to identify general dimensions of psychopathology5.001.903, 3, 8, 6, 5
2153Attentive MLP for Non-Autoregressive Generation5.000.005, 5, 5
2154The Plug and Play of Language Models for Text-to-image Generation5.001.225, 6, 3, 6
2155A Score-Based Model for Learning Neural Wavefunctions5.001.226, 3, 5, 6
2156Multi-Grid Tensorized Fourier Neural Operator for High Resolution PDEs5.000.005, 5, 5
2157Dual Student Networks for Data-Free Model Stealing5.002.128, 3, 3, 6
2158Equal Improvability: A New Fairness Notion Considering the Long-term Impact5.001.225, 6, 3, 6
2159Target Conditioned Representation Independence (TCRI); from Domain-Invariant to Domain-General Representations5.001.225, 3, 6, 6
2160Multi-Task Option Learning and Discovery for Stochastic Path Planning5.001.225, 3, 6, 6
2161Bandwith Enables Generalization in Quantum Kernel Models5.002.123, 6, 8, 3
2162SpENCNN: Orchestrating Encoding and Sparsity for Fast Homomorphically Encrypted Neural Network Inference5.000.005, 5, 5
2163Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning5.001.223, 6, 6, 5
2164Transformers Implement First-Order Logic with Majority Quantifiers5.001.908, 3, 6, 5, 3
2165FedX: Federated Learning for Compositional Pairwise Risk Optimization5.001.413, 6, 6
2166Multi-Sample Contrastive Neural Topic Model as Multi-Task Learning5.002.123, 8, 3, 6
2167Towards Fair Classification against Poisoning Attacks5.000.005, 5, 5
2168Fed-Cor: Federated Correlation Test with Secure Aggregation5.001.413, 6, 6
2169Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments5.002.123, 3, 6, 8
2170Plansformer: Generating Multi-Domain Symbolic Plans using Transformers5.001.223, 6, 6, 5
2171Multi-Environment Pretraining Enables Transfer to Action Limited Datasets5.001.906, 3, 5, 3, 8
2172Fast Sampling of Diffusion Models with Exponential Integrator5.001.226, 6, 5, 3
2173Movement-to-Action Transformer Networks for Temporal Action Proposal Generation5.002.123, 3, 6, 8
2174Interpretations of Domain Adaptations via Layer Variational Analysis5.000.005, 5, 5
2175Progressive Prompts: Continual Learning for Language Models without Forgetting5.001.225, 6, 3, 6
2176Multiple sequence alignment as a sequence-to-sequence learning problem5.001.416, 3, 6
2177Semi-Supervised Single Domain Generalization with Label-Free Adversarial Data Augmentation5.000.005, 5, 5, 5
2178Mitigating Propagation Failures in PINNs using Evolutionary Sampling5.001.416, 3, 6
2179Exploring perceptual straightness in learned visual representations5.000.005, 5, 5
2180Is Forgetting Less a Good Inductive Bias for Forward Transfer?5.000.005, 5, 5, 5
2181Simulating Environments for Evaluating Scarce Resource Allocation Policies5.002.558, 6, 5, 1
2182Revisiting Curiosity for Exploration in Procedurally Generated Environments5.002.453, 8, 3, 3, 8
2183The Power of Feel-Good Thompson Sampling: A Unified Framework for Linear Bandits5.000.005, 5, 5
2184Reward Design with Language Models5.001.226, 6, 3, 5
2185DSI++: Updating Transformer Memory with New Documents5.001.226, 5, 6, 3
2186The Game of Hidden Rules: A New Challenge for Machine Learning5.001.416, 6, 3
2187In-Time Refining Optimization Trajectories Toward Improved Robust Generalization5.000.005, 5, 5
2188Speed Up Iterative Non-Autoregressive Transformers by Distilling Multiple Steps5.000.005, 5, 5
2189When Rigid Coherency Hurts: Distributional Coherency Regularization for Probabilistic Hierarchical Time Series Forecasting5.002.558, 6, 1, 5
2190MolJET: Multimodal Joint Embedding Transformer for Conditional de novo Molecular Design and Multi-Property Optimization5.002.453, 3, 3, 8, 8
2191$O(T^{-1})$ Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games5.001.416, 3, 6
2192Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise5.001.223, 6, 6, 5
2193Explainable Machine Learning Predictions for the Long-term Performance of Brain-Computer Interfaces5.002.128, 3, 6, 3
2194Federated Learning from Small Datasets5.001.105, 6, 5, 6, 3
2195Contrastive introspection (ConSpec) to rapidly identify invariant steps for success5.001.223, 6, 5, 6
2196Panoptically guided Image Inpainting with Image-level and Object-level Semantic Discriminators5.001.225, 6, 3, 6
2197REM: Routing Entropy Minimization for Capsule Networks5.001.223, 6, 6, 5
2198Variational Classification5.000.005, 5, 5
2199ContraNorm: A Contrastive Learning Perspective on Oversmoothing and Beyond5.001.225, 6, 6, 3
2200Learning Robust Representations via Nuisance-extended Information Bottleneck5.000.005, 5, 5
2201Understanding Train-Validation Split in Meta-Learning with Neural Networks5.001.226, 3, 5, 6
2202Blessing from Experts: Super Reinforcement Learning in Confounded Environments5.001.416, 6, 3
2203DP-SGD-LF: Improving Utility under Differentially Private Learning via Layer Freezing5.001.416, 3, 6
2204A Simulation-based Framework for Robust Federated Learning to Training-time Attacks5.000.005, 5, 5, 5
2205PALM: Preference-based Adversarial Manipulation against Deep Reinforcement Learning5.001.106, 5, 3, 6, 5
2206Multi-Hypothesis 3D human pose estimation metrics favor miscalibrated distributions5.001.226, 6, 3, 5
2207Flatter, Faster: Scaling Momentum for Optimal Speedup of SGD5.001.413, 6, 6
2208SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration5.002.123, 6, 8, 3
2209AlphaFold Distillation for Improved Inverse Protein Folding5.002.126, 3, 8, 3
2210A Cognitive-inspired Multi-Module Architecture for Continual Learning5.000.005, 5, 5, 5
2211Masked Siamese ConvNets: Towards an Effective Masking Strategy for General-purpose Siamese Networks5.000.005, 5, 5
2212Training Normalizing Flows from Dependent Data5.001.416, 6, 3
2213Autoregressive Conditional Neural Processes5.001.416, 3, 6
2214Islands of Confidence: Robust Neural Network Classification with Uncertainty Quantification5.000.005, 5, 5
2215Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural Dynamics5.001.413, 6, 6
2216Renamer: A Transformer Architecture In-variant to Variable Renaming5.001.413, 6, 6
2217Learning a Domain-Agnostic Policy through Adversarial Representation Matching for Cross-Domain Policy Transfer5.001.416, 3, 6
2218Enforcing Delayed-Impact Fairness Guarantees5.000.005, 5, 5
2219Towards Reliable Link Prediction with Robust Graph Information Bottleneck5.001.226, 6, 5, 3
2220UNICORN: A Unified Backdoor Trigger Inversion Framework5.001.413, 6, 6
2221Contrastive Meta-Learning for Partially Observable Few-Shot Learning5.001.226, 3, 6, 5
2222Analyzing Transformers in Embedding Space5.002.128, 3, 3, 6
2223Simplicity bias leads to amplified performance disparities5.000.005, 5, 5, 5
2224Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection5.001.225, 3, 6, 6
2225Distributed Inference and Fine-tuning of Large Language Models Over The Internet5.000.005, 5, 5, 5
2226Irregularity Reflection Neural Network for Time Series Forecasting5.001.416, 6, 3
2227Interpreting Class Conditional GANs with Channel Awareness5.000.005, 5, 5
2228Graph MLP-Mixer5.000.005, 5, 5, 5
2229Fine-grained Few-shot Recognition by Deep Object Parsing5.001.226, 3, 5, 6
2230Learning to Solve Constraint Satisfaction Problems with Recurrent Transformers5.002.123, 3, 8, 6
2231Learning Fast and Slow for Time Series Forecasting5.001.416, 3, 6
2232Holistic Adversarially Robust Pruning5.001.225, 6, 3, 6
2233Text-Guided Diffusion Image Style Transfer with Contrastive Loss Fine-tuning5.000.005, 5, 5
2234Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling5.000.005, 5, 5
2235Prescribed Safety Performance Imitation Learning from A Single Expert Dataset5.000.005, 5, 5, 5
2236Modality Complementariness: Towards Understanding Multi-modal Robustness5.002.126, 3, 3, 8
2237No-regret Learning in Repeated First-Price Auctions with Budget Constraints5.001.733, 5, 5, 6, 3, 8
2238Robustness of Unsupervised Representation Learning without Labels5.001.226, 3, 6, 5
2239Generative Spoken Language Model based on continuous word-sized audio tokens5.000.005, 5, 5, 5
2240Better with Less: Data-Active Pre-training of Graph Neural Networks5.002.123, 6, 8, 3
2241Generalization error bounds for Neural Networks with ReLU activation5.000.005, 5, 5, 5
2242Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL5.001.413, 6, 6
2243Graphics Capsule: Learning hierarchical 3D representations from 2D images and its application on human faces5.001.005, 6, 5, 6, 5, 3
2244Group-wise Verifiable Distributed Computing for Machine Learning under Adversarial Attacks5.002.126, 3, 8, 3
2245Uncertainty-oriented Order Learning for Facial Beauty Prediction5.001.223, 5, 6, 6
2246Revisiting Uncertainty Estimation for Node Classification: New Benchmark and Insights5.000.005, 5, 5
2247SoTeacher: Toward Student-oriented Teacher Network Training for Knowledge Distillation5.001.225, 6, 6, 3
2248GuardHFL: Privacy Guardian for Heterogeneous Federated Learning5.001.413, 6, 6
2249Unsupervised 3d object learning through neuron activity aware plasticity5.001.416, 3, 6
2250Unsupervised Learning of Structured Representations via Closed-Loop Transcription5.001.226, 6, 3, 5
2251DETRDistill: A Simple Knowledge Distillation Framework for DETR-Families5.001.226, 3, 6, 5
2252Multi-Layered 3D Garments Animation5.000.005, 5, 5
2253When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning5.001.226, 6, 5, 3
2254Efficient debiasing with contrastive weight pruning5.000.005, 5, 5
2255Global Nash Equilibrium in a Class of Nonconvex N-player Games5.000.005, 5, 5, 5
2256Task-Agnostic Online Meta-Learning in Non-stationary Environments5.001.105, 5, 3, 6, 6
2257Task Ambiguity in Humans and Language Models5.001.416, 3, 6
2258Tensor Decompositions For Temporal Knowledge Graph Completion with Time Perspective5.000.005, 5, 5
2259Restoration based Generative Models5.001.226, 5, 3, 6
2260GAPS: Few-Shot Incremental Semantic Segmentation via Guided Copy-Paste Synthesis5.000.005, 5, 5
2261The Eigenlearning Framework: A Conservation Law Perspective on Kernel Ridge Regression and Wide Neural Networks5.001.226, 6, 5, 3
2262Generative Gradual Domain Adaptation with Optimal Transport5.001.226, 3, 5, 6
2263Rethinking Symbolic Regression Datasets and Benchmarks for Scientific Discovery5.000.005, 5, 5
2264Precautionary Unfairness in Self-Supervised Contrastive Pre-training5.000.005, 5, 5, 5
2265VEHICLE-INFRASTRUCTURE COOPERATIVE 3D DETECTION VIA FEATURE FLOW PREDICTION5.001.223, 6, 5, 6
2266Mesh-Independent Operator Learning for PDEs using Set Representations5.000.005, 5, 5
2267FlexRound: Learnable Rounding by Element-wise Division for Post-Training Quantization5.000.005, 5, 5, 5
2268LA-BALD: An Information-Theoretic Image Labeling Task Sampler5.001.226, 3, 5, 6
2269Anchor Sampling for Federated Learning with Partial Client Participation5.001.416, 3, 6
2270What do Vision Transformers Learn? A Visual Exploration5.000.005, 5, 5, 5
2271Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency5.001.223, 6, 5, 6
2272An efficient encoder-decoder architecture with top-down attention for speech separation5.001.413, 6, 6
2273Rethinking Identity in Knowledge Graph Embedding5.001.226, 6, 5, 3
2274Energy-based Predictive Representation for Reinforcement Learning5.002.123, 6, 8, 3
2275Exclusive Supermask Subnetwork Training for Continual Learning5.001.223, 6, 6, 5
2276Dual personalization for federated recommendation on devices5.001.226, 3, 6, 5
2277Time-Transformer AAE: Connecting Temporal Convolutional Networks and Transformer for Time Series Generation5.001.223, 5, 6, 6
2278Autoencoding Hyperbolic Representation for Adversarial Generation5.001.416, 6, 3
2279RLSBench: A Large-Scale Empirical Study of Domain Adaptation Under Relaxed Label Shift5.001.226, 5, 6, 3
2280Deep Bayesian Active Learning for Accelerating Stochastic Simulation5.001.413, 6, 6
2281On $mathcal{O}(1/K)$ Convergence and Low Sample Complexity for Single-Timescale Policy Evaluation with Nonlinear Function Approximation5.001.226, 3, 5, 6
2282Generating Features with Increased Crop-Related Diversity for Few-shot Object Detection5.001.226, 6, 3, 5
2283A Theoretical Understanding of Vision Transformers: Learning, Generalization, and Sample Complexity5.001.223, 5, 6, 6
2284Skill-Based Reinforcement Learning with Intrinsic Reward Matching5.001.223, 6, 6, 5
2285Assessing Neural Network Robustness via Adversarial Pivotal Tuning of Real Images5.000.005, 5, 5
2286Actionable Recourse Guided by User Preference5.001.413, 6, 6
2287Lipschitz regularized gradient flows and latent generative particles5.001.226, 3, 5, 6
2288Constraining Representations Yields Models That Know What They Don't Know5.001.416, 3, 6
2289Learning Controllable Adaptive Simulation for Multi-scale Physics5.001.223, 5, 6, 6
2290Posthoc Privacy guarantees for neural network queries5.001.416, 3, 6
2291Discretization Invariant Learning on Neural Fields5.001.226, 3, 5, 6
2292Global Counterfactual Explanations Are Reliable Or Efficient, But Not Both5.002.285, 1, 8, 6, 5
2293Agnostic Learning of General ReLU Activation Using Gradient Descent5.001.413, 6, 6
2294SlenderGNN: Accurate, Robust, and Interpretable GNN, and the Reasons for its Success5.001.223, 5, 6, 6
2295Noise$^+$2Noise: Co-taught De-noising Autoencoders for Time-Series Data5.001.226, 6, 5, 3
2296Neural Constraint Inference: Inferring Energy Constraints in Interacting Systems5.001.226, 3, 6, 5
2297Cortically motivated recurrence enables task extrapolation5.001.226, 5, 3, 6
2298Countering the Attack-Defense Complexity Gap for Robust Classifiers5.001.416, 6, 3
2299Tier Balancing: Towards Dynamic Fairness over Underlying Causal Factors5.001.226, 6, 3, 5
2300Peaks2Image: Reconstructing fMRI Statistical Maps from Peaks5.000.005, 5, 5
2301ContraSim -- A Similarity Measure Based on Contrastive Learning5.002.128, 6, 3, 3
2302Prefix Conditioning Unifies Language and Label Supervision5.001.226, 3, 5, 6
2303Discovering Latent Knowledge in Language Models Without Supervision5.001.225, 6, 3, 6
2304Learning Intuitive Policies Using Action Features5.001.416, 3, 6
2305Private Data Stream Analysis for Universal Symmetric Norm Estimation5.002.123, 8, 6, 3
2306Leveraging Incompatibility to Defend Against Backdoor Poisoning5.001.226, 5, 3, 6
2307Scaling Laws for a Multi-Agent Reinforcement Learning Model5.001.226, 6, 3, 5
2308Federated Learning with Openset Noisy Labels5.000.005, 5, 5, 5
2309Bi-Stride Multi-Scale Graph Neural Network for Mesh-Based Physical Simulation5.001.226, 3, 6, 5
2310Offline Policy Comparison with Confidence: Benchmarks and Baselines5.001.226, 6, 5, 3
2311Asymmetric Certified Robustness via Feature-Convex Neural Networks5.001.226, 3, 6, 5
2312Learning Efficient Models From Few Labels By Distillation From Multiple Tasks5.000.005, 5, 5
2313Do Perceptually Aligned Gradients Imply Robustness?5.001.106, 5, 3, 5, 6
2314Hard-Meta-Dataset++: Towards Understanding Few-Shot Performance on Difficult Tasks5.001.223, 6, 6, 5
2315Sharper Analysis of Sparsely Activated Wide Neural Networks with Trainable Biases5.001.223, 5, 6, 6
2316Generalization Properties of Retrieval-based Models5.001.226, 3, 6, 5
2317Semi-Variance Reduction for Fair Federated Learning5.001.226, 5, 6, 3
2318Siamese DETR5.000.005, 5, 5, 5
2319How Predictors Affect Search Strategies in Neural Architecture Search?5.000.005, 5, 5, 5
2320Incomplete to complete multiphysics forecasting - a hybrid approach for learning unknown phenomena5.002.123, 6, 8, 3
2321Gradient-based optimization is not necessary for generalization in neural networks5.001.416, 3, 6
2322Mitigating Memorization of Noisy Labels via Regularization between Representations5.001.906, 3, 3, 8, 5
2323Temporal Coherent Test Time Optimization for Robust Video Classification5.001.416, 3, 6
2324Non-parametric Outlier Synthesis5.001.413, 6, 6
2325Population-Based Reinforcement Learning for Combinatorial Optimization Problems5.000.005, 5, 5
2326Enhanced Temporal Knowledge Embeddings with Contextualized Language Representations5.001.226, 6, 3, 5
2327Data Pricing Mechanism Based on Property Rights Compensation Distribution5.000.005, 5, 5
2328Traversing Between Modes in Function Space for Fast Ensembling5.000.005, 5, 5, 5
2329Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning5.000.005, 5, 5, 5
2330When are smooth-ReLUs ReLU-like?5.000.005, 5, 5
2331Learning to mine approximate network motifs5.000.005, 5, 5, 5
2332Accelerating Guided Diffusion Sampling with Splitting Numerical Methods5.001.225, 6, 3, 6
2333oViT: An Accurate Second-Order Pruning Framework for Vision Transformers5.000.005, 5, 5
2334TOAST: Topological Algorithm for Singularity Tracking5.001.416, 6, 3
2335Simple and Scalable Nearest Neighbor Machine Translation5.001.225, 6, 3, 6
2336Topic and Hyperbolic Transformer to Handle Multi-modal Dependencies5.000.005, 5, 5
2337Name Your Colour For the Task: Artificially Discover Colour Naming via Colour Quantisation Transformer5.001.223, 5, 6, 6
2338Symmetrical SyncMap for Imbalanced General Chunking Problems5.000.005, 5, 5, 5
2339Optimising Event-Driven Spiking Neural Network with Regularisation and Cutoff5.001.105, 6, 5, 6, 3
2340How Informative is the Approximation Error from Tensor Decomposition for Neural Network Compression?5.001.226, 5, 3, 6
2341On the Expressive Equivalence Between Graph Convolution and Attention Models5.003.088, 3, 8, 1
2342Continual Learning via Adaptive Neuron Selection5.002.123, 3, 6, 8
2343Exact Group Fairness Regularization via Classwise Robust Optimization5.001.225, 6, 6, 3
2344Pairwise Confidence Difference on Unlabeled Data is Sufficient for Binary Classification5.001.416, 6, 3
2345Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning5.001.223, 6, 6, 5
2346Variance Reduction is an Antidote to Byzantines: Better Rates, Weaker Assumptions and Communication Compression as a Cherry on the Top5.002.285, 1, 5, 6, 8
2347Open Set Recognition by Mitigating Prompt Bias5.001.226, 6, 5, 3
2348Momentum Tracking: Momentum Acceleration for Decentralized Deep Learning on Heterogeneous Data5.000.005, 5, 5, 5
2349Deep Graph-Level Orthogonal Hypersphere Compression for Anomaly Detection5.001.226, 6, 3, 5
2350Gradient Deconfliction via Orthogonal Projections onto Subspaces For Multi-task Learning5.001.106, 3, 5, 5, 6
2351On the Importance of the Policy Structure in Offline Reinforcement Learning5.001.226, 3, 6, 5
2352Exact manifold Gaussian Variational Bayes5.001.226, 3, 6, 5
2353LMSeg: Language-guided Multi-dataset Segmentation5.001.416, 3, 6
2354In Search of Smooth Minima for Purifying Backdoor in Deep Neural Networks5.000.005, 5, 5
2355Deep Learning-based Source Code Complexity Prediction5.001.226, 5, 6, 3
2356Improving Explanation Reliability through Group Attribution5.001.226, 3, 6, 5
2357Finite-time Analysis of Single-timescale Actor-Critic on Linear Quadratic Regulator5.001.416, 6, 3
2358Towards Boosting the Open-Domain Chatbot with Human Feedback5.001.103, 5, 6, 5, 6
2359SWIFT: Rapid Decentralized Federated Learning via Wait-Free Model Communication5.001.225, 6, 3, 6
2360Multiscale Multimodal Transformer for Multimodal Action Recognition5.000.005, 5, 5
23613EF: Class-Incremental Learning via Efficient Energy-Based Expansion and Fusion5.001.106, 5, 3, 5, 6
2362Important Channel Tuning5.001.225, 3, 6, 6
2363Rethinking the Structure of Stochastic Gradients: Empirical and Statistical Evidence5.000.005, 5, 5
2364Clustering-Assisted Foreground and Background Separation for Weakly-supervised Temporal Action Localization5.001.223, 6, 6, 5
2365Offline Reinforcement Learning with Differential Privacy5.001.416, 6, 3
2366Policy Architectures for Compositional Generalization in Control5.002.123, 8, 6, 3
2367Lower Bounds for Differentially Private ERM: Unconstrained and Non-Euclidean5.000.005, 5, 5
2368Explainable Recommender with Geometric Information Bottleneck5.000.005, 5, 5
2369In-Context Policy Iteration5.001.226, 5, 3, 6
2370Learning Control Policies for Region Stabilization in Stochastic Systems5.000.005, 5, 5, 5
2371Semantic Video Synthesis from Video Scene Graphs5.001.223, 6, 5, 6
2372Convolutions are competitive with transformers for protein sequence pretraining5.001.416, 3, 6
2373Learning differentiable solvers for systems with hard constraints5.002.128, 3, 3, 6
2374CEPD: Co-Exploring Pruning and Decomposition for Compact DNN Models5.000.005, 5, 5, 5, 5
2375Causal discovery from conditionally stationary time series5.001.225, 3, 6, 6
2376Spatio-temporal Self-Attention for Egocentric 3D Pose Estimation5.001.416, 3, 6
2377RNAS-CL: Robust Neural Architecture Search by Cross-Layer Knowledge Distillation5.000.005, 5, 5
2378Multi-Agent Policy Transfer via Task Relationship Modeling5.001.225, 6, 3, 6
2379Distributionally Robust Post-hoc Classifiers under Prior Shifts5.001.416, 6, 3
2380Cross-Quality Few-Shot Transfer for Alloy Yield Strength Prediction: A New Material Science Benchmark and An Integrated Optimization Framework5.001.413, 6, 6
2381LEARNING THE SPECTROGRAM TEMPORAL RESOLUTION FOR AUDIO CLASSIFICATION5.001.413, 6, 6
2382Inducing Gaussian Process Networks5.000.005, 5, 5
2383DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images5.002.123, 3, 6, 8
2384Take One Gram of Neural Features, Get Enhanced Group Robustness5.001.223, 6, 6, 5
2385FaceMAE: Privacy-Preserving Face Recognition via Masked Autoencoders5.001.105, 6, 5, 3, 6
2386What can be learnt with wide convolutional neural networks?5.001.416, 6, 3
2387Logit Clipping for Robust Learning against Label Noise5.002.123, 8, 6, 3
2388FedCL: Critical Learning Periods-aware Adaptive Client Selection in Federated Learning5.000.005, 5, 5, 5
2389Decentralized Online Bandit Optimization on Directed Graphs with Regret Bounds5.002.123, 3, 8, 6
2390BED: Boundary-Enhanced Decoder for Chinese Word Segmentation5.000.005, 5, 5, 5
2391SYNC: SAFETY-AWARE NEURAL CONTROL FOR STABILIZING STOCHASTIC DELAY-DIFFERENTIAL EQUATIONS5.000.005, 5, 5
2392Reinforcement learning for instance segmentation with high-level priors5.000.005, 5, 5
2393Mutual Information-guided Knowledge Transfer for Open-World Semi-Supervised Learning5.001.226, 3, 6, 5
2394DIMENSION-REDUCED ADAPTIVE GRADIENT METHOD5.000.005, 5, 5, 5
2395Online Policy Optimization for Robust MDP5.001.223, 6, 5, 6
2396Revisiting Feature Acquisition Bias for Few-Shot Fine-Grained Image Classification5.001.223, 6, 5, 6
2397Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and Implicit Bias5.001.225, 6, 6, 3
2398Generalization bounds and algorithms for estimating the effect of multiple treatments and dosage5.000.005, 5, 5, 5
2399On the optimal precision of GANs5.001.103, 5, 5, 6, 6
2400Prompt Generation Networks for Efficient Adaptation of Frozen Vision Transformers5.000.005, 5, 5, 5
2401How Normalization and Weight Decay Can Affect SGD? Insights from a Simple Normalized Model5.000.005, 5, 5, 5
2402DCAPS: Dual Cross-Attention Coupled with Stabilizer for Few-Shot Common Action Localization5.001.226, 6, 3, 5
2403Adapting Pre-trained Language Models for Quantum Natural Language Processing5.000.005, 5, 5
2404CO3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving5.002.128, 3, 6, 3
2405PathFusion: Path-consistent Lidar-Camera Deep Feature Fusion5.000.005, 5, 5
2406Less is More: Identifying the Cherry on the Cake for Dynamic Networks5.000.005, 5, 5, 5
2407HRBP: Hardware-friendly Regrouping towards Block-wise Pruning for Sparse Training5.000.005, 5, 5, 5
2408HIVE: HIerarchical Volume Encoding for Neural Implicit Surface Reconstruction5.001.226, 3, 5, 6
2409Improving Adversarial Transferability with Worst-case Aware Attacks5.000.005, 5, 5, 5
2410Curved Representation Space of Vision Transformers5.001.225, 6, 6, 3
2411Self-Architectural Knowledge Distillation for Spiking Neural Networks5.000.005, 5, 5, 5
2412Federated Semi-supervised Learning with Dual Regulator5.001.413, 6, 6
2413Cross-modal Graph Contrastive Learning with Cellular Images5.002.123, 3, 8, 6
2414ContraGen: Effective Contrastive Learning For Causal Language Model5.001.225, 3, 6, 6
2415Beyond Single Path Integrated Gradients for Reliable Input Attribution via Randomized Path Sampling5.001.225, 3, 6, 6
2416Decoupled and Patch-based Contrastive Learning for Long-tailed Visual Recognition5.001.106, 5, 6, 5, 3
2417The Geometry of Self-supervised Learning Models and its Impact on Transfer Learning5.001.413, 6, 6
2418Rethink Depth Separation with Intra-layer Links5.001.225, 6, 3, 6
2419Unsupervised Model Selection for Time Series Anomaly Detection5.001.225, 3, 6, 6
2420Deep Active Anomaly Detection With Diverse Queries5.001.416, 3, 6
2421Augmentation Backdoors5.000.005, 5, 5
2422Compact Bilinear Pooling via General Bilinear Projection5.001.416, 3, 6
2423Stochastic Gradient Methods with Preconditioned Updates5.000.005, 5, 5
2424Modeling Multimodal Aleatoric Uncertainty in Segmentation with Mixture of Stochastic Experts5.001.413, 6, 6
2425Neural Decoding of Visual Imagery via Hierarchical Variational Autoencoders5.003.393, 6, 1, 10
2426Exploring The Role of Mean Teachers in Self-supervised Masked Auto-Encoders5.001.225, 6, 3, 6
2427Revisiting Domain Randomization Via Relaxed State-Adversarial Policy Optimization5.001.226, 6, 3, 5
2428Consistent Targets Provide Better Supervision in Semi-supervised Object Detection5.001.226, 5, 6, 3
2429Multi-Agent Sequential Decision-Making via Communication5.001.226, 6, 3, 5
2430EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion5.000.005, 5, 5
2431Single-level Adversarial Data Synthesis based on Neural Tangent Kernels5.002.123, 3, 8, 6
2432Unified Algorithms for RL with Decision-Estimation Coefficients: No-Regret, PAC, and Reward-Free Learning5.000.005, 5, 5
2433Evaluating Fairness Without Sensitive Attributes: A Framework Using Only Auxiliary Models5.001.226, 6, 5, 3
2434Parallel Deep Neural Networks Have Zero Duality Gap5.002.123, 8, 6, 3
2435Causal RL Agents for Out-of-distribution Generalization5.001.416, 6, 3
2436Make Memory Buffer Stronger in Continual Learning: A Continuous Neural Transformation Approach5.000.005, 5, 5, 5
2437Initial Value Problem Enhanced Sampling for Closed-Loop Optimal Control Design with Deep Neural Networks5.001.416, 6, 3
2438Global Context Vision Transformers5.001.225, 6, 3, 6
2439Highway Reinforcement Learning5.001.226, 3, 6, 5
2440Rememory-Based SimSiam for Unsupervised Continual Learning5.001.226, 3, 5, 6
2441Pruning with Output Error Minimization for Producing Efficient Neural Networks5.000.005, 5, 5, 5
2442DREAM: Domain-free Reverse Engineering Attributes of Black-box Model5.001.226, 6, 3, 5
2443Approximate Vanishing Ideal Computations at Scale5.001.416, 6, 3
2444Exploiting Spatial Separability for Deep Learning Multichannel Speech Enhancement with an Align-and-Filter Network5.001.226, 5, 3, 6
2445CausalAgents: A Robustness Benchmark for Motion Forecasting Using Causal Relationships5.001.105, 3, 6, 5, 6
2446Critic Sequential Monte Carlo5.001.226, 5, 3, 6
2447Learning to Take a Break: Sustainable Optimization of Long-Term User Engagement5.001.416, 6, 3
2448Laziness, Barren Plateau, and Noises in Machine Learning5.001.226, 6, 3, 5
2449Towards Online Real-Time Memory-based Video Inpainting Transformers5.001.223, 6, 6, 5
2450Gated Class-Attention with Cascaded Feature Drift Compensation for Exemplar-free Continual Learning of Vision Transformers5.001.225, 6, 6, 3
2451Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training5.001.416, 3, 6
2452TPC-NAS: Sub-Five-Minute Neural Architecture Search for Image Classification, Object-Detection, and Super-Resolution5.000.005, 5, 5, 5
2453Mutual Information Regularized Offline Reinforcement Learning5.001.223, 5, 6, 6
2454Visual Timing For Sound Source Depth Estimation in the Wild5.001.226, 3, 6, 5
2455Subclass-balancing Contrastive Learning for Long-tailed Recognition5.001.226, 5, 3, 6
2456Learning Robust Goal Space with Hypothetical Analogy-Making5.001.226, 6, 3, 5
2457Learning Disentanglement in Autoencoders through Euler Encoding5.001.223, 6, 5, 6
2458$mathrm{R}^2$-VOS: Robust Referring Video Object Segmentation via Relational Cycle Consistency5.000.005, 5, 5, 5
2459Lossless Filter Pruning via Adaptive Clustering for Convolutional Neural Networks5.000.005, 5, 5, 5
2460Divide to Adapt: Mitigating Confirmation Bias for Domain Adaptation of Black-Box Predictors5.001.105, 5, 6, 6, 3
2461Denoising Masked Autoencoders are Certifiable Robust Vision Learners5.002.126, 8, 3, 3
2462Neural Prompt Search5.000.005, 5, 5, 5
2463Few-Shot Transferable Robust Representation Learning via Bilevel Attacks5.001.225, 6, 3, 6
2464Distortion-Aware Network Pruning and Feature Reuse for Real-time Video Segmentation5.000.005, 5, 5
2465Learning to Linearize Deep Neural Networks for Secure and Efficient Private Inference5.001.416, 6, 3
2466TempCLR: Temporal Alignment Representation with Contrastive Learning5.001.223, 5, 6, 6
2467The Power of Regularization in Solving Extensive-Form Games5.000.005, 5, 5, 5
2468Neural Topic Modeling with Embedding Clustering Regularization5.001.223, 5, 6, 6
2469MLPInit: Embarrassingly Simple GNN Training Acceleration with MLP Initialization5.002.128, 6, 3, 3
2470Promoting Semantic Connectivity: Dual Nearest Neighbors Contrastive Learning for Unsupervised Domain Generalization5.001.223, 5, 6, 6
2471Towards Equivariant Graph Contrastive Learning via Cross-Graph Augmentation5.002.123, 8, 6, 3
2472One Ring to Bring Them All: Model Adaptation under Domain and Category Shift5.001.413, 6, 6
2473On the Importance of Architectures and Hyperparameters for Fairness in Face Recognition5.000.005, 5, 5, 5
2474Evaluating natural language processing models with generalization metrics that do not need access to any training or testing data5.001.226, 6, 5, 3
2475The Effects of Nonlinearity on Approximation Capacity of Recurrent Neural Networks5.002.555, 8, 1, 6
2476Curiosity-Driven Unsupervised Data Collection for Offline Reinforcement Learning5.001.226, 5, 6, 3
2477Understanding and Bridging the Modality Gap for Speech Translation5.001.223, 6, 6, 5
2478MIA: A Framework for Certified Robustness of Time-Series Classification and Forecasting Against Temporally-Localized Perturbations5.000.005, 5, 5
2479Spike Calibration: Bridging the Gap between ANNs and SNNs in ANN-SNN Conversion5.002.555, 6, 8, 1
2480Split and Merge Proxy: pre-training protein-protein contact prediction by mining rich information from monomer data5.001.226, 5, 6, 3
2481Adversarial Counterfactual Environment Model Learning5.001.413, 6, 6
2482PointDP: Diffusion-driven Purification against 3D Adversarial Point Clouds5.001.223, 5, 6, 6
2483DeSCo: Towards Scalable Deep Subgraph Counting5.001.413, 6, 6
2484Supervised Contrastive Regression5.001.226, 5, 6, 3
2485Provable Benefits of Representational Transfer in Reinforcement Learning5.001.416, 3, 6
2486Set Discrimination Contrastive Learning5.000.005, 5, 5, 5
2487A Class-Aware Representation Refinement Framework for Graph Classification5.000.005, 5, 5, 5
2488An information-theoretic approach to unsupervised keypoint representation learning5.001.226, 5, 3, 6
2489A simple but effective and efficient global modeling paradigm for image restoration5.002.126, 8, 3, 3
2490ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation5.001.226, 6, 3, 5
2491A Close Look at Token Mixer: From Attention to Convolution5.000.005, 5, 5
2492MiSAL: Active Learning for Every Budget5.002.128, 3, 6, 3
2493SOM-CPC: Unsupervised Contrastive Learning with Self-Organizing Maps for Structured Representations of High-Rate Time Series5.001.413, 6, 6
2494CLIP-FLOW: CONTRASTIVE LEARNING WITH ITERATIVE PSEUDO LABELING FOR OPTICAL FLOW5.000.005, 5, 5
2495Bidirectional Learning for Offline Model-based Biological Sequence Design5.000.005, 5, 5
2496AQUILA: Communication Efficient Federated Learning with Adaptive Quantization of Lazily-Aggregated Gradients5.001.223, 6, 6, 5
2497Multi-User Reinforcement Learning with Low Rank Rewards5.001.103, 5, 5, 6, 6
2498Bayesian Robust Graph Contrastive Learning5.000.005, 5, 5, 5
2499SoundNeRirF: Receiver-to-Receiver Sound Neural Room Impulse Response Field5.001.416, 6, 3
2500Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization5.001.226, 6, 5, 3
2501Sparse Misinformation Detector5.000.005, 5, 5
2502Trainability Preserving Neural Pruning5.001.226, 3, 5, 6
2503Harnessing Out-Of-Distribution Examples via Augmenting Content and Style5.001.225, 6, 3, 6
2504A Unified Framework of Soft Threshold Pruning5.001.416, 6, 3
2505STViT: Semantic Tokens for Efficient Global and Local Vision Transformers5.001.223, 6, 5, 6
2506Expanding Datasets With Guided Imagination5.002.123, 6, 8, 3
2507Communication Efficient Fair Federated Recommender System5.001.225, 3, 6, 6
2508Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment5.000.005, 5, 5
2509Multi-Domain Long-Tailed Learning by Augmenting Disentangled Representations5.001.226, 5, 6, 3
2510Mesh-free Eulerian Physics-Informed Neural Networks4.831.346, 3, 6, 3, 6, 5
2511Show and Write: Entity-aware Article Generation with Image Information4.831.343, 6, 6, 3, 6, 5
2512Rate-Distortion Optimized Post-Training Quantization for Learned Image Compression4.831.675, 8, 3, 5, 3, 5
2513Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance4.831.343, 6, 3, 5, 6, 6
2514Implicit Neural Spatial Representations for Time-dependent PDEs4.831.346, 5, 6, 3, 6, 3
2515Benchmarking and Improving Robustness of 3D Point Cloud Recognition against Common Corruptions4.831.675, 5, 8, 5, 3, 3
2516Adaptive IMLE for Few-shot Image Synthesis4.801.476, 6, 3, 3, 6
2517Curriculum-inspired Training for Selective Neural Networks4.800.986, 5, 5, 5, 3
2518Actor-Critic Alignment for Offline-to-Online Reinforcement Learning4.800.985, 5, 3, 5, 6
2519Learning Deep Operator Networks: The Benefits of Over-Parameterization4.801.833, 3, 5, 5, 8
2520A distinct unsupervised reference model from the environment helps continual learning4.800.985, 5, 6, 5, 3
2521Gradient Gating for Deep Multi-Rate Learning on Graphs4.800.985, 3, 5, 6, 5
2522Self-Supervised Extreme Compression of Gigapixel Images4.800.985, 5, 6, 3, 5
2523Evaluating Robustness of Cooperative MARL: A Model-based Approach4.800.983, 5, 5, 5, 6
2524Forces are not Enough: Benchmark and Critical Evaluation for Machine Learning Force Fields with Molecular Simulations4.801.476, 6, 3, 3, 6
2525An alternative approach to train neural networks using monotone variational inequality4.800.986, 5, 5, 3, 5
2526Risk-aware Bayesian RL for Cautious Exploration4.802.713, 3, 10, 5, 3
2527Attention Enables Zero Approximation Error4.800.985, 5, 3, 6, 5
2528The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels4.800.985, 3, 6, 5, 5
2529Efficient Personalized Federated Learning via Sparse Model-Adaptation4.800.986, 3, 5, 5, 5
2530Deformable Graph Transformer4.800.986, 5, 5, 5, 3
2531Data-efficient Supervised Learning is Powerful for Neural Combinatorial Optimization4.800.983, 6, 5, 5, 5
2532Entropy-Regularized Model-Based Offline Reinforcement Learning4.800.986, 3, 5, 5, 5
2533KITE: A Kernel-based Improved Transferability Estimation Method4.800.985, 6, 5, 3, 5
2534FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training4.800.985, 5, 5, 3, 6
2535Sensitivity-aware Visual Parameter-efficient Tuning4.800.985, 5, 6, 3, 5
2536Variational Imbalanced Regression4.801.945, 6, 6, 6, 1
2537MotifExplainer: a Motif-based Graph Neural Network Explainer4.800.985, 5, 3, 5, 6
2538QCRS: Improve Randomized Smoothing using Quasi-Concave Optimization4.800.985, 6, 3, 5, 5
2539Self-attentive Rationalization for Graph Contrastive Learning4.800.985, 6, 3, 5, 5
2540NeuralStagger: accelerating physics constrained neural PDE solver with spatial-temporal decomposition4.751.096, 5, 3, 5
2541Latent Linear ODEs with Neural Kalman Filtering for Irregular Time Series Forecasting4.751.095, 3, 5, 6
2542Learning with Non-Uniform Label Noise: A Cluster-Dependent Semi-Supervised Approach4.751.095, 6, 3, 5
2543SWRM: Similarity Window Reweighting and Margins for Long-Tailed Recognition4.751.095, 6, 5, 3
2544Supervised Q-Learning can be a Strong Baseline for Continuous Control4.751.095, 6, 3, 5
2545Self-Supervised Off-Policy Ranking via Crowd Layer4.751.096, 3, 5, 5
2546Incremental Predictive Coding: A Parallel and Fully Automatic Learning Algorithm4.751.093, 5, 6, 5
2547When and Why Is Pretraining Object-Centric Representations Good for Reinforcement Learning?4.751.093, 6, 5, 5
2548Contrastive Representation Learning for Multi-scale Spatial Scenes4.752.498, 5, 5, 1
2549Exploiting Personalized Invariance for Better Out-of-distribution Generalization in Federated Learning4.751.096, 5, 5, 3
2550Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management4.751.095, 3, 6, 5
2551Adaptive Computation with Elastic Input Sequence4.751.093, 6, 5, 5
2552Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?4.751.095, 3, 6, 5
2553Contrastive Learning of Molecular Representation with Fragmented Views4.752.055, 3, 3, 8
2554Contextualized Generative Retrieval4.751.093, 5, 6, 5
2555Discrete State-Action Abstraction via the Successor Representation4.752.053, 8, 3, 5
2556MiDAS: Multi-integrated Domain Adaptive Supervision for Fake News Detection4.751.093, 5, 6, 5
2557Walking the Tightrope: An Investigation of the Convolutional Autoencoder Bottleneck4.751.095, 5, 6, 3
2558The Role of Pre-training Data in Transfer Learning4.751.095, 5, 6, 3
2559Limits of Algorithmic Stability for Distributional Generalization4.752.053, 5, 8, 3
2560VQR: Automated Software Vulnerability Repair Through Vulnerability Queries4.751.095, 6, 5, 3
2561Fully Online Meta Learning4.752.498, 5, 1, 5
2562What Do We Maximize in Self-Supervised Learning And Why Does Generalization Emerge?4.751.096, 3, 5, 5
2563Sufficient Subgraph Embedding Memory for Continual Graph Representation Learning4.752.053, 8, 5, 3
2564Iterative Task-adaptive Pretraining for Unsupervised Word Alignment4.751.093, 5, 6, 5
2565Pretraining One Language Model for All With the Text-To-Text Framework Using Model-Generated Signals4.751.093, 6, 5, 5
2566TOWARD RELIABLE NEURAL SPECIFICATIONS4.752.053, 5, 8, 3
2567Pyramidal Denoising Diffusion Probabilistic Models4.751.093, 6, 5, 5
2568Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning4.751.095, 5, 6, 3
2569An Analytic Framework for Robust Training of Differentiable Hypothesis4.751.095, 6, 5, 3
2570Supervised Metric Learning for Retrieval via Contextual Similarity Optimization4.752.053, 8, 5, 3
2571How Can Deep Learning Performs Deep (Hierarchical) Learning4.751.093, 6, 5, 5
2572Sequential Brick Assembly with Efficient Constraint Satisfaction4.751.093, 5, 5, 6
2573Augmentation Curriculum Learning For Generalization in RL4.751.095, 6, 5, 3
2574Using the Training History to Detect and Prevent Overfitting in Deep Learning Models4.751.095, 5, 6, 3
2575CORE-PERIPHERY PRINCIPLE GUIDED REDESIGN OF SELF-ATTENTION IN TRANSFORMERS4.751.093, 5, 6, 5
2576How Hard is Trojan Detection in DNNs? Fooling Detectors With Evasive Trojans4.751.093, 5, 6, 5
2577Less Is More: Training on Low-Fidelity Images Improves Robustness to Adversarial Attacks4.751.093, 5, 5, 6
2578A Differentiable Loss Function for Learning Heuristics in A*4.752.058, 3, 3, 5
2579AsymQ: Asymmetric Q-loss to mitigate overestimation bias in off-policy reinforcement learning4.752.055, 3, 8, 3
2580Balance is Essence: Accelerating Sparse Training via Adaptive Gradient Correction4.751.095, 3, 6, 5
2581TEXTCRAFT: ZERO-SHOT GENERATION OF HIGH FIDELITY AND DIVERSE SHAPES FROM TEXT4.751.095, 5, 3, 6
2582Transformer-based World Models Are Happy With 100k Interactions4.752.058, 3, 3, 5
2583Robust Federated Learning with Majority Adversaries via Projection-based Re-weighting4.751.095, 5, 6, 3
2584Resource Efficient Self-Supervised Learning for Speech Recognition4.751.096, 5, 5, 3
2585HyperTime: Implicit Neural Representations for Time Series Generation4.751.095, 6, 5, 3
2586Unsupervised Pretraining for Neural Value Approximation4.752.055, 3, 8, 3
2587MALIBO: Meta-Learning for Likelihood-free Bayesian Optimization4.751.095, 5, 3, 6
2588Asynchronous Message Passing: A new Framework for Learning in Graphs4.751.095, 3, 6, 5
2589From Adaptive Query Release to Machine Unlearning4.751.096, 3, 5, 5
2590Meta-Learning Black-Box Optimization via Black-Box Optimization4.751.095, 5, 6, 3
2591Optimal Membership Inference Bounds for Adaptive Composition of Sampled Gaussian Mechanisms4.752.058, 5, 3, 3
2592SPRINT: Scalable Semantic Policy Pre-training via Language Instruction Relabeling4.751.095, 5, 6, 3
2593Data Feedback Loops: Model-driven Amplification of Dataset Biases4.751.093, 6, 5, 5
2594A Large Scale Sample Complexity Analysis of Neural Policies in the Low-Data Regime4.752.058, 3, 3, 5
2595Action Matching: A Variational Method for Learning Stochastic Dynamics from Samples4.751.095, 5, 6, 3
2596An Empirical Study on the Efficacy of Deep Active Learning Techniques4.751.096, 5, 3, 5
2597EF21-P and Friends: Improved Theoretical Communication Complexity for Distributed Optimization with Bidirectional Compression4.752.491, 8, 5, 5
2598Unleash Model Capacity for Universal Dense Retrieval by Task Specialty Optimization4.751.095, 5, 3, 6
2599Key Design Choices for Double-transfer in Source-free Unsupervised Domain Adaptation4.751.096, 5, 3, 5
2600$Phi$-DVAE: Learning Physically Interpretable Representations with Nonlinear Filtering4.751.096, 5, 3, 5
2601Rethinking Uniformity in Self-Supervised Representation Learning4.751.095, 6, 5, 3
2602Self-Supervised Learning of Maximum Manifold Capacity Representations4.751.095, 3, 6, 5
2603PMI-guided Masking Strategy to Enable Few-shot Learning for Genomic Applications4.752.055, 3, 8, 3
2604Fast Bayesian Updates for Deep Learning with a Use Case in Active Learning4.751.095, 5, 6, 3
2605FP_AINet: Fusion Prototype with Adaptive Induction Network for Few-Shot Learning4.751.093, 6, 5, 5
2606DCT-DiffStride: Differentiable Strides with Real-Valued Data4.751.095, 6, 5, 3
2607Removing Structured Noise with Diffusion Models4.752.053, 8, 3, 5
2608Closed-loop Transcription via Convolutional Sparse Coding4.751.095, 5, 6, 3
2609MC-SSL: Towards Multi-Concept Self-Supervised Learning4.751.093, 5, 6, 5
2610Latent Hierarchical Imitation Learning for Stochastic Environments4.752.058, 5, 3, 3
2611Efficient Discovery of Dynamical Laws in Symbolic Form4.752.058, 3, 5, 3
2612Human-AI Coordination via Human-Regularized Search and Learning4.752.058, 3, 3, 5
2613Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention4.751.095, 5, 6, 3
2614CounterNet: End-to-End Training of Prediction Aware Counterfactual Explanations4.753.033, 10, 3, 3
2615Adaptive Smoothing Gradient Learning for Spiking Neural Networks4.752.058, 3, 3, 5
2616Going Beyond Approximation: Encoding Constraints for Explainable Multi-hop Inference via Differentiable Combinatorial Solvers4.751.095, 5, 3, 6
2617DBA: Efficient Transformer with Dynamic Bilinear Low-Rank Attention4.751.093, 6, 5, 5
2618Client-agnostic Learning and Zero-shot Adaptation for Federated Domain Generalization4.751.095, 6, 5, 3
2619Prompt-Based Metric Learning for Few-Shot NER4.751.095, 6, 3, 5
2620MetaPhysiCa: Causality-aware Robustness to OOD Initial Conditions in Physics-informed Machine Learning4.751.095, 6, 5, 3
2621InteriorSim: A Photorealistic Simulator for Embodied AI4.751.095, 3, 5, 6
2622Spatial Entropy as an Inductive Bias for Vision Transformers4.751.095, 6, 5, 3
2623A Simple Framework for Low-Resolution Detection with High-resolution Knowledge4.751.093, 5, 6, 5
2624Zero-Label Prompt Selection4.751.095, 3, 5, 6
2625Adversarial Text to Continuous Image Generation4.751.095, 5, 6, 3
2626A GNN-Guided Predict-and-Search Framework for Mixed-Integer Linear Programming4.751.093, 6, 5, 5
2627A Weight Variation-Aware Training Method for Hardware Neuromorphic Chips4.751.096, 5, 5, 3
2628Hybrid-Regressive Neural Machine Translation4.751.093, 5, 6, 5
2629PET-NeuS: Positional Encoding Triplanes for Neural Surfaces4.752.055, 8, 3, 3
2630Effective Offline Reinforcement Learning via Conservative State Value Estimation4.752.058, 3, 5, 3
2631Visually-augmented pretrained language models for NLP Tasks without Images4.751.093, 5, 6, 5
2632Cold Rao-Blackwellized Straight-Through Gumbel-Softmax Gradient Estimator4.751.095, 5, 3, 6
2633Out-of-Domain Intent Detection Considering Multi-turn Dialogue Contexts4.751.093, 5, 5, 6
2634$epsilon$-Invariant Hierarchical Reinforcement Learning for Building Generalizable Policy4.751.095, 5, 6, 3
2635CCIL: Context-conditioned imitation learning for urban driving4.751.095, 6, 5, 3
2636Conditional Policy Similarity: An Overlooked Factor in Zero-Shot Coordination4.751.093, 5, 5, 6
2637ECLAD: Extracting Concepts with Local Aggregated Descriptors4.751.095, 3, 5, 6
2638So-TVAE: Sentiment-oriented Transformer-based Variational Autoencoder Network for Live Video Commenting4.751.095, 3, 5, 6
2639SDAC: Efficient Safe Reinforcement Learning with Low-Biased Distributional Actor-Critic4.751.095, 3, 5, 6
2640Prompt Tuning for Graph Neural Networks4.752.058, 3, 5, 3
2641Neural Unbalanced Optimal Transport via Cycle-Consistent Semi-Couplings4.751.093, 5, 6, 5
2642Learning to Boost Resilience of Complex Networks via Neural Edge Rewiring4.752.058, 3, 5, 3
2643Adversarial Robustness based on Randomized Smoothing in Quantum Machine Learning4.751.093, 6, 5, 5
2644Linear Convergence of Decentralized FedAvg for Non-Convex Objectives: The Interpolation Regime4.751.095, 3, 5, 6
2645Rethinking Missing Modality Learning: From a Decoding View4.751.095, 3, 5, 6
2646Meta-Weighted Language Model Tuning for Augmentation-Enhanced Few-Shot Learning4.751.095, 5, 3, 6
2647Graph-informed Neural Point Process With Monotonic Nets4.751.095, 6, 3, 5
2648Learning to Decouple Complex System for Sequential Data4.752.058, 5, 3, 3
2649Efficient Large-scale Transformer Training via Random and Layerwise Token Dropping4.751.093, 5, 5, 6
2650Hypothetical Training for Robust Machine Reading Comprehension of Tabular Context4.752.055, 3, 3, 8
2651On the Efficacy of Server-Aided Federated Learning against Partial Client Participation4.751.095, 6, 5, 3
2652Toxicity in Multilingual Machine Translation at Scale4.752.058, 5, 3, 3
2653Bandit Learning with General Function Classes: Heteroscedastic Noise and Variance-dependent Regret Bounds4.751.093, 5, 5, 6
2654Adaptive Sparse Softmax: An Effective and Efficient Softmax Variant for Text Classification4.751.093, 5, 6, 5
2655Continuous Goal Sampling: A Simple Technique to Accelerate Automatic Curriculum Learning4.751.096, 3, 5, 5
2656Towards Better Selective Classification4.752.053, 3, 5, 8
2657Offline Equilibrium Finding4.751.095, 5, 6, 3
2658Brainformers: Trading Simplicity for Efficiency4.751.093, 6, 5, 5
2659Effective Self-Supervised Transformers For Sparse Time Series Data4.751.096, 5, 3, 5
2660Efficient Shapley Values Estimation by Amortization for Text Classification4.752.058, 3, 5, 3
2661Precision Collaboration for Federated Learning4.751.093, 5, 5, 6
2662Offline RL of the Underlying MDP from Heterogeneous Data Sources4.751.093, 5, 6, 5
2663On the Importance of Calibration in Semi-supervised Learning4.751.095, 5, 6, 3
2664Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs4.751.096, 3, 5, 5
2665Fast Adaptation via Human Diagnosis of Task Distribution Shift4.751.093, 5, 6, 5
2666Shortcut Learning Through the Lens of Early Training Dynamics4.752.171, 6, 6, 6
2667Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures4.752.058, 3, 3, 5
2668EmbedDistill: A geometric knowledge distillation for information retrieval4.751.095, 5, 3, 6
2669Learning from Labeled Images and Unlabeled Videos for Video Segmentation4.752.055, 8, 3, 3
2670REV: Information-Theoretic Evaluation of Free-Text Rationales4.751.095, 3, 5, 6
2671Uncertainty-Driven Exploration for Generalization in Reinforcement Learning4.751.093, 5, 6, 5
2672Adaptive Parametric Prototype Learning for Cross-Domain Few-Shot Classification4.751.093, 5, 5, 6
2673Building compact representations for image-language learning4.752.058, 3, 5, 3
2674HEAV: Hierarchical Ensembling of Augmented Views for Image Captioning4.751.093, 5, 5, 6
2675Dynamic Pretraining of Vision-Language Models4.751.095, 6, 3, 5
2676Epistemological Bias As a Means for the Automated Detection of Injustices in News Media4.752.053, 8, 3, 5
2677Have Missing Data? Make It Miss More! Imputing Tabular Data with Masked Autoencoding4.751.095, 5, 3, 6
2678Federated Self-supervised Learning for Heterogeneous Clients4.751.095, 6, 5, 3
2679Waveformer: Linear-Time Attention with Forward and Backward Wavelet Transform4.751.093, 6, 5, 5
2680Semantic Image Manipulation with Background-guided Internal Learning4.751.095, 5, 3, 6
2681Reconciling Security and Communication Efficiency in Federated Learning4.751.095, 5, 3, 6
2682Noise Injection Node Regularization for Robust Learning4.751.095, 3, 5, 6
2683Taming the Long Tail of Deep Probabilistic Forecasting4.751.095, 3, 6, 5
2684Risk Control for Online Learning Models4.752.053, 8, 5, 3
2685Perturbation Analysis of Neural Collapse4.751.095, 3, 6, 5
2686Leveraging the Third Dimension in Contrastive Learning4.751.096, 5, 5, 3
2687Learning Top-k Classification with Label Ranking4.751.095, 6, 5, 3
2688Language-Aware Soft Prompting for Vision & Language Foundation Models4.751.096, 5, 3, 5
2689Theoretical Characterization of How Neural Network Pruning Affects its Generalization4.751.096, 3, 5, 5
2690Collaborative Symmetricity Exploitation for Offline Learning of Hardware Design Solver4.751.096, 5, 3, 5
2691Policy Expansion for Bridging Offline-to-Online Reinforcement Learning4.751.095, 3, 6, 5
2692Prosody-TTS: Self-Supervised Prosody Pretraining with Latent Diffusion For Text-to-Speech4.751.095, 5, 3, 6
2693Confounder Identification-free Causal Visual Feature Learning4.752.491, 5, 5, 8
2694A Neural Mean Embedding Approach for Back-door and Front-door Adjustment4.752.491, 5, 5, 8
2695Multi-View Independent Component Analysis with Shared and Individual Sources4.752.053, 8, 3, 5
2696Label-Efficient Online Continual Object Detection in Streaming Video4.751.095, 3, 5, 6
2697Multi-Agent Multi-Game Entity Transformer4.751.093, 5, 6, 5
2698RealSinger: Ultra-Realistic Singing Voice Generation via Stochastic Differential Equations4.752.053, 3, 8, 5
2699On the Role of Self-supervision in Deep Multi-view Clustering4.752.053, 8, 5, 3
2700Skill Machines: Temporal Logic Composition in Reinforcement Learning4.751.095, 3, 5, 6
2701Learning Basic Interpretable Factors from Temporal Signals via Physics Symmetry4.751.095, 5, 6, 3
2702Can Single-Pass Contrastive Learning Work for Both Homophilic and Heterophilic Graph?4.752.053, 8, 5, 3
2703Dynamical Equations With Bottom-up Self-Organizing Properties Learn Accurate Dynamical Hierarchies Without Any Loss Function4.751.095, 3, 5, 6
2704Video Scene Graph Generation from Single-Frame Weak Supervision4.751.096, 5, 3, 5
2705Contrastive Consistent Representation Distillation4.751.096, 5, 5, 3
2706CLEEGN: A Convolutional Neural Network for Plug-and-Play Automatic EEG Reconstruction4.751.093, 5, 6, 5
2707Unified neural representation model for physical and conceptual spaces4.752.058, 3, 3, 5
2708Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models4.751.095, 3, 6, 5
2709What's Behind the Mask: Estimating Uncertainty in Image-to-Image Problems4.751.096, 5, 3, 5
2710Style Balancing and Test-Time Style Shifting for Domain Generalization4.751.093, 5, 6, 5
2711Least Disagree Metric-based Active Learning4.751.093, 6, 5, 5
2712Selective Classifier Ensemble4.751.096, 3, 5, 5
2713Few-Shot Anomaly Detection on Industrial Images through Contrastive Fine-Tuning4.751.095, 5, 3, 6
2714On the robustness of self-supervised models for generative spoken language modeling4.751.096, 5, 3, 5
2715Multi-Level Contrastive Learning for Dense Prediction Task4.751.095, 5, 6, 3
2716ETSformer: Exponential Smoothing Transformers for Time-series Forecasting4.751.095, 6, 5, 3
2717Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization4.751.096, 5, 5, 3
2718SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data4.752.055, 3, 3, 8
2719Scalable 3D Object-centric Learning4.751.096, 3, 5, 5
2720Analysis of Error Feedback in Compressed Federated Non-Convex Optimization4.751.095, 6, 5, 3
2721Causal Proxy Models For Concept-Based Model Explanations4.751.095, 3, 6, 5
2722Graph Contrastive Learning Under Heterophily: Utilizing Graph Filters to Generate Graph Views4.752.055, 3, 8, 3
2723Output Distribution over the Entire Input Space: A Novel Perspective to Understand Neural Networks4.751.095, 6, 3, 5
2724Decentralized Robust V-learning for Solving Markov Games with Model Uncertainty4.751.095, 6, 3, 5
2725A Unified Framework for Comparing Learning Algorithms4.751.095, 6, 3, 5
2726KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal4.751.096, 5, 3, 5
2727Reward-free Policy Learning through Active Human Involvement4.752.053, 5, 8, 3
2728Robust Attention for Contextual Biased Visual Recognition4.751.095, 5, 6, 3
2729Complex-Target-Guided Open-Domain Conversation based on offline reinforcement learning4.752.055, 8, 3, 3
2730ObPose: Leveraging Pose for Object-Centric Scene Inference and Generation in 3D4.751.096, 3, 5, 5
2731Don't Throw Your Old Policies Away: Knowledge-based Policy Recycling Protects Against Adversarial Attacks4.752.058, 3, 3, 5
2732Contrastive Adversarial Loss for Point Cloud Reconstruction4.751.093, 6, 5, 5
2733Ahead-of-Time P-Tuning4.751.096, 3, 5, 5
2734SimST: A GNN-Free Spatio-Temporal Learning Framework for Traffic Forecasting4.751.096, 5, 5, 3
2735Social and environmental impact of recent developments in machine learning on biology and chemistry research4.752.055, 3, 8, 3
2736Environment Partitioning For Invariant Learning By Decorrelation4.751.093, 5, 6, 5
2737Style Spectroscope: Improve Interpretability and Controllability through Fourier Analysis4.752.058, 5, 3, 3
2738Cascaded Teaching Transformers with Data Reweighting for Long Sequence Time-series Forecasting4.751.093, 5, 6, 5
2739Hazard Gradient Penalty for Survival Analysis4.751.093, 5, 5, 6
2740Reach the Remote Neighbors: Dual-Encoding Transformer for Graphs4.751.095, 5, 6, 3
2741Only For You: Deep Neural Anti-Forwarding Watermark Preserves Image Privacy4.751.095, 6, 3, 5
2742PromptCast: A New Prompt-based Learning Paradigm for Time Series Forecasting4.752.058, 3, 5, 3
2743Revealing Single Frame Bias for Video-and-Language Learning4.751.095, 6, 3, 5
2744Union Subgraph Neural Networks4.751.096, 5, 5, 3
2745NEW TRAINING FRAMEWORK FOR SPEECH ENHANCEMENT USING REAL NOISY SPEECH4.752.055, 3, 3, 8
2746Dataset Condensation with Latent Space Knowledge Factorization and Sharing4.751.095, 5, 3, 6
2747Can GNNs Learn Heuristic Information for Link Prediction?4.751.093, 6, 5, 5
2748Spatial Attention Kinetic Networks with E(n)-Equivariance4.751.095, 6, 5, 3
2749Human Pose Estimation in the Dark4.751.095, 6, 3, 5
2750ETAD: A Sampling-Based Approach for Efficient Temporal Action Detection4.751.093, 5, 5, 6
2751HierBatching: Locality-Aware Out-of-Core Training of Graph Neural Networks4.751.093, 5, 5, 6
2752Bias Mitigation Framework for Intersectional Subgroups in Neural Networks4.752.058, 5, 3, 3
2753HyperQuery: A Framework for Higher Order Link Prediction4.751.096, 5, 5, 3
2754Tiny Adapters for Vision Transformers4.751.095, 5, 6, 3
2755Proximal Curriculum for Reinforcement Learning Agents4.751.095, 5, 3, 6
2756Random Weight Factorization improves the training of Continuous Neural Representations4.752.058, 5, 3, 3
2757Improving group robustness under noisy labels using predictive uncertainty4.751.095, 3, 6, 5
2758MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection4.751.093, 5, 5, 6
2759Your Neighbors Are Communicating: Towards Powerful and Scalable Graph Neural Networks4.751.096, 5, 5, 3
2760Fair Attribute Completion on Graph with Missing Attributes4.751.096, 3, 5, 5
2761Improving Generalization with Domain Convex Game4.751.093, 5, 5, 6
2762Edge Wasserstein Distance Loss for Oriented Object Detection4.751.096, 5, 5, 3
2763StyleGenes: Discrete and Efficient Latent Distributions for GANs4.752.055, 3, 3, 8
2764ZERO: A Large-scale Chinese Cross-modal Benchmark with a New Vision-Language Framework4.752.055, 3, 3, 8
2765SinGRAV: Learning a Generative Radiance Volume from a Single Natural Scene4.752.053, 5, 8, 3
2766ConBaT: Control Barrier Transformer for Safety-Critical Policy Learning4.751.095, 6, 5, 3
2767Reinforced Sample Reweighting Policy for Semi-supervised Learning4.751.093, 6, 5, 5
2768Multiple Instance Learning via Iterative Self-Paced Supervised Contrastive Learning4.751.095, 3, 6, 5
2769TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second4.751.095, 3, 5, 6
2770Friends to Help: Saving Federated Learning from Client Dropout4.751.093, 5, 6, 5
2771GraphPNAS: Learning Distribution of Good Neural Architectures via Deep Graph Generative Models4.751.095, 5, 6, 3
2772Interpretability with full complexity by constraining feature information4.751.095, 6, 3, 5
2773Stealing and Defending Transformer-based Encoders4.751.093, 6, 5, 5
2774Curriculum Reinforcement Learning via Morphology-Environment Co-Evolution4.751.095, 3, 5, 6
2775Token-Label Alignment for Vision Transformers4.751.095, 5, 3, 6
2776Efficient Covariance Estimation for Sparsified Functional Data4.751.093, 5, 5, 6
2777Does Continual Learning Equally Forget All Parameters?4.752.176, 1, 6, 6
2778EAGLE: Large-scale Learning of Turbulent Fluid Dynamics with Mesh Transformers4.751.093, 5, 5, 6
2779Cross-Domain Autonomous Driving Perception using Contrastive Appearance Adaptation4.751.095, 3, 5, 6
2780On The Inadequacy of Optimizing Alignment and Uniformity in Contrastive Learning of Sentence Representations4.751.095, 3, 5, 6
2781Approximated Anomalous Diffusion: Gaussian Mixture Score-based Generative Models4.752.053, 5, 3, 8
2782AutoSKDBERT: Learn to Stochastically Distill BERT4.751.095, 5, 3, 6
2783An Empirical Study of Metrics to Measure Representational Harms in Pre-Trained Language Models4.751.095, 5, 6, 3
2784Unsupervised Learning of Causal Relationships from Unstructured Data4.752.058, 5, 3, 3
2785Parameterized projected Bellman operator4.751.095, 5, 3, 6
2786Examining the Value of Neural Filter Pruning -- Retrospect and Prospect4.751.096, 5, 5, 3
2787Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program4.751.095, 5, 6, 3
2788DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training4.751.096, 3, 5, 5
2789Measuring Asymmetric Gradient Discrepancy in Parallel Continual Learning4.751.095, 6, 3, 5
2790Design of the topology for contrastive visual-textual alignment4.751.093, 5, 6, 5
2791Defactorization Transformer: Modeling Long Range Dependency with Local Window Cost4.751.095, 6, 5, 3
2792Multi-Modal Few-Shot Temporal Action Detection4.751.095, 6, 3, 5
2793In the ZONE: Measuring difficulty and progression in curriculum generation4.751.093, 5, 5, 6
2794Dual-Domain Diffusion Based Progressive Style Rendering towards Semantic Structure Preservation4.671.253, 5, 6
2795Mini-batch $k$-means terminates within $O(d/epsilon)$ iterations4.671.253, 5, 6
2796Functional Risk Minimization4.671.256, 5, 3
2797Causal Inference for Knowledge Graph Completion4.671.253, 6, 5
2798Rethinking Metric Based Contrastive Learning Method’s Generalization Capability4.671.256, 5, 3
2799Enriching Online Knowledge Distillation with Specialist Ensemble4.671.253, 5, 6
2800Variational Learning ISTA4.671.253, 6, 5
2801Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning4.671.255, 6, 3
2802FedGC: An Accurate and Efficient Federated Learning under Gradient Constraint for Heterogeneous Data4.671.256, 5, 3
2803MASTER: Multi-task Pre-trained Bottlenecked Masked Autoencoders are Better Dense Retrievers4.671.255, 3, 6
2804Some Practical Concerns and Solutions for Using Pretrained Representation in Industrial Systems4.671.255, 3, 6
2805Efficient Bayesian Optimization with Deep Kernel Learning and Transformer Pre-trained on Muliple Heterogeneous Datasets4.671.255, 3, 6
2806Untangling Effect and Side Effect: Consistent Causal Inference in Non-Targeted Trials4.671.256, 5, 3
2807Pseudometric guided online query and update for offline reinforcement learning4.671.256, 3, 5
2808Convergence Analysis of Split Learning on Non-IID Data4.671.255, 6, 3
2809Do Not Blindly Imitate the Teacher: Loss Perturbation for Knowledge Distillation4.672.363, 3, 8
2810Beyond Deep Learning: An Evolutionary Feature Engineering Approach to Tabular Data Classification4.671.255, 3, 6
2811Is margin all you need? An extensive empirical study of active learning on tabular data4.671.253, 6, 5
2812MolEBM: Molecule Generation and Design by Latent Space Energy-Based Modeling4.671.253, 6, 5
2813How Does Self-supervised Learning Work? A Representation Learning Perspective4.671.255, 6, 3
2814A Reproducible and Realistic Evaluation of Partial Domain Adaptation Methods4.671.253, 5, 6
2815Accelerated Training via Principled Methods for Incrementally Growing Neural Networks4.671.255, 6, 3
2816Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization4.671.255, 3, 6
2817System identification of neural systems: If we got it right, would we know?4.672.368, 3, 3
2818Axiomatic Explainer Locality With Optimal Transport4.671.253, 5, 6
2819Progressive Knowledge Distillation: Constructing Ensembles for Efficient Inference4.671.253, 5, 6
2820Blockwise self-supervised learning with Barlow Twins4.671.253, 6, 5
2821Achieving Communication-Efficient Policy Evaluation for Multi-Agent Reinforcement Learning: Local TD-Steps or Batching?4.671.253, 5, 6
2822Two-Tailed Averaging: Anytime Adaptive Once-in-a-while Optimal Iterate Averaging for Stochastic Optimization4.672.368, 3, 3
2823Replay Buffer with Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning4.671.256, 3, 5
2824DECODING LAYER SALIENCY IN TRANSFORMERS4.671.253, 5, 6
2825Decision Transformer under Random Frame Dropping4.671.253, 5, 6
2826On the Importance of Contrastive Loss in Multimodal Learning4.671.253, 6, 5
2827Generative Adversarial Federated Model4.671.256, 5, 3
2828EENet: Learning to Early Exit for Adaptive Inference4.671.256, 3, 5
2829Injecting knowledge into language generation: a case study in auto-charting after-visit care instructions from medical dialogue4.671.253, 6, 5
2830Continual Learning with Soft-Masking of Parameter-Level Gradient Flow4.671.255, 3, 6
2831Unsupervised Adaptation for Fairness under Covariate Shift4.672.368, 3, 3
2832Towards convergence to Nash equilibria in two-team zero-sum games4.671.255, 3, 6
2833Towards Understanding How Machines Can Learn Causal Overhypotheses4.671.255, 3, 6
2834The Union of Manifolds Hypothesis4.672.363, 8, 3
2835P2PRISM - Peer to peer learning with individual prism for secure aggregation4.671.253, 6, 5
2836Few-shot Backdoor Attacks via Neural Tangent Kernels4.671.256, 5, 3
2837MMVAE+: Enhancing the Generative Quality of Multimodal VAEs without Compromises4.671.255, 6, 3
2838$ell$Gym: Natural Language Visual Reasoning with Reinforcement Learning4.671.253, 5, 6
2839Towards Antisymmetric Neural Ansatz Separation4.671.253, 6, 5
2840Optimal Scalarizations for Provable Multiobjective Optimization4.671.255, 6, 3
2841A new photoreceptor-inspired CNN layer enables deep learning models of retina to generalize across lighting conditions4.671.253, 6, 5
2842Deep Probabilistic Time Series Forecasting over Long Horizons4.672.363, 8, 3
2843AN OPERATOR NORM BASED PASSIVE FILTER PRUNING METHOD FOR EFFICIENT CNNS4.671.255, 3, 6
2844Weighted Regularization for Efficient Neural Network Compression4.672.368, 3, 3
2845HiT-DVAE: Human Motion Generation via Hierarchical Transformer Dynamical VAE4.671.256, 3, 5
2846Learning Privacy-Preserving Graph Embeddings Against Sensitive Attributes Inference4.671.255, 3, 6
2847Finding Generalization Measures by Contrasting Signal and Noise4.671.255, 6, 3
2848Learning Dictionaries over Datasets through Wasserstein Barycenters4.671.256, 5, 3
2849KeyCLD: Learning Constrained Lagrangian Dynamics in Keypoint Coordinates from Images4.671.253, 5, 6
2850Score Matching via Differentiable Physics4.671.253, 5, 6
2851Short-Term Memory Convolutions4.671.253, 5, 6
2852Unbiased Decisions Reduce Regret: Adversarial Optimism for the Bank Loan Problem4.671.255, 3, 6
2853Diversity of Generated Unlabeled Data Matters for Few-shot Hypothesis Adaptation4.672.363, 8, 3
2854CAKE: CAusal and collaborative proxy-tasKs lEarning for Semi-Supervised Domain Adaptation4.671.255, 6, 3
2855How to Keep Cool While Training4.671.253, 5, 6
2856Model-Based Decentralized Policy Optimization4.671.256, 3, 5
2857Few-bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction4.671.255, 6, 3
2858Pruning by Active Attention Manipulation4.671.256, 3, 5
2859On Threshold Functions in Learning to Generate Feasible Solutions of Mixed Integer Programs4.672.363, 3, 8
2860Closed Boundary Learning for NLP Classification Tasks with the Universum Class4.671.255, 3, 6
2861UNREAL: Unlabeled Nodes Retrieval and Labeling for Heavily-imbalanced Node Classification4.671.253, 6, 5
2862GRAPHSENSOR: A Graph Attention Network for Time-Series Sensor Data4.671.256, 5, 3
2863CRISP: Curriculum inducing Primitive Informed Subgoal Prediction for Hierarchical Reinforcement Learning4.671.256, 5, 3
2864An Equal-Size Hard EM Algorithm for Diverse Dialogue Generation4.671.253, 5, 6
2865NeuralEQ: Neural-Network-Based Equalizer for High-Speed Wireline Communication4.671.255, 6, 3
2866Network Controllability Perspectives on Graph Representation4.671.253, 6, 5
2867VARIATIONAL ADAPTIVE GRAPH TRANSFORMER FOR MULTIVARIATE TIME SERIES MODELING4.671.256, 5, 3
2868Large Language Models Can Self-improve4.672.363, 3, 8
2869COMBAT: Alternated Training for Near-Perfect Clean-Label Backdoor Attacks4.671.256, 3, 5
2870Safe Reinforcement Learning with Contrastive Risk Prediction4.671.256, 3, 5
2871Imbalanced Lifelong Learning with AUC Maximization4.671.255, 3, 6
2872MoCa: Cognitive Scaffolding for Language Models in Causal and Moral Judgment Tasks4.672.363, 3, 8
2873Lattice Convolutional Networks for Learning Ground States of Quantum Many-Body Systems4.672.363, 8, 3
2874Learning to Optimize Quasi-Newton Methods4.671.253, 5, 6
2875An Adaptive Policy to Employ Sharpness-Aware Minimization4.671.256, 3, 5
2876Stochastic Bridges as Effective Regularizers for Parameter-Efficient Tuning4.671.256, 5, 3
2877Latent Bottlenecked Attentive Neural Processes4.671.253, 5, 6
2878VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment4.671.253, 5, 6
2879Annealed Training for Combinatorial Optimization on Graphs4.671.255, 3, 6
2880A Novel Fast Exact Subproblem Solver for Stochastic Quasi-Newton Cubic Regularized Optimization4.671.255, 3, 6
2881On the Mysterious Optimization Geometry of Deep Neural Networks4.671.255, 3, 6
2882On the Implicit Bias Towards Depth Minimization in Deep Neural Networks4.671.255, 3, 6
2883Quantum 3D graph structure learning with applications to molecule computing4.671.256, 5, 3
2884Score-based Generative 3D Mesh Modeling4.671.253, 5, 6
2885Why Self Attention is Natural for Sequence-to-Sequence Problems? A Perspective from Symmetries4.671.255, 6, 3
2886Large Learning Rate Matters for Non-Convex Optimization4.671.255, 6, 3
2887Value-Based Membership Inference Attack on Actor-Critic Reinforcement Learning4.671.255, 6, 3
2888FOCUS: Fairness via Agent-Awareness for Federated Learning on Heterogeneous Data4.671.253, 5, 6
2889RainProof: An Umbrella to Shield Text Generator from Out-Of-Distribution Data4.671.255, 6, 3
2890PerFedMask: Personalized Federated Learning with Optimized Masking Vectors4.671.255, 3, 6
2891Neural Implicit Manifold Learning for Topology-Aware Generative Modelling4.671.256, 3, 5
2892Characterizing neural representation of cognitively-inspired deep RL agents during an evidence accumulation task4.671.255, 3, 6
2893Rule-based policy regularization for reinforcement learning-based building control4.671.253, 6, 5
2894Deep Dependency Networks for Action Classification in Video4.671.253, 5, 6
2895Structural Adversarial Objectives for Self-Supervised Representation Learning4.671.255, 6, 3
2896Defending against Reconstruction attacks using Rényi Differential Privacy4.671.255, 6, 3
2897Abstracting Imperfect Information Away from Two-Player Zero-Sum Games4.671.253, 5, 6
2898Black-Box Adversarial Attack Guided by Model Behavior for Programming Pre-trained Language Models4.671.255, 3, 6
2899Joint Embedding Self-Supervised Learning in the Kernel Regime4.671.256, 5, 3
2900SeedGNN: Graph Neural Network for Supervised Seeded Graph Matching4.671.253, 6, 5
2901Variational Counterfactual Prediction under Runtime Domain Corruption4.671.255, 6, 3
2902Automatic Clipping: Differentially Private Deep Learning Made Easier and Stronger4.671.256, 5, 3
2903ELBO-ing Stein Mixtures4.672.363, 3, 8
2904Breaking the Curse of Dimensionality for Parametric Elliptic PDEs4.673.861, 3, 10
2905Accelerated Riemannian Optimization: Handling Constraints to Bound Geometric Penalties4.671.255, 6, 3
2906Volumetric Disentanglement for 3D Scene Manipulation4.671.253, 5, 6
2907DEEP ACCURATE SOLVER FOR THE GEODESIC PROBLEM4.672.363, 8, 3
2908Signal to Sequence Attention-Based Multiple Instance Network for Segmentation Free Inference of RNA Modifications4.671.255, 6, 3
2909Global-Local Bayesian Transformer for Semantic Correspondence4.671.255, 6, 3
2910Deep Graph-Level Clustering Using Pseudo-Label-Guided Mutual Information Maximization Network4.671.253, 5, 6
2911Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories4.671.255, 3, 6
2912Semi-Implicit Variational Inference via Score Matching4.671.256, 5, 3
2913Non-equispaced Fourier Neural Solvers for PDEs4.671.253, 5, 6
2914Group-oriented Cooperation in Multi-Agent Reinforcement Learning4.671.253, 6, 5
2915Horizon-Free Reinforcement Learning for Latent Markov Decision Processes4.671.255, 3, 6
2916Estimating Riemannian Metric with Noise-Contaminated Intrinsic Distance4.672.363, 3, 8
2917EMP: Effective Multidimensional Persistence for Graph Representation Learning4.671.256, 5, 3
2918Self-Adaptive Perturbation Radii for Adversarial Training4.671.253, 5, 6
2919Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning4.671.253, 5, 6
2920EM-Network: Learning Better Latent Variable for Sequence-to-Sequence Models4.671.253, 5, 6
2921HotProtein: A Novel Framework for Protein Thermostability Prediction and Editing4.671.255, 3, 6
2922On the Neural Tangent Kernel of Equilibrium Models4.671.253, 6, 5
2923HYPERPRUNING: EFFICIENT PRUNING THROUGH LYAPUNOV METRIC HYPERSEARCH4.671.253, 6, 5
2924Minimum Curvature Manifold Learning4.671.255, 6, 3
2925Min-Max Zero-Shot Multi-Label Classification4.671.253, 6, 5
2926Generated Graph Detection4.671.256, 3, 5
2927Quantum Fourier Networks for solving Parametric PDEs4.671.256, 3, 5
2928ADVERSARIALLY BALANCED REPRESENTATION FOR CONTINUOUS TREATMENT EFFECT ESTIMATION4.671.256, 5, 3
2929D-CIPHER: Discovery of Closed-form Partial Differential Equations4.672.363, 3, 8
2930Learning with MISELBO: The Mixture Cookbook4.671.253, 5, 6
2931Monotonicity and Double Descent in Uncertainty Estimation with Gaussian Processes4.671.255, 6, 3
2932Analyzing the Effects of Classifier Lipschitzness on Explainers4.671.255, 6, 3
2933Enhance Local Consistency for Free: A Multi-Step Inertial Momentum Approach4.671.255, 3, 6
2934Robust Constrained Reinforcement Learning4.671.253, 5, 6
2935CorruptEncoder: Data Poisoning Based Backdoor Attacks to Contrastive Learning4.671.253, 5, 6
2936Revitalize Region Feature for Democratizing Video-language Pre-training of Retrieval4.671.256, 3, 5
2937Byzantine-robust Decentralized Learning via ClippedGossip4.671.256, 3, 5
2938Towards the Out-of-Distribution Generalization of Contrastive Self-Supervised Learning4.671.255, 6, 3
2939ColoristaNet for Photorealistic Video Style Transfer4.671.253, 5, 6
2940Low-complexity Deep Video Compression with A Distributed Coding Architecture4.671.256, 5, 3
2941Property Inference Attacks Against t-SNE Plots4.671.253, 5, 6
2942D4AM: A General Denoising Framework for Downstream Acoustic Models4.671.255, 6, 3
2943Saliency-guided Vision Transformer for Few-shot Keypoint Detection4.671.256, 5, 3
2944Holistically Explainable Vision Transformers4.671.255, 3, 6
2945Instance-wise Batch Label Restoration via Gradients in Federated Learning4.671.253, 6, 5
2946GoBigger: A Scalable Platform for Cooperative-Competitive Multi-Agent Interactive Simulation4.671.255, 3, 6
2947Simultaneously Learning Stochastic and Adversarial Markov Decision Process with Linear Function Approximation4.671.255, 6, 3
2948Gated Domain Units for Multi-source Domain Generalization4.671.255, 6, 3
2949Bag of Tricks for FGSM Adversarial Training4.671.253, 5, 6
2950Exploring interactions between modalities for deepfake detection4.671.255, 6, 6, 3, 5, 3
2951A Causal Approach to Detecting Multivariate Time-series Anomalies and Root Causes4.671.256, 5, 3
2952A Closer Look at Self-supervised Lightweight Vision Transformers4.671.256, 5, 3
2953Exploring the Generalizability of CNNs via Activated Representational Substitution4.671.256, 3, 5
2954FedFA: Federated Learning with Feature Alignment for Heterogeneous Data4.671.256, 5, 3
2955MABA-Net: Masked Additive Binary Activation Network4.671.255, 3, 6
2956Quantum-Inspired Tensorized Embedding with Application to Node Representation Learning4.672.363, 8, 3
2957Federated Learning of Large Models at the Edge via Principal Sub-Model Training4.671.256, 5, 3
2958Sharper Rates and Flexible Framework for Nonconvex SGD with Client and Data Sampling4.671.253, 6, 5
2959Rademacher Complexity Over $mathcal{H} Delta mathcal{H}$ Class for Adversarially Robust Domain Adaptation4.671.253, 6, 5
2960Differentially Private Dataset Condensation4.671.253, 6, 5
2961Dynamics-inspired Neuromorphic Representation Learning4.672.363, 3, 8
2962Radial Spike and Slab Bayesian Neural Networks for Sparse Data in Ransomware Attacks4.671.256, 5, 3
2963Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks4.671.253, 5, 6
2964Receding Neuron Importances for Structured Pruning4.671.256, 3, 5
2965CONTINUAL MODEL EVOLVEMENT WITH INNER-PRODUCT RESTRICTION4.671.256, 5, 3
2966PREF: Phasorial Embedding Fields for Compact Neural Representations4.671.256, 3, 5
2967FedPSE: Personalized Sparsification with Element-wise Aggregation for Federated Learning4.671.253, 6, 5
2968Multigraph Topology Design for Cross-Silo Federated Learning4.671.253, 6, 5
2969Exploit Unlabeled Data on the Server! Federated Learning via Uncertainty-aware Ensemble Distillation and Self-Supervision4.671.253, 5, 6
2970Parallel Federated Learning over Heterogeneous Devices4.671.255, 3, 6
2971Mugs: A Multi-Granular Self-Supervised Learning Framework4.672.363, 8, 3
2972Grafting Vision Transformers4.671.256, 3, 5
2973PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction4.671.256, 3, 5
2974NIERT: Accurate Numerical Interpolation through Unifying Scattered Data Representations using Transformer Encoder4.671.255, 3, 6
2975Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets4.671.253, 6, 5
2976Manifold Characteristics That Predict Downstream Task Performance4.671.255, 3, 6
2977Improved Fully Quantized Training via Rectifying Batch Normalization4.671.255, 3, 6
2978Lottery Aware Sparsity Hunting: Enabling Federated Learning on Resource-Limited Edge4.671.253, 6, 5
2979Phase transition for detecting a small community in a large network4.671.253, 6, 5
2980Zipper: Decoupling the tradeoff Between Robustness and Accuracy4.671.256, 3, 5
2981Learning Visual Representation with Synthetic Images and Topologically-defined Labels4.671.253, 6, 5
2982A prototype-oriented clustering for domain shift with source privacy4.671.255, 6, 3
2983FADE: Enabling Large-Scale Federated Adversarial Training on Resource-Constrained Edge Devices4.671.253, 6, 5
2984Temporal Relevance Analysis for Video Action Models4.671.253, 5, 6
2985Towards Understanding Convergence and Generalization of AdamW4.671.255, 3, 6
2986Learning from Interval-valued Data4.672.363, 3, 8
2987Efficient Hyperdimensional Computing4.671.255, 6, 3
2988Auxiliary task discovery through generate and test4.671.255, 3, 6
2989Categorial Grammar Induction as a Compositionality Measure for Emergent Languages in Signaling Games4.671.253, 6, 5
2990Exploring Neural Network Representational Similarity using Filter Subspaces4.671.256, 5, 3
2991Probing into Overfitting for Video Recognition4.671.256, 3, 5
2992Universal Unlearnable Examples: Cluster-wise Perturbations without Label-consistency4.671.256, 5, 3
2993Interpretable Single/Multi-label Text Classification with Unsupervised Constituent-label alignments4.671.253, 6, 5
2994Functional Relation Field: A Model-Agnostic Framework for Multivariate Time Series Forecasting4.671.255, 6, 3
2995Generalized Category Discovery via Adaptive GMMs without Knowing the Class Number4.671.256, 3, 5
2996A Mutual Information Duality Algorithm for Multi-Agent Specialization4.621.323, 3, 5, 6, 6, 3, 6, 5
2997Graph Mixup with Soft Alignments4.601.363, 6, 6, 3, 5
2998Emergence of shared sensory-motor graphical language from visual input4.601.363, 6, 3, 5, 6
2999Temporal Dynamics Aware Adversarial Attacks On Discrete-Time Graph Models4.601.851, 5, 6, 6, 5
3000Escaping saddle points in zeroth-order optimization: two function evaluations suffice4.601.366, 5, 3, 6, 3
3001Variational Causal Dynamics: Discovering Modular World Models from Interventions4.601.366, 3, 6, 3, 5
3002Feed-Forward Latent Domain Adaptation4.602.063, 3, 3, 6, 8
3003Test-time Adaptation for Segmentation via Image Synthesis4.601.363, 6, 6, 3, 5
3004Similarity of Neural Architectures Based on Input Gradient Transferability4.602.425, 3, 1, 6, 8
3005Equivariant Descriptor Fields: SE(3)-Equivariant Energy-Based Models for End-to-End Visual Robotic Manipulation Learning4.601.363, 3, 5, 6, 6
3006Look in The Mirror: Molecular Graph Contrastive Learning with Line Graph4.602.063, 8, 3, 3, 6
3007Linear convergence for natural policy gradient with log-linear policy parametrization4.600.805, 5, 5, 5, 3
3008Chopping Formers is what you need in Vision4.601.363, 6, 6, 3, 5
3009Variance Covariance Regularization Enforces Pairwise Independence in Self-Supervised Representations4.601.363, 6, 3, 5, 6
3010Multi-Label Knowledge Distillation4.602.063, 3, 6, 8, 3
3011FrAug: Frequency Domain Augmentation for Time Series Forecasting4.600.803, 5, 5, 5, 5
3012Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity4.601.363, 6, 3, 6, 5
3013A Unimodal, Uncertainty-Aware Deep Learning Approach for Ordinal Regression4.600.805, 5, 3, 5, 5
3014Does Dataset Lottery Ticket Hypothesis Exist?4.601.363, 3, 6, 6, 5
3015Exploring The Capacity Mismatch Problem in Knowledge Distillation from the View of Soft Labels4.600.805, 3, 5, 5, 5
3016Revisiting Residual Networks for Adversarial Robustness4.600.805, 3, 5, 5, 5
3017QFuture: Learning Future Expectations in Multi-Agent Reinforcement Learning4.601.366, 3, 6, 3, 5
3018Free Bits: Platform-Aware Latency Optimization of Mixed-Precision Neural Networks for Edge Deployment4.500.875, 5, 5, 3
3019DELTA: Diverse Client Sampling for Fasting Federated Learning4.501.506, 6, 3, 3
3020Grounded Contrastive Learning for Open-world Semantic Segmentation4.500.875, 5, 3, 5
3021Batch Normalization and Bounded Activation Functions4.500.875, 5, 5, 3
3022Deep Equilibrium Non-Autoregressive Sequence Learning4.500.875, 3, 5, 5
3023On the Adversarial Robustness against Natural Weather Perturbations4.500.875, 3, 5, 5
3024Optimistic Exploration in Reinforcement Learning Using Symbolic Model Estimates4.501.506, 3, 3, 6
3025Rényi Supervised Contrastive Learning for Transferable Representation4.500.875, 3, 5, 5
3026Topology Matters in Fair Graph Learning: a Theoretical Pilot Study4.501.503, 3, 6, 6
3027Beyond the injective assumption in causal representation learning4.501.506, 3, 6, 3
3028Approximation ability of Transformer networks for functions with various smoothness of Besov spaces: error analysis and token extraction4.500.873, 5, 5, 5
3029Reinforcement Logic Rule Learning for Temporal Point Processes4.501.506, 3, 3, 6
3030UNDERSTANDING HTML WITH LARGE LANGUAGE MODELS4.500.875, 5, 3, 5
3031Semi-Autoregressive Energy Flows: Towards Determinant-Free Training of Normalizing Flows4.501.506, 3, 6, 3
3032ACE-EM: Boosted ab initio Cryo-EM 3D Reconstruction with Asymmetric Complementary Autoencoder4.501.506, 6, 3, 3
3033A Fast, Well-Founded Approximation to the Empirical Neural Tangent Kernel4.500.875, 5, 5, 3
3034Towards Unsupervised Time Series Representation Learning: A Decomposition Perspective4.501.506, 6, 3, 3
3035Steerable Equivariant Representation Learning4.500.875, 3, 5, 5
3036Federated Learning with Heterogeneous Label Noise: A Dual Structure Approach4.500.875, 3, 5, 5
3037Spatiotemporal Modeling of Multivariate Signals with Graph Neural Networks and Structured State Space Models4.500.875, 5, 5, 3
3038Domain-Invariant Auxiliary Learning for Robust Few-Shot Predictions from Noisy Data4.501.503, 3, 6, 6
3039ConserWeightive Behavioral Cloning for Reliable Offline Reinforcement Learning4.500.875, 5, 3, 5
3040ProtFIM: Fill-in-Middle Protein Sequence Design via Protein Language Models4.500.873, 5, 5, 5
3041MUG: Interactive Multimodal Grounding on User Interfaces4.500.873, 5, 5, 5
3042SIMPLE: A Gradient Estimator for k-Subset Sampling4.501.506, 3, 3, 6
3043Greedy Information Maximization for Online Feature Selection4.501.126, 5, 3, 3, 5, 5
3044Cross-Domain Few-Shot Relation Extraction via Representation Learning and Domain Adaptation4.500.875, 5, 5, 3
3045Koopman Operator Learning for Accelerating Quantum Optimization and Machine Learning4.501.506, 3, 6, 3
3046Projective Proximal Gradient Descent for Nonconvex Nonsmooth Optimization: Fast Convergence Without Kurdyka-Lojasiewicz (KL) Property4.501.503, 6, 6, 3
3047Variable Compositionality Reliably Emerges in Neural Networks4.500.873, 5, 5, 5
3048Causally-guided Regularization of Graph Attention improves Generalizability4.500.873, 5, 5, 5
3049A Simple Approach for State-Action Abstraction using a Learned MDP Homomorphism4.501.506, 3, 3, 6
3050Optimal Transport-Based Supervised Graph Summarization4.501.503, 3, 6, 6
3051Double Wins: Boosting Accuracy and Efficiency of Graph Neural Networks by Reliable Knowledge Distillation4.501.506, 3, 6, 3
3052Motif-based Graph Representation Learning with Application to Chemical Molecules4.500.875, 3, 5, 5
3053Beam Tree Recursive Cells4.500.875, 5, 3, 5
3054Cross-Silo Training of Differentially Private Models with Secure Multiparty Computation4.501.503, 6, 6, 3
3055Illusory Adversarial Attacks on Sequential Decision-Makers and Countermeasures4.500.875, 5, 3, 5
3056Catastrophic overfitting is a bug but it is caused by features4.501.506, 3, 6, 3
3057Robust Universal Adversarial Perturbations4.500.875, 3, 5, 5
3058SARNET: SARCASM VS TRUE-HATE DETECTION NETWORK4.500.875, 5, 5, 3
3059On Gradient Descent Convergence beyond the Edge of Stability4.500.875, 3, 5, 5
3060Robustifying Language Models via Adversarial Training with Masked Gradient4.500.875, 5, 5, 3
3061Convexifying Transformers: Improving optimization and understanding of transformer networks4.500.875, 5, 3, 5
3062TimeSeAD: Benchmarking Deep Time-Series Anomaly Detection4.500.875, 5, 5, 3
3063Towards Multi-spatiotemporal-scale Generalized PDE Modeling4.500.875, 3, 5, 5
3064REST: REtrieve & Self-Train for generative action recognition4.500.873, 5, 5, 5
3065Internet-augmented language models through few-shot prompting for open-domain question answering4.501.506, 6, 3, 3
3066Generalized Belief Transport4.502.065, 6, 6, 1
3067Maximal Correlation-Based Post-Nonlinear Learning for Bivariate Causal Discovery4.501.506, 6, 3, 3
3068Interactive Sequential Generative Models4.501.503, 6, 3, 6
3069Relaxed Attention for Transformer Models4.500.875, 5, 3, 5
3070Vector Quantization and Shifting: Exploiting Latent Properties to Optimize Neural Codecs4.501.506, 3, 3, 6
3071MARLlib: Extending RLlib for Multi-agent Reinforcement Learning4.500.875, 3, 5, 5
3072Energy Consumption-Aware Tabular Benchmarks for Neural Architecture Search4.500.873, 5, 5, 5
3073Delve into the Layer Choice of BP-based Attribution Explanations4.500.875, 3, 5, 5
3074Query The Agent: Improving Sample Efficiency Through Epistemic Uncertainty Estimation4.500.875, 5, 3, 5
3075Cold Posteriors through PAC-Bayes4.500.875, 3, 5, 5
3076Toward Effective Deep Reinforcement Learning for 3D Robotic Manipulation: End-to-End Learning from Multimodal Raw Sensory Data4.500.875, 3, 5, 5
3077ChemAlgebra : Algebraic Reasoning on Chemical Reactions4.501.506, 3, 3, 6
3078Improving Adversarial Robustness via Frequency Regularization4.500.875, 3, 5, 5
3079$omega$GNNs: Deep Graph Neural Networks Enhanced by Multiple Propagation Operators4.500.875, 5, 3, 5
3080Learning from Asymmetrically-corrupted Data in Regression for Sensor Magnitude4.502.066, 1, 6, 5
3081Modeling the Uncertainty with Maximum Discrepant Students for Semi-supervised 2D Pose Estimation4.500.875, 3, 5, 5
3082Adversarial Causal Augmentation for Graph Covariate Shift4.501.506, 3, 3, 6
3083On the Robustness of Randomized Ensembles to Adversarial Perturbations4.501.506, 6, 3, 3
3084Deep Transformer Q-Networks for Partially Observable Reinforcement Learning4.502.066, 6, 5, 1
3085Visual Expertise and the Log-Polar Transform Explain Image Inversion Effects4.500.873, 5, 5, 5
3086Neural Semi-Counterfactual Risk Minimization4.502.698, 6, 3, 1
3087FedDebias: Reducing the Local Learning Bias Improves Federated Learning on Heterogeneous Data4.500.875, 3, 5, 5
3088Best Possible Q-Learning4.501.503, 6, 6, 3
3089Self-Supervised Logit Adjustment4.500.875, 5, 3, 5
3090Leaves: Learning Views for Time-Series Data in Contrastive Learning4.500.873, 5, 5, 5
3091DeepGuiser: Learning to Disguise Neural Architectures for Impeding Adversarial Transfer Attacks4.501.503, 6, 3, 6
3092The Cost of Privacy in Fair Machine Learning4.500.873, 5, 5, 5
3093When Majorities Prevent Learning: Eliminating Bias to Improve Worst-group and Out-of-distribution Generalization4.500.873, 5, 5, 5
3094Fairness-Aware Model-Based Multi-Agent Reinforcement Learning for Traffic Signal Control4.500.875, 5, 5, 3
3095Learning Unified Representations for Multi-Resolution Face Recognition4.500.875, 3, 5, 5
3096Graph Signal Sampling for Inductive One-Bit Matrix Completion: a Closed-form Solution4.501.506, 3, 3, 6
3097Adaptive Weight Decay: On The Fly Weight Decay Tuning for Improving Robustness4.500.875, 3, 5, 5
3098Machine Unlearning of Federated Clusters4.501.506, 3, 3, 6
3099Link Prediction with Non-Contrastive Learning4.500.873, 5, 5, 5
3100Goal-Space Planning with Subgoal Models4.500.875, 5, 5, 3
3101Learning Unsupervised Forward Models from Object Keypoints4.500.873, 5, 5, 5
3102Meta Temporal Point Processes4.500.873, 5, 5, 5
3103DCI-ES: An Extended Disentanglement Framework with Connections to Identifiability4.500.873, 5, 5, 5
3104OTCOP: Learning optimal transport maps via constraint optimizations4.501.506, 6, 3, 3
3105Graduated Non-Convexity for Robust Self-Trained Language Understanding4.501.503, 6, 6, 3
3106SemSup-XC: Semantic Supervision for Extreme Classification4.500.875, 5, 5, 3
3107Wide Graph Neural Network4.502.066, 5, 1, 6
3108Integrating Episodic and Global Novelty Bonuses for Efficient Exploration4.500.875, 3, 5, 5
3109Dynamics-aware Skill Generation from Behaviourally Diverse Demonstrations4.501.506, 3, 6, 3
3110Calibrating Transformers via Sparse Gaussian Processes4.501.503, 6, 3, 6
3111Multimodal Open-Vocabulary Video Classification via Vision and Language Models4.501.506, 6, 3, 3
3112When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning4.500.875, 5, 3, 5
3113Domain-Unified Prompt Representations for Source-Free Domain Generalization4.500.875, 5, 3, 5
3114Disentangling Learning Representations with Density Estimation4.500.875, 5, 3, 5
3115A Risk-Averse Equilibrium for Multi-Agent Systems4.501.506, 3, 6, 3
3116A Learning Based Hypothesis Test for Harmful Covariate Shift4.500.875, 5, 3, 5
3117On the Relationship Between Adversarial Robustness and Decision Region in Deep Neural Networks4.500.875, 3, 5, 5
3118Noether Embeddings: Fast Temporal Association Mining4.500.875, 5, 5, 3
3119Poisson Process for Bayesian Optimization4.500.875, 5, 5, 3
3120Where prior learning can and can't work in unsupervised inverse problems4.501.506, 6, 3, 3
3121Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training4.501.506, 3, 3, 6
3122An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale Multitask Learning Systems4.502.061, 6, 6, 5
3123Schedule-Robust Online Continual Learning4.500.873, 5, 5, 5
3124Contrastive Hierarchical Clustering4.500.873, 5, 5, 5
3125On Incremental Learning with Long Short Term Strategy4.500.875, 5, 5, 3
3126ESP: Exponential Smoothing on Perturbations for Increasing Robustness to Data Corruptions4.500.875, 5, 5, 3
3127Multiple Invertible and Equivariant Transformation for Disentanglement in VAEs4.500.875, 5, 3, 5
3128Revisiting Fast Adversarial Training4.500.875, 5, 3, 5
3129Bayesian semi-supervised learning with a principled likelihood from a generative model of data curation4.500.875, 5, 3, 5
3130Deep High-Frequency Extrapolation for Neuronal Spike Restoration4.501.503, 3, 6, 6
3131Emergent Communication with Attention4.500.875, 3, 5, 5
3132Black-box Knowledge Distillation4.500.873, 5, 5, 5
3133Self-Consistent Learning: Cooperation between Generators and Discriminators4.502.061, 5, 6, 6
3134Personalized Decentralized Bilevel Optimization over Stochastic and Directed Networks4.500.875, 3, 5, 5
3135Data-Free Continual Graph Learning4.501.506, 3, 3, 6
3136Can you Trust your Disentanglement?4.502.698, 6, 3, 1
3137Visual Reinforcement Learning with Self-Supervised 3D Representations4.501.506, 6, 3, 3
3138CUSTOMIZING PRE-TRAINED DIFFUSION MODELS FOR YOUR OWN DATA4.500.875, 5, 3, 5
3139Dr-Fairness: Dynamic Data Ratio Adjustment for Fair Training on Real and Generated Data4.500.875, 5, 3, 5
3140Adversarially Robust Neural Lyapunov Control4.500.875, 5, 5, 3
3141Domain-Specific Risk Minimization for Out-of-Distribution Generalization4.500.875, 5, 5, 3
3142Temporally-Weighted Spike Encoding for Event-based Object Detection and Classification4.501.503, 3, 6, 6
3143SimA: Simple Softmax-free Attention For Vision Transformers4.500.873, 5, 5, 5
3144What does a platypus look like? Generating customized prompts for zero-shot image classification4.501.506, 3, 3, 6
3145Hyperbolic Contrastive Learning for Visual Representations beyond Objects4.500.875, 3, 5, 5
3146Hybrid RL: Using both offline and online data can make RL efficient4.502.061, 5, 6, 6
3147Scalable and Privacy-enhanced Graph Generative Model for Graph Neural Networks4.501.503, 6, 6, 3
3148Understanding Hindsight Goal Relabeling Requires Rethinking Divergence Minimization4.502.061, 6, 6, 5
3149Heterogeneous Continual Learning4.500.873, 5, 5, 5
3150Momentum Diminishes the Effect of Spectral Bias in Physics-Informed Neural Networks4.502.693, 1, 8, 6
3151SeqSHAP: Subsequence Level Shapley Value Explanations for Sequential Predictions4.500.875, 5, 3, 5
3152Group-level Brain Decoding with Deep Learning4.500.873, 5, 5, 5
3153Learning Inductive Object-Centric Slot Initialization via Clustering4.500.875, 5, 3, 5
3154The Continuous CNN: from Task-Specific to Unified CNN Architecture4.501.503, 3, 6, 6
3155Pixel-Aligned Non-parametric Hand Mesh Reconstruction4.500.875, 5, 3, 5
3156Is the Deep Model Representation Sparse and Symbolic with Causal Patterns?4.500.873, 5, 5, 5
3157TransformMix: Learning Transformation and Mixing Strategies for Sample-mixing Data Augmentation4.500.873, 5, 5, 5
3158Disentangled Knowledge Transfer: A New Perspective for Personalized Federated Learning4.500.873, 5, 5, 5
3159DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization4.500.873, 5, 5, 5
3160Defense against Backdoor Attacks via Identifying and Purifying Bad Neurons4.500.875, 5, 3, 5
3161DSP: Dynamic Semantic Prototype for Generative Zero-Shot Learning4.500.875, 5, 5, 3
3162Topic Aware Transformer: Domain Shift for Unconditional Text Generation Model4.501.506, 6, 3, 3
3163Extracting Expert's Goals by What-if Interpretable Modeling4.500.873, 5, 5, 5
3164Improving Molecular Pretraining with Complementary Featurizations4.501.506, 3, 6, 3
3165AutoSparse: Towards Automated Sparse Training4.501.125, 5, 3, 3, 5, 6
3166PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets4.502.691, 3, 8, 6
3167Bootstrap Motion Forecasting With Self-Consistent Constraints4.500.875, 3, 5, 5
3168Learning to Split for Automatic Bias Detection4.501.503, 6, 3, 6
3169Physics-empowered Molecular Representation Learning4.500.873, 5, 5, 5
3170MINI: Mining Implicit Novel Instances for Few-Shot Object Detection4.500.875, 5, 3, 5
3171FedGSNR: Accelerating Federated Learning on Non-IID Data via Maximum Gradient Signal to Noise Ratio4.501.506, 3, 3, 6
3172Learning to acquire novel cognitive tasks with evolution, plasticity and meta-meta-learning4.500.875, 3, 5, 5
3173Light-weight probing of unsupervised representations for Reinforcement Learning4.501.506, 3, 3, 6
3174Tackling the Retrieval Trilemma with Cross-Modal Indexing4.500.873, 5, 5, 5
3175Watch What You Pretrain For: Targeted, Transferable Adversarial Examples on Self-Supervised Speech Recognition models4.501.503, 6, 6, 3
3176Shot Retrieval and Assembly with Text Script for Video Montage Generation4.501.503, 6, 3, 6
3177Margin-based Neural Network Watermarking4.500.875, 5, 3, 5
3178Revisiting Global Pooling through the Lens of Optimal Transport4.500.875, 5, 3, 5
3179Towards Expressive Graph Representations for Graph Neural Networks4.500.875, 3, 5, 5
3180Efficient, Stable, and Analytic Differentiation of the Sinkhorn Loss4.501.503, 6, 6, 3
3181Dynamical Isometry for Residual Networks4.501.506, 3, 3, 6
3182Deep Learning meets Nonparametric Regression: Are Weight-Decayed DNNs Locally Adaptive?4.500.873, 5, 5, 5
3183Minibatch Stochastic Three Points Method for Unconstrained Smooth Minimization4.500.875, 3, 5, 5
3184Learning Symbolic Rules for Reasoning in Quasi-Natural Language4.500.875, 3, 5, 5
3185Least-to-Most Prompting Enables Complex Reasoning in Large Language Models4.502.066, 1, 6, 5
3186Approximate Bayesian Inference with Stein Functional Variational Gradient Descent4.500.875, 3, 5, 5
3187It Takes Two: Masked Appearance-Motion Modeling for Self-Supervised Video Transformer Pre-Training4.500.875, 3, 5, 5
3188In-the-wild Pretrained Models Are Good Feature Extractors for Video Quality Assessment4.500.875, 5, 3, 5
3189Mitigating Forgetting in Online Continual Learning via Contrasting Semantically Distinct Augmentations4.500.875, 5, 3, 5
3190Contextual Symbolic Policy For Meta-Reinforcement Learning4.500.875, 3, 5, 5
3191Node Classification Beyond Homophily: Towards a General Solution4.501.506, 3, 3, 6
3192Decouple Graph Neural Networks: Train Multiple Simple GNNs Simultaneously Instead of One4.500.875, 5, 3, 5
3193On the Effectiveness of Adapting Pre-trained Transformer Models via Adversarial Noise4.500.873, 5, 5, 5
3194Federated Learning in Non-IID Settings Aided by Differentially Private Synthetic Data4.500.873, 5, 5, 5
3195A UNIFIED VIEW OF FINDING AND TRANSFORMING WINNING LOTTERY TICKETS4.501.506, 3, 3, 6
3196Revisiting Group Robustness: Class-specific Scaling is All You Need4.501.503, 3, 6, 6
3197DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models4.500.875, 5, 3, 5
3198Semi-Supervised Semantic Segmentation via Boosting Uncertainty on Unlabeled Data4.500.873, 5, 5, 5
3199Gamma Sampling: Fine-grained Controlling Language Models without Training4.500.875, 5, 5, 3
3200Contrastive Continuity on Augmentation Stability Rehearsal for Continual Self-Supervised Learning4.501.506, 3, 3, 6
3201Uncertainty Calibration via Knowledge Flow under Long-tailed Distribution4.500.875, 3, 5, 5
3202$1times1$ Convolution is All You Need for Image Super-Resolution4.500.873, 5, 5, 5
3203Multiplane NeRF-Supervised Disentanglement of Depth and Camera Pose from Videos4.500.873, 5, 5, 5
3204ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing4.500.875, 3, 5, 5
3205Parameter Averaging for Feature Ranking4.500.875, 5, 3, 5
3206Smooth-Reduce: Leveraging Patches for Improved Certified Robustness4.501.506, 3, 6, 3
3207Stochastic Differentially Private and Fair Learning4.500.873, 5, 5, 5
3208SegNeRF: 3D Part Segmentation with Neural Radiance Fields4.500.873, 5, 5, 5
3209Faster Neural Architecture 'Search' for Deep Image Prior4.500.875, 5, 3, 5
3210Object Localization helps Action Recognition Models Adapt to New Environments4.500.875, 3, 5, 5
3211Is Self-Supervised Contrastive Learning More Robust Than Supervised Learning?4.500.873, 5, 5, 5
3212Correcting the Sub-optimal Bit Allocation4.502.698, 1, 6, 3
3213Partial transportability for domain generalization4.501.503, 3, 6, 6
3214Quasi-Conservative Score-based Generative Models4.500.873, 5, 5, 5
3215Neural Attention Memory4.501.506, 6, 3, 3
3216Mimic before Reconstruct: Enhance Masked Autoencoders with Feature Mimicking4.500.873, 5, 5, 5
3217Meta Optimal Transport4.500.875, 3, 5, 5
3218Backpropagation Path Search On Adversarial Transferability4.500.875, 5, 3, 5
3219Efficient Exploration via Fragmentation and Recall4.500.875, 5, 5, 3
3220CLEP: Exploiting Edge Partitioning for Graph Contrastive Learning4.401.968, 5, 3, 3, 3
3221Behavior Proximal Policy Optimization4.401.205, 3, 6, 5, 3
3222Fairness via Adversarial Attribute Neighbourhood Robust Learning4.401.203, 5, 6, 5, 3
3223Representation Power of Graph Convolutions : Neural Tangent Kernel Analysis4.401.963, 5, 3, 3, 8
3224Accuracy Boosters: Epoch-Driven Mixed-Mantissa Block Floating-Point for DNN Training4.401.965, 3, 8, 3, 3
3225End-to-end Invariance Learning with Relational Inductive Biases in Multi-Object Robotic Manipulation4.401.205, 6, 5, 3, 3
3226Homotopy-based training of NeuralODEs for accurate dynamics discovery4.401.203, 5, 3, 6, 5
3227Learning To Invert: Simple Adaptive Attacks for Gradient Inversion in Federated Learning4.401.205, 6, 3, 5, 3
3228Robustify Transformers with Robust Kernel Density Estimation4.401.203, 6, 5, 3, 5
3229Rethinking Knowledge Distillation with Raw Features for Semantic Segmentation4.401.745, 6, 1, 5, 5
3230M-L2O: Towards Generalizable Learning-to-Optimize by Test-Time Fast Self-Adaptation4.401.205, 3, 3, 6, 5
3231Node Importance Specific Meta Learning in Graph Neural Networks4.401.205, 5, 6, 3, 3
3232Self-supervised Speech Enhancement using Multi-Modal Data4.401.203, 5, 6, 3, 5
3233Contrastive Graph Few-Shot Learning4.401.206, 5, 3, 5, 3
3234DropAut: Automatic Dropout Approaches to learn and adapt Drop Rates4.401.205, 6, 3, 5, 3
3235MUTUAL EXCLUSIVE MODULATOR FOR LONG-TAILED RECOGNITION4.401.206, 5, 3, 3, 5
3236Conditional Invariances for Conformer Invariant Protein Representations4.401.203, 6, 5, 3, 5
3237HOYER REGULARIZER IS ALL YOU NEED FOR EXTREMELY SPARSE SPIKING NEURAL NETWORKS4.401.205, 6, 3, 3, 5
3238Breaking Beyond COCO Object Detection4.401.203, 5, 3, 6, 5
3239A Deep Conjugate Direction Method for Iteratively Solving Linear Systems4.401.963, 3, 5, 3, 8
3240MixMask: Revisiting Masked Siamese Self-supervised Learning in Asymmetric Distance4.401.203, 5, 3, 6, 5
3241Topology-aware robust optimization4.401.203, 5, 5, 3, 6
3242Decoupling Concept Bottleneck Model4.401.203, 5, 5, 3, 6
3243Active Topological Mapping by Metric-Free Exploration via Task and Motion Imitation4.401.203, 3, 5, 5, 6
3244pFedKT: Personalized Federated Learning via Knowledge Transfer4.330.945, 5, 3
3245Deep Reinforcement Learning based Insight Selection Policy4.330.945, 3, 5
3246Coreset for Rational Functions4.330.945, 5, 3
3247Enabling Equation Learning with the Bayesian Model Evidence via systematic $R^2$-elimination4.330.945, 3, 5
3248PTUnifier: Pseudo Tokens as Paradigm Unifiers in Medical Vision-and-Language Pre-training4.330.945, 5, 3
3249Improving the Calibration of Fine-tuned Language Models via Denoising Variational Auto-Encoders4.330.945, 3, 5
3250SELCOR: Self-Correction for Weakly Supervised Learning4.330.945, 5, 3
3251An Experiment Design Paradigm using Joint Feature Selection and Task Optimization4.330.943, 5, 5
3252Intra-Instance VICReg: Bag of Self-Supervised Image Patch Embedding Explains the Performance4.330.943, 5, 5
3253Deep Latent State Space Models for Time-Series Generation4.330.945, 3, 5
3254Covariance Matrix Adaptation MAP-Annealing4.330.943, 5, 5
3255AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers4.330.945, 3, 5
3256Kuiper: Moderated Asynchronous Federated Learning on Heterogeneous Mobile Devices with Non-IID Data4.330.943, 5, 5
3257A Computationally Efficient Sparsified Online Newton Method4.330.943, 5, 5
3258MILE: Memory-Interactive Learning Engine for Solving Mathematical Problems4.330.945, 5, 3
3259Outlier-Robust Group Inference via Gradient Space Clustering4.330.945, 3, 5
3260The Vendi Score: A Diversity Evaluation Metric for Machine Learning4.330.945, 5, 3
3261Designing and Using Goal-Conditioned Tools4.330.945, 5, 3
3262Gradient Preconditioning for Non-Lipschitz smooth Nonconvex Optimization4.330.945, 5, 3
3263BertNet: Harvesting Knowledge Graphs from Pretrained Language Models4.330.945, 3, 5
32643D Surface Reconstruction in the Wild by Deforming Shape Priors from Synthetic Data4.330.945, 5, 3
3265Linkless Link Prediction via Relational Distillation4.330.945, 3, 5
3266Efficient Proxy for NAS is Extensible Now4.330.945, 3, 5
3267DIGEST: FAST AND COMMUNICATION EFFICIENT DECENTRALIZED LEARNING WITH LOCAL UPDATES4.330.945, 3, 5
3268Learning to Improve Code Efficiency4.330.945, 3, 5
3269Aging with GRACE: Lifelong Model Editing with Key-Value Adaptors4.330.945, 5, 3
3270Contrastive Vision Transformer for Self-supervised Out-of-distribution Detection4.330.943, 5, 5
3271Selection Collider Bias in Large Language Models4.330.945, 3, 5
3272Mind the Privacy Budget: How Generative Models Spend their Privacy Budgets4.330.945, 3, 5
3273MAD for Robust Reinforcement Learning in Machine Translation4.330.943, 5, 5
3274Zero-Shot Retrieval with Search Agents and Hybrid Environments4.330.945, 5, 3
3275Learning the Visualness of Text Using Large Vision-Language Models4.330.945, 5, 3
3276Explanation Uncertainty with Decision Boundary Awareness4.330.943, 5, 5
3277Do We Really Need Labels for Backdoor Defense?4.330.945, 5, 3
3278Non-Gaussian Process Regression4.330.945, 5, 3
3279The Adversarial Regulation of the Temporal Difference Loss Costs More Than Expected4.330.945, 3, 5
3280A Subspace Correction Method for ReLU Neural Networks for Solving PDEs4.330.943, 5, 5
3281$mathcal{O}$-GNN: incorporating ring priors into molecular modeling4.330.943, 5, 5
3282Graph Contrastive Learning with Model Perturbation4.330.945, 5, 3
3283Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models4.330.943, 5, 5
3284Highly Parallel Deep Ensemble Learning4.330.943, 5, 5
3285Brain2GAN; Reconstructing perceived faces from the primate brain via StyleGAN34.330.943, 5, 5
3286Learning to Cooperate and Communicate Over Imperfect Channels4.330.943, 5, 5
3287Towards Federated Learning of Deep Graph Neural Networks4.330.943, 5, 5
3288Hidden Markov Mixture of Gaussian Process Functional Regression: Utilizing Multi-Scale Structure for Time-Series Forecasting4.330.943, 5, 5
3289Multivariate Time Series Forecasting By Graph Attention Networks With Theoretical Guarantees4.330.945, 5, 3
3290Hierarchical Prototypes for Unsupervised Dynamics Generalization in Model-Based Reinforcement Learning4.330.943, 5, 5
3291Learning to Register Unbalanced Point Pairs4.332.366, 6, 1
3292Thinking fourth dimensionally: Treating Time as a Random Variable in EBMs4.330.945, 3, 5
3293FedProp: Cross-client Label Propagation for Federated Semi-supervised Learning4.330.943, 5, 5
3294Scalable Multi-Modal Continual Meta-Learning4.330.945, 3, 5
3295Does Structural Information have been Fully Exploited in Graph Data?4.330.943, 5, 5
3296DeepGRAND: Deep Graph Neural Diffusion4.330.945, 3, 5
3297ASIF: coupled data turns unimodal models to multimodal without training4.330.943, 5, 5
3298Two-Dimensional Weisfeiler-Lehman Graph Neural Networks for Link Prediction4.330.945, 5, 3
3299Object Detection with OOD Generalizable Neural Architecture Search4.330.943, 5, 5
3300Inverse Learning with Extremely Sparse Feedback for Recommendation4.330.945, 3, 5
3301CLUTR: Curriculum Learning via Unsupervised Task Representation Learning4.330.945, 5, 3
3302Robust Quantity-Aware Aggregation for Federated Learning4.330.943, 5, 5
3303Local Distance Preserving Auto-encoders using Continuous k-Nearest Neighbours Graphs4.330.945, 5, 3
3304PADDLES: Phase-Amplitude Spectrum Disentangled Early Stopping for Learning with Noisy Labels4.330.945, 3, 5
3305Textless Phrase Structure Induction from Visually-Grounded Speech4.330.943, 5, 5
3306On Regularization for Explaining Graph Neural Networks: An Information Theory Perspective4.332.366, 1, 6
3307COMNET : CORTICAL MODULES ARE POWERFUL4.330.945, 3, 5
3308Intrinsic Computational Complexity of Equivariant Neural Networks4.330.945, 3, 5
3309Weakly-Supervised Domain Adaptation in Federated Learning4.330.943, 5, 5
3310Text and Patterns: For Effective Chain of Thought It Takes Two to Tango4.330.945, 3, 5
3311Unlearning with Fisher Masking4.330.945, 5, 3
3312How Weakly Supervised Information helps Contrastive Learning4.330.945, 3, 5
3313Adaptive Kernel Selection for Convolutional Neural Network4.330.943, 5, 5
3314Online Min-max Optimization: Nonconvexity, Nonstationarity, and Dynamic Regret4.330.945, 3, 5
3315Treatment Effect Estimation with Collider Bias and Confounding Bias4.330.945, 3, 5
3316Upcycled-FL: Improving Accuracy and Privacy with Less Computation in Federated Learning4.330.943, 5, 5
3317Unsupervised Manifold Linearizing and Clustering4.330.945, 5, 3
3318Towards Class-Balanced Transductive Few-Shot Learning4.330.943, 5, 5
3319Eigenvalue Initialisation and Regularisation for Koopman Autoencoders4.330.945, 5, 3
3320A Quasistatic Derivation of Optimization Algorithms' Exploration on Minima Manifolds4.330.943, 5, 5
3321A Deep Learning Framework for Musical Acoustics Simulations4.330.943, 5, 5
3322Amos: An Adam-style Optimizer with Adaptive Weight Decay towards Model-Oriented Scale4.330.945, 3, 5
3323Anamnesic Neural Differential Equations with Orthogonal Polynomial Projections4.332.361, 6, 6
3324uGLAD: A deep learning model to recover conditional independence graphs4.330.945, 3, 5
3325Graph in Graph Neural Network4.330.943, 5, 5
3326Generative Adversarial Training for Neural Combinatorial Optimization Models4.330.945, 5, 3
3327Spatially Resolved Temporal Networks: Online Unsupervised Representation Learning of High Frequency Time Series4.330.945, 5, 3
3328How does overparametrization affect performance on minority groups?4.330.945, 3, 5
3329MSQ-BioBERT: Ambiguity Resolution to Enhance BioBERT Medical Question-Answering4.330.945, 3, 5
3330G-CEALS: Gaussian Cluster Embedding in Autoencoder Latent Space for Tabular Data Representation4.330.945, 3, 5
3331Performance Disparities Between Accents in Automatic Speech Recognition4.330.943, 5, 5
3332Pruning Parameterization with Bi-level Optimization for Efficient Semantic Segmentation on the Edge4.330.945, 5, 3
3333Adversarial Attack Detection Under Realistic Constraints4.330.945, 3, 5
3334Towards Estimating Transferability using Hard Subsets4.330.945, 5, 3
3335Trust Your $nabla$: Gradient-based Intervention Targeting for Causal Discovery4.330.945, 5, 3
3336Efficient Point Cloud Geometry Compression Through Neighborhood Point Transformer4.330.945, 5, 3
3337Uncovering the Effectiveness of Calibration on Open Intent Classification4.330.943, 5, 5
3338Lossy Compression with Gaussian Diffusion4.330.945, 5, 3
3339Deep Generative Wasserstein Gradient Flows4.330.945, 3, 5
3340DISCO-DANCE: Learning to Discover Skills with Guidance4.330.943, 5, 5
3341Lightweight Uncertainty for Offline Reinforcement Learning via Bayesian Posterior4.330.945, 5, 3
3342GPR-Net: Multi-view Layout Estimation via a Geometry-aware Panorama Registration Network4.330.945, 3, 5
3343Pareto Optimization for Active Learning under Out-of-Distribution Data Scenarios4.330.945, 3, 5
3344Deep Causal Generative Modeling for Tabular Data Imputation and Intervention4.330.945, 5, 3
3345Semantic Category Discovery with Vision-language Representations4.330.945, 3, 5
3346Non-Parametric State-Space Models: Identifiability, Estimation and Forecasting4.330.945, 3, 5
3347FedDM: Iterative Distribution Matching for Communication-Efficient Federated Learning4.330.945, 3, 5
3348Grounding High Dimensional Representation Similarity by Comparing Decodability and Network Performance4.330.943, 5, 5
3349Likelihood adjusted semidefinite programs for clustering heterogeneous data4.330.943, 5, 5
3350Hybrid and Collaborative Passage Reranking4.330.945, 3, 5
3351Few-Shot Learning with Representative Global Prototype4.330.945, 3, 5
3352Causal Knowledge Transfer from Task Affinity4.330.945, 5, 3
3353Hybrid Federated Learning for Feature & Sample Heterogeneity: Algorithms and Implementation4.330.943, 5, 5
3354RelationCLIP: Training-free Fine-grained Visual and Language Concept Matching4.330.943, 5, 5
3355Thinking Two Moves Ahead: Anticipating Other Users Improves Backdoor Attacks in Federated Learning4.330.943, 5, 5
3356Progressive Transformation Learning For Leveraging Virtual Images in Training4.330.945, 3, 5
3357Neighborhood Gradient Clustering: An Efficient Decentralized Learning Method for Non-IID Data Distributions4.330.943, 5, 5
3358Predicting Drug Repurposing Candidates and Their Mechanisms from A Biomedical Knowledge Graph4.330.945, 5, 3
3359Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees4.330.945, 5, 3
3360Policy-Induced Self-Supervision Improves Representation Finetuning in Visual RL4.330.945, 5, 3
3361NeuralPCG: Learning Preconditioner for Solving Partial Differential Equations with Graph Neural Network4.330.943, 5, 5
3362Parameter-varying neural ordinary differential equations with partition-of-unity networks4.330.945, 5, 3
3363OoD-Control: Out-of-Distribution Generalization for Adaptive UAV Flight Control4.330.943, 5, 5
3364VLG: General Video Recognition with Web Textual Knowledge4.330.945, 3, 5
3365Take 5: Interpretable Image Classification with a Handful of Features4.330.945, 3, 5
3366M$^3$Video: Masked Motion Modeling for Self-Supervised Video Representation Learning4.330.945, 3, 5
3367A New Paradigm for Federated Structure Non-IID Subgraph Learning4.330.945, 3, 5
3368Fine-Grained Image Retrieval with Neighbor-Attention Label Correction4.330.945, 3, 5
3369Provable Unsupervised Data Sharing for Offline Reinforcement Learning4.330.945, 5, 3
3370Boosting Discriminative Visual Representation Learning with Scenario-Agnostic Mixup4.330.943, 5, 5
3371AutoDisc: Automatic Distillation Schedule for Large Language Model Compression4.330.943, 5, 5
3372E$^2$: Entropy Discrimination and Energy Optimization for Source-free Universal Domain Adaptation4.330.945, 3, 5
3373Benchmarking Encoder-Decoder Architectures for Biplanar X-ray to 3D Bone Shape Reconstruction4.330.945, 5, 3
3374AdaWAC: Adaptively Weighted Augmentation Consistency Regularization for Volumetric Medical Image Segmentation4.330.945, 3, 5
3375Implicit Offline Reinforcement Learning via Supervised Learning4.330.945, 5, 3
3376Learnable Visual Words for Interpreting Image Recognition Models4.330.945, 3, 5
3377PIPS: Path Integral Stochastic Optimal Control for Path Sampling in Molecular Dynamics4.330.943, 5, 5
3378Visual Transformation Telling4.330.945, 3, 5
3379Rethinking the Training Shot Number in Robust Model-Agnostic Meta-Learning4.330.945, 3, 5
3380OpenFE: Automated Feature Generation beyond Expert-level Performance4.330.943, 5, 5
3381Learning to Count Everything: Transformer-based Trackers are Strong Baselines for Class Agnostic Counting4.330.945, 3, 5
3382Optimal Neural Network Approximation of Wasserstein Gradient Direction via Convex Optimization4.330.943, 5, 5
3383DELVING INTO THE HIERARCHICAL STRUCTURE FOR EFFICIENT LARGE-SCALE BI-LEVEL LEARNING4.330.943, 5, 5
3384Towards predicting dynamic stability of power grids with Graph Neural Networks4.330.945, 5, 3
3385ACAT: Adversarial Counterfactual Attention for Classification and Detection in Medical Imaging4.330.943, 5, 5
3386Structural Generalization of Visual Imitation Learning with Position-Invariant Regularization4.330.945, 5, 3
3387Generative Model Based Noise Robust Training for Unsupervised Domain Adaptation4.330.943, 5, 5
3388Triangle Inequality for Inverse Optimal Control4.330.943, 5, 5
3389CAMVR: Context-Adaptive Multi-View Representation Learning for Dense Retrieval4.330.943, 5, 5
3390BIL: Bandit Inference Learning for Online Representational Similarity Test4.330.943, 5, 5
3391Spatially constrained Adversarial Attack Detection and Localization in the Representation Space of Optical Flow Networks4.330.943, 5, 5
3392Learn the Time to Learn: Replay Scheduling in Continual Learning4.330.945, 3, 5
3393Coordinate and Generalize: A Unified Framework for Audio-Visual Zero-Shot Learning4.330.943, 5, 5
3394Iterative Relaxing Gradient Projection for Continual Learning4.330.945, 5, 3
3395Private GANs, Revisited4.330.945, 3, 5
3396FEW-SHOT NODE PROMPT TUNING4.330.943, 5, 5
3397Unfixed Bias Iterator: A New Iterative Format4.330.945, 3, 5
3398MS3: A Multimodal Supervised Pretrained Model for Semantic Segmentation4.330.945, 5, 3
3399Unified Vision and Language Prompt Learning4.330.945, 5, 3
3400Module-wise Training of Residual Networks via the Minimizing Movement Scheme4.330.945, 5, 3
3401On the Dynamics under the Averaged Sample Margin Loss and Beyond4.332.361, 6, 6
3402Learning a 3D-Aware Encoder for Style-based Generative Radiance Field4.330.945, 3, 5
3403TT-NF: Tensor Train Neural Fields4.330.945, 3, 5
3404Reward Learning with Trees: Methods and Evaluation4.330.943, 5, 5
3405HyperFeel: An Efficient Federated Learning Framework Using Hyperdimensional Computing4.330.943, 5, 5
3406Learning to aggregate: A parameterized aggregator to debias aggregation for cross-device federated learning4.251.306, 3, 5, 3
3407Long-horizon video prediction using a dynamic latent hierarchy4.251.303, 3, 5, 6
3408Gene finding revisited: improved robustness through structured decoding from learning embeddings4.252.598, 3, 5, 1
3409Towards a Complete Theory of Neural Networks with Few Neurons4.251.303, 6, 3, 5
3410Gradient-Based Transfer Learning4.251.303, 3, 5, 6
3411FLOP: Tasks for Fitness Landscapes Of Protein families using sequence- and structure-based representations4.251.303, 5, 6, 3
3412Diversity Boosted Learning for Domain Generalization with a Large Number of Domains4.251.305, 6, 3, 3
3413The guide and the explorer: smart agents for resource-limited iterated batch reinforcement learning4.251.306, 5, 3, 3
3414Smooth image-to-image translations with latent space interpolations4.251.305, 3, 6, 3
3415Protein Sequence Design in a Latent Space via Model-based Reinforcement Learning4.252.173, 3, 3, 8
3416On the convergence of SGD under the over-parameter setting4.251.921, 6, 5, 5
3417Exphormer: Scaling Graph Transformers with Expander Graphs4.251.305, 3, 3, 6
3418Challenging Common Assumptions about Catastrophic Forgetting4.251.303, 6, 5, 3
3419How to fine-tune vision models with SGD4.251.303, 5, 3, 6
3420Machine Learning Force Fields with Data Cost Aware Training4.251.303, 6, 3, 5
3421A Probabilistic Framework For Modular Continual Learning4.251.303, 3, 5, 6
3422Automatic Data Augmentation via Invariance-Constrained Learning4.251.303, 5, 6, 3
3423NEURAL HAMILTONIAN FLOWS IN GRAPH NEURAL NETWORKS4.251.303, 3, 5, 6
3424Finding Private Bugs: Debugging Implementations of Differentially Private Stochastic Gradient Descent4.251.303, 5, 6, 3
3425Robust Generative Flows on Reliable Image Reconstruction without Training Data4.251.305, 3, 6, 3
3426Boomerang: Local sampling on image manifolds using diffusion models4.252.173, 3, 8, 3
3427Latent Topology Induction for Understanding Contextualized Representations4.251.925, 1, 6, 5
3428Adaptive Anchor for Robust Keypoint Localization4.251.926, 1, 5, 5
3429Getting away with more network pruning: From sparsity to geometry and linear regions4.252.591, 8, 3, 5
3430Faster Hyperparameter Search for GNNs via Calibrated Dataset Condensation4.251.303, 5, 6, 3
3431High-dimensional Continuum Armed and High-dimensional Contextual Bandit: with Applications to Assortment and Pricing4.251.305, 3, 3, 6
3432Do Summarization Models Synthesize?4.251.303, 5, 3, 6
3433$beta$-Stochastic Sign SGD: A Byzantine Resilient and Differentially Private Gradient Compressor for Federated Learning4.251.303, 5, 6, 3
3434Graph Fourier MMD for signals on data graphs4.251.306, 3, 5, 3
3435Proportional Multicalibration4.251.305, 3, 3, 6
3436Effectively Modeling Time Series with Simple Discrete State Spaces4.252.173, 3, 3, 8
3437Tabular Deep Learning when $d gg n$ by Using an Auxiliary Knowledge Graph4.252.591, 3, 5, 8
3438Preserving In-Context Learning Ability in Large Language Model Fine-tuning4.251.306, 3, 5, 3
3439Meta-Learning with Explicit Task Information4.252.598, 5, 1, 3
3440Differentiable Channel Selection for Self-Attention4.251.306, 3, 3, 5
3441Membership Inference Attacks Against Text-to-image Generation Models4.251.306, 5, 3, 3
3442Fair Graph Message Passing with Transparency4.251.306, 5, 3, 3
3443DeepReShape: Redesigning Neural Networks for Private Inference4.251.303, 3, 5, 6
3444Learning to reason with relational abstractions4.251.303, 5, 3, 6
3445General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States4.251.303, 6, 5, 3
3446Does the Half Adversarial Robustness Represent the Whole? It Depends... A Theoretical Perspective of Subnetwork Robustness4.251.303, 6, 3, 5
3447Few-Shot Incremental Learning Using HyperTransformers4.251.305, 3, 3, 6
3448Graph schemas as abstractions for transfer learning, inference, and planning4.251.305, 6, 3, 3
3449Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits4.251.305, 3, 6, 3
3450Efficient One-Shot Neural Architecture Search With Progressive Choice Freezing Evolutionary Search4.252.173, 8, 3, 3
3451GraphEditor: An Efficient Graph Representation Learning and Unlearning Approach4.251.303, 3, 6, 5
3452Towards a More Rigorous Science of Blindspot Discovery in Image Models4.251.303, 3, 6, 5
3453Self-supervised video pretraining yields strong image representations4.251.303, 3, 5, 6
3454Loop Unrolled Shallow Equilibrium Regularizer (LUSER) - A Memory-Efficient Inverse Problem Solver4.251.306, 3, 3, 5
3455FedLite: Improving Communication Efficiency in Federated Split Learning4.251.303, 6, 5, 3
3456Reinforcement Learning for Bandits with Continuous Actions and Large Context Spaces4.251.305, 3, 3, 6
3457How to Enable Uncertainty Estimation in Proximal Policy Optimization4.251.303, 5, 6, 3
3458Training Equilibria in Reinforcement Learning4.251.305, 6, 3, 3
3459Planning with Large Language Models for Code Generation4.252.173, 3, 8, 3
3460Conformal Prediction is Robust to Label Noise4.251.303, 6, 5, 3
3461MyoDex: Generalizable Representations for Dexterous Physiological Manipulation4.251.306, 5, 3, 3
3462On the Expressive Power of Geometric Graph Neural Networks4.252.173, 8, 3, 3
3463CLMIU: Commonsense Learning in Multimodal Image Understanding.4.251.305, 3, 3, 6
3464TOWARDS AN OBJECTIVE EVALUATION OF THE TRUSTWORTHINESS OF CLASSIFIERS4.252.591, 3, 8, 5
3465Direct-Effect Risk Minimization4.251.303, 6, 5, 3
3466Predicting Out-of-Domain Generalization with Local Manifold Smoothness4.252.173, 8, 3, 3
3467Burstormer: Burst Image Restoration and Enhancement Transformer4.252.173, 3, 8, 3
3468$sigma$Reparam: Stable Transformer Training with Spectral Reparametrization4.252.173, 3, 8, 3
3469Federated Learning on Adaptively Weighted Nodes by Bilevel Optimization4.251.306, 5, 3, 3
3470MultiQuan RDP: Rate-Distortion-Perception Coding via Offset Quantizers4.251.303, 5, 6, 3
3471Removing Backdoors in Pre-trained Models by Regularized Continual Pre-training4.251.306, 3, 3, 5
3472CLAS: Central Latent Action Spaces for Coordinated Multi-Robot Manipulation4.251.303, 6, 3, 5
3473Sample-efficient multi-objective molecular optimization with GFlowNets4.252.593, 8, 5, 1
3474A Simple Nadaraya-Watson Head for Explainable and Calibrated Classification4.251.303, 5, 3, 6
3475Conditional Execution Of Cascaded Models Improves The Accuracy-Efficiency Trade-Off4.252.173, 3, 8, 3
3476DynaMS: Dyanmic Margin Selection for Efficient Deep Learning4.251.303, 3, 6, 5
3477Dimensionless instance segmentation by learning graph representations of point clouds4.252.173, 8, 3, 3
3478Semantic Prior for Weakly Supervised Class-Incremental Segmentation4.251.305, 3, 3, 6
3479Biological Factor Regulatory Neural Network4.251.303, 6, 3, 5
3480Differentiable Logic Programming for Probabilistic Reasoning4.251.306, 3, 5, 3
3481Graph Neural Networks as Gradient Flows: understanding graph convolutions via energy4.251.306, 3, 3, 5
3482Memory Learning of Multivariate Asynchronous Time Series4.251.305, 6, 3, 3
3483Improving Generative Flow Networks with Path Regularization4.251.305, 3, 6, 3
3484Calibration for Decision Making via Empirical Risk Minimization4.251.305, 3, 3, 6
3485Contextual Transformer for Offline Reinforcement Learning4.251.305, 3, 3, 6
3486Improving Continual Learning by Accurate Gradient Reconstructions of the Past4.251.306, 3, 5, 3
3487FairGrad: Fairness Aware Gradient Descent4.251.303, 6, 3, 5
3488A Mathematical Framework for Characterizing Dependency Structures of Multimodal Learning4.251.926, 1, 5, 5
3489Unbiased Representation of Electronic Health Records for Patient Outcome Prediction4.251.303, 5, 6, 3
3490Class-wise Visual Explanations for Deep Neural Networks4.251.305, 6, 3, 3
3491Identification of the Adversary from a Single Adversarial Example4.251.303, 3, 5, 6
3492A HIERARCHICAL FRAGMENT-BASED MODEL FOR 3D DRUG-LIKE MOLECULE GENERATION4.251.305, 6, 3, 3
3493Poisoning Generative Models to Promote Catastrophic Forgetting4.251.306, 5, 3, 3
3494Equivariant Disentangled Transformation for Domain Generalization under Combination Shift4.251.303, 5, 3, 6
3495Deep Contrastive Learning Approximates Ensembles of One-Class SVMs with Neural Tangent Kernels4.251.305, 6, 3, 3
3496Limitations of Piecewise Linearity for Efficient Robustness Certification4.251.306, 3, 5, 3
3497Leveraged Asymmetric Loss with Disambiguation for Multi-label Recognition with One-Positive Annotations4.251.303, 3, 5, 6
3498A Semantic Hierarchical Graph Neural Network for Text Classification4.252.178, 3, 3, 3
3499DROP: Conservative Model-based Optimization for Offline Reinforcement Learning4.251.303, 5, 3, 6
3500Semi-Supervised Segmentation-Guided Tumor-Aware Generative Adversarial Network for Multi-Modality Brain Tumor Translation4.251.305, 3, 6, 3
3501HSVC: Transformer-based Hierarchical Distillation for Software Vulnerability Classification4.251.305, 3, 6, 3
3502Oracles and Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning4.251.306, 3, 5, 3
3503What Deep Representations Should We Learn? -- A Neural Collapse Perspective4.251.303, 6, 3, 5
3504Towards Adversarially Robust Deepfake Detection: An Ensemble Approach4.252.173, 3, 3, 8
3505AlphaDesign: A graph protein design method and benchmark on AlphaFold DB4.251.925, 1, 6, 5
3506A Scalable and Exact Gaussian Process Sampler via Kernel Packets4.251.305, 3, 6, 3
3507Model ChangeLists: Characterizing Changes in ML Prediction APIs4.251.303, 5, 6, 3
3508Towards Large Scale Transfer Learning for Differentially Private Image Classification4.251.305, 6, 3, 3
3509Mixed Federated Learning: Joint Decentralized and Centralized Learning4.251.303, 6, 5, 3
3510Toward Discovering Options that Achieve Faster Planning4.251.306, 3, 3, 5
3511Stable Optimization of Gaussian Likelihoods4.251.305, 3, 6, 3
3512Efficient Sequence Packing without Cross-contamination: Accelerating Large Language Models without Impacting Performance4.251.306, 5, 3, 3
3513Evaluating Counterfactual Explainers4.251.303, 5, 3, 6
3514A Reinforcement Learning Approach to Estimating Long-term Treatment Effects4.251.306, 3, 3, 5
3515Conceptual SCAN: Learning With and About Rules4.251.305, 6, 3, 3
3516Unsupervised learning of features and object boundaries from local prediction4.251.303, 3, 5, 6
3517On the Activation Function Dependence of the Spectral Bias of Neural Networks4.251.305, 3, 6, 3
3518MERMADE: $K$-shot Robust Adaptive Mechanism Design via Model-Based Meta-Learning4.251.303, 5, 3, 6
3519Unpacking Large Language Models with Conceptual Consistency4.252.178, 3, 3, 3
3520StarGraph: Knowledge Representation Learning based on Incomplete Two-hop Subgraph4.252.173, 3, 8, 3
3521Memory-efficient Trajectory Matching for Scalable Dataset Distillation4.251.303, 6, 3, 5
3522Attentional Context Alignment for Multimodal Sequential Learning4.251.305, 3, 3, 6
3523REAP: A Large-Scale Realistic Adversarial Patch Benchmark4.251.306, 5, 3, 3
3524Federated Training of Dual Encoding Models on Small Non-IID Client Datasets4.251.305, 6, 3, 3
3525REDUCING OVERSMOOTHING IN GRAPH NEURAL NETWORKS BY CHANGING THE ACTIVATION FUNCTION4.251.303, 3, 5, 6
3526Multitask Reinforcement Learning by Optimizing Neural Pathways4.251.303, 5, 6, 3
3527Input Perturbation Reduces Exposure Bias in Diffusion Models4.251.303, 3, 6, 5
3528RangeAugment: Efficient Online Augmentation with Range Learning4.252.173, 3, 3, 8
3529Privacy-Preserving Vision Transformer on Permutation-Encrypted Images4.251.925, 1, 5, 6
3530FastDiff 2: Dually Incorporating GANs into Diffusion Models for High-Quality Speech Synthesis4.251.305, 6, 3, 3
3531On the Convergence and Calibration of Deep Learning with Differential Privacy4.251.305, 6, 3, 3
3532Critical Batch Size Minimizes Stochastic First-Order Oracle Complexity of Deep Learning Optimizer using Hyperparameters Close to One4.251.306, 5, 3, 3
3533Restricted Generative Projection for One-Class Classification and Anomaly detection4.251.305, 3, 3, 6
3534learning hierarchical multi-agent cooperation with long short-term intention4.251.306, 3, 3, 5
3535Pixel-Level Task Helps Pruned Network Transfer to Downstream Tasks4.251.305, 3, 3, 6
3536Efficient block contrastive learning via parameter-free meta-node approximation4.251.306, 3, 5, 3
3537Improving Model Consistency of Decentralized Federated Learning via Sharpness Aware Minimization and Multiple Gossip Approaches4.251.303, 3, 5, 6
3538Supplementing Domain Knowledge to BERT with Semi-structured Information of Documents4.251.305, 3, 3, 6
3539Window Projection Features are All You Need for Time Series Anomaly Detection4.251.303, 3, 6, 5
3540Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes4.251.303, 6, 3, 5
3541MetaFS: An Effective Wrapper Feature Selection via Meta Learning4.251.303, 3, 6, 5
3542A Time-Consistency Curriculum for Learning from Instance-Dependent Noisy Labels4.251.303, 6, 5, 3
3543Learning Object Affordance with Contact and Grasp Generation4.251.303, 5, 6, 3
3544Benchmarking Approximate k-Nearest Neighbour Search for Big High Dimensional Dynamic Data4.251.303, 6, 5, 3
3545Bias Mimicking: A Simple Sampling Approach for Bias Mitigation4.251.303, 5, 3, 6
3546From Coarse to Fine-grained Concept based Discrimination for Phrase Detection4.251.303, 6, 3, 5
3547k-Median Clustering via Metric Embedding: Towards Better Initialization with Differential Privacy4.251.305, 3, 6, 3
3548Randomized Smoothing with Masked Inference for Adversarially Robust NLP Systems4.251.306, 3, 5, 3
3549A Data-Based Perspective on Transfer Learning4.251.303, 5, 6, 3
3550GeONet: a neural operator for learning the Wasserstein geodesic4.251.303, 3, 6, 5
3551The Convergence Rate of SGD's Final Iterate: Analysis on Dimension Dependence4.251.303, 6, 5, 3
3552FAME: Fast Adaptive Moment Estimation based on Triple Exponential Moving Average4.252.173, 8, 3, 3
3553No Double Descent in PCA: Training and Pre-Training in High Dimensions4.251.303, 5, 3, 6
3554To be robust and to be fair: aligning fairness with robustness4.252.178, 3, 3, 3
3555Fair Clustering via Equalized Confidence4.251.306, 3, 3, 5
3556Diagnosing and exploiting the computational demands of videos games for deep reinforcement learning4.251.305, 3, 3, 6
3557Improving Information Retention in Large Scale Online Continual Learning4.251.303, 6, 3, 5
3558ON INJECTING NOISE DURING INFERENCE4.251.303, 6, 3, 5
3559Uncertainty-based Multi-Task Data Sharing for Offline Reinforcement Learning4.251.303, 3, 6, 5
3560Differentiable Meta-Logical Programming4.251.303, 5, 3, 6
3561Regularizing hard examples improves robustness4.251.303, 3, 5, 6
3562From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models4.251.305, 6, 3, 3
3563High probability error bounds of SGD in unbounded domain4.251.306, 3, 3, 5
3564MAXENT LOSS: CONSTRAINED MAXIMUM ENTROPY FOR CALIBRATING DEEP NEURAL NETWORKS4.251.303, 5, 3, 6
3565Efficient and Stealthy Backdoor Attack Triggers are Close at Hand4.251.303, 3, 5, 6
3566Teaching Others is Teaching Yourself Regularization For Controllable Language Models4.251.303, 3, 5, 6
3567On Intriguing Layer-Wise Properties of Robust Overfitting in Adversarial Training4.251.303, 5, 3, 6
3568Uncertainty-Aware Meta-Learning for Multimodal Task Distributions4.251.305, 3, 6, 3
3569Federated Learning for Inference at Anytime and Anywhere4.251.303, 5, 6, 3
3570Low-Rank Graph Neural Networks Inspired by the Weak-balance Theory in Social Networks4.251.303, 5, 3, 6
3571Node-Level Membership Inference Attacks Against Graph Neural Networks4.251.303, 6, 5, 3
3572Holding Monotonic Improvement and Generality for Multi-Agent Proximal Policy Optimization4.252.173, 3, 8, 3
3573Towards the gradient adjustment by loss status for Neural Network Optimization4.251.305, 6, 3, 3
3574Linear Video Transformer with Feature Fixation4.251.303, 3, 6, 5
3575Local Coefficient Optimization in Federated Learning4.251.303, 3, 6, 5
3576DSPNet: Towards Slimmable Pretrained Networks based on Discriminative Self-supervised Learning4.251.303, 3, 6, 5
3577Why pseudo-label based algorithm is effective? --from the perspective of pseudo-labeled data4.251.303, 5, 3, 6
3578RbX: Region-based explanations of prediction models4.251.303, 5, 3, 6
3579Motif-induced Graph Normalization4.251.305, 6, 3, 3
3580Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks4.251.303, 3, 6, 5
3581Evaluation of Attribution Explanations without Ground Truth4.251.303, 5, 6, 3
3582Going Deeper with Spiking Neurons: Towards Binary Outputs of Deep Logic Spiking Neural Network4.252.591, 8, 5, 3
3583Correcting Three Existing Beliefs on Mutual Information in Contrastive Learning4.251.305, 6, 3, 3
3584Batch Normalization Is Blind to the First and Second Derivatives of the Loss w.r.t. Features4.251.925, 1, 6, 5
3585Node Number Awareness Representation for Graph Similarity Learning4.251.303, 5, 6, 3
3586Improving the Transferability of Adversarial Attacks through Experienced Precise Nesterov Momentum4.251.303, 5, 6, 3
3587Sparse Random Networks for Communication-Efficient Federated Learning4.251.305, 3, 6, 3
3588WaveMix-Lite: A Resource-efficient Neural Network for Image Analysis4.251.303, 5, 6, 3
3589Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask4.251.305, 6, 3, 3
3590Imposing conservation properties in deep dynamics modeling via contrastive learning4.251.303, 5, 3, 6
3591Accumulative Poisoning Defense with Memorization Discrepancy4.251.305, 6, 3, 3
3592S^2-Transformer for Mask-Aware Hyperspectral Image Reconstruction4.251.306, 3, 3, 5
3593A deep top-down approach to hierarchically coherent probabilistic forecasting4.251.303, 3, 6, 5
3594Smart Multi-tenant Federated Learning4.252.173, 8, 3, 3
3595Accelerating Inverse Reinforcement Learning with Expert Bootstrapping4.251.303, 3, 6, 5
3596Intepreting & Improving Pretrained Language Models: A Probabilistic Conceptual Approach4.252.178, 3, 3, 3
3597Efficient Trojan Injection: 90% Attack Success Rate Using 0.04% Poisoned Samples4.251.305, 3, 3, 6
3598Multi-Dataset Multi-Task Framework for Learning Molecules and Protein-target Interactions Properties4.251.306, 3, 3, 5
3599Deep Ensembles for Graphs with Higher-order Dependencies4.251.306, 3, 5, 3
3600MEGAN: Multi Explanation Graph Attention Network4.252.173, 8, 3, 3
3601Practical Approaches for Fair Learning with Multitype and Multivariate Sensitive Attributes4.251.921, 5, 6, 5
3602FedREP: A Byzantine-Robust, Communication-Efficient and Privacy-Preserving Framework for Federated Learning4.251.303, 5, 6, 3
3603Targeted Adversarial Self-Supervised Learning4.251.303, 6, 3, 5
3604Triplet Similarity Learning on Concordance Constraint4.251.303, 3, 5, 6
3605Robust Transfer Learning Based on Minimax Principle4.251.303, 5, 6, 3
3606Interpreting Neural Networks Through the Lens of Heat Flow4.251.303, 5, 3, 6
3607DCE: Offline Reinforcement Learning With Double Conservative Estimates4.251.303, 5, 3, 6
3608Efficient Surrogate Gradients for Training Spiking Neural Networks4.251.303, 5, 3, 6
3609Extreme Masking for Learning Instance and Distributed Visual Representations4.252.593, 8, 5, 1
3610Leveraging Hierarchical Structure for Multi-Domain Active Learning with Theoretical Guarantees4.251.306, 3, 5, 3
3611Configuring Mixed-Integer Linear Programming Solvers with Deep Metric Learning4.252.178, 3, 3, 3
3612Graph Neural Bandits4.251.303, 6, 5, 3
3613Deep Power Laws for Hyperparameter Optimization4.251.303, 6, 3, 5
3614Prompt-Matched Semantic Segmentation4.251.303, 3, 5, 6
3615GeoVeX: Geospatial Vectors with Hexagonal Convolutional Autoencoders4.251.303, 6, 5, 3
3616Feature Synchronization in Backdoor Attacks4.251.306, 3, 3, 5
3617MMTSA: Multi-Modal Temporal Segment Attention Network for Efficient Human Activity Recognition4.251.305, 3, 6, 3
3618Look Back When Surprised: Stabilizing Reverse Experience Replay for Neural Approximation4.252.593, 5, 8, 1
3619Multiscale Neural Operator: Learning Fast and Grid-independent PDE Solvers4.251.303, 6, 5, 3
3620A Massively Parallel Benchmark for Safe Dexterous Manipulation4.251.303, 5, 6, 3
3621Rethinking the Explanation of Graph Neural Network via Non-parametric Subgraph Matching4.252.173, 8, 3, 3
3622Q-Match: Self-Supervised Learning For Tabular Data by Matching Distributions Induced by a Queue4.251.303, 3, 6, 5
3623Voting from Nearest Tasks: Meta-Vote Pruning of Pretrained Models for Downstream Tasks4.251.303, 5, 3, 6
3624Momentum in Momentum for Adaptive Optimization4.252.178, 3, 3, 3
3625NICO++: Towards Better Benchmarking for Domain Generalization4.251.305, 3, 6, 3
3626Gradient Norm Regularizer Seeks Flat Minima and Improves Generalization4.251.303, 3, 5, 6
3627Calibrating Multimodal Learning4.251.305, 3, 6, 3
3628Token Turing Machines4.251.303, 5, 6, 3
3629Cutting Long Gradient Flows: Decoupling End-to-End Backpropagation Based on Supervised Contrastive Learning4.251.303, 5, 3, 6
3630ThinkSum: Probabilistic reasoning over sets using large language models4.252.178, 3, 3, 3
3631Model-agnostic Measure of Generalization Difficulty4.252.173, 3, 3, 8
3632Hedge Your Actions: Flexible Reinforcement Learning for Complex Action Spaces4.252.591, 3, 5, 8
3633Online Learning for Obstacle Avoidance4.201.943, 6, 6, 5, 1
3634FaDIn: Fast Discretized Inference for Hawkes Processes with General Parametric Kernels4.200.983, 5, 5, 5, 3
3635Game-Theoretic Understanding of Misclassification4.201.943, 5, 6, 6, 1
3636Improving Vision Attention with Random Walk Graph Kernel4.200.985, 5, 3, 3, 5
3637Lifting the Curse of Capacity Gap in Distilling Large Language Models4.200.983, 5, 5, 3, 5
3638Semi-supervised learning of partial differential operators and dynamical flows4.200.983, 5, 5, 3, 5
3639Language Models Can See: Plugging Visual Controls in Text Generation4.201.473, 3, 3, 6, 6
3640Logic-aware Pre-training of Language Models4.201.601, 5, 5, 5, 5
3641Towards Discovering Neural Architectures from Scratch4.201.476, 3, 6, 3, 3
3642Neural Autoregressive Refinement for Self-Supervised Outlier Detection beyond Images4.171.675, 5, 5, 1, 6, 3
3643Data Leakage in Tabular Federated Learning4.001.416, 3, 3
3644Towards Robust Online Dialogue Response Generation4.001.003, 5, 5, 3
3645MolBART: Generative Masked Language Models for Molecular Representations4.001.003, 5, 3, 5
3646Formal Specifications from Natural Language4.001.005, 3, 3, 5
3647Pseudo-Differential Integral Operator for Learning Solution Operators of Partial Differential Equations4.001.003, 3, 5, 5
3648A Simple Contrastive Learning Objective for Alleviating Neural Text Degeneration4.001.005, 3, 5, 3
3649Moment Distributionally Robust Probabilistic Supervised Learning4.001.003, 5, 5, 3
3650Accelerating spiking neural network training using the $d$-block model4.001.263, 3, 6, 5, 3
3651RG: OUT-OF-DISTRIBUTION DETECTION WITH REACTIVATE GRADNORM4.001.003, 5, 5, 3
3652Breaking Large Language Model-based Code Generation4.001.413, 6, 3
3653Proximal Validation Protocol4.001.003, 5, 3, 5
3654AUTOMATIC CURRICULUM FOR UNSUPERVISED REIN- FORCEMENT LEARNING4.002.161, 5, 6
3655On Representation Learning in the First Layer of Deep CNNs and the Dynamics of Gradient Descent4.001.005, 3, 5, 3
3656Learning Layered Implicit Model for 3D Avatar Clothing Representation4.001.003, 5, 5, 3
3657Generalizable Multi-Relational Graph Representation Learning: A Message Intervention Approach4.003.101, 10, 3, 3, 3
3658Explicitly Maintaining Diverse Playing Styles in Self-Play4.001.413, 6, 3
3659Label Similarity Aware Contrastive Learning4.001.005, 5, 3, 3
3660Incompatibility between Deterministic Policy and Generative Adversarial Imitation Learning4.001.263, 3, 6, 3, 5
3661CAT: Collaborative Adversarial Training4.001.005, 3, 3, 5
3662Therbligs in Action: Video Understanding through Motion Primitives4.001.005, 3, 3, 5
3663DEFENDING BACKDOOR ATTACKS VIA ROBUSTNESS AGAINST NOISY LABEL4.001.005, 3, 5, 3
3664Efficient, probabilistic analysis of combinatorial neural codes4.001.413, 6, 3
3665Simple and Deep Graph Attention Networks4.001.005, 3, 5, 3
3666GNN Domain Adaptation using Optimal Transport4.001.003, 5, 5, 3
3667An Integrated Multi-Label Multi-Modal Framework in Deep Metric Learning4.001.416, 3, 3
3668Autoregressive Graph Network for Learning Multi-step Physics4.001.003, 3, 5, 5
3669Layer-wise Balanced Activation Mechanism4.001.003, 5, 3, 5
3670Neural Integral Equations4.001.416, 3, 3
3671Consistent Data Distribution Sampling for Large-scale Retrieval4.001.003, 5, 3, 5
3672Mixture of Quantized Experts (MoQE): Complementary Effect of Low-bit Quantization and Robustness4.001.266, 3, 3, 3, 5
3673Dynamics Model Based Adversarial Training For Competitive Reinforcement Learning4.001.005, 3, 3, 5
3674A Stable and Scalable Method for Solving Initial Value PDEs with Neural Networks4.001.413, 3, 6
3675CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets4.001.005, 3, 3, 5
3676Forgetful causal masking makes causal language models better zero-shot learners4.002.121, 6, 6, 3
3677Marich: A Query-efficient & Online Model Extraction Attack using Public Data4.001.413, 3, 6
3678Connecting representation and generation via masked vision-language transformer4.001.005, 3, 5, 3
3679Current Anomaly Detectors are Anomalous: On Semantic Treatment of OOD Inputs4.001.005, 3, 3, 5
3680Event-former: A Self-supervised Learning Paradigm for Temporal Point Processes4.002.123, 1, 6, 6
3681Controllable Concept Transfer of Intermediate Representations4.001.003, 3, 5, 5
3682SaiT: Sparse Vision Transformers through Adaptive Token Pruning4.001.005, 3, 3, 5
3683Differentiable Rendering with Reparameterized Volume Sampling4.001.003, 3, 5, 5
3684Just Avoid Robust Inaccuracy: Boosting Robustness Without Sacrificing Accuracy4.001.413, 6, 3
3685Invariant Aggregator for Defending against Federated Backdoor Attacks4.001.005, 3, 5, 3
3686UNDERSTANDING THE ROLE OF POSITIONAL ENCODINGS IN SENTENCE REPRESENTATIONS4.001.003, 5, 3, 5
3687Attribution Scores are Redundant: Explaining Feature Contribution By Trajectories4.001.263, 3, 6, 5, 3
3688Neural Networks as Paths through the Space of Representations4.001.003, 3, 5, 5
3689From Points to Functions: Infinite-dimensional Representations in Diffusion Models4.001.005, 5, 3, 3
3690ESC: A Benchmark For Multi-Domain End-to-End Speech Recognition4.001.005, 3, 5, 3
3691Towards Dynamic Sparsification by Iterative Prune-Grow LookAheads4.001.416, 3, 3
3692Skill Decision Transformer4.001.003, 3, 5, 5
36933D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction4.001.413, 3, 6
3694Synthetic Pre-Training Tasks for Neural Machine Translation4.001.005, 3, 5, 3
3695Optimizing the Performance of Text Classification Models by Improving the Isotropy of the Embeddings using a Joint Loss Function4.001.003, 5, 3, 5
3696A $2$-parameter Persistence Layer for Learning4.001.003, 5, 5, 3
3697UniS-MMC: Learning Unimodality-supervised Multimodal Contrastive Representations4.001.003, 3, 5, 5
3698NAG-GS: semi-implicit, accelerated and robust stochastic optimizer.4.001.003, 5, 3, 5
3699Adversarial Policies Beat Professional-Level Go AIs4.001.413, 6, 3
3700Pre-train Graph Neural Networks for Brain Network Analysis4.001.003, 5, 3, 5
3701Unscented Autoencoder4.002.121, 3, 6, 6
3702AQuaMaM: An Autoregressive, Quaternion Manifold Model for Rapidly Estimating Complex SO(3) Distributions4.001.413, 3, 6
3703Multi-Objective GFlowNets4.001.413, 6, 3
3704A Scalable Training Strategy for Blind Multi-Distribution Noise Removal4.001.005, 5, 3, 3
3705Triplet learning of task representations in latent space for continual learning4.001.003, 5, 3, 5
3706The Robustness Limits of SoTA Vision Models to Natural Variation4.001.005, 3, 3, 5
3707DLP: Data-Driven Label-Poisoning Backdoor Attack4.001.003, 5, 5, 3
3708ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech4.001.003, 3, 5, 5
3709Semantic Transformation-based Data Augmentation for Few-Shot Learning4.001.413, 6, 3
3710COC curve: operating neural networks at high accuracy and low manual effort4.001.416, 3, 3
3711Wide Attention is the Way Forward for Transformers4.001.005, 5, 3, 3
3712Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning4.001.005, 3, 3, 5
3713SAGE: Semantic-Aware Global Explanations for Named Entity Recognition4.001.265, 3, 6, 3, 3
3714On the Forward Invariance of Neural ODEs4.002.123, 1, 6, 6
3715Learning Debiased Representations via Conditional Attribute Interpolation4.001.413, 6, 3
3716Multi-stationary point losses for robust model4.002.941, 3, 8
3717Learning Stackelberg Equilibria and Applications to Economic Design Games4.002.123, 1, 6, 6
3718Personalized federated composite learning with forward-backward envelopes4.001.005, 3, 3, 5
3719Attention Based Models for Cell Type Classification on Single-Cell RNA-Seq Data4.001.003, 5, 3, 5
3720Robust and accelerated single-spike spiking neural network training with applicability to challenging temporal tasks4.001.005, 3, 3, 5
3721Annealed Fisher Implicit Sampler4.001.003, 5, 5, 3
3722Differentiable and transportable structure learning4.001.003, 3, 5, 5
3723SeKron: A Decomposition Method Supporting Many Factorization Structures4.002.161, 6, 5
3724Deep Class Conditional Gaussians for Continual Learning4.001.413, 6, 3
3725On Feature Diversity in Energy-based Models4.001.795, 5, 1, 6, 3
3726How does Uncertainty-aware Sample-selection Help Decision against Action Noise?4.001.413, 3, 6
3727QuAFL: Federated Averaging Made Asynchronous and Communication-Efficient4.001.003, 5, 3, 5
3728Targeted Attacks on Timeseries Forecasting4.001.003, 3, 5, 5
3729Flareon: Stealthy Backdoor Injection via Poisoned Augmentation4.001.413, 3, 6
3730Multi-Head State Space Model for Sequence Modeling4.002.123, 6, 1, 6
3731Rewiring with Positional Encodings for GNNs4.001.005, 3, 3, 5
3732Gated Inference Network: Inferencing and Learning State-Space Models4.001.416, 3, 3
3733Optimizing Spca-based Continual Learning: A Theoretical Approach4.002.126, 3, 1, 6
3734Learning Task Agnostic Temporal Consistency Correction4.001.003, 5, 5, 3
3735Transformers with Multiresolution Attention Heads4.001.413, 6, 3
3736Reinforcement Learning using a Molecular Fragment Based Approach for Reaction Discovery4.001.263, 3, 3, 6, 5
3737Invariance Makes a Difference: Disentangling the Role of Invariance and Equivariance in Representations4.001.413, 3, 6
3738Learning DAGs from Fourier-Sparse Data4.001.003, 5, 3, 5
3739Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments4.001.005, 5, 3, 3
3740Neural Image Compression with a Diffusion-based Decoder4.001.413, 3, 6
3741Caption supervision enables robust learners: a controlled study of distributionally robust model training4.001.796, 1, 5, 3, 5
3742Pessimistic Policy Iteration for Offline Reinforcement Learning4.001.263, 6, 3, 3, 5
3743Prototypical Context-aware Dynamics Generalization for High-dimensional Model-based Reinforcement Learning4.001.003, 3, 5, 5
3744Efficient Hyperparameter Optimization Through Tensor Completion4.001.005, 3, 5, 3
3745UTS: When Monotonic Value Factorisation Meets Non-monotonic and Stochastic Targets4.001.413, 3, 6
3746Learning Rotation-Equivariant Features for Visual Correspondence4.001.005, 3, 5, 3
3747PAVI: Plate-Amortized Variational Inference4.001.003, 3, 5, 5
3748Multimodal Masked Autoencoders Learn Transferable Representations4.001.003, 3, 5, 5
3749Test-Time AutoEval with Supporting Self-supervision4.001.005, 3, 3, 5
3750MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning4.001.005, 3, 5, 3
3751Partial Differential Equation-Regularized Neural Networks: An Application to Image Classification4.001.003, 5, 5, 3
3752On Nullspace of Vision Transformers and What Does it Tell Us?4.001.005, 3, 5, 3
3753Which is Better for Learning with Noisy Labels: The Semi-supervised Method or Modeling Label Noise?4.001.003, 5, 3, 5
3754FACS: FAST ADAPTIVE CHANNEL SQUEEZING4.001.003, 5, 5, 3
3755DYNAMIC ENSEMBLE FOR PROBABILISTIC TIME- SERIES FORECASTING VIA DEEP REINFORCEMENT LEARNING4.001.005, 3, 5, 3
3756Understanding Pruning at Initialization: An Effective Node-Path Balancing Perspective4.001.003, 5, 3, 5
3757Oracle-oriented Robustness: Robust Image Model Evaluation with Pretrained Models as Surrogate Oracle4.001.003, 3, 5, 5
3758Mitigating Demographic Bias of Federated Learning Models via Global Domain Smoothing4.002.165, 6, 1
3759Analysis of differentially private synthetic data: a general measurement error approach4.001.005, 3, 5, 3
3760Counterfactual Contrastive Learning for Robust Text Classification4.001.003, 5, 3, 5
3761Which Invariance Should We Transfer? A Causal Minimax Learning Approach4.001.003, 5, 3, 5
3762Graph Contrastive Learning with Reinforced Augmentation4.001.003, 5, 5, 3
3763Trusted Aggregation (TAG): Model Filtering Backdoor Defense In Federated Learning4.001.005, 5, 3, 3
3764BiViT: Exploring Binary Vision Transformers4.001.416, 3, 3
3765LVQ-VAE:End-to-end Hyperprior-based Variational Image Compression with Lattice Vector Quantization4.001.003, 3, 5, 5
3766Towards Solving Industrial Sequential Decision-making Tasks under Near-predictable Dynamics via Reinforcement Learning: an Implicit Corrective Value Estimation Approach4.001.003, 3, 5, 5
3767The Graph Learning Attention Mechanism: Learnable Sparsification Without Heuristics4.001.003, 5, 3, 5
3768On Convergence of Federated Averaging Langevin Dynamics4.001.413, 6, 3
3769BYPASSING THE STABILITY-PLASTICITY TRADEOFF TO REDUCE PREDICTIVE CHURN4.002.371, 8, 3, 5, 3
3770Learning Object-Centric Dynamic Modes from Video and Emerging Properties4.001.003, 5, 5, 3
3771Invertible normalizing flow neural networks by JKO scheme4.001.005, 3, 3, 5
3772Towards Causal Concepts for Explaining Language Models4.001.003, 3, 5, 5
3773Leveraging Human Features at Test-Time4.001.413, 3, 6
3774SaMoE: Parameter Efficient MoE Language Models via Self-Adaptive Expert Combination4.001.413, 3, 6
3775Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size4.001.005, 3, 5, 3
3776Learning from Others: Similarity-based Regularization for Mitigating Artifacts4.001.005, 5, 3, 3
3777Red PANDA: Disambiguating Anomaly Detection by Removing Nuisance Factors4.002.126, 1, 6, 3
3778Taming Policy Constrained Offline Reinforcement Learning for Non-expert Demonstrations4.001.005, 5, 3, 3
3779Internal Purity: A Differential Entropy based Internal Validation Index for Clustering Validation4.001.003, 5, 3, 5
3780PromptSum: Planning with Mixed Prompts for Parameter-Efficient Controllable Abstractive Summarization4.001.003, 5, 3, 5
3781A Theory of Equivalence-Preserving Program Embeddings4.001.003, 5, 3, 5
3782Formal Interpretability with Merlin-Arthur Classifiers4.001.005, 5, 3, 3
3783How deep convolutional neural networks lose spatial information with training4.001.413, 6, 3
3784gGN: learning to represent nodes in directed graphs as low-rank Gaussian distributions4.001.005, 5, 3, 3
3785Provable Sharpness-Aware Minimization with Adaptive Learning Rate4.001.003, 5, 5, 3
3786Beyond re-balancing: distributionally robust augmentation against class-conditional distribution shift in long-tailed recognition4.001.005, 3, 5, 3
3787Offline Communication Learning with Multi-source Datasets4.001.005, 5, 3, 3
3788Training via Confidence Ranking4.001.003, 5, 3, 5
3789Computational Doob h-transforms for Online Filtering of Discretely Observed Diffusions4.001.735, 5, 1, 5
3790Reconciling feature sharing and multiple predictions with MIMO Vision Transformers4.001.005, 3, 3, 5
3791$Q$-learning with regularization converges with non-linear non-stationary features4.001.413, 6, 3
3792Backdoor or Feature? A New Perspective on Data Poisoning4.001.003, 5, 5, 3
3793SpeedyZero: Mastering Atari with Limited Data and Time4.001.413, 3, 6
3794Source-Target Coordinated Training with Multi-head Hybrid-Attention for Domain Adaptive Semantic Segmentation4.001.005, 3, 3, 5
3795Revisiting Activation Function Design for Improving Adversarial Robustness at Scale4.001.005, 5, 3, 3
3796What Does Vision Supervision Bring to Language Models? A Case Study of CLIP4.001.005, 3, 5, 3
3797Learning to Counter: Stochastic Feature-based Learning for Diverse Counterfactual Explanations4.001.005, 3, 5, 3
3798Exploiting Certified Defences to Attack Randomised Smoothing4.001.005, 3, 5, 3
3799How and Why We Detect Distribution Shift: Critical Analysis of Methods and Benchmarks4.001.413, 3, 6
3800$textrm{D}^3textrm{Former}$: Debiased Dual Distilled Transformer for Incremental Learning4.001.003, 5, 3, 5
3801Score-Based Graph Generative Modeling with Self-Guided Latent Diffusion4.001.005, 3, 3, 5
3802BrGANs: Stabilizing GANs' Training Process with Brownian Motion Control4.001.005, 5, 3, 3
3803Unfair geometries: exactly solvable data model with fairness implications4.001.005, 3, 3, 5
3804ExtraMix: Extrapolatable Data Augmentation for Regression using Generative Models4.001.005, 5, 3, 3
3805Learning Combinatorial Node Labeling Algorithms4.001.005, 3, 3, 5
3806PBFormer: Capturing Complex Scene Text Shape with Polynomial Band Transformer4.001.003, 5, 5, 3
3807Addressing Variable Dependency in GNN-based SAT Solving4.001.003, 5, 5, 3
3808Adversarial Examples Guided Pseudo-label Refinement for Decentralized Domain Adaptation4.001.005, 3, 5, 3
3809Molecule Generation for Target Receptor Binding via Continuous Normalizing Flows4.001.003, 5, 5, 3
3810Lost Domain Generalization Is a Natural Consequence of Lack of Training Domains4.001.413, 6, 3
3811ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading4.002.166, 5, 1
3812OCD: Learning to Overfit with Conditional Diffusion Models4.001.003, 5, 5, 3
3813Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives4.001.005, 3, 3, 5
3814$z$-SignFedAvg: A Unified Stochastic Sign-based Compression for Federated Learning4.001.416, 3, 3
3815DECN: Evolution Inspired Deep Convolution Network for Black-box Optimization4.001.263, 5, 6, 3, 3
3816Multi-Treatment Effect Estimation with Proxy: Contrastive Learning and Rank Weighting4.001.003, 5, 5, 3
3817DeepTime: Deep Time-index Meta-learning for Non-stationary Time-series Forecasting4.001.003, 5, 5, 3
3818Efficient Method for Bi-level Optimization with Non-smooth Lower-Level Problem4.001.003, 5, 3, 5
3819Learning an Invertible Output Mapping Can Mitigate Simplicity Bias in Neural Networks4.001.003, 3, 5, 5
3820Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk4.001.003, 5, 5, 3
3821MaskConver: A Universal Panoptic and Semantic Segmentation Model with Pure Convolutions4.001.003, 3, 5, 5
3822AxBERT: An Explainable Chinese Spelling Correction Method Driven by Associative Knowledge Network4.001.005, 5, 3, 3
3823Towards Efficient Posterior Sampling in Deep Neural Networks via Symmetry Removal4.002.003, 3, 8, 3, 3
3824Are Neurons Actually Collapsed? On the Fine-Grained Structure in Neural Representations4.001.003, 3, 5, 5
3825Knowledge-Driven New Drug Recommendation4.001.003, 5, 5, 3
3826Contrastive Prompt Tuning Improves Generalization in Vision-Language Models4.001.005, 3, 5, 3
3827On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly-Communicating MDPs4.001.416, 3, 3
3828Robust Reinforcement Learning with Distributional Risk-averse formulation4.001.003, 5, 5, 3
3829Model-based Value Exploration in Actor-critic Deep Reinforcement Learning4.001.005, 5, 3, 3
3830Adversarial Detector for Decision Tree Ensembles Using Representation Learning4.001.005, 3, 3, 5
3831'Why did the Model Fail?': Attributing Model Performance Changes to Distribution Shifts4.001.003, 5, 3, 5
3832Imitation Improvement Learning for Large-scale Capacitated Vehicle Routing Problems4.001.005, 5, 3, 3
3833Points2NeRF: Generating Neural Radiance Fields from 3D point cloud4.001.003, 5, 5, 3
3834DEEPER-GXX: DEEPENING ARBITRARY GNNS4.001.003, 3, 5, 5
3835Music-to-Text Synaesthesia: Generating Descriptive Text from Music Recordings4.001.005, 3, 3, 5
3836HyperMAML: Few-Shot Adaptation of Deep Models with Hypernetworks4.001.005, 3, 5, 3
3837EIT: Enhanced Interactive Transformer for Sequence Generation4.001.003, 5, 3, 5
3838Local Attention Layers for Vision Transformers4.001.005, 5, 3, 3
3839Neural Discrete Reinforcement Learning4.001.005, 3, 3, 5
3840Memory-Augmented Variational Adaptation for Online Few-Shot Segmentation4.001.003, 3, 5, 5
3841QUANTILE-LSTM: A ROBUST LSTM FOR ANOMALY DETECTION4.001.005, 3, 3, 5
3842Auto-Encoding Adversarial Imitation Learning4.001.003, 5, 3, 5
3843BiTAT: Neural Network Binarization with Task-Dependent Aggregated Transformation4.001.003, 5, 5, 3
3844Constrained Reinforcement Learning for Safety-Critical Tasks via Scenario-Based Programming4.001.413, 3, 6
3845Physically Plausible and Conservative Solutions to Navier-Stokes Equations Using Physics-Informed CNNs4.001.413, 6, 3
3846Does Federated Learning Really Need Backpropagation?4.001.416, 3, 3
3847Specialization of Sub-paths for Adaptive Depth Networks4.001.005, 3, 5, 3
3848Closing the Performance Gap between Cumbersome and Lightweight Contrastive Models4.001.263, 3, 6, 5, 3
3849MAGA: Modeling a Group Action4.001.003, 3, 5, 5
3850Recursion of Thought: Divide and Conquer Reasoning with Language Models4.002.948, 1, 3
3851Geo-NN: An End-to-End Framework for Geodesic Mean Estimation on the Manifold of Symmetric Positive Definite Matrices4.001.413, 3, 6
3852Progressive Image Synthesis from Semantics to Details with Denoising Diffusion GAN4.001.005, 5, 3, 3
3853Learning large-scale Kernel Networks4.001.005, 3, 3, 5
3854Path Regularization: A Convexity and Sparsity Inducing Regularization for Parallel ReLU Networks4.001.416, 3, 3
3855MAT: Mixed-Strategy Game of Adversarial Training in Fine-tuning4.001.003, 5, 5, 3
3856MAFormer: A Transformer Network with Multi-scale Attention Fusion for Visual Recognition4.001.003, 5, 3, 5
3857MQSP: Micro-Query Sequence Parallelism for Linearly Scaling Long Sequence Transformer4.001.005, 3, 3, 5
3858Schrödinger's FP: Training Neural Networks with Dynamic Floating-Point Containers4.001.005, 3, 3, 5
3859Continual Learning with Group-wise Neuron Normalization4.001.005, 3, 3, 5
3860Sparse Hyperbolic Representation Learning4.001.413, 6, 3
3861Universal embodied intelligence: learning from crowd, recognizing the world, and reinforced with experience4.002.121, 6, 6, 3
3862LAMDA: Latent mapping for domain adaption of image generators4.001.005, 3, 5, 3
3863Novel Class Discovery under Unreliable Sampling4.001.416, 3, 3
3864Teach me how to Interpolate a Myriad of Embeddings4.001.413, 3, 6
3865Interventional Rationalization4.001.003, 3, 5, 5
3866Diverse, Difficult, and Odd Instances (D2O): A New Test Set for Object Classification4.001.003, 5, 3, 5
3867Effective dimension of machine learning models4.001.003, 5, 3, 5
3868A theory of representation learning in neural networks gives a deep generalisation of kernel methods4.001.413, 6, 3
3869A spatiotemporal graph neural network with multi granularity for air quality prediction4.001.413, 3, 6
3870OPERA: Omni-Supervised Representation Learning with Hierarchical Supervisions4.001.003, 3, 5, 5
3871How you start matters for generalization4.001.413, 3, 6
3872PVT++: A Simple End-to-End Latency-Aware Visual Tracking Framework4.001.005, 5, 3, 3
3873Dimensionality-Varying Diffusion Process4.001.003, 3, 5, 5
3874Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents4.001.005, 3, 5, 3
3875On Storage Neural Network Augmented Approximate Nearest Neighbor Search4.001.003, 5, 3, 5
3876Sample Importance in SGD Training4.001.003, 5, 3, 5
3877Critical Learning Periods Augmented Model Poisoning Attacks to Byzantine-Robust Federated Learning4.001.003, 3, 5, 5
3878Individual Fairness of Data Provider Regarding Privacy Risk and Gain4.001.005, 3, 3, 5
3879Semi-supervised Node Classification with Imbalanced Receptive Field4.001.003, 5, 5, 3
3880CEREAL: Few-Sample Clustering Evaluation4.001.005, 3, 3, 5
3881Computational-Unidentifiability in Representation for Fair Downstream Tasks4.001.416, 3, 3
3882Accelerating Federated Learning Convergence via Opportunistic Mobile Relaying4.001.416, 3, 3
3883Learning Control Lyapunov Functions For High-dimensional Unknown Systems using Guided Iterative State Space Exploration4.001.005, 3, 3, 5
3884Universal Mini-Batch Consistency for Set Encoding Functions4.001.005, 5, 3, 3
3885Soundness and Completeness: An Algorithmic Perspective on Evaluation of Feature Attribution4.001.003, 5, 3, 5
3886Improving Differentially-Private Deep Learning with Gradients Index Pruning4.001.263, 5, 6, 3, 3
3887Shuffle Gaussian Mechanism for Differential Privacy4.001.416, 3, 3
3888MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning4.001.003, 5, 3, 5
3889Distributional Reinforcement Learning via Sinkhorn Iterations4.001.003, 5, 3, 5
3890MLM with Global Co-occurrence4.001.003, 5, 5, 3
3891Breaking Correlation Shift via Conditional Invariant Regularizer4.001.005, 5, 3, 3
3892How Powerful is Implicit Denoising in Graph Neural Networks4.002.126, 1, 3, 6
3893ChemSpacE: Interpretable and Interactive Chemical Space Exploration4.001.005, 3, 5, 3
3894Probing into the Fine-grained Manifestation in Multi-modal Image Synthesis4.001.416, 3, 3
3895Eliminating Catastrophic Overfitting Via Abnormal Adversarial Examples Regularization4.001.003, 3, 5, 5
3896Factor Learning Portfolio Optimization Informed by Continuous-Time Finance Models4.001.416, 3, 3
3897Closing the Gap Between SVRG and TD-SVRG with Gradient Splitting4.001.735, 1, 5, 5
3898Sorted eigenvalue comparison $d_{mathsf{Eig}}$: A simple alternative to $d_{mathsf{FID}}$4.001.003, 5, 3, 5
3899Never Revisit: Continuous Exploration in Multi-Agent Reinforcement Learning4.001.003, 5, 5, 3
3900SepRep-Net: Multi-source Free Domain Adaptation via Model Separation and Reparameterization4.001.005, 3, 3, 5
3901Generalizability of Adversarial Robustness Under Distribution Shifts4.001.003, 5, 3, 5
3902Uncertainty-Driven Active Vision for Implicit Scene Reconstruction4.001.003, 3, 5, 5
3903Spurious Local Minima Provably Exist for Deep Convolutional Neural Networks4.001.003, 3, 5, 5
3904Graph Contrastive Learning with Personalized Augmentation4.001.003, 5, 5, 3
3905Variational Reparametrized Policy Learning with Differentiable Physics4.001.413, 3, 6
3906Stable, Efficient, and Flexible Monotone Operator Implicit Graph Neural Networks4.001.416, 3, 3
3907LSAP: Rethinking Inversion Fidelity, Perception and Editability in GAN Latent Space4.001.005, 3, 3, 5
3908Neural Sorting Networks with Error-Free Differentiable Swap Functions4.001.413, 3, 6
3909SWORD: Demystify the Secrets of Open-world Instance Recognition4.001.416, 3, 3
3910Learning Antidote Data to Individual Unfairness4.001.003, 5, 3, 5
3911TiDAL: Learning Training Dynamics for Active Learning4.001.416, 3, 3
3912CompletionFormer: Depth Completion with Convolutions and Vision Transformers4.002.121, 3, 6, 6
3913Demystifying the Optimization and Generalization of Deep PAC-Bayesian Learning4.001.003, 3, 5, 5
3914Nearing or Surpassing: Overall Evaluation of Human-Machine Dynamic Vision Ability4.001.413, 3, 6
3915Learn to Know Unknowns: A Bionic Memory Network for Unsupervised Anomaly Detection4.001.003, 5, 3, 5
3916Double dynamic sparse training for GANs4.001.003, 3, 5, 5
3917Improving Corruption Robustness with Adversarial Feature Alignment Transformers4.001.413, 6, 3
3918Slimmable Networks for Contrastive Self-supervised Learning4.001.003, 3, 5, 5
3919TEAS: Exploiting Spiking Activity for Temporal-wise Adaptive Spiking Neural Networks4.001.005, 3, 3, 5
3920Exploring Visual Interpretability for Contrastive Language-Image Pretraining4.001.263, 3, 5, 3, 6
3921BiBench: Benchmarking and Analyzing Network Binarization4.001.416, 3, 3
3922Identifying Phase Transition Thresholds of Permuted Linear Regression via Message Passing3.801.941, 6, 6, 3, 3
3923Speech denoising by listening to noise3.800.983, 3, 5, 3, 5
3924Knowledge-Grounded Reinforcement Learning3.800.983, 3, 5, 5, 3
3925Auditing Fairness Online through Interactive Refinement3.800.983, 5, 5, 3, 3
3926G-Censor: Graph Contrastive Learning with Task-Oriented Counterfactual Views3.800.983, 5, 5, 3, 3
3927GLASU: A Communication-Efficient Algorithm for Federated Learning with Vertically Distributed Graph Data3.800.983, 5, 3, 3, 5
3928MODULAR FEDERATED CONTRASTIVE LEARNING WITH PEER NORMALIZATION3.800.983, 3, 3, 5, 5
3929SwinZS3: Zero-Shot Semantic Segmentation with a Swin Transformer3.751.921, 5, 3, 6
3930Thresholded Lexicographic Ordered Multi-Objective Reinforcement Learning3.751.303, 3, 3, 6
3931xTrimoABFold: Improving Antibody Structure Prediction without Multiple Sequence Alignments3.751.923, 6, 5, 1
3932Gandalf : Data Augmentation is all you need for Extreme Classification3.751.306, 3, 3, 3
3933Model-based Unknown Input Estimation via Partially Observable Markov Decision Processes3.751.925, 1, 6, 3
3934Help Me Explore: Combining Autotelic and Social Learning via Active Goal Queries3.751.925, 6, 3, 1
3935Learning to reason over visual objects3.751.303, 3, 6, 3
3936VER: Learning Natural Language Representations for Verbalizing Entities and Relations3.751.303, 3, 3, 6
3937Training Neural Networks with Low-Precision Model Memory3.751.303, 6, 3, 3
3938FoveaTer: Foveated Transformer for Image Classification3.751.921, 3, 5, 6
3939TG-Gen: A Deep Generative Model Framework for Temporal Graphs3.751.303, 6, 3, 3
3940Comparing Human and Machine Bias in Face Recognition3.751.303, 3, 6, 3
3941Finding the smallest tree in the forest: Monte Carlo Forest Search for UNSAT solving3.751.303, 3, 6, 3
3942Predictive Coding with Approximate Laplace Monte Carlo3.751.303, 6, 3, 3
3943The Ultimate Combo: Boosting Adversarial Example Transferability by Composing Data Augmentations3.751.303, 3, 6, 3
3944Improving Aspect Ratio Distribution Fairness in Detector Pretraining via Cooperating RPN’s3.751.923, 6, 5, 1
3945How to Do a Vocab Swap? A Study of Embedding Replacement for Pre-trained Transformers3.751.303, 3, 3, 6
3946UnDiMix: Hard Negative Sampling Strategies for Contrastive Representation Learning3.751.921, 3, 6, 5
3947Exploring Connections Between Memorization And Membership Inference3.751.306, 3, 3, 3
3948FedAvg Converges to Zero Training Loss Linearly: The Power of Overparameterized Multi-Layer Neural Networks3.751.303, 3, 3, 6
3949ResFed: Communication Efficient Federated Learning by Transmitting Deep Compressed Residuals3.751.303, 3, 3, 6
3950Multi-instance Interactive Segmentation with Self-Supervised Transformer3.751.303, 3, 6, 3
3951CLUSTERBERT: MULTI-STAGE FINE-TUNING OF TRANSFORMERS FOR DEEP TEXT CLUSTERING3.751.303, 3, 6, 3
3952Distilling Pre-trained Knowledge in Chemical Reactions for Molecular Property Prediction3.751.303, 3, 3, 6
3953Batch Normalization Explained3.751.303, 6, 3, 3
3954CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration3.751.303, 3, 3, 6
3955RoCourseNet: Distributionally Robust Training of a Prediction Aware Recourse Model3.751.303, 3, 3, 6
3956Global-Scale Species Mapping From Crowdsourced Data3.751.303, 3, 3, 6
3957Learning Robust Kernel Ensembles with Kernel Average Pooling3.751.303, 3, 6, 3
3958Harnessing Client Drift with Decoupled Gradient Dissimilarity3.751.303, 6, 3, 3
3959VQ-TR: Vector Quantized Attention for Time Series Forecasting3.751.925, 3, 6, 1
3960Emergent collective intelligence from massive-agent cooperation and competition3.751.923, 6, 5, 1
3961CLIP model is an Efficient Continual Learner3.751.303, 3, 6, 3
3962GAML: geometry-aware meta-learning via a fully adaptive preconditioner3.751.926, 1, 3, 5
3963Graph Neural Networks for Aerodynamic Flow Reconstruction from Sparse Sensing3.751.303, 3, 6, 3
3964Revisiting the Activation Function for Federated Image Classification3.751.921, 3, 6, 5
3965Route, Interpret, Repeat: Blurring the Line Between Posthoc Explainability and Interpretable Models3.751.923, 5, 1, 6
3966Analysis of Radio Localiser Networks under Distribution Shift3.751.303, 3, 3, 6
3967Bayesian Optimal Experimental Design for the Survey Bandit Setting3.751.306, 3, 3, 3
3968Pathfinding Neural Cellular Automata3.751.303, 3, 6, 3
3969Unleashing the Potential of Data Sharing in Ensemble Deep Reinforcement Learning3.751.923, 5, 6, 1
3970K-SAM: Sharpness-Aware Minimization at the Speed of SGD3.751.303, 3, 6, 3
3971A Simple Unsupervised Data Depth-based Method to Detect Adversarial Images3.752.593, 1, 8, 3
3972Counterfactual Memorization in Neural Language Models3.751.303, 3, 6, 3
3973Safer Reinforcement Learning with Counterexample-guided Offline Training3.751.303, 3, 3, 6
3974Enhancing Cross-Category Learning in Recommendation Systems with Multi-Layer Embedding Training3.751.306, 3, 3, 3
3975Populating memory in Continual Learning with Consistency Aware Sampling3.751.303, 3, 3, 6
3976System Identification as a Reinforcement Learning Problem3.751.925, 3, 1, 6
3977Projected Latent Distillation for Data-Agnostic Consolidation in Multi-Agent Continual Learning3.751.303, 3, 6, 3
3978Domain Generalization in Regression3.751.303, 6, 3, 3
3979Latent-space disentanglement with untrained generator networks allows to isolate different motion types in video data3.751.921, 3, 6, 5
3980FedMAE: Federated Self-Supervised Learning with One-Block Masked Auto-Encoder3.751.303, 6, 3, 3
3981Learning Sampling Policy to Achieve Fewer Queries for Zeroth-Order Optimization3.751.925, 6, 3, 1
3982Learning Graph Neural Network Topologies3.751.303, 6, 3, 3
3983Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning3.751.303, 6, 3, 3
3984Deep Generative Model based Rate-Distortion for Image Downscaling Assessment3.751.303, 3, 6, 3
3985Impact of the Last Fully Connected Layer on Out-of-distribution Detection3.751.303, 6, 3, 3
3986Optformer: Beyond Transformer for Black-box Optimization3.751.303, 3, 6, 3
3987Group-Equivariant Transformers Without Positional Encoding3.751.303, 6, 3, 3
3988Beyond Counting Linear Regions of Neural Networks, Simple Linear Regions Dominate!3.751.303, 6, 3, 3
3989Local Stochastic Bilevel Optimization with Momentum-Based Variance Reduction3.751.303, 6, 3, 3
3990FixEval: Execution-based Evaluation of Program Fixes for Competitive Programming Problems3.751.303, 6, 3, 3
3991Learning with Instance-Dependent Label Noise: Balancing Accuracy and Fairness3.751.303, 3, 6, 3
3992VC Theoretical Explanation of Double Descent3.751.303, 3, 3, 6
3993Distraction is All You Need For Fairness3.751.303, 6, 3, 3
3994Formal Conceptual Views in Neural Networks3.751.306, 3, 3, 3
3995Understanding Masked Image Modeling via Learning Occlusion Invariant Feature3.752.955, 8, 1, 1
3996RegQ: Convergent Q-Learning with Linear Function Approximation using Regularization3.751.923, 1, 5, 6
3997Fast 6D Object Pose Refinement via Implicit Surface Representation Driven Optimization3.751.303, 3, 6, 3
3998Variation-based Cause Effect Identification3.751.306, 3, 3, 3
3999Physics-Regularized Stereo Matching for Depth Estimation3.752.591, 8, 3, 3
4000Additive Poisson Process: Learning Intensity of Higher-Order Interaction in Poisson Processes3.751.306, 3, 3, 3
4001Hyperbolic Binary Neural Network3.751.926, 1, 5, 3
4002Training Instability and Disharmony Between ReLU and Batch Normalization3.751.303, 3, 3, 6
4003The Biased Artist: Exploiting Cultural Biases via Homoglyphs in Text-Guided Image Generation Models3.751.925, 3, 1, 6
4004Semantic Grouping Network for Audio Source Separation3.751.925, 1, 6, 3
4005On Stability and Generalization of Bilevel Optimization Problems3.751.921, 6, 3, 5
4006Do Spiking Neural Networks Learn Similar Representation with Artificial Neural Networks? A Pilot Study on SNN Representation3.751.303, 3, 6, 3
4007Learning to Perturb for Contrastive Learning of Unsupervised Sentence Representations3.670.943, 3, 5
4008A Hybrid Framework for Generating A Country-scale Synthetic Population3.670.943, 3, 5
4009Pocket-specific 3D Molecule Generation by Fragment-based Autoregressive Diffusion Models3.670.943, 5, 3
4010Graph Spline Networks for Efficient Continuous Simulation of Dynamical Systems3.670.943, 5, 3
4011AMA: Asymptotic Midpoint Augmentation for Margin Balancing and Moderate Broadening3.670.943, 5, 3
4012Towards A Unified Neural Architecture for Visual Recognition and Reasoning3.670.945, 3, 3
4013Estimating Treatment Effects using Neurosymbolic Program Synthesis3.670.943, 3, 5
4014Boosting Drug-Target Affinity Prediction from Nearest Neighbors3.670.943, 3, 5
4015Tight Non-asymptotic Inference via Sub-Gaussian Intrinsic Moment Norm3.670.943, 3, 5
4016PBES: PCA Based Exemplar Sampling Algorithm for Continual Learning3.670.943, 3, 5
4017Multi-scale Sinusoidal Embeddings Enable Learning on High Resolution Mass Spectrometry Data3.671.895, 5, 1
4018Protecting DNN from Evasion Attacks using Ensemble of High Focal Diversity3.670.943, 3, 5
4019Giving Robots a Hand: Broadening Generalization via Hand-Centric Human Video Demonstrations3.670.943, 3, 5
4020No Pairs Left Behind: Improving Metric Learning with Regularized Triplet Objective3.670.943, 5, 3
4021Matrix factorization under the constraint of connectivity between observed and source data ~ Muscle synergy analysis based on connectivity between muscle and brain activities ~3.670.943, 5, 3
4022VISION TRANSFORMER FOR MULTIVARIATE TIME- SERIES CLASSIFICATION (VITMTSC)3.670.945, 3, 3
4023Factors Influencing Generalization in Chaotic Dynamical Systems3.670.943, 3, 5
4024Query by Self3.670.945, 3, 3
4025Graph Neural Networks Are More Powerful Than we Think3.670.943, 5, 3
4026On a Benefit of Masked Language Model Pretraining: Robustness to Simplicity Bias3.670.943, 5, 3
4027Improving Subgraph Representation Learning via Multi-View Augmentation3.670.943, 5, 3
4028CrystalBox: Efficient Model-Agnostic Explanations for Deep RL Controllers3.670.945, 3, 3
4029Efficient Approximations of Complete Interatomic Potentials for Crystal Property Prediction3.670.943, 3, 5
4030RankMe: Assessing the Downstream Performance of Pretrained Self-Supervised Representations by Their Rank3.670.943, 3, 5
4031Soft Diffusion: Score Matching For General Corruptions3.670.943, 3, 5
4032Online Continual Learning with Feedforward Adaptation3.670.943, 5, 3
4033Learning parsimonious dynamics for generalization in reinforcement learning3.670.945, 3, 3
4034Homotopy Learning of Parametric Solutions to Constrained Optimization Problems3.670.943, 3, 5
4035Re-calibrated Wasserstein GAN for large-scale imputation with informative missing3.670.943, 5, 3
4036Domain Invariant Q-Learning for model-free robust continuous control under visual distractions3.670.943, 3, 5
4037Learning Useful Representations for Shifting Tasks and Distributions3.670.943, 5, 3
4038Architectural Backdoors in Neural Networks3.670.943, 3, 5
4039Can we achieve robustness from data alone?3.670.943, 3, 5
4040Perceptual Grouping in Vision-Language Models3.670.943, 3, 5
4041A Deep Dive into Dataset Imbalance and Bias in Face Identification3.670.943, 3, 5
4042Causally Constrained Data Synthesis For Private Data Release3.670.943, 3, 5
4043A simple Training-Free Method for Rejection Option3.670.943, 5, 3
4044Reducing the Capacity Gap via Spherical Knowledge Distillation3.671.895, 5, 1
4045Time Series Subsequence Anomaly Detection via Graph Neural Networks3.670.945, 3, 3
4046Improving Generalization of Motor-Imagery Brainwave Decoding via Dynamic Convolutions3.671.895, 5, 1
4047Bridging between Pool- and Stream-Based Active Learning with Temporal Data Coherence3.671.895, 1, 5
4048SYNC: Efficient Neural Code Search Through Structurally Guided Hard Negative Curricula3.670.945, 3, 3
4049Semi-parametric Prompt-Generation for Model Editing3.670.945, 3, 3
4050Dynamic Scheduled Sampling with Imitation Loss for Neural Text Generation3.670.945, 3, 3
4051Fourier PINNs: From Strong Boundary Conditions to Adaptive Fourier Bases3.670.943, 3, 5
4052Exploring Methods for Parsing Movie Scripts - Feature Extraction for Further Social Injustice Analysis3.670.945, 3, 3
4053Conceptual Behavior and Human-Likeness in Vision-and-Language Models3.670.943, 3, 5
4054Quantization-aware Policy Distillation (QPD)3.670.943, 5, 3
4055Active Learning at the ImageNet Scale3.670.943, 5, 3
4056Automatic Curriculum Generation for Reinforcement Learning in Zero-Sum Games3.670.945, 3, 3
4057Language Modeling Using Tensor Trains3.671.891, 5, 5
4058Would decentralization hurt generalization?3.671.495, 3, 1, 5, 3, 5
4059Tackling Imbalanced Class in Federated Learning via Class Distribution Estimation3.670.945, 3, 3
4060Solving Math Word Problems with Process-based and Outcome-based Feedback3.670.943, 3, 5
4061Few-shot Lifelong Reinforcement Learning with Generalization Guarantees: An Empirical PAC-Bayes Approach3.670.943, 3, 5
4062SEQuence-rPPG: A Fast BVP Signal Extraction Method From Frame Sequences3.670.943, 3, 5
4063Linearised Implicit Variational Inference3.670.943, 3, 5
4064Learning Interpretable Neural Discrete Representation for Time Series Classification3.670.943, 5, 3
4065SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data3.670.943, 5, 3
4066Perturbation Defocusing for Adversarial Defense3.671.895, 1, 5
4067Preserving Semantics in Textual Adversarial Attacks3.670.943, 5, 3
4068A Decomposition Based Dual Projection Model for Multivariate Time Series Forecasting and Anomaly Detection3.670.943, 5, 3
4069FedHPO-Bench: A Benchmark Suite for Federated Hyperparameter Optimization3.670.943, 5, 3
4070Cyclophobic Reinforcement Learning3.670.943, 3, 5
4071Dynamic-Aware GANs: Time-Series Generation with Handy Self-Supervision3.670.943, 5, 3
4072Learning System Dynamics from Sensory Input under Optimal Control Principles3.670.943, 5, 3
4073On the Shortcut Learning in Multilingual Neural Machine Translation3.670.943, 3, 5
4074I Speak, You Verify: Toward Trustworthy Neural Program Synthesis3.671.895, 1, 5
4075ACQL: An Adaptive Conservative Q-Learning Framework for Offline Reinforcement Learning3.670.945, 3, 3
4076Efficient Controllable Generation with Guarantee3.671.895, 1, 5
4077Weak Supervision Variational Auto-Encoder3.670.945, 3, 3
4078Extending graph transformers with quantum computed aggregation3.670.945, 3, 3
4079Self-Supervised SVDE from Videos with Depth Variance to Shifted Positional Information3.670.943, 3, 5
4080TransLog: A Unified Transformer-based Framework for Log Anomaly Detection3.670.943, 3, 5
4081Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning3.670.943, 3, 5
4082Continuous Monte Carlo Graph Search3.670.943, 3, 5
4083Backdoor Mitigation by Correcting Activation Distribution Alteration3.670.943, 5, 3
4084Pose Transfer using a Single Spatial Transformation3.671.895, 1, 5
4085How Distinguishable Are Vocoder Models? Analyzing Vocoder Fingerprints for Fake Audio3.670.943, 3, 5
4086Holographic-(V)AE: an end-to-end SO(3)-Equivariant (Variational) Autoencoder in Fourier Space3.670.943, 5, 3
4087Wasserstein Barycenter-based Model Fusion and Linear Mode Connectivity of Neural Networks3.670.945, 3, 3
4088Robust Multi-Agent Reinforcement Learning against Adversaries on Observation3.670.945, 3, 3
4089Self-supervised Learning for Cell Segmentation and Quantification in Digital Pathology Images3.670.943, 5, 3
4090Learning to Generate Pseudo Anomalies3.670.945, 3, 3
4091Scalable feature selection via sparse learnable masks3.670.943, 3, 5
4092Dataset Projection: Finding Target-aligned Subsets of Auxiliary Data3.670.943, 5, 3
4093Decentralized Federated Learning via Overlapping Data Augmentation3.670.943, 5, 3
4094An interpretable contrastive logical knowledge learning method for sentiment analysis3.670.943, 3, 5
4095Training image classifiers using Semi-Weak Label Data3.670.945, 3, 3
4096Magnum: Tackling High-Dimensional Structures with Self-Organization3.671.891, 5, 5
4097Vector Quantized Wasserstein Auto-Encoder3.670.943, 5, 3
4098A Sample Based Method for Understanding The Decisions of Neural Networks Semantically3.670.945, 3, 3
4099Deep Biological Pathway Informed Pathology-Genomic Multimodal Survival Prediction3.670.943, 3, 5
4100Explaining Patterns in Data with Language Models via Interpretable Autoprompting3.670.943, 5, 3
4101Neural DAEs: Constrained neural networks3.670.943, 3, 5
4102Variance Double-Down: The Small Batch Size Anomaly in Multistep Deep Reinforcement Learning3.670.945, 3, 3
4103Adversarial Representation Learning for Canonical Correlation Analysis3.670.943, 5, 3
4104Explaining Image Classification through Knowledge-aware Neuron Interpretation3.670.943, 3, 5
4105PointConvFormer: Revenge of the Point-Based Convolution3.670.943, 3, 5
4106Recurrent Real-valued Neural Autoregressive Density Estimator for Online Density Estimation and Classification of Streaming Data3.670.943, 3, 3, 5, 3, 5
4107StructViT: Learning Correlation Structures for Vision Transformers3.670.945, 3, 3
4108Stationary Deep Reinforcement Learning with Quantum K-spin Hamiltonian Equation3.670.943, 3, 5
4109Interpolating Compressed Parameter Subspaces3.670.943, 3, 5
4110Multi-Modality Alone is Not Enough: Generating Scene Graphs using Cross-Relation-Modality Tokens3.670.943, 3, 5
4111Clustering and Ordering Variable-Sized Sets: The Catalog Problem3.670.945, 3, 3
4112KerDEQ: Optimization induced Deep Equilibrium models via Gaussian Kernel3.670.943, 5, 3
4113TCNL: Transparent and Controllable Network Learning Via Embedding Human-Guided Concepts3.670.945, 3, 3
4114From ChebNet to ChebGibbsNet3.670.945, 3, 3
4115Towards Understanding Robust Memorization in Adversarial Training3.670.943, 3, 5
4116FV-MgNet: Fully Connected V-cycle MgNet for Interpretable Time Series Forecasting3.670.943, 3, 5
4117FFCV: Accelerating Training by Removing Data Bottlenecks3.670.943, 5, 3
4118Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label Thresholding3.670.943, 3, 5
4119Uncertainty and Traffic Light Aware Pedestrian Crossing Intention Prediction3.670.945, 3, 3
4120Token-level Fitting Issues of Seq2seq Models3.670.943, 5, 3
4121Worst-case Few-shot Evaluation: Are Neural Networks Robust Few-shot Learners?3.671.891, 5, 5
4122Leveraging Online Semantic Point Fusion for 3D-Aware Object Goal Navigation3.671.895, 5, 1
4123Robust Manifold Estimation Approach for Evaluating Fidelity and Diversity3.670.943, 5, 3
4124CAPE: Channel-Attention-Based PDE Parameter Embeddings for SciML3.670.943, 3, 5
4125Solving Partial Label Learning Problem with Multi-Agent Reinforcement Learning3.670.945, 3, 3
4126SDT: Specific Domain Training in Domain Generalization3.670.943, 5, 3
4127Is Class Incremental Learning Truly Learning Representations Continually?3.670.943, 3, 5
4128Understanding Adversarial Transferability in Federated Learning3.670.943, 3, 5
4129Attribute Alignment and Enhancement for Generalized Zero-Shot Learning3.670.943, 5, 3
4130Unified Probabilistic Modeling of Image Aesthetic Rating Distributions towards Measuring Subjectivity3.670.943, 5, 3
4131Analyzing adversarial robustness of vision transformers against spatial and spectral attacks3.670.943, 5, 3
4132The Progressive Alignment-aware Multimodal Fusion with Easy2hard Strategy for Multimodal Neural Machine Translation3.670.945, 3, 3
4133CacheGNN: Enhancing Graph Neural Networks with Global Information Caching3.670.943, 3, 5
4134Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning3.670.943, 5, 3
4135Towards Identification of Microaggressions in real-life and Scripted conversations, using Context-Aware Machine Learning Techniques.3.670.945, 3, 3
4136When does Bias Transfer in Transfer Learning?3.670.943, 5, 3
4137Towards Realtime Distributed Virtual Flow Meter via Compressed Continual Learning3.670.943, 3, 5
4138Robust Neural ODEs via Contractivity-promoting Regularization3.670.943, 5, 3
4139A Robust Stacking Framework for Training Deep Graph Models with Multifaceted Node Features3.670.943, 5, 3
4140Learning Diverse and Effective Policies with Non-Markovian Rewards3.670.943, 3, 5
4141BAMBI: Vertical Federated Bilevel Optimization with Privacy-Preserving and Computation Efficiency3.670.943, 5, 3
4142MULTILEVEL XAI: VISUAL AND LINGUISTIC BONDED EXPLANATIONS3.670.945, 3, 3
4143miCSE: Mutual Information Contrastive Learning for Low-shot Sentence Embeddings3.670.943, 3, 5
4144When Few-shot Meets Cross-domain Object Detection: Learning Instance-level Class Prototypes for Knowledge Transfer3.670.945, 3, 3
4145Unsupervised Threshold Learning with '$L$'-trend Prior For Visual Anomaly Detection3.670.945, 3, 3
4146Grouped self-attention mechanism for a memory-efficient Transformer3.670.943, 5, 3
4147Synergistic Neuromorphic Federated Learning with ANN-SNN Conversion For Privacy Protection3.670.943, 3, 5
4148Time Series Anomaly Detection via Hypothesis Testing for Dynamical Systems3.671.895, 1, 5
4149Identifying Latent Causal Content for Multi-Source Domain Adaptation3.670.945, 3, 3
4150NetBooster: Empowering Tiny Deep Learning By Standing on the Shoulders of Deep Giants3.670.943, 3, 5
4151On the Difficulties of Video Summarization: Structure and Subjectivity3.670.943, 5, 3
4152Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer3.670.943, 3, 5
4153Personalized Subgraph Federated Learning3.670.945, 3, 3
4154Adversarial Learned Fair Representations using Dampening and Stacking3.670.943, 3, 5
4155On the Importance of Pretrained Knowledge Distillation for 3D Object Detection3.670.943, 3, 5
4156Harnessing spectral representations for subgraph alignment3.670.945, 3, 3
4157Mixed-Precision Inference Quantization: Problem Resetting, Mapping math concept and Branch&bound methods3.670.943, 3, 5
4158Partial Advantage Estimator for Proximal Policy Optimization3.670.943, 5, 3
4159PatchBlender: A Motion Prior for Video Transformers3.670.943, 3, 5
4160Similarity and Generalization: from Noise to Corruption3.670.943, 5, 3
4161A Generalized EigenGame With Extensions to Deep Multiview Representation Learning3.670.943, 5, 3
4162Offline Model-Based Reinforcement Learning with Causal Structure3.670.943, 5, 3
4163Temporal Label Smoothing for Early Prediction of Adverse Events3.670.943, 3, 5
4164What's Wrong with the Robustness of Object Detectors?3.671.895, 1, 5
4165Corruption Depth: Analysis of DNN depth for Misclassification3.670.945, 3, 3
4166How Does Value Distribution in Distributional Reinforcement Learning Help Optimization?3.670.943, 5, 3
4167Quality Matters: Embracing Quality Clues for Robust 3D Multi-Object Tracking3.670.943, 3, 5
4168GOAT: A Global Transformer on Large-scale Graphs3.670.943, 5, 3
4169HeatDETR: Hardware-Efficient DETR with Device-Adaptive Thinning3.670.943, 3, 5
4170An Incremental Learning Approach for Sustainable Regional Isolation and Integration3.670.943, 5, 3
4171Very Large Scale Multi-Agent Reinforcement Learning with Graph Attention Mean Field3.670.943, 5, 3
4172Representation Mutual Learning for End-to-End Weakly-Supervised Semantic Segmentation3.670.943, 3, 5
4173Consistent and Truthful Interpretation with Fourier Analysis3.670.945, 3, 3
4174GENERALIZED MATRIX LOCAL LOW RANK REPRESENTATION BY RANDOM PROJECTION AND SUBMATRIX PROPAGATION3.670.943, 3, 5
4175Formulating and Proving the Trend of DNNs Learning Simple Concepts3.670.945, 3, 3
4176Selective Classification Via Neural Network Training Dynamics3.670.945, 3, 3
4177FlexPose: Pose Distribution Adaptation with Few-shot Guidance3.670.943, 3, 5
4178Structure-Sensitive Graph Dictionary Embedding for Graph Classification3.670.943, 5, 3
4179Variational Autoencoders with Decremental Information Bottleneck for Disentanglement3.670.943, 3, 5
4180FreeSeg: Free Mask from Interpretable Contrastive Language-Image Pretraining for Semantic Segmentation3.670.943, 3, 5
4181(LA)YER-NEIGH(BOR) SAMPLING: DEFUSING NEIGHBORHOOD EXPLOSION3.670.945, 3, 3
4182Feint in Multi-Player Games3.671.895, 5, 1
4183Metro: Memory-Enhanced Transformer for Retrosynthetic Planning via Reaction Tree3.670.943, 3, 5
4184Efficient Multi-Task Reinforcement Learning via Selective Behavior Sharing3.670.943, 5, 3
4185Recursive Neural Programs: Variational Learning of Image Grammars and Part-Whole Hierarchies3.601.203, 6, 3, 3, 3
4186SPC-Net: A New Scalable Point Cloud Compression Framework for Both Machine and Human Vision Tasks3.601.203, 3, 6, 3, 3
4187Addressing High-dimensional Continuous Action Space via Decomposed Discrete Policy-Critic3.601.206, 3, 3, 3, 3
4188Fully Continuous Gated Recurrent Units For processing Time Series3.601.743, 6, 5, 1, 3
4189Why Adversarial Training of ReLU Networks Is Difficult?3.601.743, 3, 5, 1, 6
4190Machine Learning from Explanations3.501.663, 5, 5, 1
4191Transformer needs NMDA receptor nonlinearity for long-term memory3.500.873, 3, 3, 5
4192Rethinking the Value of Prompt Learning for Vision-Language Models3.500.873, 3, 3, 5
4193Towards Performance-maximizing Network Pruning via Global Channel Attention3.500.873, 3, 3, 5
4194Object-Centric Learning with Slot Mixture Models3.500.873, 3, 5, 3
4195RISC-V MICROARCHITECTURE EXPLORATION VIA REINFORCEMENT LEARNING3.500.873, 3, 3, 5
4196How (Un)Fair is Text Summarization?3.500.875, 3, 3, 3
4197Simulating Task-Free Continual Learning Streams From Existing Datasets3.500.873, 3, 5, 3
4198Attention Flows for General Transformers3.500.873, 3, 3, 5
4199Group-Disentangling Conditional Shift3.500.873, 3, 3, 5
4200Distance VS. Coordinate: Distance Based Embedding Improves Model Generalization for Routing Problems3.500.873, 3, 3, 5
4201Text2Model: Model Induction for Zero-shot Generalization Using Task Descriptions3.500.873, 3, 5, 3
4202Opportunistic Actor-Critic (OPAC) with Clipped Triple Q-learning3.500.875, 3, 3, 3
4203On Information Maximisation in Multi-View Self-Supervised Learning3.501.663, 1, 5, 5
4204SRBGCN: Tangent space-Free Lorentz Transformations for Graph Feature Learning3.500.875, 3, 3, 3
4205Mirror Training for Input Convex Neural Network3.501.665, 3, 5, 1
4206A Benchmark Dataset for Learning from Label Proportions3.500.875, 3, 3, 3
4207Don’t Bet on Sparsity: Designing Brain-inspired Distance-preserving Encoder3.500.873, 3, 5, 3
4208Learned Nearest-Class-Mean for Biased Representations in Long-Tailed Recognition3.500.873, 3, 3, 5
4209DYNAMIC BATCH NORM STATISTICS UPDATE FOR NATURAL ROBUSTNESS3.500.873, 5, 3, 3
4210MixBin: Towards Budgeted Binarization3.500.873, 3, 3, 5
4211Corruption-free Single-view Self-supervised Learning on Graphs3.500.875, 3, 3, 3
4212Quasiconvex Shallow Neural Network3.500.873, 5, 3, 3
4213Text-Conditioned Graph Generation Using Discrete Graph Variational Autoencoders3.500.873, 5, 3, 3
4214Diffusion-based point cloud generation with smoothness constraints3.500.875, 3, 3, 3
4215Towards Out-of-Distribution Adversarial Robustness3.500.873, 3, 3, 5
4216Learning to perceive objects by prediction3.500.873, 3, 5, 3
4217Why do Models with Conditional Computation Learn Suboptimal Solutions?3.500.873, 5, 3, 3
4218Divide-and-Cluster: Spatial Decomposition Based Hierarchical Clustering3.500.873, 3, 3, 5
4219Fast Yet Effective Graph Unlearning through Influence Analysis3.500.873, 3, 3, 5
4220TI-VAE: A temporally independent VAE with applications to latent factor learning in neuroimaging3.500.873, 5, 3, 3
4221On Representation Learning Under Class Imbalance3.500.873, 3, 5, 3
4222Efficient Stochastic Optimization for Attacking Randomness Involved Inference3.500.873, 3, 3, 5
4223GLINKX: A Scalable Unified Framework For Homophilous and Heterophilous Graphs3.500.873, 5, 3, 3
4224Graph Neural Networks as Multi-View Learning3.500.875, 3, 3, 3
4225A Retrieve-and-Read Framework for Knowledge Graph Reasoning3.500.873, 3, 5, 3
4226FLGAME: A Game-theoretic Defense against Backdoor Attacks In Federated Learning3.501.665, 1, 5, 3
4227High-Precision Regressors for Particle Physics3.501.661, 5, 5, 3
4228Fine-Tuning Offline Policies With Optimistic Action Selection3.500.873, 3, 5, 3
4229Test-Time Training on Video Streams3.501.665, 3, 5, 1
4230The Right Losses for the Right Gains: Improving the Semantic Consistency of Deep Text-to-Image Generation with Distribution-Sensitive Losses3.500.873, 3, 3, 5
4231CausalBench: A Large-scale Benchmark for Network Inference from Single-cell Perturbation Data3.501.665, 5, 3, 1
4232Semi-supervised consistency regularization for accurate cell type fraction and gene expression estimation3.500.873, 3, 5, 3
4233Pareto Rank-Preserving Supernetwork for HW-NAS3.500.873, 3, 5, 3
4234PGASL: Predictive and Generative Adversarial Semi-supervised Learning for imbalanced data3.500.873, 5, 3, 3
4235MaxMin-Novelty: Maximizing Novelty via Minimizing the State-Action Values in Deep Reinforcement Learning3.501.661, 3, 5, 5
4236Handling Covariate Shifts in Federated Learning with Generalization Guarantees3.500.873, 3, 5, 3
4237Hierarchical Neural Program Synthesis3.500.875, 3, 3, 3
4238SPIDER: Searching Personalized Neural Architecture for Federated Learning3.500.873, 5, 3, 3
4239Robust Graph Representation Learning via Predictive Coding3.500.875, 3, 3, 3
4240Brain Signal Generation and Data Augmentation with a Single-Step Diffusion Probabilistic Model3.501.661, 5, 3, 5
4241Bounded Attacks and Robustness in Image Transform Domains3.500.875, 3, 3, 3
4242Efficient Exploration using Model-Based Quality-Diversity with Gradients3.500.873, 3, 5, 3
4243Distinguishing Feature Model for Ranking From Pairwise Comparisons3.501.663, 5, 1, 5
4244Applying Second Order Optimization to Deep Transformers with Parameter-Efficient Tuning3.500.873, 3, 5, 3
4245Mask-tuning: Towards Improving Pre-trained Language Models' Generalization3.500.873, 5, 3, 3
4246Tiered Pruning for Efficient Differentialble Inference-Aware Neural Architecture Search3.500.873, 5, 3, 3
4247Spurious Features in Continual Learning3.500.875, 3, 3, 3
4248Why Did This Model Forecast This Future? Information-Theoretic Temporal Saliency for Counterfactual Explanations of Probabilistic Forecasts3.500.873, 3, 5, 3
4249Topological Data Analysis-Deep Learning Framework for Predicting Cancer Phenotypes3.501.661, 3, 5, 5
4250Reprogramming Large Pretrained Language Models for Antibody Sequence Infilling3.500.875, 3, 3, 3
4251Differentially Private Conditional Text Generation For Synthetic Data Production3.500.873, 5, 3, 3
4252FUN: Filter-based Unlearnable Datasets3.500.873, 5, 3, 3
4253GMML is All you Need3.500.873, 5, 3, 3
4254Variational Pseudo Labels for Meta Test-time Adaptation3.500.875, 3, 3, 3
4255Continuously Parameterized Mixture Models3.500.873, 3, 5, 3
4256DP-InstaHide: Data Augmentations Provably Enhance Guarantees Against Dataset Manipulations3.500.875, 3, 3, 3
4257Affinity-VAE for clustering and classification of objects in multidimensional image data3.500.873, 5, 3, 3
4258Guided Safe Shooting: model based reinforcement learning with safety constraints3.500.873, 3, 5, 3
4259Counterfactual Explanation via Search in Gaussian Mixture Distributed Latent Space3.500.873, 5, 3, 3
4260AGREE: A Simple Aggregator of Detectors’ Decisions3.500.873, 3, 5, 3
4261Prompt Injection: Parameterization of Fixed Inputs3.500.875, 3, 3, 3
4262LEXA: Language-agnostic Cross-consistency Training for Question Answering Tasks3.500.873, 3, 5, 3
4263RulE: Neural-Symbolic Knowledge Graph Reasoning with Rule Embedding3.500.873, 3, 3, 5
4264Improving the generalization ability of the chaotic time-series classification models by residual component extraction3.501.665, 1, 3, 5
4265Consciousness-Aware Multi-Agent Reinforcement Learning3.501.661, 5, 3, 5
4266Pseudo-Edge: Semi-Supervised Link Prediction with Graph Neural Networks3.500.873, 3, 3, 5
4267Can Fair Federated Learning reduce the need for personalization?3.500.873, 3, 3, 5
4268Dynamical Signatures of Learning in Recurrent Networks3.500.873, 5, 3, 3
4269Preventing Mode Collapse When Imitating Latent Policies from Observations3.500.875, 3, 3, 3
4270Compositional Image Generation and Manipulation with Latent Diffusion Models3.500.875, 3, 3, 3
4271Cross-Protein Wasserstein Transformer for Protein-Protein Interactions3.500.873, 3, 5, 3
4272Inverse Optimal Transport with Application to Contrastive Learning3.500.873, 3, 3, 5
4273Demystifying black-box DNN training processes through Concept-Monitor3.501.663, 1, 5, 5
4274Improving the Estimation of Instance-dependent Transition Matrix by using Self-supervised Learning3.501.661, 5, 3, 5
4275A general differentially private learning framework for decentralized data3.500.873, 3, 3, 5
4276ReG-NAS: Graph Neural Network Architecture Search using Regression Proxy Task3.500.873, 5, 3, 3
4277MaskNeRF: Masked Neural Radiance Fields for Sparse View Synthesis3.500.873, 5, 3, 3
4278Penalizing the High-likelihood: A Novel Sampling Method for Open-ended Neural Text Generation via Inverse Probability Weighting3.500.873, 3, 3, 5
4279Injecting Image Details into CLIP's Feature Space3.500.873, 3, 3, 5
4280OCIM : Object-centric Compositional Imagination for Visual Abstract Reasoning3.500.873, 5, 3, 3
4281SplitMixer: Fat Trimmed From MLP-like Models3.501.661, 3, 5, 5
4282Effectively Clarify Confusion via Visualized Aggregation and Separation of Deep Representation3.500.873, 3, 3, 5
4283The Impact of Neighborhood Distribution in Graph Convolutional Networks3.500.873, 3, 3, 5
4284Structural Code Representation Learning for Auto-Vectorization3.500.873, 3, 5, 3
4285MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input Features3.500.873, 3, 5, 3
4286Learning-Based Radiomic Prediction of Type 2 Diabetes Mellitus Using Image-Derived Phenotypes3.500.875, 3, 3, 3
4287Revisiting Instance-Reweighted Adversarial Training3.500.873, 3, 3, 5
4288Examining the Difference Among Transformers and CNNs with Explanation Methods3.501.663, 1, 5, 5
4289Few-Shot Text Classification with Dual Contrastive Consistency Training3.500.873, 3, 3, 5
4290Capsa: A Unified Framework for Quantifying Risk in Deep Neural Networks3.500.873, 3, 3, 5
4291Self-supervised Continual Learning based on Batch-mode Novelty Detection3.500.873, 3, 3, 5
4292A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games3.501.661, 5, 5, 3
4293TRIDE: A Temporal, Robust, and Informative Data Augmentation Framework for Disease Progression Modeling3.500.873, 5, 3, 3
4294Approximate Conditional Coverage via Neural Model Approximations3.500.873, 5, 3, 3
4295Towards Representative Subset Selection for Self-Supervised Speech Recognition3.500.873, 5, 3, 3
4296Learning to Act through Activation Function Optimization in Random Networks3.500.875, 3, 3, 3
4297Representation Learning via Consistent Assignment of Views over Random Partitions3.500.873, 3, 3, 5
4298PRANC: Pseudo RAndom Networks for Compacting deep models3.500.873, 3, 3, 5
4299Biological connectomes as a representation for the architecture of artificial neural networks3.501.665, 5, 1, 3
4300Perceive, Ground, Reason, and Act: A Benchmark for General-purpose Visual Representation3.500.875, 3, 3, 3
4301Task Regularized Hybrid Knowledge Distillation For Continual Object Detection3.500.875, 3, 3, 3
4302GOING BEYOND 1-WL EXPRESSIVE POWER WITH 1-LAYER GRAPH NEURAL NETWORKS3.500.873, 3, 3, 5
4303Backdoors Stuck At The Frontdoor: Multi-Agent Backdoor Attacks That Backfire3.501.661, 5, 5, 3
4304Less is More: Rethinking Few-Shot Learning and Recurrent Neural Nets3.500.873, 5, 3, 3
4305When Neural ODEs meet Neural Operators3.500.873, 5, 3, 3
4306Reducing Forgetting In Federated Learning with Truncated Cross-Entropy3.500.873, 5, 3, 3
4307FedEED: Efficient Federated Distillation with Ensemble of Aggregated Models3.500.873, 3, 5, 3
4308A Simple, Yet Effective Approach to Finding Biases in Code Generation3.500.873, 5, 3, 3
4309Surrogate Gradient Design for LIF networks3.500.873, 3, 3, 5
4310The Multiple Subnetwork Hypothesis: Enabling Multidomain Learning by Isolating Task-Specific Subnetworks in Feedforward Neural Networks3.500.873, 5, 3, 3
4311Gradient-Informed Quality Diversity for the Illumination of Discrete Spaces3.502.501, 6, 1, 6
4312Linear Scalarization for Byzantine-Robust Learning on non-IID data3.500.873, 3, 3, 5
4313Planning With Uncertainty: Deep Exploration in Model-Based Reinforcement Learning3.500.873, 3, 3, 5
4314A Hierarchical Hyper-rectangle Mass Model for Fine-grained Entity Typing3.500.873, 5, 3, 3
4315Enhancing the Transferability of Adversarial Examples via a Few Queries and Fuzzy Domain Eliminating3.501.661, 5, 3, 5
4316Towards Information-Theoretic Pattern Mining in Time Series3.501.661, 5, 5, 3
4317SuperMarioDomains: Generalizing to Domains with Evolving Graphics3.500.873, 3, 5, 3
4318AIA: learn to design greedy algorithm for NP-complete problems using neural networks3.500.873, 3, 3, 5
4319AVT: Audio-Video Transformer for Multimodal Action Recognition3.500.873, 3, 5, 3
4320Accelerating Adaptive Federated Optimization with Local Gossip Communications3.500.873, 5, 3, 3
4321On the Complexity of Bayesian Generalization3.500.873, 3, 3, 5
4322Compound Tokens: Channel Fusion for Vision-Language Representation Learning3.500.875, 3, 3, 3
4323Are vision transformers more robust than CNNs for Backdoor attacks?3.500.875, 3, 3, 3
4324Fair Federated Learning via Bounded Group Loss3.500.873, 5, 3, 3
4325Target-Free Ligand Scoring via One-Shot Learning3.500.873, 3, 3, 5
4326Beyond Traditional Transfer Learning: Co-finetuning for Action Localisation3.500.873, 3, 5, 3
4327Neural Embeddings for Text3.500.873, 3, 3, 5
4328Tessellated Neural Networks: A Robust Defence against Adversarial Attacks3.500.873, 3, 3, 5
4329Deep Reinforcement learning on Adaptive Pairwise Critic and Asymptotic Actor3.500.873, 5, 3, 3
4330Causal Inference via Nonlinear Variable Decorrelation in Healthcare3.500.873, 5, 3, 3
4331DoE2Vec: Representation Learning for Exploratory Landscape Analysis3.500.873, 5, 3, 3
4332Test-time recalibration of conformal predictors under distribution shift based on unlabeled examples3.500.875, 3, 3, 3
4333Newton Losses: Efficiently Including Second-Order Information into Gradient Descent3.500.873, 5, 3, 3
4334When is Adversarial Robustness Transferable?3.500.873, 3, 5, 3
4335On the Connection between Fisher's Criterion and Shannon's Capacity: Theoretical Concepts and Implementation3.500.873, 5, 3, 3
4336Neural Operator Variational Inference based on Regularized Stein Discrepancy for Deep Gaussian Processes3.500.873, 3, 3, 5
4337Self Check-in: Tight Privacy Amplification for Practical Distributed Learning3.501.661, 5, 5, 3
4338Understanding Catastrophic Overfitting in Fast Adversarial Training From a Non-robust Feature Perspective3.500.873, 3, 3, 5
4339TCFimt: Temporal Counterfactual Forecasting from Individual Multiple Treatment Perspective3.500.873, 3, 3, 5
4340Generative Multi-Flow Networks: Centralized, Independent and Conservation3.500.873, 3, 5, 3
4341motifNet: Functional motif interactions discovered in mRNA sequences with implicit neural representation learning3.500.873, 3, 3, 5
4342Rethinking Data Augmentation for Improving Transferable Targeted Attacks3.500.875, 3, 3, 3
4343ID and OOD Performance Are Sometimes Inversely Correlated on Real-world Datasets3.500.873, 3, 3, 5
4344Cali-NCE: Boosting Cross-modal Video Representation Learning with Calibrated Alignment3.500.873, 5, 3, 3
4345Strength-Adaptive Adversarial Training3.500.875, 3, 3, 3
4346Deep Deformation Based on Feature-Constraint for 3D Human Mesh Correspondence3.500.875, 3, 3, 3
4347An Improved Baseline for Masked Contrastive Learning3.501.661, 5, 5, 3
4348Towards Generalized Combinatorial Solvers via Reward Adjustment Policy Optimization3.501.661, 3, 5, 5
4349Revisiting Embeddings for Graph Neural Networks3.500.873, 5, 3, 3
4350Empirical analysis of representation learning and exploration in neural kernel bandits3.500.873, 3, 3, 5
4351EMO: Episodic Memory Optimization for Few-Shot Meta-Learning3.500.873, 5, 3, 3
4352Explainability of deep reinforcement learning algorithms in robotic domains by using Layer-wise Relevance Propagation3.500.873, 5, 3, 3
4353High Dimensional Bayesian Optimization with Reinforced Transformer Deep Kernels3.500.873, 3, 3, 5
4354Latent Offline Distributional Actor-Critic3.500.875, 3, 3, 3
4355Leveraging variational autoencoders for multiple data imputation3.501.665, 1, 3, 5
4356Rethinking Learning Dynamics in RL using Adversarial Networks3.500.873, 5, 3, 3
4357Is Stochastic Gradient Descent Near Optimal?3.500.875, 3, 3, 3
4358Elastic Mean-Teacher Distillation Mitigates the Continual Learning Stability Gap3.500.873, 3, 3, 5
4359FONDUE: an Algorithm to Find the Optimal Dimensionality of the Latent Representations of Variational Autoencoders3.500.873, 5, 3, 3
4360Interpreting Distributional Reinforcement Learning: A Regularization Perspective3.500.873, 3, 3, 5
4361Global Hardest Example Mining with Prototype-based Triplet Loss3.500.873, 3, 3, 5
4362MGMA: Mesh Graph Masked Autoencoders for Self-supervised Learning on 3D Shape3.500.873, 3, 3, 5
4363Improving the Latent Space of Image Style Transfer3.500.875, 3, 3, 3
4364Language-Guided Artistic Style Transfer Using the Latent Space of DALL-E3.500.873, 3, 5, 3
4365Out-of-distribution Detection with Diffusion-based Neighborhood3.500.873, 3, 3, 5
4366SELF-SUPERVISED PRETRAINING FOR DIFFERENTIALLY PRIVATE LEARNING3.500.873, 3, 5, 3
4367Learning Axis-Aligned Decision Trees with Gradient Descent3.500.875, 3, 3, 3
4368A Fairness Analysis on Differentially Private Aggregation of Teacher Ensembles3.500.873, 5, 3, 3
4369Domain Specific Denoising Diffusion Probabilistic Models for Brain Dynamics3.501.663, 5, 5, 1
4370A Simple and Provable Method to Adapt Pre-trained Model across Domains with Few Samples3.500.875, 3, 3, 3
4371EyeDAS: Securing Perception of Autonomous Cars Against the Stereoblindness Syndrome3.501.665, 1, 5, 3
4372Hardware-restriction-aware training (HRAT) for memristor neural networks3.500.873, 3, 5, 3
4373ViTKD: Practical Guidelines for ViT Feature Knowledge Distillation3.500.873, 5, 3, 3
4374Sharpness-aware Quantization for Deep Neural Networks3.500.873, 3, 5, 3
4375DOTIN: Dropping Out Task-Irrelevant Nodes for GNNs3.500.875, 3, 3, 3
4376GraphCG: Unsupervised Discovery of Steerable Factors in Graphs3.500.873, 5, 3, 3
4377Rethinking Knowledge Distillation via Cross-Entropy3.500.873, 3, 5, 3
4378Progressive Mixup Augmented Teacher-Student Learning for Unsupervised Domain Adaptation3.400.803, 3, 3, 5, 3
4379On Making Graph Continual Learning Easy, Fool-Proof, and Extensive: a Benchmark Framework and Scenarios3.401.503, 3, 5, 1, 5
4380Off Policy Average Reward Actor Critic with Deterministic Policy Search3.401.501, 3, 3, 5, 5
4381Rethinking Deep Spiking Neural Networks: A Multi-Layer Perceptron Approach3.400.805, 3, 3, 3, 3
4382Cooperative Adversarial Learning via Closed-Loop Transcription3.401.505, 1, 3, 3, 5
4383Dealing with missing data using attention and latent space regularization3.400.803, 5, 3, 3, 3
4384Revisiting Information-Based Clustering with Pseudo-Posterior Models3.332.051, 6, 3
4385BiasPAD: A Bias-Progressive Auto-Debiasing Framework3.332.053, 1, 6
4386Are Graph Attention Networks Attentive Enough? Rethinking Graph Attention by Capturing Homophily and Heterophily3.332.053, 6, 1
4387Human alignment of neural network representations3.332.053, 1, 6
4388ON COMPLEX-DOMAIN CNN REPRESENTATIONS FOR CLASSIFYING REAL/COMPLEX-VALUED DATA3.332.056, 1, 3
4389Enhancing Robustness of Deep Networks Based on a Two-phase Model of Their Training with Noisy Labels3.332.053, 1, 6
4390How Erdös and Rényi Win the Lottery3.332.056, 3, 1
4391Convergence Rate of Primal-Dual Approach to Constrained Reinforcement Learning with Softmax Policy3.251.796, 3, 1, 3
4392Towards biologically plausible Dreaming and Planning3.251.791, 3, 6, 3
4393On the Convergence of Federated Deep AUC Maximization3.251.791, 6, 3, 3
4394Who are playing the games?3.251.796, 3, 1, 3
4395Post-mortem on a deep learning contest: a Simpson’s paradox and the complementary roles of scale metrics versus shape metrics3.251.793, 3, 1, 6
4396Complete Likelihood Objective for Latent Variable Models3.252.861, 3, 1, 8
4397Meta-Learning via Classifier(-free) Guidance3.251.791, 3, 6, 3
4398Marginal Probability Explanation: A Saliency Map with Closed-loop Validation3.252.281, 5, 6, 1
4399Representation Interference Suppression via Non-linear Value Factorization for Indecomposable Markov Games3.252.281, 5, 6, 1
4400Rank-1 Matrix Completion with Gradient Descent and Small Random Initialization3.251.791, 3, 6, 3
4401Exploring semantic information in disease: Simple Data Augmentation Techniques for Chinese Disease Normalization3.251.793, 1, 6, 3
4402The Curse of Low Task Diversity: On the Failure of Transfer Learning to Outperform MAML and their Empirical Equivalence3.251.793, 1, 3, 6
4403Contrastive Unsupervised Learning of World Model with Invariant Causal Features3.251.793, 1, 3, 6
4404Certification of Attribution Robustness for Euclidean Distance and Cosine Similarity Measure3.251.791, 3, 6, 3
4405Quark: A Gradient-Free Quantum Learning Framework for Classification Tasks3.251.793, 6, 1, 3
4406On the Impact of Adversarially Robust Models on Algorithmic Recourse3.251.793, 1, 6, 3
4407Link Prediction without Graph Neural Networks3.251.791, 6, 3, 3
4408scFormer: a universal representation learning approach for single-cell data using transformers3.251.793, 6, 1, 3
4409The Crossword Puzzle: Simplifying Deep Neural Network Pruning with Fabulous Coordinates3.202.046, 5, 1, 1, 3
4410Suppression helps: Lateral Inhibition-inspired Convolutional Neural Network for Image Classification3.001.411, 3, 5, 3
4411Detecting Out-of-Distribution Data with Semi-supervised Graph “Feature' Networks3.001.413, 3, 1, 5
4412Joint Representations of Text and Knowledge Graphs for Retrieval and Evaluation3.000.003, 3, 3
4413Towards scalable and non-IID robust Hierarchical Federated Learning via Label-driven Knowledge Aggregator3.000.003, 3, 3
4414Loss Adapted Plasticity: Learning From Data With Unreliable Sources3.000.003, 3, 3
4415Online black-box adaptation to label-shift in the presence of conditional-shift3.000.003, 3, 3, 3
4416Improving Protein Interaction Prediction using Pretrained Structure Embedding3.000.003, 3, 3, 3
4417Versatile Energy-Based Models for High Energy Physics3.001.413, 5, 3, 1
4418Mixture of Basis for Interpretable Continual Learning with Distribution Shifts3.001.413, 1, 5, 3
4419Scrunch: Preventing sensitive property inference through privacy-preserving representation learning3.000.003, 3, 3
4420GM-VAE: Representation Learning with VAE on Gaussian Manifold3.000.003, 3, 3
4421Learning Test Time Augmentation with Cascade Loss Prediction3.000.003, 3, 3, 3
4422Optimizing Data-Flow in Binary Neural Networks3.000.003, 3, 3, 3
4423Neural Representations in Multi-Task Learning guided by Task-Dependent Contexts3.000.003, 3, 3, 3
4424Multi Task Learning of Different Class Label Representations for Stronger Models3.001.413, 3, 1, 5
4425Oscillation Neural Ordinary Differential Equations3.000.003, 3, 3
4426Noise Transforms Feed-Forward Networks into Sparse Coding Networks3.000.003, 3, 3, 3
4427Robust attributions require rethinking robustness metrics3.001.263, 3, 3, 5, 1
4428Atomized Deep Learning Models3.000.003, 3, 3, 3, 3
4429How Should I Plan? A Performance Comparison of Decision-Time vs. Background Planning3.001.415, 1, 3, 3
4430Towards Diverse Perspective Learning with Switch over Multiple Temporal Pooling3.001.631, 3, 5
4431Probe Into Multi-agent Adversarial Reinforcement Learning through Mean-Field Optimal Control3.001.413, 1, 5, 3
4432LEARNING DYNAMIC ABSTRACT REPRESENTATIONS FOR SAMPLE-EFFICIENT REINFORCEMENT LEARNING3.000.003, 3, 3
4433Boosting Adversarial Training with Masked Adaptive Ensemble3.000.003, 3, 3, 3
4434Disentangled Conditional Variational Autoencoder for Unsupervised Anomaly Detection3.000.003, 3, 3, 3
4435Protecting Bidder Information in Neural Auctions3.000.003, 3, 3
4436META-LEARNING FOR UNSUPERVISED OUTLIER DETECTION WITH OPTIMAL TRANSPORT3.001.415, 1, 3, 3
4437ADVL: Adaptive Distillation for Vision-Language Tasks3.000.003, 3, 3
4438Learning Arborescence with An Efficient Inference Algorithm3.000.003, 3, 3
4439Cross-Domain Self-Supervised Deep Learning for Robust Alzheimer's Disease Progression Modeling3.001.413, 3, 1, 5
4440Systematic Generalization and Emergent Structures in Transformers Trained on Structured Tasks3.001.635, 1, 3
4441DeepDFA: Dataflow Analysis-Guided Efficient Graph Learning for Vulnerability Detection3.000.003, 3, 3, 3
4442Spatial Reasoning Network for Zero-shot Constrained Scene Generation3.001.635, 1, 3
4443Optimal control neural networks for data-driven discovery of gradient flows.3.000.003, 3, 3, 3
4444NOTELA: A Generalizable Method for Source Free Domain Adaptation3.000.003, 3, 3, 3
4445Federated Representation Learning via Maximal Coding Rate Reduction3.001.411, 3, 5, 3
4446Memory Efficient Dynamic Sparse Training3.000.003, 3, 3, 3
4447Temporal Change Sensitive Representation for Reinforcement Learing3.000.003, 3, 3, 3
4448TKIL: Tangent Kernel Optimization for Class Balanced Incremental Learning3.000.003, 3, 3, 3
4449A Framework for Comprehensive Evaluations of Graph Neural Network based Community Detection using Node Clustering3.000.003, 3, 3
4450Improving the Strength of Human-Like Models in Chess3.000.003, 3, 3, 3
4451Domain Transfer with Large Dynamics Shift in Offline Reinforcement Learning3.000.003, 3, 3
4452Real Data Distributions Prefer Simplicity and So Do Our Models: Why Machine Learning and Model Selection Are Possible3.000.003, 3, 3, 3
4453Continual Active Learning3.001.413, 5, 3, 1
4454Pessimistic Model-Based Actor-Critic for Offline Reinforcement Learning: Theory and Algorithms3.000.003, 3, 3, 3
4455Improving Adversarial Robustness of Deep Neural Networks via Self-adaptive Margin Defense3.000.003, 3, 3
4456Knowledge Cascade: Reverse Knowledge Distillation3.000.003, 3, 3
4457Membership Leakage in Pre-trained Language Models3.001.633, 1, 5
4458An Exploration of Conditioning Methods in Graph Neural Networks3.000.003, 3, 3
4459Robust Policy Optimization in Deep Reinforcement Learning3.000.003, 3, 3, 3
4460EiX-GNN : Concept-level eigencentrality explainer for graph neural networks3.001.411, 5, 3, 3
4461The Minimal Feature Removal Problem in Neural Networks3.001.633, 5, 1
4462Continuous Depth Recurrent Neural Differential Equations3.000.003, 3, 3
4463Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning3.000.003, 3, 3, 3
4464Progressive Data Dropout: An Adaptive Training Strategy for Large-Scale Supervised Learning3.000.003, 3, 3, 3
4465Towards a Mathematics Formalisation Assistant using Large Language Models3.001.413, 1, 5, 3
4466Learning Portable Skills by Identifying Generalizing Features with an Attention-Based Ensemble3.000.003, 3, 3
4467Data dependent frequency sensitivity of convolutional neural networks3.000.003, 3, 3
4468Is end-to-end learning enough for fitness activity recognition?3.000.943, 3, 3, 5, 3, 3, 3, 1, 3
4469Forget to Learn (F2L): Rethinking Replay Loss in Unsupervised Continuous Domain Adaptation3.001.633, 5, 1
4470Single SMPC Invocation DPHelmet: Differentially Private Distributed Learning on a Large Scale3.000.003, 3, 3, 3
4471Robust Exploration via Clustering-based Online Density Estimation3.000.003, 3, 3
4472Using semantic distance for diverse and sample efficient genetic programming3.001.631, 3, 5
4473Soft Sampling for Efficient Training of Deep Neural Networks on Massive Data3.000.003, 3, 3
4474Improving Adversarial Robustness by Contrastive Guided Diffusion Process3.000.003, 3, 3
4475Revealing Dominant Eigendirections via Spectral Non-Robustness Analysis in the Deep Reinforcement Learning Policy Manifold3.000.003, 3, 3, 3, 3
4476Enhanced Spatio-Temporal Image Encoding for Online Human Activity Recognition3.000.003, 3, 3, 3
4477SmilesFormer: Language Model for Molecular Design3.001.631, 3, 5
4478A NEW PARADIGM FOR CROSS-MODALITY PERSON RE-IDENTIFICATION3.000.003, 3, 3, 3
4479Using Planning to Improve Semantic Parsing of Instructional Texts3.001.413, 3, 1, 5
4480Model Stealing Attacks Against Vision-Language Models3.001.415, 1, 3, 3
4481Improved Stein Variational Gradient Descent with Importance Weights3.001.633, 5, 1
4482Reducing Communication Entropy in Multi-Agent Reinforcement Learning3.000.003, 3, 3, 3
4483Time-Myopic Go-Explore: Learning A State Representation for the Go-Explore Paradigm3.000.003, 3, 3, 3
4484Physics Model-based Autoencoding for Magnetic Resonance Fingerprinting3.000.003, 3, 3, 3
4485Lightweight Equivariant Graph Representation Learning for Protein Engineering3.001.631, 3, 5
4486Optimizing Connectivity through Network Gradients for the Restricted Machine3.000.003, 3, 3
4487QUIC-FL: : Quick Unbiased Compression for Federated Learning3.000.003, 3, 3
4488FedMEKT: Split Multimodal Embedding Knowledge Transfer in Federated Learning3.000.003, 3, 3, 3
4489End-to-End Speech Synthesis Based on Deep Conditional Schrödinger Bridges3.001.413, 5, 1, 3
4490CCT: Cross-consistency training for Clone Detection and Code Search Tasks3.001.415, 3, 3, 1
4491GraphVF: Controllable Protein-Specific 3D Molecule Generation with Variational Flow3.000.003, 3, 3, 3
4492Comparing Auxiliary Tasks for Learning Representations for Reinforcement Learning3.000.003, 3, 3, 3
4493UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion3.000.003, 3, 3
4494Server Aggregation as Linear Regression: Reformulation for Federated Learning3.000.003, 3, 3, 3
4495The Effective coalitions of Shapley value For Integrated Gradients3.000.003, 3, 3
4496Tree-structure segmentation for logistic regression3.000.003, 3, 3
4497Learning to solve the Hidden Clique Problem with Graph Neural Networks3.000.003, 3, 3
4498PREDICTION OF TOURISM FLOW WITH SPARSE DATA INCORPORATING TOURIST GEOLOCATIONS3.000.003, 3, 3, 3
4499Meta-learning with Auto-generated Tasks for Predicting Human Behaviour in Normal Form Games3.001.415, 3, 3, 1
4500Decentralized Policy Optimization3.000.003, 3, 3
4501Image Segmentation using Transfer Learning with DeepLabv3 to Facilitate Photogrammetric Limb Scanning3.000.003, 3, 3
4502Augmentative Topology Agents For Open-Ended Learning3.000.003, 3, 3, 3
4503Revisiting Over-smoothing in Graph Neural Networks3.000.003, 3, 3, 3
4504StepGCN: Step-oriented Graph Convolutional Networks in Representation Learning3.000.003, 3, 3
4505Gradient-based Algorithms for Pessimistic Bilevel Optimization3.000.003, 3, 3
4506ENHANCING THE PRIVACY OF FEDERATED LEARNING THROUGH DATA SYNTHESIS3.001.411, 3, 3, 5
4507The Emergence of Prototypicality: Unsupervised Feature Learning in Hyperbolic Space3.000.003, 3, 3, 3
4508Coordinated Strategy Identification Multi-Agent Reinforcement Learning3.000.003, 3, 3
4509Evaluating Robustness of Generative Models with Adversarial Networks3.000.003, 3, 3
4510Approximating How Single Head Attention Learns3.000.003, 3, 3
4511MVP: Multi-task Supervised Pre-training for Natural Language Generation3.001.635, 1, 3
4512Improving Inductive Link Prediction through Learning Generalizable Node Representations3.001.413, 3, 1, 5
4513ATTRIBUTES RECONSTRUCTION IN HETEROGENEOUS NETWORKS VIA GRAPH AUGMENTATION3.001.635, 1, 3
4514HAS IT REALLY IMPROVED? KNOWLEDGE GRAPH BASED SEPARATION AND FUSION FOR RECOMMENDATION3.000.003, 3, 3
4515On Assimilating Learned Views in Contrastive Learning3.000.003, 3, 3, 3
4516Block-Diagonal Structure Learning for Subspace Clustering3.000.003, 3, 3
4517Thrust: Adaptively Propels Large Language Models with External Knowledge3.001.413, 3, 5, 1
4518SGD and Weight Decay Provably Induce a Low-Rank Bias in Neural Networks3.001.411, 3, 3, 5
4519Transfer Learning with Context-aware Feature Compensation3.000.003, 3, 3
4520TuneUp: A Training Strategy for Improving Generalization of Graph Neural Networks3.000.003, 3, 3, 3
4521Logical view on fairness of a binary classification task3.001.631, 5, 3
4522Active Sampling for Node Attribute Completion on Graphs3.001.413, 3, 1, 5
4523Emb-GAM: an Interpretable and Efficient Predictor using Pre-trained Language Models3.001.633, 1, 5
4524FedCUAU: Clustered Federated Learning using weight divergence3.000.003, 3, 3
4525A Probabilistic Approach to Self-Supervised Learning using Cyclical Stochastic Gradient MCMC3.000.003, 3, 3
4526Tabular Data to Image Generation: Benchmark Data, Approaches, and Evaluation3.000.003, 3, 3
4527Representing Latent Dimensions Using Compressed Number Lines3.001.631, 5, 3
4528Deep Invertible Approximation of Topologically Rich Maps between Manifolds3.001.413, 1, 5, 3
4529Neural Graphical Models3.000.003, 3, 3, 3
4530Meta-learning from demonstrations improves compositional generalization3.000.003, 3, 3, 3
4531Communication-Optimal Distributed Graph Clustering under Duplication Models3.001.631, 3, 5
4532LSTM-BASED-AUTO-BI-LSTM for Remaining Useful Life (RUL) Prediction: the first round of test results3.000.003, 3, 3
4533ModReduce: A Multi-Knowledge Distillation Framework with Online Learning3.001.413, 5, 3, 1
4534Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning3.000.003, 3, 3, 3, 3
4535Isometric Representations in Neural Networks Improve Robustness3.000.003, 3, 3, 3
4536CBP-QSNN: Spiking Neural Networks Quantized Using Constrained Backpropagation3.000.003, 3, 3
4537Disentangled (Un)Controllable Features3.000.003, 3, 3, 3
4538CWATR: Generating Richer Captions with Object Attributes3.000.003, 3, 3
4539QUANTIZATION AWARE FACTORIZATION FOR DEEP NEURAL NETWORK COMPRESSION3.000.003, 3, 3, 3
4540Fairness of Federated Learning with Dynamic Participants3.000.003, 3, 3
4541Context and History Aware Other-Shaping3.001.413, 1, 3, 5
4542SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation3.000.003, 3, 3
4543Bidirectional global to local attention for deep metric learning.3.001.413, 5, 1, 3
4544Class Interference of Deep Networks3.001.633, 5, 1
4545Bi-Level Dynamic Parameter Sharing among Individuals and Teams for Promoting Collaborations in Multi-Agent Reinforcement Learning3.000.003, 3, 3, 3
4546Uplift Modelling based on Graph Neural Network Combined with Causal Knowledge3.000.003, 3, 3
4547SynMotor: A Benchmark Suite for Object Attribute Regression and Multi-task Learning3.001.631, 3, 5
4548Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning3.000.003, 3, 3
4549Signs in the Lottery: Structural Similarities Between Winning Tickets3.001.413, 1, 5, 3
4550To be private and robust: Differentially Private Optimizers Can Learn Adversarially Robust Models3.001.265, 3, 1, 3, 3
4551Fine-Grained Source Code Vulnerability Detection via Graph Neural Networks3.001.411, 3, 5, 3
4552APLA: Class-imbalanced Semi-supervised Learning with Adapative Pseudo-labeling and Loss Adjustment3.001.631, 3, 5
4553Hypernetwork approach to Bayesian MAML3.000.003, 3, 3
4554Deep Leakage from Model in Federated Learning3.001.633, 5, 1
4555Existence of a bad local minimum of neural networks with general smooth activation functions3.000.003, 3, 3, 3
4556ADVERSARY-AWARE PARTIAL LABEL LEARNING WITH LABEL DISTILLATION3.001.413, 1, 3, 5
4557Identical Initialization: A Universal Approach to Fast and Stable Training of Neural Networks3.000.003, 3, 3, 3
4558Detecting Backdoor Attacks via Layer-wise Feature Analysis3.000.003, 3, 3
4559Neural Layered Min-sum Decoders for Algebraic Codes3.000.003, 3, 3
4560The Importance of Suppressing Complete Reconstruction in Autoencoders for Unsupervised Outlier Detection3.000.003, 3, 3, 3
4561CENTROID-BASED JOINT REPRESENTATION FOR HUMAN POSE ESTIMATION AND INSTANCE SEGMENTATION3.001.633, 1, 5
4562Leveraging Hard Negative Priors for Automatic Medical Report Generation3.000.003, 3, 3, 3
4563MULTI-VIEW DEEP EVIDENTIAL FUSION NEURAL NETWORK FOR ASSESSMENT OF SCREENING MAMMOGRAMS3.001.413, 5, 3, 1
4564Probable Dataset Searching Method with Uncertain Dataset Information in Adjusting Architecture Hyper Parameter3.000.003, 3, 3
4565Scaled Neural Multiplicative Model for Tractable Optimization3.001.631, 5, 3
4566On the Power-Law Hessian Spectra in Deep Learning3.000.003, 3, 3
4567Theoretical generalization bounds for improving the efficiency of deep online training3.000.003, 3, 3, 3
4568A Representation Bottleneck of Bayesian Neural Networks3.000.003, 3, 3
4569LAU: A novel two-parameter learnable Logmoid Activation Unit3.001.631, 3, 5
4570N-Student Learning: An Approach to Model Uncertainty and Combat Overfitting3.000.003, 3, 3
4571Better handling unlabeled entity problem using PU-learning and negative sampling3.000.003, 3, 3, 3
4572Communication-Efficient and Drift-Robust Federated Learning via Elastic Net3.000.003, 3, 3, 3
4573Partition Matters in Learning and Learning-to-Learn Implicit Neural Representations3.000.003, 3, 3, 3
4574Substructured Graph Convolution for Non-overlapping Graph Decomposition3.000.003, 3, 3
4575Inverse Kernel Decomposition3.000.003, 3, 3, 3
4576An Investigation of Domain Generalization with Rademacher Complexity3.000.003, 3, 3, 3
4577ProGen2: Exploring the Boundaries of Protein Language Models3.000.003, 3, 3
4578Convergence of Generative Deep Linear Networks Trained with Bures-Wasserstein Loss3.000.003, 3, 3, 3
4579Spotting Expressivity Bottlenecks and Fixing Them Optimally3.000.003, 3, 3, 3
4580Diffusing Graph Attention3.000.003, 3, 3, 3
4581AdaptFSP: Adaptive Fictitious Self Play3.000.003, 3, 3, 3
4582An Intrinsic Dimension Perspective of Transformers for Sequential Modeling3.001.411, 3, 3, 5
4583TabDDPM: Modelling Tabular Data with Diffusion Models3.001.413, 1, 5, 3
4584ErGOT: entropy-regularized graph optimal transport3.000.003, 3, 3, 3
4585Considering Layerwise Importance in the Lottery Ticket Hypothesis3.000.003, 3, 3
4586Memory of Unimaginable Outcomes in Experience Replay3.000.003, 3, 3, 3
4587Deep Reinforcement Learning for Cryptocurrency Trading: Practical Approach to Address Backtest Overfitting3.000.003, 3, 3
4588TGP: Explainable Temporal Graph Neural Networks for Personalized Recommendation3.001.411, 3, 5, 3
4589Efficient Policy Space Response Oracles3.001.633, 5, 1
4590RetinexUTV: ROBUST RETINEX MODEL WITH UNFOLDING TOTAL VARIATION3.001.413, 1, 3, 5
4591Learning in Compressed Domain via Knowledge Transfer3.000.003, 3, 3, 3
4592Generative Recorrupted-to-Recorrupted: An Unsupervised Image Denoising Network for Arbitrary Noise Distribution3.001.413, 1, 5, 3
4593Protective Label Enhancement for Label Privacy3.001.631, 3, 5
4594Low-Entropy Features Hurt Out-of-Distribution Performance3.000.003, 3, 3, 3
4595Determinant regularization for Deep Metric Learning3.001.413, 1, 3, 5
4596Learning to Communicate using Contrastive Learning3.000.003, 3, 3
4597Flexible Relation Preserving for Adversarial Training3.001.633, 1, 5
4598Joint Spatiotemporal Attention for Mortality Prediction of Patients with Long COVID3.000.003, 3, 3
4599PA-LoFTR: Local Feature Matching with 3D Position-Aware Transformer3.000.003, 3, 3
4600Explaining Representation Bottlenecks of Convolutional Decoder Networks3.000.003, 3, 3, 3
4601Divide and conquer policy for efficient GAN training3.001.413, 3, 5, 1
4602TaylorNet: A Taylor-Driven Generic Neural Architecture3.000.003, 3, 3
4603Coupling Semi-supervised Learning with Reinforcement Learning for Better Decision Making -- An application to Cryo-EM Data Collection3.000.003, 3, 3
4604ProtoVAE: Using Prototypical Networks for Unsupervised Disentanglement3.000.003, 3, 3, 3
4605Abstract Visual Reasoning by Self-supervised Contrastive Learning3.000.003, 3, 3, 3
4606i-MAE: Are Latent Representations in Masked Autoencoders Linearly Separable?3.000.003, 3, 3
4607Mixed-Precision Inference Quantization: Radically Towards Faster inference speed, Lower Storage requirement, and Lower Loss3.001.631, 3, 5
4608Leveraging Double Descent for Scientific Data Analysis: Face-Based Social Behavior as a Case Study3.001.413, 1, 3, 5
4609Deep Duplex Learning for Weak Supervision3.001.413, 1, 3, 5
4610Fast Test-Time Adaptation Using Hints3.000.003, 3, 3
4611Gradient Properties of Hard Thresholding Operator3.001.411, 3, 3, 5
4612Accurate and Efficient Soma Reconstruction in a Full Adult Fly Brain3.001.635, 1, 3
4613An Encryption Framework for Pre-Trained Neural Networks3.001.631, 3, 5
4614Wasserstein Fair Autoencoders3.001.633, 5, 1
4615Low-Rank Winograd Transformation for 3D Convolutional Neural Networks3.000.003, 3, 3, 3
4616Structure-based Drug Design with Equivariant Diffusion Models3.000.003, 3, 3, 3
4617Deep reinforced active learning for multi-class image classification3.000.003, 3, 3
4618Big Learning: A Universal Machine Learning Paradigm?3.001.635, 1, 3
4619Interpretable Out-of-Distribution Detection using Pattern Identification3.000.003, 3, 3
4620NASiam: Efficient Representation Learning using Neural Architecture Search for Siamese Networks3.001.411, 3, 3, 5
4621On a Built-in Conflict between Deep Learning and Systematic Generalization3.001.415, 1, 3, 3
4622Block-level Stiffness Analysis of Residual Networks3.000.003, 3, 3
4623Explainable Artificial Intelligence: Reaping the Fruits of Decision Trees3.001.415, 1, 3, 3
4624Hard Regularization to Prevent Collapse in Online Deep Clustering without Data Augmentation3.001.413, 3, 1, 5
46253D-Scene-Entities: Using Phrase-to-3D-Object Correspondences for Richer Visio-Linguistic Models in 3D Scenes3.000.003, 3, 3, 3
4626MultiWave: Multiresolution Deep Architectures through Wavelet Decomposition for Multivariate Timeseries Forecasting and Prediction3.000.003, 3, 3, 3
4627Shared Knowledge Lifelong Learning3.000.003, 3, 3, 3
4628WeightRelay: Efficient Heterogenous Federated Learning on Time Series3.000.003, 3, 3, 3
4629Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders3.000.003, 3, 3
4630Generaling Multimodal Variational Methods to Sets3.001.411, 3, 3, 5
4631Training A Multi-stage Deep Classifier with Feedback Signals3.001.631, 5, 3
4632Normalized Activation Function: Toward Better Convergence3.000.003, 3, 3, 3
4633Hybrid Neuro-Symbolic Reasoning based on Multimodal Fusion3.001.413, 1, 5, 3
4634Distilling Text-Image Foundation Models3.000.003, 3, 3, 3
4635Refining Visual Representation for Generalized Zero-Shot Recognition through Implicit-Semantics-Guided Metric Learning3.000.003, 3, 3, 3
4636A MULTI-SCALE STRUCTURE-PRESERVING HETEROLOGOUS IMAGE TRANSFORMATION ALGORITHM BASED ON CONDITIONAL ADVERSARIAL NETWORK LEARNING3.000.003, 3, 3
4637When do Convolutional Neural Networks Stop Learning?2.801.833, 3, 6, 1, 1
4638Universal Graph Neural Networks without Message Passing2.802.231, 5, 6, 1, 1
4639Understanding ReLU Network Robustness Through Test Set Certification Performance2.752.051, 1, 6, 3
4640Sparsity by Redundancy: Solving $L_1$ with a Simple Reparametrization2.752.051, 6, 1, 3
4641Self-Programming Artificial Intelligence Using Code-Generating Language Models2.600.803, 3, 3, 3, 1
4642Exploring Generalization of Non-Contrastive self-supervised Learning2.600.803, 3, 3, 1, 3
4643Quantized Disentangled Representations for Object-Centric Visual Tasks2.500.873, 1, 3, 3
4644HOW SAMPLING AFFECTS TRAINING: AN EFFECTIVE SAMPLING THEORY STUDY FOR LONG-TAILED IMAGE CLASSIFICATION2.500.871, 3, 3, 3
4645Farsighter: Efficient Multi-step Exploration for Deep Reinforcement Learning2.500.873, 3, 3, 1
4646CLASSIFICATION OF INCOMPLETE DATA USING AUGMENTED MLP2.500.873, 3, 1, 3
4647Correspondences between word learning in children and captioning models2.500.873, 3, 1, 3
4648Stabilized training of joint energy-based models and its practical applications2.500.873, 3, 1, 3
4649Robustness Evaluation Using Local Substitute Networks2.500.873, 3, 1, 3
4650An Empirical Study of the Neural Contextual Bandit Algorithms2.500.871, 3, 3, 3
4651Global View For GCN: Why Go Deep When You Can Be Shallow?2.501.663, 1, 5, 1
4652BIG-Graph: Brain Imaging Genetics by Graph Neural Network2.500.871, 3, 3, 3
4653Combining pretrained speech and text encoders for spoken language processing2.500.873, 3, 3, 1
4654Image Emotion Recognition using Cognitive Contextual Summarization Framework2.500.873, 3, 3, 1
4655FedPD: Defying data heterogeneity through privacy distillation2.500.871, 3, 3, 3
4656Multivariate Gaussian Representation of Previous Tasks for Continual Learning2.500.871, 3, 3, 3
4657Automatic Dictionary Generation: Could Brothers Grimm Create a Dictionary with BERT?2.500.871, 3, 3, 3
4658Indoor Localisation for Detecting Medication Use in Parkinson's Disease2.500.871, 3, 3, 3
4659Skill Graph for Real-world Quadrupedal Robot Reinforcement Learning2.500.873, 3, 1, 3
4660Hierarchical Multi-Resolution Graph Generation Networks2.500.873, 1, 3, 3
4661TT-Rules: Extracting & Optimizing Exact Rules of a CNN-Based Model - Application to Fairness2.500.873, 3, 1, 3
4662A sampling framework for value-based reinforcement learning2.500.871, 3, 3, 3
4663Change Detection for bi-temporal images classification based on Siamese Variational AutoEncoder and Transfer Learning2.500.873, 1, 3, 3
4664Coarse-to-fine Knowledge Graph Domain Adaptation based on Distantly-supervised Iterative Training2.500.871, 3, 3, 3
4665Representing Multi-view Time-series Graph Structures for Multivariate Long-term Time-series Forecasting2.501.661, 3, 5, 1
4666Automaton Distillation: A Neuro-Symbolic Transfer Learning Approach for Deep RL2.500.871, 3, 3, 3
4667Point-based Molecular Representation Learning from Conformers2.501.661, 5, 1, 3
4668Inferring Causal Relations between Temporal Events2.500.871, 3, 3, 3
4669On the Nonconvex Convergence of SGD2.500.873, 1, 3, 3
4670Comparative Analysis between Vision Transformers and CNNs from the view of Neuroscience2.500.873, 1, 3, 3
4671A Robustly and Effectively Optimized Pretraining Approach for Masked Autoencoder2.500.871, 3, 3, 3
4672Transmission Dynamics of Hepatitis B: Analysis and Control2.500.873, 3, 1, 3
4673Enhancement and Numerical Assessment of Novel SARS-CoV-2 Virus Transmission Model2.500.873, 3, 1, 3
4674DEEAPR: Controllable Depth Enhancement via Adaptive Parametric Feature Rotation2.500.873, 3, 3, 1
4675BinaryVQA: A Versatile Dataset to Push the Limits of VQA Models2.500.873, 1, 3, 3
4676Causal Information Bottleneck Boosts Adversarial Robustness of Deep Neural Network2.501.661, 3, 1, 5
4677Go-Explore with a guide: Speeding up search in sparse reward settings with goal-directed intrinsic rewards2.500.871, 3, 3, 3
4678Exploring Over-smoothing in Graph Attention Networks from the Markov Chain Perspective2.500.873, 3, 1, 3
4679Multiple output samples for each input in a single-output Gaussian process2.500.873, 3, 3, 1
4680Supervised Random Feature Regression via Projection Pursuit2.330.943, 1, 3
4681Geometry Problem Solving based on Counterfactual Evolutionary Reasoning2.330.943, 1, 3
4682Improve distance metric learning by learning positions of class centers2.330.943, 3, 1
4683MCTransformer: Combining Transformers And Monte-Carlo Tree Search For Offline Reinforcement Learning2.330.943, 1, 3
4684NOVEL FEATURE REPRESENTATION STRATEGIES FOR TIME SERIES FORECASTING WITH PREDICTED FUTURE COVARIATES2.330.943, 1, 3
4685CNN Compression and Search Using Set Transformations with Width Modifiers on Network Architectures2.330.941, 3, 3
4686Discerning Hydroclimatic Behavior with a Deep Convolutional Residual Regressive Neural Network2.330.943, 3, 1
4687Multi-scale Attention for Diabetic Retinopathy Detection in Retinal Fundus Images2.330.943, 3, 1
4688PES: Probabilistic Exponential Smoothing for Time Series Forecasting2.330.941, 3, 3
4689The batch size can affect inference results2.330.943, 1, 3
4690Multi-Reward Fusion: Learning from Other Policies by Distilling2.330.943, 1, 3
4691Break the Wall Between Homophily and Heterophily for Graph Representation Learning2.330.943, 3, 1
4692SC2EGSet: StarCraft II Esport Replay and Game-state Dataset2.330.943, 1, 3
4693Structural Privacy in Graphs2.330.943, 3, 1
4694Personalized Federated Hypernetworks for Privacy Preservation in Multi-Task Reinforcement Learning2.330.943, 3, 1
4695Uncertainty Guided Depth Fusion for Spike Camera2.330.943, 3, 1
4696$$CONVOLUTION AND POOLING OPERATION MODULE WITH ADAPTIVE STRIDE PROCESSING EFFEC$$2.331.895, 1, 1
4697Towards Global Optimality in Cooperative MARL with Sequential Transformation2.330.941, 3, 3
4698Towards Controllable Policy through Goal-Masked Transformers2.330.943, 3, 1
4699Monkeypox with Cross Infection Hypothesis via Epidemiological Mode2.330.943, 3, 1
4700MANDERA: Malicious Node Detection in Federated Learning via Ranking2.330.943, 1, 3
4701C3PO: Learning to Achieve Arbitrary Goals via Massively Entropic Pretraining2.330.941, 3, 3
4702SAE: Estimation for Transition Matrix in Annotation Algorithms2.330.943, 1, 3
4703Do We Really Achieve Fairness with Explicit Sensitive Attributes?2.330.941, 3, 3
4704Rethinking Backdoor Data Poisoning Attacks in the Context of Semi-Supervised Learning2.330.941, 3, 3
4705CoGANs: Collaborative Generative Adversarial Networks2.330.943, 3, 1
4706S-SOLVER: Numerically Stable Adaptive Step Size Solver for Neural ODEs2.331.891, 1, 5
4707Probing for Correlations of Causal Facts: Large Language Models and Causality2.252.171, 1, 1, 6
4708CI-VAE: a Class-Informed Deep Variational Autoencoder for Enhanced Class-Specific Data Interpolation2.252.171, 1, 6, 1
4709Improved Gradient Descent Optimization Algorithm based on Inverse Model-Parameter Difference2.001.001, 3, 1, 3
4710Emergence of Exploration in Policy Gradient Reinforcement Learning via Resetting2.001.001, 3, 1, 3
4711Counterfactual Vision-Language Data Synthesis with Intra-Sample Contrast Learning2.001.003, 3, 1, 1
4712Shallow Learning In Materio.2.001.003, 1, 1, 3
4713Improving Accuracy and Explainability of Online Handwriting Recognition2.001.001, 3, 1, 3
4714ESEAD: An Enhanced Simple Ensemble and Distillation Framework for Natural Language Processing2.001.003, 3, 1, 1
4715Deep Learning of Intrinsically Motivated Options in the Arcade Learning Environment2.001.001, 1, 3, 3
4716'I pick you choose': Joint human-algorithm decision making in multi-armed bandits2.001.003, 1, 1, 3
4717Unsupervised Non-Parametric Signal Separation Using Bayesian Neural Networks2.001.003, 1, 1, 3
4718Re-Benchmarking Out-of-Distribution Detection in Deep Neural Networks2.001.003, 1, 1, 3
4719Smooth Mathematical Functions from Compact Neural Networks2.001.003, 1, 3, 1
4720Online Reinforcement Learning via Posterior Sampling of Policy2.001.001, 1, 3, 3
4721Comparing semantic and morphological analogy completion in word embeddings2.001.001, 3, 1, 3
4722Co-Evolution As More Than a Scalable Alternative for Multi-Agent Reinforcement Learning2.001.003, 3, 1, 1
4723Self-Paced Learning Enhanced Physics-informed Neural Networks for Solving Partial Differential Equations2.001.001, 3, 3, 1
4724Searching optimal adjustment features for treatment effect estimation2.001.003, 3, 1, 1
4725Feature-Driven Talking Face Generation with StyleGAN22.001.001, 3, 1, 3
4726GENERATIVE OF ORIGIN MODEL DISTRIBUTION MASKED WITH EMOTIONS AND TOPICS DISTRIBUTION IN HYBRID METHOD2.001.003, 1, 1, 3
4727MESSAGENET: MESSAGE CLASSIFICATION USING NATURAL LANGUAGE PROCESSING AND META-DATA2.001.001, 3, 1, 3
4728Semi-connected Joint Entity Recognition and Relation Extraction of Contextual Entities in Family History Records2.001.001, 3, 3, 1
4729An Empirical Study on Anomaly detection Using Density Based and Representative Based Clustering algorithms2.001.003, 3, 1, 1
4730Tree Structure LSTM for Chinese Named Entity Recognition2.001.001, 1, 3, 3
4731MixQuant: A Quantization Bit-width Search that Can Optimize the Performance of your Quantization Method2.001.003, 3, 1, 1
4732The GANfather: Controllable generation of malicious activity to expose detection weaknesses and improve defence systems.1.670.941, 1, 3
4733Vectorial Graph Convolutional Networks1.670.943, 1, 1
4734Learning Discriminative Representations for Chromosome Classification with Small Datasets1.670.941, 1, 3
4735REPRESENTATIVE PROTOTYPE WITH CONSTRASTIVE LEARNING FOR SEMI-SUPENVISED FEW-SHOT CLASSIFICATION1.670.941, 1, 3
4736Adaptive Gradient Methods with Local Guarantees1.670.941, 1, 3
4737Predicting Antimicrobial MICs for Nontyphoidal Salmonella Using Multitask Representations Learning1.670.941, 3, 1
4738Convergence of the mini-batch SIHT algorithm1.670.941, 1, 3
4739Partial Output Norm: Mitigating the Model Output Blow-up Effect of Cross Entropy Loss1.500.873, 1, 1, 1
4740State Decomposition for Model-free Partially observable Markov Decision Process1.500.871, 3, 1, 1
4741Recurrent Back-Projection Generative Adversarial Network for Video Super Resolution1.500.871, 1, 3, 1
4742Ensemble Homomorphic Encrypted Data Classification1.500.873, 1, 1, 1
4743The Use of Open-Source Boards for Data Collection and Machine Learning in Remote Deployments1.500.871, 3, 1, 1
4744Speeding up Policy Optimization with Vanishing Hypothesis and Variable Mini-Batch Size1.500.871, 1, 1, 3
4745URVoice: An Akl-Toussaint/ Graham- Sklansky Approach towards Convex Hull Computation for Sign Language Interpretation1.500.871, 3, 1, 1
4746Generalization Mechanics in Deep Learning1.500.871, 3, 1, 1
4747Fusion of Deep Transfer Learning with Mixed convolution network1.500.871, 3, 1, 1
4748Evaluating Weakly Supervised Object Localization Methods Right? A Study on Heatmap-based XAI and Neural Backed Decision Tree1.500.871, 1, 1, 3
4749Quantum reinforcement learning1.000.001, 1, 1, 1
4750Manipulating Multi-agent Navigation Task via Emergent Communications1.000.001, 1, 1
4751A comparison of dataset distillation and active learning in text classification1.000.001, 1, 1
4752Activation Function: Absolute Function,One Function Behaves more Individualized1.000.001, 1, 1, 1
4753Rotation Invariant Quantization for Model Compression1.000.001, 1, 1
\ No newline at end of file +ICLR2023 Statistics
ICLR 2023 Statistics
Github
# (4753)TitleR1stdRatings
1Git Re-Basin: Merging Models modulo Permutation Symmetries8.670.9410, 8, 8
2Rethinking the Expressive Power of GNNs via Graph Biconnectivity8.670.9410, 8, 8
3Emergence of Maps in the Memories of Blind Navigation Agents8.500.878, 8, 8, 10
4DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems8.500.8710, 8, 8, 8
5Graph Neural Networks for Link Prediction with Subgraph Sketching8.500.878, 8, 8, 10
6Revisiting the Entropy Semiring for Neural Speech Recognition8.501.6610, 8, 6, 10
7Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning8.252.058, 10, 10, 5
8Learning a Data-Driven Policy Network for Pre-Training Automated Feature Engineering8.000.008, 8, 8
9Fast Nonlinear Vector Quantile Regression8.000.008, 8, 8
10Scaling Up Probabilistic Circuits by Latent Variable Distillation8.000.008, 8, 8
11​​What learning algorithm is in-context learning? Investigations with linear models8.000.008, 8, 8
12FedExP: Speeding up Federated Averaging via Extrapolation8.000.008, 8, 8
13DreamFusion: Text-to-3D using 2D Diffusion8.000.008, 8, 8, 8
14Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching8.001.6310, 8, 6
15ReAct: Synergizing Reasoning and Acting in Language Models8.000.008, 8, 8
16The Lie Derivative for Measuring Learned Equivariance8.000.008, 8, 8
17Agree to Disagree: Diversity through Disagreement for Better Transferability8.000.008, 8, 8, 8
18Can We Find Nash Equilibria at a Linear Rate in Markov Games?8.000.008, 8, 8, 8
19Aligning Model and Macaque Inferior Temporal Cortex Representations Improves Model-to-Human Behavioral Alignment and Adversarial Robustness8.000.008, 8, 8
20Robust Scheduling with GFlowNets8.000.008, 8, 8, 8
21Transformers Learn Shortcuts to Automata8.001.638, 10, 6
22Strong inductive biases provably prevent harmless interpolation8.000.008, 8, 8
23Confidential-PROFITT: Confidential PROof of FaIr Training of Trees8.000.008, 8, 8
24Minimum Variance Unbiased N:M Sparsity for the Neural Gradients8.000.008, 8, 8
25Asymptotic Instance-Optimal Algorithms for Interactive Decision Making8.001.268, 8, 10, 8, 6
26Targeted Hyperparameter Optimization with Lexicographic Preferences Over Multiple Objectives8.000.008, 8, 8
27Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning8.000.008, 8, 8
28Self-Stabilization: The Implicit Bias of Gradient Descent at the Edge of Stability8.000.008, 8, 8
29Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness8.000.008, 8, 8, 8
30AudioGen: Textually Guided Audio Generation8.000.008, 8, 8, 8
31Geometric Networks Induced by Energy Constrained Diffusion8.001.418, 6, 8, 10
32A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification8.001.638, 10, 6
33Martingale Posterior Neural Processes8.000.008, 8, 8
34Relative representations enable zero-shot latent space communication8.001.6310, 6, 8
35Sign and Basis Invariant Networks for Spectral Graph Representation Learning8.000.008, 8, 8, 8
36Conditional Antibody Design as 3D Equivariant Graph Translation8.000.008, 8, 8, 8
37Evaluating Long-Term Memory in 3D Mazes8.000.008, 8, 8
38Generate rather than Retrieve: Large Language Models are Strong Context Generators8.001.418, 10, 8, 6
39Betty: An Automatic Differentiation Library for Multilevel Optimization8.001.418, 6, 10, 8
40Benchmarking Deformable Object Manipulation with Differentiable Physics8.000.008, 8, 8
41Generating Diverse Cooperative Agents by Learning Incompatible Policies8.000.008, 8, 8, 8
42On the duality between contrastive and non-contrastive self-supervised learning7.751.798, 5, 8, 10
43Flow Matching for Generative Modeling7.751.7910, 8, 8, 5
44DiffEdit: Diffusion-based semantic image editing with mask guidance7.751.798, 5, 8, 10
45GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation7.672.058, 5, 10
46Selection-Inference: Exploiting Large Language Models for Interpretable Logical Reasoning7.600.808, 8, 8, 6, 8
47BigVGAN: A Universal Neural Vocoder with Large-Scale Training7.600.808, 8, 8, 8, 6
48Exponential Generalization Bounds with Near-Optimal Rates for $L_q$-Stable Algorithms7.600.808, 6, 8, 8, 8
49CROM: Continuous Reduced-Order Modeling of PDEs Using Implicit Neural Representations7.600.808, 6, 8, 8, 8
50Concept-level Debugging of Part-Prototype Networks7.500.876, 8, 8, 8
51WikiWhy: Answering and Explaining Cause-and-Effect Questions7.500.878, 6, 8, 8
52GEASS: Neural causal feature selection for high-dimensional biological data7.500.878, 8, 6, 8
53Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions7.500.876, 8, 8, 8
54SMART: Self-supervised Multi-task pretrAining with contRol Transformers7.500.878, 8, 8, 6
55The Surprising Effectiveness of Equivariant Models in Domains with Latent Symmetry7.500.878, 8, 8, 6
56Provably Efficient Neural Offline Reinforcement Learning via Perturbed Rewards7.500.878, 8, 8, 6
57Near-optimal Coresets for Robust Clustering7.500.878, 8, 8, 6
58PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification7.500.876, 8, 8, 8
59GLM-130B: An Open Bilingual Pre-trained Model7.500.878, 8, 8, 6
60Provably Auditing Ordinary Least Squares in Low Dimensions7.500.878, 8, 6, 8
61Effects of Graph Convolutions in Multi-layer Networks7.500.878, 8, 8, 6
62Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask?7.501.668, 6, 10, 6
63Few-shot Cross-domain Image Generation via Inference-time Latent-code Learning7.500.878, 8, 6, 8
64Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs7.500.878, 8, 8, 6
65Symbolic Physics Learner: Discovering governing equations via Monte Carlo tree search7.500.878, 8, 8, 6
66Prompt-to-Prompt Image Editing with Cross-Attention Control7.500.878, 8, 6, 8
67PV3D: A 3D Generative Model for Portrait Video Generation7.501.666, 8, 10, 6
68UNIFIED-IO: A Unified Model for Vision, Language, and Multi-modal Tasks7.500.878, 6, 8, 8
69Omnigrok: Grokking Beyond Algorithmic Data7.500.876, 8, 8, 8
70A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics7.500.878, 8, 8, 6
71Accurate Image Restoration with Attention Retractable Transformer7.500.878, 8, 8, 6
72Generalized structure-aware missing view completion network for incomplete multi-view clustering7.500.878, 8, 6, 8
73PEER: A Collaborative Language Model7.500.876, 8, 8, 8
74Empowering Networks With Scale and Rotation Equivariance Using A Similarity Convolution7.500.878, 8, 6, 8
75Token Merging: Your ViT But Faster7.500.876, 8, 8, 8
76Image as Set of Points7.500.878, 8, 6, 8
77H2RBox: Horizonal Box Annotation is All You Need for Oriented Object Detection7.501.668, 6, 6, 10
78Pushing the Limits of Fewshot Anomaly Detection in Industry Vision: Graphcore7.500.878, 8, 8, 6
79Minimax Optimal Kernel Operator Learning via Multilevel Training7.401.7410, 5, 8, 8, 6
80Few-Shot Domain Adaptation For End-to-End Communication7.330.948, 6, 8
81Improved Training of Physics-Informed Neural Networks Using Energy-Based Priors: a Study on Electrical Impedance Tomography7.331.8910, 6, 6
82Combinatorial Pure Exploration of Causal Bandits7.330.948, 8, 6
83The In-Sample Softmax for Offline Reinforcement Learning7.330.948, 6, 8
84Discrete Predictor-Corrector Diffusion Models for Image Synthesis7.330.948, 6, 8
85Binding Language Models in Symbolic Languages7.330.948, 8, 6
86Evolve Smoothly, Fit Consistently: Learning Smooth Latent Dynamics For Advection-Dominated Systems7.330.948, 8, 6
87Learning Language Representations with Logical Inductive Bias7.330.946, 8, 8
88Implicit Bias of Large Depth Networks: a Notion of Rank for Nonlinear Functions7.331.8010, 8, 5, 8, 5, 8
89Contrastive Corpus Attribution for Explaining Representations7.330.948, 8, 6
90SoftZoo: A Soft Robot Co-design Benchmark For Locomotion In Diverse Environments7.330.948, 6, 8
91Disentanglement of Correlated Factors via Hausdorff Factorized Support7.330.948, 6, 8
92Exploring the Limits of Differentially Private Deep Learning with Group-wise Clipping7.330.946, 8, 8
93DiffusER: Diffusion via Edit-based Reconstruction7.330.946, 8, 8
94Efficient recurrent architectures through activity sparsity and sparse back-propagation through time7.330.946, 8, 8
95Symmetric Pruning in Quantum Neural Networks7.330.948, 8, 6
96Incremental Learning of Structured Memory via Closed-Loop Transcription7.330.948, 6, 8
97Scaling Forward Gradient With Local Losses7.330.948, 6, 8
98Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning7.330.948, 6, 8
99Progress measures for grokking via mechanistic interpretability7.330.946, 8, 8
100Simplified State Space Layers for Sequence Modeling7.330.948, 6, 8
101Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms7.330.946, 8, 8
102Post-hoc Concept Bottleneck Models7.330.948, 6, 8
103Open-Vocabulary Object Detection upon Frozen Vision and Language Models7.330.948, 6, 8
104Temporal Dependencies in Feature Importance for Time Series Prediction7.330.946, 8, 8
105Pre-training via Denoising for Molecular Property Prediction7.330.946, 8, 8
106A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning7.330.946, 8, 8
107SCALE-UP: An Efficient Black-box Input-level Backdoor Detection via Analyzing Scaled Prediction Consistency7.330.948, 6, 8
108Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve7.330.946, 8, 8
109A framework for benchmarking Class-out-of-distribution detection and its application to ImageNet7.330.948, 8, 6
110SketchKnitter: Vectorized Sketch Generation with Diffusion Models7.330.946, 8, 8
111Tailoring Language Generation Models under Total Variation Distance7.330.948, 6, 8
112Bag of Tricks for Unsupervised Text-to-Speech7.330.948, 8, 6
113Statistical Efficiency of Score Matching: The View from Isoperimetry7.330.946, 8, 8
114Multifactor Sequential Disentanglement via Structured Koopman Autoencoders7.330.948, 6, 8
115View Synthesis with Sculpted Neural Points7.330.948, 6, 8
116AutoGT: Automated Graph Transformer Architecture Search7.330.948, 8, 6
117Neural Optimal Transport7.330.946, 8, 8
118Deep Ranking Ensembles for Hyperparameter Optimization7.330.948, 8, 6
119Win: Weight-Decay-Integrated Nesterov Acceleration for Adaptive Gradient Algorithms7.330.948, 6, 8
120Measuring axiomatic identifiability of counterfactual image models7.330.948, 8, 6
121GFlowNets and variational inference7.331.8910, 6, 6
122Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes7.251.928, 6, 10, 5
123gDDIM: Generalized denoising diffusion implicit models7.251.308, 8, 8, 5
124A Theoretical Framework for Inference and Learning in Predictive Coding Networks7.252.598, 3, 10, 8
125The Onset of Variance-Limited Behavior for Networks in the Lazy and Rich Regimes7.251.308, 8, 5, 8
126The Asymmetric Maximum Margin Bias of Quasi-Homogeneous Neural Networks7.251.928, 10, 5, 6
127Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation7.251.305, 8, 8, 8
128A probabilistic framework for task-aligned intra- and inter-area neural manifold estimation7.251.308, 5, 8, 8
129Neuromechanical Autoencoders: Learning to Couple Elastic and Neural Network Nonlinearity7.251.308, 8, 5, 8
130Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning7.251.305, 8, 8, 8
131Efficient Learning of Rationalizable Equilibria in General-Sum Games7.251.308, 8, 8, 5
132ExpressivE: A Spatio-Functional Embedding For Knowledge Graph Completion7.251.928, 5, 10, 6
133Fundamental Limits in Formal Verification of Message-Passing Neural Networks7.252.593, 8, 10, 8
134Learning on Large-scale Text-attributed Graphs via Variational Inference7.251.305, 8, 8, 8
135Extreme Q-Learning: MaxEnt RL without Entropy7.251.928, 5, 10, 6
136STaSy: Score-based Tabular data Synthesis7.251.305, 8, 8, 8
137BAYES RISK CTC: CONTROLLABLE CTC ALIGNMENT IN SEQUENCE-TO-SEQUENCE TASKS7.251.308, 5, 8, 8
138A Convergent Single-Loop Algorithm for Gromov-Wasserstein in Graph Data7.251.308, 8, 8, 5
139Provable Memorization Capacity of Transformers7.251.308, 5, 8, 8
140Mega: Moving Average Equipped Gated Attention7.251.308, 5, 8, 8
141Domain-Indexing Variational Bayes for Domain Adaptation7.251.308, 8, 5, 8
142Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?7.251.928, 6, 10, 5
143ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor7.251.308, 8, 8, 5
144Multi-skill Mobile Manipulation for Object Rearrangement7.251.928, 10, 6, 5
145MocoSFL: enabling cross-client collaborative self-supervised learning7.251.308, 8, 8, 5
146MECTA: Memory-Economic Continual Test-Time Model Adaptation7.251.308, 8, 8, 5
147Diversify and Disambiguate: Out-of-Distribution Robustness via Disagreement7.251.308, 8, 8, 5
148Depth Separation with Multilayer Mean-Field Networks7.200.986, 8, 6, 8, 8
149A Holistic View of Noise Transition Matrix in Deep Learning and Beyond7.200.988, 6, 8, 6, 8
150Masked Unsupervised Self-training for Label-free Image Classification7.171.218, 6, 8, 8, 5, 8
151Softened Symbol Grounding for Neuro-symbolic Systems7.002.125, 5, 8, 10
152Learning Group Importance using the Differentiable Hypergeometric Distribution7.001.008, 6, 8, 6
153A Message Passing Perspective on Learning Dynamics of Contrastive Learning7.001.418, 5, 8
154LiftedCL: Lifting Contrastive Learning for Human-Centric Perception7.001.418, 5, 8
155Learning with Logical Constraints but without Shortcut Satisfaction7.001.008, 8, 6, 6
156Automatically Answering and Generating Machine Learning Final Exams7.002.948, 10, 3
157A Closer Look at Model Adaptation using Feature Distortion and Simplicity Bias7.002.128, 10, 5, 5
158What Makes Convolutional Models Great on Long Sequence Modeling?7.001.008, 6, 8, 6
159The Role of Coverage in Online Reinforcement Learning7.001.418, 5, 8
160Diffusion-GAN: Training GANs with Diffusion7.001.006, 6, 8, 8
161Real-time variational method for learning neural trajectory and its dynamics7.001.008, 6, 6, 8
162When and why Vision-Language Models behave like Bags-of-Words, and what to do about it?7.001.006, 6, 8, 8
163Learning Iterative Neural Optimizers for Image Steganography7.001.006, 6, 8, 8
164Interpretable Geometric Deep Learning via Learnable Randomness Injection7.001.008, 8, 6, 6
165Is Reinforcement Learning (Not) for Natural Language Processing?: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization7.001.006, 6, 8, 8
166Learning rigid dynamics with face interaction graph networks7.001.736, 10, 6, 6
167Why (and When) does Local SGD Generalize Better than SGD?7.001.415, 8, 8
168Do We Really Need Complicated Model Architectures For Temporal Networks?7.001.418, 8, 5
169Modeling the Data-Generating Process is Necessary for Out-of-Distribution Generalization7.001.008, 8, 6, 6
170(Certified!!) Adversarial Robustness for Free!7.001.008, 6, 8, 6
171Efficient Conditionally Invariant Representation Learning7.001.418, 5, 8
172Canary in a Coalmine: Better Membership Inference with Ensembled Adversarial Queries7.001.418, 8, 5
173Learning Fair Graph Representations via Automated Data Augmentations7.001.008, 8, 6, 6
174Latent Neural ODEs with Sparse Bayesian Multiple Shooting7.001.008, 8, 6, 6
175Decentralized Optimistic Hyperpolicy Mirror Descent: Provably No-Regret Learning in Markov Games7.001.008, 8, 6, 6
176Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training7.001.008, 6, 8, 6
177A Higher Precision Algorithm for Computing the $1$-Wasserstein Distance7.001.415, 8, 8
178Imitating Human Behaviour with Diffusion Models7.001.008, 6, 6, 8
179LexMAE: Lexicon-Bottlenecked Pretraining for Large-Scale Retrieval7.001.008, 8, 6, 6
180Sampling-based inference for large linear models, with application to linearised Laplace7.001.008, 8, 6, 6
181Dual Algorithmic Reasoning7.001.415, 8, 8
182Almost Linear Constant-Factor Sketching for $ell_1$ and Logistic Regression7.001.418, 8, 5
183Spectral Subgraph Localization7.001.418, 8, 5
184FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation7.002.1210, 8, 5, 5
185On Compositional Uncertainty Quantification for Seq2seq Graph Parsing7.002.948, 3, 10
186Efficient Attention via Control Variates7.001.006, 8, 6, 8
187Augmented Lagrangian is Enough for Optimal Offline RL with General Function Approximation and Partial Coverage7.001.006, 6, 8, 8
188DocPrompting: Generating Code by Retrieving the Docs7.001.008, 6, 8, 6
189Words are all you need? Language as an approximation for representational similarity7.002.125, 8, 5, 10
190FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning7.001.418, 5, 8
191Spectral Decomposition Representation for Reinforcement Learning7.001.418, 8, 5
192Certifiably Robust Policy Learning against Adversarial Multi-Agent Communication7.001.418, 8, 5
193Learning Sparse Group Models Through Boolean Relaxation7.001.006, 8, 6, 8
194Deconstructing Distributions: A Pointwise Framework of Learning7.001.008, 6, 6, 8
195Parametrizing Product Shape Manifolds by Composite Networks7.001.418, 8, 5
196Learning Hyper Label Model for Programmatic Weak Supervision7.001.008, 6, 6, 8
197STOCHASTIC NO-REGRET LEARNING FOR GENERAL GAMES WITH VARIANCE REDUCTION7.001.008, 6, 8, 6
198TAN without a burn: Scaling laws of DP-SGD7.001.008, 8, 6, 6
199Pink Noise Is All You Need: Colored Noise Exploration in Deep Reinforcement Learning7.001.415, 8, 8
200A Unified Algebraic Perspective on Lipschitz Neural Networks7.001.006, 6, 8, 8
201Sparsity-Constrained Optimal Transport7.001.7910, 8, 5, 6, 6
202Embedding Fourier for Ultra-High-Definition Low-Light Image Enhancement7.001.006, 8, 8, 6
203HT-Net: Hierarchical Transformer based Operator Learning Model for Multiscale PDEs7.002.125, 10, 8, 5
204On the Usefulness of Embeddings, Clusters and Strings for Text Generation Evaluation7.001.006, 8, 8, 6
205Accurate Bayesian Meta-Learning by Accurate Task Posterior Inference7.001.008, 8, 6, 6
206Context-enriched molecule representations improve few-shot drug discovery7.001.008, 8, 6, 6
207A Universal 3D Molecular Representation Learning Framework7.002.943, 8, 10
208The Generalized Eigenvalue Problem as a Nash Equilibrium7.001.008, 6, 6, 8
209Language Modelling with Pixels7.001.008, 6, 6, 8
210Faster Gradient-Free Methods for Escaping Saddle Points7.001.008, 6, 8, 6
211Classically Approximating Variational Quantum Machine Learning with Random Fourier Features7.001.415, 8, 8
212Self-supervision through Random Segments with Autoregressive Coding (RandSAC)7.001.415, 8, 8
213Exploring Temporally Dynamic Data Augmentation for Video Recognition7.001.006, 6, 8, 8
214Meta-Learning in Games7.001.006, 8, 8, 6
215Continuized Acceleration for Quasar Convex Functions in Non-Convex Optimization7.001.008, 6, 6, 8
216InCoder: A Generative Model for Code Infilling and Synthesis7.001.006, 6, 8, 8
217Benchmarking Offline Reinforcement Learning on Real-Robot Hardware7.001.008, 8, 6, 6
218Transformers are Sample-Efficient World Models7.001.008, 6, 6, 8
219Scalable Subset Sampling with Neural Conditional Poisson Networks7.001.008, 6, 6, 8
220Diffusion Posterior Sampling for General Noisy Inverse Problems7.001.006, 8, 6, 8
221Learning the Positions in CountSketch7.001.008, 6, 8, 6
222DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection7.001.268, 8, 5, 8, 6
223Provable Sim-to-real Transfer in Continuous Domain with Partial Observations7.001.418, 5, 8
224Outcome-directed Reinforcement Learning by Uncertainty & Temporal Distance-Aware Curriculum Goal Generation7.001.418, 8, 5
225Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning7.001.006, 8, 8, 6
226NeRN: Learning Neural Representations for Neural Networks7.001.008, 6, 6, 8
227Rank Preserving Framework for Asymmetric Image Retrieval7.001.006, 8, 8, 6
228Closing the gap: Exact maximum likelihood training of generative autoencoders using invertible layers7.001.006, 8, 8, 6
229Switch-NeRF: Learning Scene Decomposition with Mixture of Experts for Large-scale Neural Radiance Fields7.001.008, 6, 6, 8
230Plateau in Monotonic Linear Interpolation --- A 'Biased' View of Loss Landscape for Deep Networks7.001.006, 8, 8, 6
231Automated Data Augmentations for Graph Classification7.001.415, 8, 8
232Self-Supervised Category-Level Articulated Object Pose Estimation with Part-Level SE(3) Equivariance7.001.7310, 6, 6, 6
233Human Motion Diffusion Model7.001.006, 8, 8, 6
234More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity6.801.945, 8, 10, 6, 5
235Understanding Edge-of-Stability Training Dynamics with a Minimalist Example6.801.478, 5, 5, 8, 8
236Self-Distillation for Further Pre-training of Transformers6.800.986, 8, 6, 6, 8
237Neural Networks and the Chomsky Hierarchy6.800.986, 8, 8, 6, 6
238Implicit Bias in Leaky ReLU Networks Trained on High-Dimensional Data6.752.5910, 6, 3, 8
239Certified Training: Small Boxes are All You Need6.751.306, 5, 8, 8
240A Kernel Perspective of Skip Connections in Convolutional Networks6.751.305, 8, 8, 6
241Chasing All-Round Graph Representation Robustness: Model, Training, and Optimization6.752.178, 3, 8, 8
242Robust Algorithms on Adaptive Inputs from Bounded Adversaries6.751.308, 6, 5, 8
243Simple initialization and parametrization of sinusoidal networks via their kernel bandwidth6.751.308, 6, 8, 5
244Reparameterization through Spatial Gradient Scaling6.751.305, 8, 6, 8
245Guiding Energy-based Models via Contrastive Latent Variables6.751.306, 8, 5, 8
246Gradient Descent Converges Linearly for Logistic Regression on Separable Data6.751.308, 5, 8, 6
247Momentum Stiefel Optimizer, with Applications to Suitably-Orthogonal Attention, and Optimal Transport6.751.926, 5, 6, 10
248On the Sensitivity of Reward Inference to Misspecified Human Models6.752.178, 8, 3, 8
249Promptagator: Few-shot Dense Retrieval From 8 Examples6.751.305, 6, 8, 8
250Label Propagation with Weak Supervision6.751.308, 8, 6, 5
251Learning MLPs on Graphs: A Unified View of Effectiveness, Robustness, and Efficiency6.751.306, 8, 8, 5
252Disentangling with Biological Constraints: A Theory of Functional Cell Types6.751.308, 6, 5, 8
253DINO as a von Mises-Fisher mixture model6.751.308, 5, 6, 8
254Scalable Batch-Mode Deep Bayesian Active Learning via Equivalence Class Annealing6.751.308, 8, 6, 5
255Provable Defense Against Geometric Transformations6.751.306, 5, 8, 8
256Taking a Step Back with KCal: Multi-Class Kernel-Based Calibration for Deep Neural Networks6.751.306, 5, 8, 8
257Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints6.751.305, 8, 8, 6
258Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics6.751.308, 6, 5, 8
259In-Situ Text-Only Adaptation of Speech Models with Low-Overhead Speech Imputations6.751.305, 6, 8, 8
260Choreographer: Learning and Adapting Skills in Imagination6.751.305, 8, 8, 6
261In-context Reinforcement Learning with Algorithm Distillation6.751.308, 8, 6, 5
262User-Interactive Offline Reinforcement Learning6.752.598, 3, 6, 10
263Lower Bounds on the Depth of Integral ReLU Neural Networks via Lattice Polytopes6.751.308, 6, 5, 8
264Learning Vortex Dynamics for Fluid Inference and Prediction6.751.305, 8, 8, 6
265Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data6.751.308, 5, 6, 8
266Unsupervised Semantic Segmentation with Self-supervised Object-centric Representations6.751.305, 8, 6, 8
267Decompositional Generation Process for Instance-Dependent Partial Label Learning6.752.173, 8, 8, 8
268Building a Subspace of Policies for Scalable Continual Learning6.751.306, 8, 8, 5
269Visually-Augmented Language Modeling6.751.926, 5, 10, 6
270Understanding the Role of Nonlinearity in Training Dynamics of Contrastive Learning6.751.305, 6, 8, 8
271CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis6.751.308, 5, 8, 6
272SAM as an Optimal Relaxation of Bayes6.751.308, 8, 5, 6
273Partial Label Unsupervised Domain Adaptation with Class-Prototype Alignment6.751.305, 8, 8, 6
274Moving Forward by Moving Backward: Embedding Action Impact over Action Semantics6.751.306, 5, 8, 8
275Undersampling is a Minimax Optimal Robustness Intervention in Nonparametric Classification6.751.308, 8, 6, 5
276Sampling with Mollified Interaction Energy Descent6.751.308, 6, 8, 5
277Does Zero-Shot Reinforcement Learning Exist?6.752.596, 3, 8, 10
278PaLI: A Jointly-Scaled Multilingual Language-Image Model6.751.305, 8, 8, 6
279Learning with Stochastic Orders6.751.308, 6, 5, 8
280Offline Congestion Games: How Feedback Type Affects Data Coverage Requirement6.751.308, 6, 8, 5
281Powderworld: A Platform for Understanding Generalization via Rich Task Distributions6.752.173, 8, 8, 8
282Is Attention All That NeRF Needs?6.751.308, 6, 5, 8
283The Influence of Learning Rule on Representation Dynamics in Wide Neural Networks6.751.306, 5, 8, 8
284RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch6.751.305, 6, 8, 8
285Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!6.751.306, 8, 8, 5
286Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search6.751.308, 5, 6, 8
287Does Deep Learning Learn to Abstract? A Systematic Probing Framework6.751.308, 5, 6, 8
288Variance-Aware Sparse Linear Bandits6.751.305, 8, 6, 8
289Image to Sphere: Learning Equivariant Features for Efficient Pose Prediction6.751.306, 8, 5, 8
290Self-Consistency Improves Chain of Thought Reasoning in Language Models6.751.925, 6, 6, 10
291Combinatorial-Probabilistic Trade-Off: P-Values of Community Properties Test in the Stochastic Block Models6.751.308, 5, 6, 8
292Improving Deep Regression with Ordinal Entropy6.752.178, 8, 3, 8
293Clifford Neural Layers for PDE Modeling6.751.305, 8, 8, 6
294Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning6.751.306, 8, 8, 5
295A Model or 603 Exemplars: Towards Memory-Efficient Class-Incremental Learning6.751.305, 8, 8, 6
296Contextual bandits with concave rewards, and an application to fair ranking6.751.308, 6, 5, 8
297When to Make and Break Commitments?6.751.305, 6, 8, 8
298Advancing Radiograph Representation Learning with Masked Record Modeling6.751.308, 6, 5, 8
299Quadratic models for understanding neural network dynamics6.751.308, 8, 6, 5
300Hidden Markov Transformer for Simultaneous Machine Translation6.751.308, 6, 5, 8
301Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model6.751.305, 8, 6, 8
302Masked Visual-Textual Prediction for Document Image Representation Pretraining6.751.308, 8, 6, 5
303Crossformer: Transformer Utilizing Cross-Dimension Dependency for Multivariate Time Series Forecasting6.751.306, 8, 5, 8
304Linear Connectivity Reveals Generalization Strategies6.751.308, 5, 8, 6
305ViT-Adapter: Exploring Plain Vision Transformer for Accurate Dense Predictions6.751.306, 5, 8, 8
306Collaborative Pure Exploration in Kernel Bandit6.751.308, 8, 6, 5
307LAVA: Data Valuation without Pre-Specified Learning Algorithms6.751.305, 6, 8, 8
308Generative Augmented Flow Networks6.751.306, 5, 8, 8
309Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language6.751.308, 6, 5, 8
310Automating Nearest Neighbor Search Configuration with Constrained Optimization6.751.308, 8, 6, 5
311Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders6.751.308, 8, 5, 6
312Can discrete information extraction prompts generalize across language models?6.751.308, 8, 6, 5
313Contextual Convolutional Networks6.751.308, 5, 8, 6
314Easy Differentially Private Linear Regression6.751.306, 8, 8, 5
315Towards Stable Test-time Adaptation in Dynamic Wild World6.752.178, 8, 8, 3
316Neural ePDOs: Spatially Adaptive Equivariant Partial Differential Operator Based Networks6.751.305, 8, 6, 8
317An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion6.751.306, 8, 5, 8
318PatchDCT: Patch Refinement for High Quality Instance Segmentation6.751.306, 5, 8, 8
319Representation Learning for Low-rank General-sum Markov Games6.751.306, 5, 8, 8
320DFPC: Data flow driven pruning of coupled channels without data.6.670.946, 6, 8
321Transformer-based model for symbolic regression via joint supervised learning6.670.946, 6, 8
322Curriculum-based Co-design of Morphology and Control of Voxel-based Soft Robots6.670.946, 8, 6
323Modeling content creator incentives on algorithm-curated platforms6.670.948, 6, 6
324Efficient Deep Reinforcement Learning Requires Regulating Statistical Overfitting6.670.946, 6, 8
325The Tilted Variational Autoencoder: Improving Out-of-Distribution Detection6.670.946, 8, 6
326Mind the Pool: Convolutional Neural Networks Can Overfit Input Size6.670.948, 6, 6
327Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection6.670.946, 6, 8
328On Achieving Optimal Adversarial Test Error6.670.946, 8, 6
329KwikBucks: Correlation Clustering with Cheap-Weak and Expensive-Strong Signals6.670.946, 6, 8
330Integrating Symmetry into Differentiable Planning with Steerable Convolutions6.670.948, 6, 6
331Revisiting Populations in multi-agent Communication6.670.946, 6, 8
332Sublinear Algorithms for Kernel Matrices via Kernel Density Estimation6.670.946, 6, 8
333Representational Dissimilarity Metric Spaces for Stochastic Neural Networks6.670.946, 6, 8
334Guess the Instruction! Making Language Models Stronger Zero-Shot Learners6.670.946, 6, 8
335TDR-CL: Targeted Doubly Robust Collaborative Learning for Debiased Recommendations6.670.946, 8, 6
336Scaffolding a Student to Instill Knowledge6.670.946, 8, 6
337The Implicit Bias of Minima Stability in Multivariate Shallow ReLU Networks6.670.946, 8, 6
338MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning6.670.946, 8, 6
339Capturing the Motion of Every Joint: 3D Human Pose and Shape Estimation with Independent Tokens6.670.946, 6, 8
340Quality-Similar Diversity via Population Based Reinforcement Learning6.670.946, 8, 6
341Mind's Eye: Grounded Language Model Reasoning through Simulation6.670.946, 8, 6
342Understanding Embodied Reference with Touch-Line Transformer6.670.946, 8, 6
343Domain Generalization via Heckman-type Selection Models6.670.946, 6, 8
344Hyperbolic Deep Reinforcement Learning6.670.946, 8, 6
345Where to Begin? Exploring the Impact of Pre-Training and Initialization in Federated6.670.946, 8, 6
346Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier6.670.946, 6, 8
347AutoTransfer: AutoML with Knowledge Transfer - An Application to Graph Neural Networks6.670.948, 6, 6
348Text Summarization with Oracle Expectation6.670.946, 6, 8
349Out-of-Distribution Detection and Selective Generation for Conditional Language Models6.670.946, 6, 8
350Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descriptions6.670.946, 8, 6
351Active Image Indexing6.670.946, 6, 8
352Efficient Model Updates for Approximate Unlearning of Graph-Structured Data6.670.946, 6, 8
353DiGress: Discrete Denoising diffusion for graph generation6.670.948, 6, 6
354Differentially private Bias-Term Only Fine-tuning of Foundation Models6.670.946, 6, 8
355Accurate Neural Training with 4-bit Matrix Multiplications at Standard Formats6.670.946, 6, 8
356KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in Low-Resource NLP6.670.948, 6, 6
357MARS: Meta-learning as Score Matching in the Function Space6.670.948, 6, 6
358Simplicial Hopfield networks6.670.946, 8, 6
359MICN: Multi-scale Local and Global Context Modeling for Long-term Series Forecasting6.670.946, 8, 6
360Progressive Voronoi Diagram Subdivision Enables Accurate Data-free Class-Incremental Learning6.670.946, 8, 6
361Hungry Hungry Hippos: Towards Language Modeling with State Space Models6.670.946, 8, 6
362Near-optimal Policy Identification in Active Reinforcement Learning6.670.946, 8, 6
363Generative Modeling Helps Weak Supervision (and Vice Versa)6.670.946, 6, 8
364AIM: Adapting Image Models for Efficient Video Understanding6.670.946, 6, 8
365GAIN: On the Generalization of Instructional Action Understanding6.670.948, 6, 6
366Efficient Federated Domain Translation6.670.948, 6, 6
367Improved Convergence of Differential Private SGD with Gradient Clipping6.670.946, 8, 6
368Learning QUBO Forms in Quantum Annealing6.670.948, 6, 6
369Backstepping Temporal Difference Learning6.670.946, 6, 8
370Learning to Jointly Share and Prune Weights for Grounding Based Vision and Language Models6.670.946, 6, 8
371TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis6.670.948, 6, 6
372Active Learning in Bayesian Neural Networks with Balanced Entropy Learning Principle6.670.946, 8, 6
373Robust Active Distillation6.670.946, 8, 6
374Neural Episodic Control with State Abstraction6.670.948, 6, 6
375Learning to Generate Columns with Application to Vertex Coloring6.670.946, 6, 8
376EVA3D: Compositional 3D Human Generation from 2D Image Collections6.670.948, 6, 6
377Alternating Differentiation for Optimization Layers6.670.946, 6, 8
378MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction6.670.946, 6, 8
379Learning Domain-Agnostic Representation for Disease Diagnosis6.670.948, 6, 6
380Object Tracking by Hierarchical Part-Whole Attention6.670.946, 6, 8
381Boosting the Cycle Counting Power of Graph Neural Networks with I$^2$-GNNs6.601.208, 5, 6, 6, 8
382Pitfalls of Gaussians as a noise distribution in NCE6.601.208, 6, 6, 5, 8
383Theoretical Characterization of Neural Network Generalization with Group Imbalance6.602.0610, 5, 8, 5, 5
384Flow Annealed Importance Sampling Bootstrap6.601.206, 5, 6, 8, 8
385FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification6.601.206, 6, 8, 5, 8
386Sub-Task Decomposition Enables Learning in Sequence to Sequence Tasks6.601.205, 8, 8, 6, 6
387Adversarial Training descends without descent: Finding actual descent directions based on Danskin's theorem6.500.876, 8, 6, 6
388Generating Intuitive Fairness Specifications for Natural Language Processing6.500.876, 6, 8, 6
389LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning6.501.505, 8, 5, 8
390Selective Frequency Network for Image Restoration6.501.508, 8, 5, 5
391Multi-Objective Online Learning6.501.505, 8, 5, 8
392Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient6.500.876, 6, 8, 6
393Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks6.501.505, 8, 5, 8
394On the Importance and Applicability of Pre-Training for Federated Learning6.501.505, 8, 5, 8
395Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward6.501.508, 8, 5, 5
396Weighted Clock Logic Point Process6.501.508, 8, 5, 5
397Diffusion-based Image Translation using disentangled style and content representation6.500.878, 6, 6, 6
398How Much Data Are Augmentations Worth? An Investigation into Scaling Laws, Invariance, and Implicit Regularization6.501.505, 8, 5, 8
399Artificial Neuronal Ensembles with Learned Context Dependent Gating6.501.505, 8, 5, 8
400Backpropagation at the Infinitesimal Inference Limit of Energy-Based Models: Unifying Predictive Coding, Equilibrium Propagation, and Contrastive Hebbian Learning6.501.505, 8, 5, 8
401Dichotomy of Control: Separating What You Can Control from What You Cannot6.501.508, 5, 8, 5
402Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization6.500.876, 8, 6, 6
403Associative Memory Augmented Asynchronous Spatiotemporal Representation Learning for Event-based Perception6.500.876, 8, 6, 6
404Semi Parametric Inducing Point Networks6.500.878, 6, 6, 6
405Augmentation with Projection: Towards an Effective and Efficient Data Augmentation Paradigm for Distillation6.500.876, 8, 6, 6
406Transfer Learning with Deep Tabular Models6.501.505, 8, 8, 5
407Interneurons accelerate learning dynamics in recurrent neural networks for statistical adaptation6.501.505, 5, 8, 8
408HypeR: Multitask Hyper-Prompted Training Enables Large-Scale Retrieval Generalization6.500.878, 6, 6, 6
409On the Trade-Off between Actionable Explanations and the Right to be Forgotten6.500.876, 6, 6, 8
410Learning What and Where - Unsupervised Disentangling Location and Identity Tracking6.501.505, 5, 8, 8
411CANIFE: Crafting Canaries for Empirical Privacy Measurement in Federated Learning6.501.508, 8, 5, 5
412Training language models for deeper understanding improves brain alignment6.501.505, 8, 5, 8
413Sampling-free Inference for Ab-Initio Potential Energy Surface Networks6.501.508, 8, 5, 5
414Wasserstein Auto-encoded MDPs: Formal Verification of Efficiently Distilled RL Policies with Many-sided Guarantees6.501.505, 5, 8, 8
415Solving Constrained Variational Inequalities via a First-order Interior Point-based Method6.500.876, 6, 8, 6
416Calibration Matters: Tackling Maximization Bias in Large-scale Advertising Recommendation Systems6.500.878, 6, 6, 6
417Data Continuity Matters: Improving Sequence Modeling with Lipschitz Regularizer6.500.876, 6, 6, 8
418Control Graph as Unified IO for Morphology-Task Generalization6.501.505, 8, 8, 5
419Restricted Strong Convexity of Deep Learning Models with Smooth Activations6.500.878, 6, 6, 6
420Koopman Neural Operator Forecaster for Time-series with Temporal Distributional Shifts6.501.505, 8, 5, 8
421The Surprising Computational Power of Nondeterministic Stack RNNs6.500.878, 6, 6, 6
422A Non-monotonic Self-terminating Language Model6.500.876, 6, 6, 8
423Differentially Private $L_2$-Heavy Hitters in the Sliding Window Model6.501.508, 8, 5, 5
424Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning6.501.505, 8, 8, 5
425EA-HAS-Bench: Energy-aware Hyperparameter and Architecture Search Benchmark6.500.876, 6, 8, 6
426Versatile Neural Processes for Learning Implicit Neural Representations6.501.508, 5, 5, 8
427Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning6.500.876, 6, 8, 6
428Characterizing the Influence of Graph Elements6.500.876, 6, 8, 6
429Personalized Federated Learning with Feature Alignment and Classifier Collaboration6.501.508, 5, 5, 8
430Simple Yet Effective Graph Contrastive Learning for Recommendation6.501.505, 8, 5, 8
431Dual Diffusion Implicit Bridges for Image-to-Image Translation6.502.065, 5, 10, 6
432Learning to Grow Pretrained Models for Efficient Transformer Training6.500.878, 6, 6, 6
433Learning to Estimate Shapley Values with Vision Transformers6.501.505, 8, 8, 5
434Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning6.500.878, 6, 6, 6
435Code Translation with Compiler Representations6.502.0610, 6, 5, 5
436AnyDA: Anytime Domain Adaptation6.500.876, 6, 8, 6
437Differentiable Mathematical Programming for Object-Centric Representation Learning6.501.508, 5, 8, 5
438Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding6.500.878, 6, 6, 6
439Mass-Editing Memory in a Transformer6.500.876, 6, 6, 8
440On the Saturation Effect of Kernel Ridge Regression6.500.876, 6, 8, 6
441AANG : Automating Auxiliary Learning6.501.508, 8, 5, 5
442Private Federated Learning Without a Trusted Server: Optimal Algorithms for Convex Losses6.500.876, 6, 6, 8
443Robust Fair Clustering: A Novel Fairness Attack and Defense Framework6.500.876, 8, 6, 6
444Dynamic Historical Adaptation for Continual Image-Text Modeling6.501.508, 5, 8, 5
445Learning without Prejudices: Continual Unbiased Learning via Benign and Malignant Forgetting6.501.508, 8, 5, 5
446Spherical Sliced-Wasserstein6.500.876, 8, 6, 6
447Causal Representation Learning for Instantaneous and Temporal Effects6.501.508, 8, 5, 5
448The Role of ImageNet Classes in Fréchet Inception Distance6.501.508, 5, 5, 8
449Towards Understanding Why Mask Reconstruction Pretraining Helps in Downstream Tasks6.500.876, 8, 6, 6
450Prompt Learning with Optimal Transport for Vision-Language Models6.500.876, 6, 6, 8
451DASHA: Distributed Nonconvex Optimization with Communication Compression and Optimal Oracle Complexity6.500.876, 6, 8, 6
452LDMIC: Learning-based Distributed Multi-view Image Coding6.500.876, 6, 6, 8
453Causal Balancing for Domain Generalization6.500.876, 6, 6, 8
454Multi-lingual Evaluation of Code Generation Models6.500.876, 6, 6, 8
455ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure6.500.878, 6, 6, 6
456Digging into Backbone Design on Face Detection6.500.878, 6, 6, 6
457Sparse Mixture-of-Experts are Domain Generalizable Learners6.501.508, 5, 8, 5
458STREET: A MULTI-TASK STRUCTURED REASONING AND EXPLANATION BENCHMARK6.501.508, 5, 8, 5
459Fairness-aware Contrastive Learning with Partially Annotated Sensitive Attributes6.501.505, 8, 8, 5
460Patch-Level Contrasting without Patch Correspondence for Accurate and Dense Contrastive Representation Learning6.500.876, 6, 8, 6
461Excess Risk of Two-Layer ReLU Neural Networks in Teacher-Student Settings and its Superiority to Kernel Methods6.402.068, 3, 5, 8, 8
462Fundamental limits on the robustness of image classifiers6.401.368, 6, 5, 8, 5
463ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning6.401.365, 6, 8, 5, 8
464RoPAWS: Robust Semi-supervised Representation Learning from Uncurated Data6.402.068, 3, 8, 8, 5
465On Emergence of Activation Sparsity in Trained Transformers6.401.368, 5, 8, 5, 6
466ManyDG: Many-domain Generalization for Healthcare Applications6.402.068, 5, 8, 8, 3
467Neuro-Symbolic Procedural Planning with Commonsense Prompting6.401.366, 5, 8, 5, 8
468Direct Embedding of Temporal Network Edges via Time-Decayed Line Graphs6.382.0610, 8, 5, 3, 8, 6, 6, 5
469Quantifying and Mitigating the Impact of Label Errors on Model Disparity Metrics6.331.258, 6, 5
470Learning Continuous Normalizing Flows For Faster Convergence To Target Distribution via Ascent Regularizations6.331.256, 8, 5
471Learning Uncertainty for Unknown Domains with Zero-Target-Assumption6.331.258, 5, 6
472Making Substitute Models More Bayesian Can Enhance Transferability of Adversarial Examples6.331.255, 8, 6
473Zeroth-Order Optimization with Trajectory-Informed Derivative Estimation6.331.255, 8, 6
474Ordered GNN: Ordering Message Passing to Deal with Heterophily and Over-smoothing6.331.256, 5, 8
475Masked Distillation with Receptive Tokens6.331.255, 6, 8
476On Representing Linear Programs by Graph Neural Networks6.331.258, 6, 5
477Implicit Regularization for Group Sparsity6.331.258, 6, 5
478Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-Oriented Dialogue Systems6.331.256, 8, 5
479Supervision Complexity and its Role in Knowledge Distillation6.331.258, 5, 6
480Neural Causal Models for Counterfactual Identification and Estimation6.331.256, 5, 8
481How I Learned to Stop Worrying and Love Retraining6.331.256, 8, 5
482Systematic Rectification of Language Models via Dead-end Analysis6.331.258, 5, 6
483f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation6.331.256, 8, 5
484Scaling up and Stabilizing Differentiable Planning with Implicit Differentiation6.331.258, 6, 5
485Bispectral Neural Networks6.331.255, 6, 8
486Where to Diffuse, How to Diffuse and How to get back: Learning in Multivariate Diffusions6.332.363, 8, 8
487Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences6.331.255, 6, 8
488Explicitly Minimizing the Blur Error of Variational Autoencoders6.331.258, 5, 6
489Error Sensitivity Modulation based Experience Replay: Mitigating Abrupt Representation Drift in Continual Learning6.331.256, 8, 5
490Bayes-MIL: A New Probabilistic Perspective on Attention-based Multiple Instance Learning for Whole Slide Images6.331.258, 5, 6
491Using Language to Extend to Unseen Domains6.331.258, 5, 6
492Explainability as statistical inference6.331.255, 8, 6
493Bringing robotics taxonomies to continuous domains via GPLVM on hyperbolic manifolds6.331.256, 8, 5
494A Theory of Dynamic Benchmarks6.331.258, 5, 6
495Computing all Optimal Partial Transports6.331.258, 6, 5
496A View From Somewhere: Human-Centric Face Representations6.331.258, 6, 5
497Efficient Planning in a Compact Latent Action Space6.331.255, 6, 8
498Localized Randomized Smoothing for Collective Robustness Certification6.331.258, 6, 5
499Unbiased Supervised Contrastive Learning6.331.255, 8, 6
500Compressing multidimensional weather and climate data into neural networks6.331.255, 8, 6
501That Label's got Style: Handling Label Style Bias for Uncertain Image Segmentation6.331.255, 8, 6
502StableDR: Stabilized Doubly Robust Learning for Recommendation on Data Missing Not at Random6.331.256, 5, 8
503Learnable Graph Convolutional Attention Networks6.331.255, 6, 8
504How Sharpness-Aware Minimization Minimizes Sharpness?6.331.255, 8, 6
505Quantized Compressed Sensing with Score-Based Generative Models6.331.255, 8, 6
506On The Relative Error of Random Fourier Features for Preserving Kernel Distance6.332.368, 8, 3
507Weakly Supervised Neuro-Symbolic Image Manipulation via Multi-Hop Complex Instructions6.331.256, 5, 8
508Pushing the Accuracy-Fairness Tradeoff Frontier with Introspective Self-play6.331.258, 6, 5
509Imbalanced Semi-supervised Learning with Bias Adaptive Classifier6.331.258, 6, 5
510Excess risk analysis for epistemic uncertainty with application to variational inference6.332.363, 8, 8
511Meta-Learning General-Purpose Learning Algorithms with Transformers6.331.255, 8, 6
5123D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation6.332.368, 8, 3
513Re-calibrating Feature Attributions for Model Interpretation6.332.368, 8, 3
514Offline RL for Natural Language Generation with Implicit Language Q Learning6.332.368, 8, 3
515Fairness and Accuracy under Domain Generalization6.331.256, 5, 8
516Iteratively Learning Novel Strategies with Diversity Measured in State Distances6.331.255, 8, 6
517Contrastive Learning Can Find An Optimal Basis For Approximately View-Invariant Functions6.331.258, 6, 5
518Efficiently Computing Nash Equilibria in Adversarial Team Markov Games6.331.256, 8, 5
519SimPer: Simple Self-Supervised Learning of Periodic Targets6.332.368, 3, 8
520Causal Imitation Learning via Inverse Reinforcement Learning6.331.256, 8, 5
521Efficient Discrete Multi Marginal Optimal Transport Regularization6.331.255, 8, 6
522Human-level Atari 200x faster6.332.363, 8, 8
523Temporal Domain Generalization with Drift-Aware Dynamic Neural Networks6.331.256, 8, 5
524Matching receptor to odorant with protein language and graph neural networks6.331.256, 8, 5
525PGrad: Learning Principal Gradients For Domain Generalization6.332.368, 3, 8
526Statistical Guarantees for Consensus Clustering6.331.258, 5, 6
527Expressive Monotonic Neural Networks6.332.368, 8, 3
528Learning to CROSS exchange to solve min-max vehicle routing problems6.332.363, 8, 8
529Mitigating Dataset Bias by Using Per-Sample Gradient6.331.258, 5, 6
530Multiple Modes for Continual Learning6.332.873, 6, 10
531REVISITING PRUNING AT INITIALIZATION THROUGH THE LENS OF RAMANUJAN GRAPH6.331.256, 8, 5
532Learning Cut Selection for Mixed-Integer Linear Programming via Hierarchical Sequence Model6.331.255, 8, 6
533ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency6.332.368, 8, 3
534Neural Architecture Design and Robustness: A Dataset6.331.256, 8, 5
535Learning to Decompose Visual Features with Latent Textual Prompts6.331.258, 6, 5
536MATS: Memory Attention for Time-Series forecasting6.331.256, 5, 8
537MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer6.331.255, 6, 8
538Text-Driven Generative Domain Adaptation with Spectral Consistency Regularization6.331.258, 6, 5
539Transfer Learning with Pre-trained Conditional Generative Models6.331.255, 6, 8
540Treeformer: Dense Gradient Trees for Efficient Attention Computation6.331.256, 5, 8
541Cycle to Clique (Cy2C) Graph Neural Network: A Sight to See beyond Neighborhood Aggregation6.331.258, 6, 5
5423D Molecular Generation by Virtual Dynamics6.331.255, 6, 8
543Adversarial Attacks on Adversarial Bandits6.331.258, 5, 6
544On the Perils of Cascading Robust Classifiers6.331.255, 8, 6
545Diving into Unified Data-Model Sparsity for Class-Imbalanced Graph Representation Learning6.332.363, 8, 8
546Sparse tree-based Initialization for Neural Networks6.331.258, 6, 5
547On the Performance of Temporal Difference Learning With Neural Networks6.331.258, 6, 5
548Calibrating Sequence likelihood Improves Conditional Language Generation6.331.258, 6, 5
549SlotFormer: Unsupervised Visual Dynamics Simulation with Object-Centric Models6.331.255, 6, 8
550Fuzzy Alignments in Directed Acyclic Graph for Non-Autoregressive Machine Translation6.331.256, 5, 8
551On the complexity of nonsmooth automatic differentiation6.331.256, 5, 8
552Masked Image Modeling with Denoising Contrast6.331.258, 5, 6
553HiViT: A Simpler and More Efficient Design of Hierarchical Vision Transformer6.331.258, 6, 5
554Risk-Aware Reinforcement Learning with Coherent Risk Measures and Non-linear Function Approximation6.331.256, 8, 5
555Learning Proximal Operators to Discover Multiple Optima6.331.258, 6, 5
556Formal Mathematics Statement Curriculum Learning6.332.368, 3, 8
557POPGym: Benchmarking Partially Observable Reinforcement Learning6.332.368, 8, 3
558Learning Sparse and Low-Rank Priors for Image Recovery via Iterative Reweighted Least Squares Minimization6.331.256, 5, 8
559Truthful Self-Play6.331.258, 5, 6
560Continual Transformers: Redundancy-Free Attention for Online Inference6.331.256, 5, 8
561Dirichlet-based Uncertainty Calibration for Active Domain Adaptation6.331.258, 6, 5
562Robustness to corruption in pre-trained Bayesian neural networks6.331.256, 5, 8
563Meta-learning Adaptive Deep Kernel Gaussian Processes for Molecular Property Prediction6.331.255, 8, 6
564Bort: Towards Explainable Neural Networks with Bounded Orthogonal Constraint6.331.256, 5, 8
565A view of mini-batch SGD via generating functions: conditions of convergence, phase transitions, benefit from negative momenta.6.331.258, 5, 6
566ManiSkill2: A Unified Benchmark for Generalizable Manipulation Skills6.331.255, 8, 6
567Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph Matching6.331.258, 6, 5
568GANet: Graph-Aware Network for Point Cloud Completion with Displacement-Aware Point Augmentor6.332.8710, 6, 3
569Out-of-distribution Detection with Implicit Outlier Transformation6.331.256, 5, 8
570MCAL: Minimum Cost Human-Machine Active Labeling6.331.255, 6, 8
571Learnable Topological Features For Phylogenetic Inference via Graph Neural Networks6.332.363, 8, 8
572Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection6.332.363, 8, 8
573Surgical Fine-Tuning Improves Adaptation to Distribution Shifts6.331.256, 8, 5
574DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Manipulation6.331.255, 8, 6
575Understanding and Adopting Rational Behavior by Bellman Score Estimation6.291.166, 5, 8, 5, 8, 6, 6
576Solving stochastic weak Minty variational inequalities without increasing batch size6.251.096, 5, 6, 8
577WiNeRT: Towards Neural Ray Tracing for Wireless Channel Modelling and Differentiable Simulations6.251.096, 6, 5, 8
578On the Certification of Classifiers for Outperforming Human Annotators6.251.095, 6, 6, 8
579Don’t fear the unlabelled: safe semi-supervised learning via debiasing6.252.056, 3, 8, 8
580Boosting Causal Discovery via Adaptive Sample Reweighting6.251.098, 6, 5, 6
581Mole-BERT: Rethinking Pre-training Graph Neural Networks for Molecules6.251.096, 8, 6, 5
582Learning in temporally structured environments6.251.098, 6, 5, 6
583Efficient Certified Training and Robustness Verification of Neural ODEs6.251.096, 8, 5, 6
584UL2: Unifying Language Learning Paradigms6.252.058, 3, 8, 6
585Bitrate-Constrained DRO: Beyond Worst Case Robustness To Unknown Group Shifts6.251.096, 6, 8, 5
586FLIP: A Provable Defense Framework for Backdoor Mitigation in Federated Learning6.252.053, 8, 6, 8
587Structured World Representations via Block-Slot Attention6.251.095, 6, 8, 6
588CktGNN: Circuit Graph Neural Network for Electronic Design Automation6.251.095, 8, 6, 6
589Linearly Mapping from Image to Text Space6.252.058, 8, 3, 6
590Is the Performance of My Deep Network Too Good to Be True? A Direct Approach to Estimating the Bayes Error in Binary Classification6.251.096, 5, 8, 6
591Memorization Capacity of Neural Networks with Conditional Computation6.252.053, 6, 8, 8
592Neural Image-based Avatars: Generalizable Radiance Fields for Human Avatar Modeling6.252.058, 3, 6, 8
593Compositional Task Representations for Large Language Models6.251.096, 8, 5, 6
594Unsupervised Learning for Combinatorial Optimization Needs Meta Learning6.251.096, 8, 5, 6
595Unsupervised Meta-learning via Few-shot Pseudo-supervised Contrastive Learning6.252.056, 8, 3, 8
596Decepticons: Corrupted Transformers Breach Privacy in Federated Learning for Language Models6.253.038, 1, 8, 8
597Implicit regularization in Heavy-ball momentum accelerated stochastic gradient descent6.252.053, 8, 6, 8
598Pruning Deep Neural Networks from a Sparsity Perspective6.251.096, 6, 8, 5
599Composite Slice Transformer: An Efficient Transformer with Composition of Multi-Scale Multi-Range Attentions6.251.096, 6, 8, 5
600Information-Theoretic Diffusion6.251.095, 6, 6, 8
601Robust Graph Dictionary Learning6.251.098, 6, 5, 6
602Understanding Influence Functions and Datamodels via Harmonic Analysis6.251.098, 6, 6, 5
603TextGrad: Advancing Robustness Evaluation in NLP by Gradient-Driven Optimization6.251.096, 6, 8, 5
604Dynamical systems embedding with a physics-informed convolutional network6.251.095, 8, 6, 6
605Learning Interpretable Dynamics from Images of a Freely Rotating 3D Rigid Body6.251.096, 5, 6, 8
606Characteristic Neural Ordinary Differential Equation6.251.096, 5, 6, 8
607Forget Unlearning: Towards True Data-Deletion in Machine Learning6.251.098, 6, 5, 6
608Serving Graph Compression for Graph Neural Networks6.252.056, 3, 8, 8
609Learning where and when to reason in neuro-symbolic inference6.251.096, 5, 6, 8
610FIGARO: Controllable Music Generation using Learned and Expert Features6.251.095, 6, 6, 8
611Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function6.252.058, 3, 8, 6
612Hyper-Decision Transformer for Efficient Online Policy Adaptation6.252.056, 3, 8, 8
613Solving Continuous Control via Q-learning6.251.098, 5, 6, 6
614Rhino: Deep Causal Temporal Relationship Learning with History-dependent Noise6.251.098, 5, 6, 6
615Pseudoinverse-Guided Diffusion Models for Inverse Problems6.251.095, 6, 6, 8
616Sequential Gradient Coding For Straggler Mitigation6.251.098, 6, 6, 5
617Understanding DDPM Latent Codes Through Optimal Transport6.251.095, 6, 6, 8
618Self-supervised learning with rotation-invariant kernels6.251.096, 8, 5, 6
619Bidirectional Language Models Are Also Few-shot Learners6.251.096, 5, 8, 6
620EPISODE: Episodic Gradient Clipping with Periodic Resampled Corrections for Federated Learning with Heterogeneous Data6.251.098, 6, 5, 6
621Probabilistically Robust Recourse: Navigating the Trade-offs between Costs and Robustness in Algorithmic Recourse6.251.096, 8, 6, 5
622Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning6.251.096, 8, 6, 5
623Contrastive Learning for Unsupervised Domain Adaptation of Time Series6.252.058, 8, 3, 6
624Fisher-Legendre (FishLeg) optimization of deep neural networks6.251.096, 5, 8, 6
625A law of adversarial risk, interpolation, and label noise6.251.098, 8, 5, 6, 6, 5, 6, 6
626Revisiting Dense Retrieval with Unaswerable Counterfactuals6.251.098, 6, 6, 5
627Pareto-Efficient Decision Agents for Offline Multi-Objective Reinforcement Learning6.251.098, 5, 6, 6
628Language Models are Realistic Tabular Data Generators6.251.096, 8, 6, 5
629CRISP: Curriculum based Sequential neural decoders for Polar code family6.251.095, 6, 6, 8
630Learning Diffusion Bridges on Constrained Domains6.251.098, 5, 6, 6
631Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models6.251.096, 8, 6, 5
632PartAfford: Part-level Affordance Discovery6.252.053, 6, 8, 8
633NewModel: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing6.251.096, 8, 6, 5
634Max-Margin Works while Large Margin Fails: Generalization without Uniform Convergence6.251.096, 8, 6, 5
635Preference Transformer: Modeling Human Preferences using Transformers for RL6.251.095, 6, 6, 8
636MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations6.251.096, 5, 6, 8
637PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm6.252.058, 8, 6, 3
638Language Models Can Teach Themselves to Program Better6.251.098, 6, 6, 5
639Learning Kernelized Contextual Bandits in a Distributed and Asynchronous Environment6.251.098, 6, 5, 6
640Moderate Coreset: A Universal Method of Data Selection for Real-world Data-efficient Deep Learning6.251.096, 5, 6, 8
641Diffusion Models for Causal Discovery via Topological Ordering6.252.056, 8, 3, 8
642MetaMD: Principled Optimiser Meta-Learning for Deep Learning6.252.056, 8, 8, 3
643When Source-Free Domain Adaptation Meets Learning with Noisy Labels6.251.096, 5, 6, 8
644Concept Gradient: Concept-based Interpretation Without Linear Assumption6.251.096, 5, 8, 6
645MetaGL: Evaluation-Free Selection of Graph Learning Models via Meta-Learning6.251.096, 6, 5, 8
646Critical Initialization of Wide and Deep Neural Networks through Partial Jacobians: General Theory and Applications6.252.056, 8, 3, 8
647MaskViT: Masked Visual Pre-Training for Video Prediction6.251.096, 6, 8, 5
648How to Train your HIPPO: State Space Models with Generalized Orthogonal Basis Projections6.251.098, 6, 6, 5
649Generalization and Estimation Error Bounds for Model-based Neural Networks6.251.098, 5, 6, 6
650SGDA with shuffling: faster convergence for nonconvex-PŁ minimax optimization6.251.096, 5, 8, 6
651LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification6.251.096, 5, 6, 8
652Liquid Structural State-Space Models6.252.053, 8, 6, 8
653Ollivier-Ricci Curvature for Hypergraphs: A Unified Framework6.251.096, 8, 5, 6
654TiAda: A Time-scale Adaptive Algorithm For Nonconvex Minimax Optimization6.251.096, 5, 8, 6
655Teacher Guided Training: An Efficient Framework for Knowledge Transfer6.251.096, 6, 5, 8
656Adversarial Training of Self-supervised Monocular Depth Estimation against Physical-World Attacks6.251.098, 5, 6, 6
657Self-supervised Geometric Correspondence for Category-level 6D Object Pose Estimation in the Wild6.251.096, 6, 5, 8
658A Simple Yet Powerful Deep Active Learning With Snapshots Ensembles6.252.058, 6, 8, 3
659Towards Open Temporal Graph Neural Networks6.251.096, 5, 6, 8
660Batch Multivalid Conformal Prediction6.251.098, 6, 6, 5
661Equivariant 3D-Conditional Diffusion Models for Molecular Linker Design6.252.058, 3, 8, 6
662UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer6.252.058, 6, 3, 8
663Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation6.251.098, 5, 6, 6
664Unsupervised visualization of image datasets using contrastive learning6.252.496, 10, 3, 6
665A Differential Geometric View and Explainability of GNN on Evolving Graphs6.251.098, 6, 6, 5
666Generative Modelling with Inverse Heat Dissipation6.251.095, 6, 8, 6
667Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images6.251.095, 6, 8, 6
668Recon: Reducing Conflicting Gradients From the Root For Multi-Task Learning6.252.058, 6, 8, 3
669Breaking the Curse of Dimensionality in Multiagent State Space: A Unified Agent Permutation Framework6.251.096, 5, 6, 8
670Hierarchical Sliced Wasserstein Distance6.251.096, 8, 5, 6
671Prototypical Calibration for Few-shot Learning of Language Models6.251.095, 8, 6, 6
672Your Contrastive Learning Is Secretly Doing Stochastic Neighbor Embedding6.252.053, 8, 6, 8
673Distributionally Robust Recourse Action6.251.098, 6, 5, 6
674Visual Classification via Description from Large Language Models6.251.095, 6, 6, 8
675The World is Changing: Improving Fair Training under Correlation Shifts6.252.058, 3, 6, 8
676Relational Attention: Generalizing Transformers for Graph-Structured Tasks6.251.096, 8, 6, 5
677Distilling Model Failures as Directions in Latent Space6.252.053, 6, 8, 8
678Countinuous pseudo-labeling from the start6.251.096, 6, 5, 8
679FedDA: Faster Framework of Local Adaptive Gradient Methods via Restarted Dual Averaging6.251.096, 8, 5, 6
680FoSR: First-order spectral rewiring for addressing oversquashing in GNNs6.251.095, 8, 6, 6
681Deep Generative Symbolic Regression6.251.095, 6, 8, 6
682Diffusion Probabilistic Fields6.251.096, 5, 8, 6
683Novel View Synthesis with Diffusion Models6.251.098, 6, 6, 5
684LMC: Fast Training of GNNs via Subgraph Sampling with Provable Convergence6.252.058, 8, 6, 3
685How to Exploit Hyperspherical Embeddings for Out-of-Distribution Detection?6.251.095, 6, 8, 6
686Emergent world representations: Exploring a sequence model trained on a synthetic task6.252.056, 3, 8, 8
687Programmatically Grounded, Compositionally Generalizable Robotic Manipulation6.252.056, 8, 8, 3
688Anisotropic Message Passing: Graph Neural Networks with Directional and Long-Range Interactions6.251.096, 6, 8, 5
689Planckian Jitter: countering the color-crippling effects of color jitter on self-supervised training6.252.053, 8, 6, 8
690GAMR: A Guided Attention Model for (visual) Reasoning6.251.096, 6, 8, 5
691Monocular Scene Reconstruction with 3D SDF Transformers6.251.095, 8, 6, 6
692Re-parameterizing Your Optimizers rather than Architectures6.252.053, 8, 8, 6
693Deep Generative Modeling on Limited Data with Regularization by Nontransferable Pre-trained Models6.251.098, 6, 5, 6
694Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation6.251.095, 6, 8, 6
695NeRF-SOS: Any-View Self-supervised Object Segmentation on Complex Scenes6.251.095, 6, 8, 6
696Analyzing Tree Architectures in Ensembles via Neural Tangent Kernel6.251.098, 6, 5, 6
697Proactive Multi-Camera Collaboration for 3D Human Pose Estimation6.251.095, 8, 6, 6
698Become a Proficient Player with Limited Data through Watching Pure Videos6.251.098, 5, 6, 6
699Multi-domain image generation and translation with identifiability guarantees6.251.095, 6, 8, 6
700Information-Theoretic Analysis of Unsupervised Domain Adaptation6.252.056, 8, 8, 3
701Understanding Zero-shot Adversarial Robustness for Large-Scale Models6.252.058, 3, 8, 6
702Continual evaluation for lifelong learning: Identifying the stability gap6.251.095, 8, 6, 6
703A General Framework For Proving The Equivariant Strong Lottery Ticket Hypothesis6.251.096, 5, 6, 8
704CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning6.252.056, 8, 8, 3
705Everybody Needs Good Neighbours: An Unsupervised Locality-based Method for Bias Mitigation6.251.096, 8, 6, 5
706Towards Robust Object Detection Invariant to Real-World Domain Shifts6.251.098, 6, 6, 5
707Light Sampling Field and BRDF Representation for Physically-based Neural Rendering6.252.056, 8, 8, 3
708Bidirectional Propagation for Cross-Modal 3D Object Detection6.251.095, 6, 8, 6
709Policy Pre-training for Autonomous Driving via Self-supervised Geometric Modeling6.251.096, 5, 8, 6
710EurNet: Efficient Multi-Range Relational Modeling of Spatial Multi-Relational Data6.251.096, 5, 6, 8
711FINDE: Neural Differential Equations for Finding and Preserving Invariant Quantities6.252.058, 6, 3, 8
712Near-Optimal Adversarial Reinforcement Learning with Switching Costs6.252.058, 8, 6, 3
713Sparse Token Transformer with Attention Back Tracking6.251.095, 6, 6, 8
714Kernel Neural Optimal Transport6.251.098, 5, 6, 6
715Iterative $alpha$-(de)Blending: Learning a Deterministic Mapping Between Arbitrary Densities6.251.098, 6, 5, 6
716Diffusion Models Already Have A Semantic Latent Space6.251.096, 8, 6, 5
717Towards Real-Time Neural Image Compression With Mask Decay6.252.056, 3, 8, 8
718Predicting Cellular Responses with Variational Causal Inference and Refined Relational Information6.251.095, 6, 8, 6
719BrainBERT: Self-supervised representation learning for Intracranial Electrodes6.251.095, 6, 8, 6
720Nonlinear Reconstruction for Operator Learning of PDEs with Discontinuities6.252.058, 3, 6, 8
721Sound Randomized Smoothing in Floating-Point Arithmetic6.251.096, 6, 8, 5
722Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path6.252.056, 3, 8, 8
723Test-Time Robust Personalization for Federated Learning6.251.098, 6, 5, 6
724The Trade-off between Universality and Label Efficiency of Representations from Contrastive Learning6.252.056, 8, 8, 3
725MPCFORMER: FAST, PERFORMANT AND PRIVATE TRANSFORMER INFERENCE WITH MPC6.252.058, 8, 6, 3
726Disparate Impact in Differential Privacy from Gradient Misalignment6.251.096, 6, 5, 8
727Interactive Portrait Harmonization6.251.098, 5, 6, 6
728Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction6.251.095, 6, 8, 6
729Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class-Incremental Learning6.251.095, 8, 6, 6
730WaGI: Wavelet-based GAN Inversion for Preserving High-Frequency Image Details6.251.098, 6, 5, 6
731Continuous-Discrete Convolution for (3+1)D Geometry-Sequence Modeling in Proteins6.251.095, 8, 6, 6
732Uniform-in-time propagation of chaos for the mean field gradient Langevin dynamics6.200.988, 5, 6, 6, 6
733SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing6.201.478, 5, 5, 5, 8
734A Mixture-of-Expert Approach to RL-based Dialogue Management6.201.838, 6, 3, 6, 8
735Can Neural Networks Learn Implicit Logic from Physical Reasoning?6.200.986, 6, 6, 5, 8
736Quantitative Universal Approximation Bounds for Deep Belief Networks6.201.838, 6, 3, 8, 6
737Compositional Law Parsing with Latent Random Functions6.200.988, 6, 5, 6, 6
738StyleMorph: Disentangling Shape, Pose and Appearance through 3D Morphable Image and Geometry Generation6.201.833, 8, 8, 6, 6
739Multi-Prompt Alignment for Multi-source Unsupervised Domain Adaptation6.201.475, 8, 5, 5, 8
740Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning6.200.985, 6, 8, 6, 6
741GRACE-C: Generalized Rate Agnostic Causal Estimation via Constraints6.200.985, 6, 8, 6, 6
742TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding6.201.836, 3, 8, 6, 8
743Learning ReLU networks to high uniform accuracy is intractable6.171.678, 6, 3, 6, 8, 6
744Sharper Bounds for Uniformly Stable Algorithms with Stationary $varphi$-mixing Process6.170.906, 6, 5, 8, 6, 6
745FARE: Provably Fair Representation Learning6.002.453, 8, 8, 3, 8
746Encoding Recurrence into Transformers6.001.415, 8, 5
747Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS6.002.128, 5, 3, 8
748CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code6.002.128, 8, 3, 5
749Cross-Layer Retrospective Retrieving via Layer Attention6.001.225, 5, 8, 6
750xTrimoDock: Cross-Modal Transformer for Multi-Chain Protein Docking6.001.415, 8, 5
751RandProx: Primal-Dual Optimization Algorithms with Randomized Proximal Updates6.002.943, 10, 5
752Guarded Policy Optimization with Imperfect Online Demonstrations6.002.128, 3, 5, 8
753Hierarchical Abstraction for Combinatorial Generalization in Object Rearrangement6.001.415, 5, 8
754Arbitrary Virtual Try-On Network: Characteristics Representation and Trade-off between Body and Clothing6.002.128, 3, 8, 5
755Feature selection and low test error in shallow low-rotation ReLU networks6.001.225, 5, 8, 6
756Coupled Multiwavelet Operator Learning for Coupled Differential Equations6.000.006, 6, 6
757Mechanistic Mode Connectivity6.000.006, 6, 6, 6
758ADELT: Unsupervised Transpilation Between Deep Learning Frameworks6.001.225, 6, 5, 8
759Recursive Time Series Data Augmentation6.002.556, 3, 5, 10
760Robust Multivariate Time-Series Forecasting: Adversarial Attacks and Defense Mechanisms6.001.226, 5, 5, 8
761Ask Me Anything: A simple strategy for prompting language models6.000.006, 6, 6, 6
762The Best of Both Worlds: Accurate Global and Personalized Models through Federated Learning with Data-Free Hyper-Knowledge Distillation6.001.225, 6, 8, 5
763Over-Training with Mixup May Hurt Generalization6.001.225, 5, 8, 6
764Principal Trade-off Analysis6.002.128, 3, 5, 8
765Federated Neural Bandits6.001.225, 8, 5, 6
766Contextual Subspace Approximation with Neural Householder Transforms6.001.418, 5, 5
767A second order regression model shows edge of stability behavior6.001.105, 8, 6, 6, 5
768Broken Neural Scaling Laws6.001.415, 8, 5
769LEARNING CONTEXT-AWARE ADAPTIVE SOLVERS TO ACCELERATE QUADRATIC PROGRAMMING6.001.415, 5, 8
770$mathrm{SE}(3)$-Equivariant Attention Networks for Shape Reconstruction in Function Space6.001.225, 5, 8, 6
771How Can GANs Learn Hierarchical Generative Models for Real-World Distributions6.000.006, 6, 6
772BiAdam: Fast Adaptive Bilevel Optimization Methods6.002.128, 8, 5, 3
773Lovasz Theta Contrastive Learning6.002.555, 10, 6, 3
774Information Plane Analysis for Dropout Neural Networks6.002.125, 8, 8, 3
775Learning Harmonic Molecular Representations on Riemannian Manifold6.001.228, 6, 5, 5
776Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement6.001.415, 8, 5
777STay-On-the-Ridge (STON'R): Guaranteed Convergence to Local Minimax Equilibrium in Nonconvex-Nonconcave Games6.001.415, 8, 5
778Understanding Multi-Task Scaling in Machine Translation6.001.228, 6, 5, 5
779A Simple Approach for Visual Room Rearrangement: 3D Mapping and Semantic Search6.000.006, 6, 6
780Neural Compositional Rule Learning for Knowledge Graph Reasoning6.002.123, 8, 5, 8
781Efficient approximation of neural population structure and correlations with probabilistic circuits6.001.228, 6, 5, 5
782AGRO: Adversarial discovery of error-prone Groups for Robust Optimization6.001.226, 5, 5, 8
783On The Specialization of Neural Modules6.001.415, 5, 8
784Language models are multilingual chain-of-thought reasoners6.001.006, 8, 5, 6, 6, 5
785Subsampling in Large Graphs Using Ricci Curvature6.001.225, 5, 6, 8
786Score-based Continuous-time Discrete Diffusion Models6.002.555, 6, 10, 3
787SurCo: Learning Linear Surrogates for Combinatorial Nonlinear Optimization Problems6.001.415, 8, 5
788Analogical Networks for Memory-Modulated 3D Parsing6.001.225, 8, 5, 6
789DySR: Adaptive Super-Resolution via Algorithm and System Co-design6.001.225, 6, 5, 8
790Synergies Between Disentanglement and Sparsity: a Multi-Task Learning Perspective6.000.006, 6, 6, 6
791Causal Reasoning in the Presence of Latent Confounders via Neural ADMG Learning6.001.226, 5, 8, 5
792Beyond Lipschitz: Sharp Generalization and Excess Risk Bounds for Full-Batch GD6.001.228, 6, 5, 5
793Pushing the limits of self-supervised learning: Can we outperform supervised learning without labels?6.001.225, 6, 8, 5
794DensePure: Understanding Diffusion Models towards Adversarial Robustness6.001.228, 6, 5, 5
795Automatically Auditing Large Language Models via Discrete Optimization6.001.225, 5, 6, 8
796How gradient estimator variance and bias impact learning in neural networks6.001.225, 5, 8, 6
797Distributed Extra-gradient with Optimal Complexity and Communication Guarantees6.001.415, 8, 5
798FIT: A Metric for Model Sensitivity6.001.908, 8, 3, 5, 6
799Revisiting Robustness in Graph Machine Learning6.000.006, 6, 6
800Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation6.001.226, 5, 8, 5
801Logical Message Passing Networks with One-hop Inference on Atomic Formulas6.000.006, 6, 6
802Symmetries, Flat Minima and the Conserved Quantities of Gradient Flow6.001.225, 8, 6, 5
803Synaptic Dynamics Realize First-order Adaptive Learning and Weight Symmetry6.001.415, 8, 5
804Order Matters: Agent-by-agent Policy Optimization6.001.105, 6, 5, 6, 8
805On the Convergence of AdaGrad on $mathbb{R}^d$: Beyond Convexity, Non-Asymptotic Rate and Acceleration6.001.415, 5, 8
806Large language models are not zero-shot communicators6.001.225, 8, 5, 6
807ImageNet-X: Understanding Model Mistakes with Factor of Variation Annotations6.001.415, 8, 5
808Improved Learning-augmented Algorithms for k-means and k-medians Clustering6.000.006, 6, 6
809DIFFUSION GENERATIVE MODELS ON SO(3)6.001.418, 5, 5
810Learning About Progress From Experts6.000.006, 6, 6
811Distributed Graph Neural Network Training with Periodic Stale Representation Synchronization6.001.226, 5, 8, 5
812Obtaining More Generalizable Fair Classifiers on Imbalanced Datasets6.000.006, 6, 6
813Understanding The Robustness of Self-supervised Learning Through Topic Modeling6.000.006, 6, 6
814Adversarial Cheap Talk6.001.228, 5, 5, 6
815Achieve Near-Optimal Individual Regret & Low Communications in Multi-Agent Bandits6.000.006, 6, 6
816Online Boundary-Free Continual Learning by Scheduled Data Prior6.001.105, 6, 8, 5, 6
817Revisiting adapters with adversarial training6.001.228, 6, 5, 5
818A Self-Attention Ansatz for Ab-initio Quantum Chemistry6.001.228, 6, 5, 5
819Multi-Behavior Dynamic Contrastive Learning for Recommendation6.001.228, 5, 5, 6
820HyperDeepONet: learning operator with complex target function space using the limited resources via hypernetwork6.000.006, 6, 6
821Towards the Detection of Diffusion Model Deepfakes6.001.106, 5, 8, 5, 6
822Identifiability Results for Multimodal Contrastive Learning6.001.228, 6, 5, 5
823Causal Attention to Exploit Transient Emergence of Causal Effect6.001.418, 5, 5
824Correlative Information Maximization Based Biologically Plausible Neural Networks for Correlated Source Separation6.001.415, 8, 5
825Copy is All You Need6.001.226, 5, 5, 8
826Why adversarial training can hurt robust accuracy6.002.128, 3, 5, 8
827Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection6.000.006, 6, 6, 6
828TANGOS: Regularizing Tabular Neural Networks through Gradient Orthogonalization and Specialization6.001.415, 8, 5
829Improving the imputation of missing data with Markov Blanket discovery6.001.225, 8, 6, 5
830Understanding Neural Coding on Latent Manifolds by Sharing Features and Dividing Ensembles6.000.006, 6, 6
831Defending against Adversarial Audio via Diffusion Model6.001.226, 5, 8, 5
832Theoretical Characterization of the Generalization Performance of Overfitted Meta-Learning6.001.225, 8, 5, 6
833Towards graph-level anomaly detection via deep evolutionary mapping6.001.415, 8, 5
834Global Explainability of GNNs via Logic Combination of Learned Concepts6.001.415, 8, 5
835Instance-Specific Augmentation: Capturing Local Invariances6.000.006, 6, 6
836$Lambda$-DARTS: Mitigating Performance Collapse by Harmonizing Operation Selection among Cells6.000.006, 6, 6, 6
837Better Teacher Better Student: Dynamic Prior Knowledge for Knowledge Distillation6.001.418, 5, 5
838Inequality phenomenon in $l_{infty}$-adversarial training, and its unrealized threats6.002.123, 8, 5, 8
839Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow6.000.006, 6, 6
840Complexity-Based Prompting for Multi-step Reasoning6.002.128, 5, 3, 8
841Not All Tasks Are Born Equal: Understanding Zero-Shot Generalization6.001.226, 5, 5, 8
842What Do Self-Supervised Vision Transformers Learn?6.002.125, 3, 8, 8
843Sampled Transformer for Point Sets6.001.225, 5, 8, 6
844Squeeze Training for Adversarial Robustness6.000.006, 6, 6, 6
845Provably efficient multi-task Reinforcement Learning in large state spaces6.001.415, 5, 8
846Learning Multi-Object Positional Relationships via Emergent Communication6.002.128, 5, 3, 8
847The Dark Side of Invariance: Revisiting the Role of Augmentations in Contrastive Learning6.001.225, 5, 6, 8
848Long-Tailed Partial Label Learning via Dynamic Rebalancing6.001.226, 8, 5, 5
849How hard are computer vision datasets? Calibrating dataset difficulty to viewing time6.001.225, 8, 5, 6
850Do We Always Need to Penalize Variance of Losses for Learning with Label Noise?6.001.418, 5, 5
851Causal Estimation for Text Data with (Apparent) Overlap Violations6.000.006, 6, 6, 6
852Adversarial Diversity in Hanabi6.000.006, 6, 6
853CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos6.000.006, 6, 6, 6, 6
854CAREER: Transfer Learning for Economic Prediction of Labor Data6.001.415, 5, 8
855Federated Nearest Neighbor Machine Translation6.000.006, 6, 6, 6
856ROCO: A General Framework for Evaluating Robustness of Combinatorial Optimization Solvers on Graphs6.001.225, 5, 6, 8
857PiFold: Toward effective and efficient protein inverse folding6.001.418, 5, 5
858Distributional Signals for Node Classification in Graph Neural Networks6.001.415, 8, 5
859Planning Goals for Exploration6.001.903, 5, 6, 8, 8
860Scalable and Equivariant Spherical CNNs by Discrete-Continuous (DISCO) Convolutions6.001.226, 8, 5, 5
861Learning Efficient Hybrid Particle-continuum Representations of Non-equilibrium N-body Systems6.001.415, 8, 5
862Neural Network Approximation of Lipschitz Functions in High Dimensions with Applications to Inverse Problems6.001.226, 5, 5, 8
863Minimum Description Length Control6.001.225, 8, 5, 6
864Tuning Frequency Bias in Neural Network Training with Nonuniform Data6.001.226, 5, 8, 5
865Is Adversarial Training Really a Silver Bullet for Mitigating Data Poisoning?6.002.553, 6, 10, 5
866Does Decentralized Learning with Non-IID Unlabeled Data Benefit from Self Supervision?6.001.228, 5, 5, 6
867MEDFAIR: BENCHMARKING FAIRNESS FOR MEDICAL IMAGEING6.002.123, 5, 8, 8
868Exploring and Exploiting Decision Boundary Dynamics for Adversarial Robustness6.001.105, 5, 8, 6, 6
869SMART: Sentences as Basic Units for Text Evaluation6.001.225, 8, 5, 6
870Neural Design for Genetic Perturbation Experiments6.001.226, 8, 5, 5
871Quantifying Memorization Across Neural Language Models6.001.225, 5, 8, 6
872Diffusion Adversarial Representation Learning for Self-supervised Vessel Segmentation6.000.006, 6, 6, 6
873A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games6.002.125, 8, 8, 3
874The Dark Side of AutoML: Towards Architectural Backdoor Search6.001.228, 5, 5, 6
875On the Data-Efficiency with Contrastive Image Transformation in Reinforcement Learning6.001.226, 5, 5, 8
876Energy-based Out-of-Distribution Detection for Graph Neural Networks6.001.225, 5, 8, 6
877Compositional Semantic Parsing with Large Language Models6.001.225, 5, 6, 8
878MEDICAL IMAGE UNDERSTANDING WITH PRETRAINED VISION LANGUAGE MODELS: A COMPREHENSIVE STUDY6.001.225, 6, 8, 5
879Adversarial Attack Detection Through Network Transport Dynamics6.001.418, 5, 5
880Knowledge-Driven Active Learning6.001.105, 5, 6, 6, 8
881CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment6.001.105, 5, 6, 8, 6
882Transferring Pretrained Diffusion Probabilistic Models6.001.225, 5, 6, 8
883Test-Time Adaptation via Self-Training with Nearest Neighbor Information6.001.225, 8, 5, 6
884Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting6.001.415, 8, 5
885Massively Scaling Heteroscedastic Classifiers6.001.735, 8, 3, 6, 8, 6
886Blurring Diffusion Models6.001.225, 5, 6, 8
887Hyperbolic Self-paced Learning for Self-supervised Skeleton-based Action Representations6.001.226, 5, 5, 8
888On Uni-modal Feature Learning in Multi-modal Learning6.001.225, 6, 8, 5
889VA-DepthNet: A Variational Approach to Single Image Depth Prediction6.001.225, 5, 8, 6
890E-Forcing: Improving Autoregressive Models by Treating it as an Energy-Based One6.001.415, 8, 5
891TRANSFORMER-PATCHER: ONE MISTAKE WORTH ONE NEURON6.001.225, 6, 5, 8
892On the Edge of Benign Overfitting: Label Noise and Overparameterization Level6.000.006, 6, 6
893Measure the Predictive Heterogeneity6.001.225, 6, 8, 5
894In-sample Actor Critic for Offline Reinforcement Learning6.001.228, 5, 6, 5
895Joint Gaussian Mixture Model for Versatile Deep Visual Model Explanation6.002.128, 8, 3, 5
896Localized Graph Contrastive Learning6.001.225, 8, 6, 5
897CircuitNet: A Generic Neural Network to Realize Universal Circuit Motif Modeling6.000.006, 6, 6
898Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting6.001.226, 5, 5, 8
899Ensuring DNN Solution Feasibility for Optimization Problems with Linear Constraints6.001.225, 8, 6, 5
900AutoFHE: Automated Adaption of CNNs for Efficient Evaluation over FHE6.001.415, 8, 5
901From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data6.002.125, 3, 8, 8
902FINE: Future-Aware Inference for Streaming Speech Translation6.001.106, 8, 5, 5, 6
903Stable Target Field for Reduced Variance Score Estimation6.001.415, 8, 5
904Dynamic Embeddings of Temporal High-Order Interactions via Neural Diffusion-Reaction Processes6.001.225, 5, 8, 6
905DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking6.003.083, 8, 10, 3
906Certified Defences Against Adversarial Patch Attacks on Semantic Segmentation6.001.228, 5, 5, 6
907How Much Space Has Been Explored? Measuring the Chemical Space Covered by Databases and Machine-Generated Molecules6.001.226, 8, 5, 5
908Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective6.001.105, 6, 8, 6, 5
909DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases6.001.228, 5, 6, 5
910NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis6.001.225, 5, 8, 6
911Iterative Patch Selection for High-Resolution Image Recognition6.002.128, 8, 5, 3
9123D Segmenter: 3D Transformer based Semantic Segmentation via 2D Panoramic Distillation6.001.225, 6, 5, 8
913GOOD: Exploring geometric cues for detecting objects in an open world6.001.226, 8, 5, 5
914TabCaps: A Capsule Neural Network for Tabular Data Classification with BoW Routing6.001.415, 5, 8
915Koopman neural operator for learning non-linear partial differential equations6.001.415, 5, 8
916CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling6.001.225, 5, 6, 8
917Toeplitz Neural Network for Sequence Modeling6.002.123, 8, 5, 8
918Deep Learning on Implicit Neural Representations of Shapes6.001.228, 5, 6, 5
919Learning Counterfactually Invariant Predictors6.001.228, 5, 6, 5
920ImaginaryNet: Learning Object Detectors without Real Images and Annotations6.001.225, 8, 6, 5
921Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased6.000.006, 6, 6, 6
922From $t$-SNE to UMAP with contrastive learning6.001.908, 5, 8, 3, 6
923Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning6.001.008, 5, 6, 6, 5, 6
924Generalize Learned Heuristics to Solve Large-scale Vehicle Routing Problems in Real-time6.001.226, 5, 8, 5
925Towards the Generalization of Contrastive Self-Supervised Learning6.002.285, 3, 6, 10, 6
926Do We Need Neural Collapse? Learning Diverse Features for Fine-grained and Long-tail Classification6.001.415, 8, 5
927DepthFL : Depthwise Federated Learning for Heterogeneous Clients6.001.225, 6, 5, 8
928BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers6.001.226, 5, 8, 5
929CooPredict : Cooperative Differential Games For Time Series Prediction6.001.415, 8, 5
930Molecule Generation For Target Protein Binding with Structural Motifs6.001.226, 5, 5, 8
931Towards Robustness Certification Against Universal Perturbations6.002.128, 8, 5, 3
932Multimodal Federated Learning via Contrastive Representation Ensemble6.001.225, 8, 5, 6
933Adversarial perturbation based latent reconstruction for domain-agnostic self-supervised learning6.001.225, 6, 8, 5
934Protein Representation Learning by Geometric Structure Pretraining6.001.225, 8, 5, 6
935Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation6.001.226, 8, 5, 5
936Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning6.001.228, 6, 5, 5
937Reversible Column Networks6.000.006, 6, 6
938What Is Missing in IRM Training and Evaluation? Challenges and Solutions6.000.006, 6, 6
939Multi-task Self-supervised Graph Neural Networks Enable Stronger Task Generalization6.000.006, 6, 6
940Hierarchies of Reward Machines6.001.418, 5, 5
941LatentAugment: Dynamically Optimized Latent Probabilities of Data Augmentation6.001.225, 8, 5, 6
942Policy Contrastive Imitation Learning6.001.415, 5, 8
943Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes6.000.006, 6, 6, 6
944Dataless Knowledge Fusion by Merging Weights of Language Models6.001.225, 6, 8, 5
945GReTo: Remedying dynamic graph topology-task discordance via target homophily6.001.106, 6, 8, 5, 5
946Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning6.000.006, 6, 6
947Particle-based Variational Inference with Preconditioned Functional Gradient Flow6.000.006, 6, 6
948Selective Annotation Makes Language Models Better Few-Shot Learners6.001.225, 5, 6, 8
949Adaptive Client Sampling in Federated Learning via Online Learning with Bandit Feedback6.001.225, 5, 6, 8
950SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation6.002.128, 3, 8, 5
951Learning Symbolic Models for Graph-structured Physical Mechanism6.001.415, 5, 8
952AdaDQH Optimizer: Evolving from Stochastic to Adaptive by Auto Switch of Precondition Matrix6.001.418, 5, 5
953Dataset Pruning: Reducing Training Data by Examining Generalization Influence6.001.225, 8, 6, 5
954Expected Gradients of Maxout Networks and Consequences to Parameter Initialization6.001.108, 6, 5, 5, 6
955Online Continual Learning for Progressive Distribution Shift (OCL-PDS): A Practitioner's Perspective6.002.555, 3, 10, 6
956Understanding Why Generalized Reweighting Does Not Improve Over ERM6.001.226, 5, 5, 8
957Composing Ensembles of Pre-trained Models via Iterative Consensus6.001.226, 8, 5, 5
958Learning Label Encodings for Deep Regression6.000.006, 6, 6, 6
959Riemannian Metric Learning via Optimal Transport6.001.225, 6, 5, 8
960Deep Variational Implicit Processes6.001.225, 6, 5, 8
961Estimating individual treatment effects under unobserved confounding using binary instruments6.000.006, 6, 6, 6
962Denoising Diffusion Error Correction Codes6.000.006, 6, 6
963Exploring Active 3D Object Detection from a Generalization Perspective6.000.006, 6, 6, 6
964Learning Object-Language Alignments for Open-Vocabulary Object Detection6.001.225, 8, 6, 5
965Inferring Fluid Dynamics via Inverse Rendering6.001.418, 5, 5
966Exploring Low-Rank Property in Multiple Instance Learning for Whole Slide Image Classification6.001.228, 6, 5, 5
967Equiformer: Equivariant Graph Attention Transformer for 3D Atomistic Graphs6.001.225, 5, 6, 8
968IDP: Iterative Differentiable Pruning based on Attention for Deep Neural Networks6.001.228, 5, 6, 5
969OTOv2: Automatic, Generic, User-Friendly6.001.415, 5, 8
970Sparse Q-Learning: Offline Reinforcement Learning with Implicit Value Regularization6.001.415, 5, 8
971Admeta: A Novel Double Exponential Moving Average to Adaptive and Non-adaptive Momentum Optimizers with Bidirectional Looking6.000.006, 6, 6, 6
972Statistical Inference for Fisher Market Equilibrium6.000.006, 6, 6
973Scenario-based Question Answering with Interacting Contextual Properties6.000.006, 6, 6
974Visual Recognition with Deep Nearest Centroids6.001.225, 6, 8, 5
975Continuous PDE Dynamics Forecasting with Implicit Neural Representations6.000.006, 6, 6, 6
976Towards Inferential Reproducibility of Machine Learning Research6.001.418, 5, 5
977Graph Contrastive Learning for Skeleton-based Action Recognition6.002.125, 8, 3, 8
978Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation6.001.108, 6, 5, 6, 5
979Spikformer: When Spiking Neural Network Meets Transformer6.002.555, 10, 3, 6
980Multimodal Analogical Reasoning over Knowledge Graphs6.001.415, 5, 8
981What shapes the loss landscape of self supervised learning?6.000.006, 6, 6
982Conditional Positional Encodings for Vision Transformers6.001.226, 8, 5, 5
983Label Distribution Learning via Implicit Distribution Representation6.001.228, 5, 6, 5
984Learning to Compose Soft Prompts for Compositional Zero-Shot Learning6.001.228, 6, 5, 5
985SQA3D: Situated Question Answering in 3D Scenes6.000.006, 6, 6, 6
986The Benefits of Model-Based Generalization in Reinforcement Learning6.001.225, 5, 6, 8
987Extracting Robust Models with Uncertain Examples6.001.225, 5, 6, 8
988Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks6.001.226, 5, 8, 5
989DifFace: Blind Face Restoration with Diffused Error Contraction6.001.226, 5, 8, 5
990ChiroDiff: Modelling chirographic data with Diffusion Models6.000.006, 6, 6
991Real-Time Image Demoir$acute{e}$ing on Mobile Devices6.002.123, 8, 5, 8
992Steering Prototypes with Prompt Tuning for Rehearsal-free Continual Learning6.000.006, 6, 6, 6
993Decompose to Generalize: Species-Generalized Animal Pose Estimation6.001.225, 5, 8, 6
994Learning Implicit Scale Conditioned Memory Compensation for Talking Head Generation6.000.006, 6, 6
995Logical Entity Representation in Knowledge-Graphs for Differentiable Rule Learning6.001.228, 5, 6, 5
996Suppressing the Heterogeneity: A Strong Feature Extractor for Few-shot Segmentation6.001.226, 5, 5, 8
997Sparse and Hierarchical Masked Modeling for Convolutional Representation Learning6.001.415, 8, 5
998On amortizing convex conjugates for optimal transport6.000.006, 6, 6, 6
999ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent Training6.001.228, 6, 5, 5
1000Generalization Bounds for Federated Learning: Fast Rates, Unparticipating Clients and Unbounded Losses5.831.075, 6, 5, 6, 8, 5
1001Corrupted Image Modeling for Self-Supervised Visual Pre-Training5.831.076, 5, 8, 6, 5, 5
1002Neural Probabilistic Logic Programming in Discrete-Continuous Domains5.801.175, 5, 5, 8, 6
1003Substructure-Atom Cross Attention for Molecular Representation Learning5.801.175, 5, 8, 5, 6
1004Language Models Can (kind of) Reason: A Systematic Formal Analysis of Chain-of-Thought5.801.178, 5, 5, 5, 6
1005Evaluation of Active Feature Acquisition Methods under Missing Data5.801.606, 8, 6, 6, 3
1006Learning to Induce Causal Structure5.801.176, 5, 5, 5, 8
1007Energy Transformer5.801.175, 5, 8, 6, 5
1008Sample Relationships through the Lens of Learning Dynamics with Label Information5.801.178, 5, 5, 6, 5
1009CUDA: Curriculum of Data Augmentation for Long-tailed Recognition5.801.176, 5, 8, 5, 5
1010Transport with Support: Data-Conditional Diffusion Bridges5.750.436, 6, 5, 6
1011FairGBM: Gradient Boosting with Fairness Constraints5.751.793, 6, 8, 6
1012Robust Training through Adversarially Selected Data Subsets5.750.436, 5, 6, 6
1013Face reconstruction from facial templates by learning latent space of a generator network5.750.435, 6, 6, 6
1014Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery5.751.793, 6, 8, 6
1015Gray-Box Gaussian Processes for Automated Reinforcement Learning5.751.305, 5, 5, 8
1016One-Step Estimator for Permuted Sparse Recovery5.750.436, 6, 6, 5
1017Leveraging Large Language Models for Multiple Choice Question Answering5.751.308, 5, 5, 5
1018Transfer NAS with Meta-learned Bayesian Surrogates5.750.436, 6, 5, 6
1019Mitigating the Limitations of Multimodal VAEs with Coordination-Based Approach5.751.305, 5, 5, 8
1020Sharp Convergence Analysis of Gradient Descent for Deep Linear Neural Networks5.751.305, 5, 8, 5
1021Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation5.750.436, 6, 6, 5
1022Sparse Distributed Memory is a Continual Learner5.751.305, 8, 5, 5
1023Hyper-parameter Tuning for Fair Classification without Sensitive Attribute Access5.751.308, 5, 5, 5
1024Rethinking Symbolic Regression: Morphology and Adaptability in the Context of Evolutionary Algorithms5.751.798, 6, 6, 3
1025Imitating Graph-Based Planning with Goal-Conditioned Policies5.751.796, 3, 8, 6
1026Computational Language Acquisition with Theory of Mind5.751.798, 6, 3, 6
1027Pareto Invariant Risk Minimization5.751.308, 5, 5, 5
1028Can Agents Run Relay Race with Strangers? Generalization of RL to Out-of-Distribution Trajectories5.750.436, 6, 6, 5
1029STUNT: Few-shot Tabular Learning with Self-generated Tasks from Unlabeled Tables5.750.436, 5, 6, 6
1030Compressed Predictive Information Coding5.751.796, 6, 3, 8
1031WebBrain: Learning to Generate Factually Correct Articles for Queries by Grounding on Large Web Corpus5.751.793, 6, 8, 6
1032Reinforcement Learning-Based Estimation for Partial Differential Equations5.750.436, 5, 6, 6
1033Heterogeneous-Agent Mirror Learning5.751.798, 3, 6, 6
1034TextShield: Beyond Successfully Detecting Adversarial Sentences in NLP5.751.305, 5, 8, 5
1035Minimalistic Unsupervised Learning with the Sparse Manifold Transform5.750.436, 6, 5, 6
1036Quantile Risk Control: A Flexible Framework for Bounding the Probability of High-Loss Predictions5.750.436, 5, 6, 6
1037HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware Attention5.750.436, 5, 6, 6
1038Return Augmentation gives Supervised RL Temporal Compositionality5.750.436, 6, 5, 6
1039Characterizing intrinsic compositionality in transformers with Tree Projections5.751.796, 3, 6, 8
1040Open-Set 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning5.750.436, 6, 6, 5
1041Interaction-Based Disentanglement of Entities for Object-Centric World Models5.750.436, 6, 5, 6
1042PromptBoosting: Black-Box Text Classification with Ten Forward Passes5.750.436, 6, 6, 5
1043Adaptive Optimization in the $infty$-Width Limit5.751.305, 5, 5, 8
1044A Control-Centric Benchmark for Video Prediction5.751.796, 3, 8, 6
1045Data-Efficient Finetuning Using Cross-Task Nearest Neighbors5.751.796, 3, 8, 6
1046Unveiling Transformers with LEGO: A Synthetic Reasoning Task5.751.798, 3, 6, 6
1047Efficiently Controlling Multiple Risks with Pareto Testing5.751.796, 8, 6, 3
1048Learning Structured Representations by Embedding Class Hierarchy5.751.308, 5, 5, 5
1049FunkNN: Neural Interpolation for Functional Generation5.750.435, 6, 6, 6
1050Approximating any Function via Coreset for Radial Basis Functions: Towards Provable Data Subset Selection For Efficient Neural Networks training5.750.435, 6, 6, 6
1051Towards Understanding GD with Hard and Conjugate Pseudo-labels for Test-Time Adaptation5.751.796, 6, 8, 3
1052A Statistical Framework for Personalized Federated Learning and Estimation: Theory, Algorithms, and Privacy5.750.435, 6, 6, 6
1053Learning Low Dimensional State Spaces with Overparameterized Recurrent Neural Networks5.750.436, 6, 5, 6
1054DT+GNN: A Fully Explainable Graph Neural Network using Decision Trees5.750.436, 6, 6, 5
1055Spatio-temporal point processes with deep non-stationary kernels5.750.435, 6, 6, 6
1056DAG Learning via Sparse Relaxations5.750.436, 5, 6, 6
1057Autoregressive Diffusion Model for Graph Generation5.750.436, 5, 6, 6
1058Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations5.750.436, 6, 6, 5
1059Don’t forget the nullspace! Nullspace occupancy as a mechanism for out of distribution failure5.751.305, 5, 8, 5
1060Towards Interpretable Deep Reinforcement Learning with Human-Friendly Prototypes5.750.435, 6, 6, 6
1061Compositional Task Generalization with Discovered Successor Feature Modules5.751.796, 6, 8, 3
1062Phenaki: Variable Length Video Generation from Open Domain Textual Descriptions5.751.793, 6, 8, 6
1063On the (Non-)Robustness of Two-Layer Neural Networks in Different Learning Regimes5.751.796, 3, 8, 6
1064CrAM: A Compression-Aware Minimizer5.751.798, 6, 3, 6
1065Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees5.751.796, 3, 8, 6
1066Hebbian Deep Learning Without Feedback5.750.435, 6, 6, 6
1067Learning to Abstain from Uninformative Data5.751.308, 5, 5, 5
1068Know Your Boundaries: The Advantage of Explicit Behavior Cloning in Offline RL5.751.793, 6, 8, 6
1069Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning5.751.793, 6, 8, 6
1070Maximum Entropy Information Bottleneck for Confidence-aware Stochastic Embedding5.751.305, 8, 5, 5
1071Certifiably Robust Transformers with 1-Lipschitz Self-Attention5.750.435, 6, 6, 6
1072$k$NN Prompting: Learning Beyond the Context with Nearest Neighbor Inference5.751.796, 6, 8, 3
1073Uncovering Directions of Instability via Quadratic Approximation of Deep Neural Loss in Reinforcement Learning5.751.308, 5, 5, 5
1074This Looks Like It Rather Than That: ProtoKNN For Similarity-Based Classifiers5.750.436, 5, 6, 6
1075Leveraging Importance Weights in Subset Selection5.751.798, 6, 6, 3
1076Gradient flow in the gaussian covariate model: exact solution of learning curves and multiple descent structures5.750.436, 6, 6, 5
1077MILAN: Masked Image Pretraining on Language Assisted Representation5.751.305, 8, 5, 5
1078Learning topology-preserving data representations5.751.796, 8, 6, 3
1079The Curious Case of Benign Memorization5.751.796, 3, 6, 8
1080Can Wikipedia Help Offline Reinforcement Learning?5.751.798, 6, 3, 6
1081Modeling Temporal Data as Continuous Functions with Process Diffusion5.750.435, 6, 6, 6
1082Model-based Causal Bayesian Optimization5.751.305, 8, 5, 5
1083Probabilistic Imputation for Time-series Classification with Missing Data5.751.305, 5, 5, 8
1084Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints5.751.796, 6, 8, 3
1085Statistical Theory of Differentially Private Marginal-based Data Synthesis Algorithms5.750.436, 5, 6, 6
1086A Primal-Dual Framework for Transformers and Neural Networks5.751.796, 3, 6, 8
1087Understanding the Generalization of Adam in Learning Neural Networks with Proper Regularization5.750.436, 5, 6, 6
1088MAST: Masked Augmentation Subspace Training for Generalizable Self-Supervised Priors5.751.798, 6, 3, 6
1089Quantum Vision Transformers5.752.595, 10, 3, 5
1090Pre-training Protein Structure Encoder via Siamese Diffusion Trajectory Prediction5.751.305, 8, 5, 5
1091Scaling Laws in Mean-Field Games5.751.796, 6, 3, 8
1092Clustering for directed graphs using parametrized random walk diffusion kernels5.750.435, 6, 6, 6
1093ProsodyBERT: Self-Supervised Prosody Representation for Style-Controllable TTS5.752.595, 10, 3, 5
1094Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation5.750.436, 6, 5, 6
1095The hidden uniform cluster prior in self-supervised learning5.750.435, 6, 6, 6
1096Spacetime Representation Learning5.751.798, 6, 3, 6
1097CLIP-Dissect: Automatic Description of Neuron Representations in Deep Vision Networks5.750.435, 6, 6, 6
1098LipsFormer: Introducing Lipschitz Continuity to Vision Transformers5.751.793, 8, 6, 6
1099Automatic Chain of Thought Prompting in Large Language Models5.751.793, 6, 6, 8
1100Latent Variable Representation for Reinforcement Learning5.751.793, 6, 8, 6
1101SoftMatch: Addressing the Quantity-Quality Tradeoff in Semi-supervised Learning5.751.798, 6, 3, 6
1102Attention-Guided Backdoor Attacks against Transformers5.751.305, 5, 8, 5
1103Overthinking the Truth: Understanding how Language Models process False Demonstrations5.751.305, 8, 5, 5
1104Re-Imagen: Retrieval-Augmented Text-to-Image Generator5.750.435, 6, 6, 6
1105Implicit regularization via Spectral Neural Networks and non-linear matrix sensing5.751.796, 6, 3, 8
1106A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning5.751.305, 8, 5, 5
1107Graph Neural Network-Inspired Kernels for Gaussian Processes in Semi-Supervised Learning5.750.436, 6, 5, 6
1108Graph Convolutional Normalizing Flows for Semi-Supervised Classification and Clustering5.751.308, 5, 5, 5
1109Weakly Supervised Explainable Phrasal Reasoning with Neural Fuzzy Logic5.750.436, 6, 6, 5
1110Weighted Ensemble Self-Supervised Learning5.751.793, 6, 8, 6
1111TILP: Differentiable Learning of Temporal Logical Rules on Knowledge Graphs5.751.305, 5, 5, 8
1112Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP5.751.305, 5, 8, 5
1113CURE: A Pre-training Framework on Large-scale Patient Data for Treatment Effect Estimation5.751.305, 5, 8, 5
1114Bridging the Gap between Semi-supervised and Supervised Continual Learning via Data Programming5.751.305, 8, 5, 5
1115Measuring Forgetting of Memorized Training Examples5.750.436, 6, 5, 6
1116Efficient Edge Inference by Selective Query5.751.796, 8, 6, 3
1117Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments5.751.305, 8, 5, 5
1118Model Transferability with Responsive Decision Subjects5.751.305, 5, 5, 8
1119NTFields: Neural Time Fields for Physics-Informed Robot Motion Planning5.750.436, 6, 5, 6
1120ZiCo: Zero-shot NAS via inverse Coefficient of Variation on Gradients5.750.436, 6, 5, 6
1121Learning Simultaneous Navigation and Construction in Grid Worlds5.750.435, 6, 6, 6
1122PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs5.750.435, 6, 6, 6
1123Towards Minimax Optimal Reward-free Reinforcement Learning in Linear MDPs5.750.436, 5, 6, 6
1124Which Layer is Learning Faster? A Systematic Exploration of Layer-wise Convergence Rate for Deep Neural Networks5.750.436, 6, 6, 5
1125Scaleformer: Iterative Multi-scale Refining Transformers for Time Series Forecasting5.750.436, 6, 6, 5
1126Sparse MoE with Random Routing as the New Dropout: Training Bigger and Self-Scalable Models5.750.436, 5, 6, 6
1127Jump-Start Reinforcement Learning5.751.796, 8, 6, 3
1128Sequence to sequence text generation with diffusion models5.751.793, 6, 6, 8
1129BSTT: A Bayesian Spatial-Temporal Transformer for Sleep Staging5.751.308, 5, 5, 5
1130Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation5.750.436, 6, 5, 6
1131Learning Adversarial Linear Mixture Markov Decision Processes with Bandit Feedback and Unknown Transition5.750.436, 6, 6, 5
1132Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning5.751.305, 8, 5, 5
1133Equivariant Energy-Guided SDE for Inverse Molecular Design5.751.308, 5, 5, 5
1134Demystifying Approximate RL with $epsilon$-greedy Exploration: A Differential Inclusion View5.751.308, 5, 5, 5
1135Delving into the Openness of CLIP5.751.305, 5, 5, 8
1136Unsupervised Manifold Alignment with Joint Multidimensional Scaling5.751.798, 3, 6, 6
1137Learning with Auxiliary Activation for Memory-Efficient Training5.751.793, 6, 6, 8
1138Finding the global semantic representation in GAN through Fréchet Mean5.751.798, 3, 6, 6
1139E3Bind: An End-to-End Equivariant Network for Protein-Ligand Docking5.750.435, 6, 6, 6
1140Joint Generator-Ranker Learning for Natural Language Generation5.750.436, 5, 6, 6
1141Gromov-Wasserstein Autoencoders5.750.436, 6, 5, 6
1142Learning to Learn with Generative Models of Neural Network Checkpoints5.751.305, 8, 5, 5
1143Optimal Activation Functions for the Random Features Regression Model5.751.308, 5, 5, 5
1144Generalizing and Decoupling Neural Collapse via Hyperspherical Uniformity Gap5.751.798, 3, 6, 6
1145Hierarchical Protein Representations via Complete 3D Graph Networks5.751.798, 6, 6, 3
1146Write and Paint: Generative Vision-Language Models are Unified Modal Learners5.750.436, 5, 6, 6
1147Recovering Top-Two Answers and Confusion Probability in Multi-Choice Crowdsourcing5.751.796, 8, 3, 6
1148Contrastive Novelty Learning: Anticipating Outliers with Large Language Models5.750.436, 6, 5, 6
1149Adaptive Robust Evidential Optimization For Open Set Detection from Imbalanced Data5.750.435, 6, 6, 6
1150Learning Soft Constraints From Constrained Expert Demonstrations5.751.305, 5, 5, 8
1151Bridge the Inference Gaps of Neural Processes via Expectation Maximization5.751.793, 6, 6, 8
1152Masked Vision and Language Modeling for Multi-modal Representation Learning5.751.305, 5, 5, 8
1153Markup-to-Image Diffusion Models with Scheduled Sampling5.751.796, 6, 8, 3
1154Posterior Sampling Model-based Policy Optimization under Approximate Inference5.751.793, 8, 6, 6
1155What Can we Learn From The Selective Prediction And Uncertainty Estimation Performance Of 523 Imagenet Classifiers?5.750.436, 6, 6, 5
1156Transformer Meets Boundary Value Inverse Problems5.751.308, 5, 5, 5
1157Landscape Learning for Neural Network Inversion5.750.436, 5, 6, 6
1158Stochastic Multi-Person 3D Motion Forecasting5.751.798, 6, 6, 3
1159Multi-Objective Reinforcement Learning: Convexity, Stationarity and Pareto Optimality5.751.798, 6, 3, 6
1160Continual Unsupervised Disentangling of Self-Organizing Representations5.751.793, 8, 6, 6
1161Learning Human-Compatible Representations for Case-Based Decision Support5.750.436, 5, 6, 6
1162Unified Discrete Diffusion for Simultaneous Vision-Language Generation5.751.305, 8, 5, 5
1163Evidential Uncertainty and Diversity Guided Active Learning for Scene Graph Generation5.750.436, 6, 6, 5
1164Approximate Nearest Neighbor Search through Modern Error-Correcting Codes5.751.796, 8, 6, 3
1165DENSE RGB SLAM WITH NEURAL IMPLICIT MAPS5.750.436, 6, 6, 5
1166Modeling Sequential Sentence Relation to Improve Cross-lingual Dense Retrieval5.751.796, 6, 8, 3
1167Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths5.751.793, 6, 8, 6
1168Understanding Rare Spurious Correlations in Neural Networks5.751.305, 8, 5, 5
1169Neural Diffusion Processes5.751.796, 8, 3, 6
1170Learning Locality and Isotropy in Dialogue Modeling5.751.796, 6, 3, 8
1171Adaptive Update Direction Rectification for Unsupervised Continual Learning5.750.436, 6, 6, 5
1172NORM: Knowledge Distillation via N-to-One Representation Matching5.751.305, 5, 5, 8
1173CroMA: Cross-Modality Adaptation for Monocular BEV Perception5.751.305, 5, 5, 8
1174Robust Multi-Agent Reinforcement Learning with State Uncertainties5.750.436, 6, 5, 6
1175Neural Optimal Transport with General Cost Functionals5.751.796, 3, 6, 8
1176Strategic Classification on Graphs5.751.793, 6, 8, 6
1177Priors, Hierarchy, and Information Asymmetry for Skill Transfer in Reinforcement Learning5.751.308, 5, 5, 5
1178Visual Imitation Learning with Patch Rewards5.751.793, 6, 8, 6
1179Discovering Informative and Robust Positives for Video Domain Adaptation5.750.435, 6, 6, 6
1180Gradient-Guided Importance Sampling for Learning Binary Energy-Based Models5.750.435, 6, 6, 6
1181Single-shot General Hyper-parameter Optimization for Federated Learning5.751.796, 3, 6, 8
1182ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation5.751.798, 6, 6, 3
1183SCoMoE: Efficient Mixtures of Experts with Structured Communication5.750.436, 5, 6, 6
1184Uncertainty-Aware Self-Supervised Learning with Independent Sub-networks5.751.308, 5, 5, 5
1185Towards Semi-Supervised Learning with Non-Random Missing Labels5.750.435, 6, 6, 6
1186Masked Frequency Modeling for Self-Supervised Visual Pre-Training5.751.305, 5, 5, 8
1187S-NeRF: Neural Radiance Fields for Street Views5.751.796, 6, 8, 3
1188Differentiable Gaussianization Layers for Inverse Problems Regularized by Deep Generative Models5.751.793, 8, 6, 6
1189Evaluating and Inducing Personality in Pre-trained Language Models5.750.436, 5, 6, 6
1190Block and Subword-Scaling Floating-Point (BSFP) : An Efficient Non-Uniform Quantization For Low Precision Inference5.750.436, 6, 5, 6
1191CAST: Concurrent Recognition and Segmentation with Adaptive Segment Tokens5.750.436, 6, 5, 6
1192Effective Self-supervised Pre-training on Low-compute networks without Distillation5.751.308, 5, 5, 5
1193CoRTX: Contrastive Framework for Real-time Explanation5.751.308, 5, 5, 5
1194Networks are Slacking Off: Understanding Generalization Problem in Image Deraining5.750.436, 6, 6, 5
1195Towards Smooth Video Composition5.750.436, 5, 6, 6
1196GAIN: Enhancing Byzantine Robustness in Federated Learning with Gradient Decomposition5.751.796, 6, 3, 8
1197No Reason for No Supervision: Improved Generalization in Supervised Models5.751.798, 3, 6, 6
1198Clustering Structure Identification With Ordering Graph5.751.798, 3, 6, 6
1199Robust and Controllable Object-Centric Learning through Energy-based Models5.751.793, 6, 8, 6
1200Limitless Stability for Graph Convolutional Networks5.751.798, 3, 6, 6
1201Rethinking skip connection model as a learnable Markov chain5.750.436, 5, 6, 6
1202Neural Groundplans: Persistent Neural Scene Representations from a Single Image5.750.436, 5, 6, 6
1203Global Prototype Encoding for Incremental Video Highlights Detection5.751.798, 3, 6, 6
1204Neural-Symbolic Recursive Machine for Systematic Generalization5.750.436, 6, 6, 5
1205DrML: Diagnosing and Rectifying Vision Models using Language5.750.436, 6, 5, 6
1206MaSS: Multi-attribute Selective Suppression5.750.436, 6, 6, 5
1207Trust-consistent Visual Semantic Embedding for Image-Text Matching5.751.798, 3, 6, 6
1208Delving into Semantic Scale Imbalance5.751.305, 5, 5, 8
1209DAG Matters! GFlowNets Enhanced Explainer for Graph Neural Networks5.751.308, 5, 5, 5
1210Set-Level Self-Supervised Learning from Noisily-Labeled Data5.711.678, 3, 5, 5, 8, 5, 6
1211Distributed Least Square Ranking with Random Features5.672.058, 3, 6
1212EquiMod: An Equivariance Module to Improve Self-Supervised Learning5.672.056, 3, 8
1213Task-Aware Information Routing from Common Representation Space in Lifelong Learning5.670.475, 6, 6
1214Decision S4: Efficient Sequence-Based RL via State Spaces Layers5.670.476, 6, 5
1215Actionable Neural Representations: Grid Cells from Minimal Constraints5.672.053, 6, 8
1216A sparse, fast, and stable representation for multiparameter topological data analysis5.670.476, 6, 5
1217Causal Explanations of Structural Causal Models5.672.056, 8, 3
1218CASR: Generating Complex Sequences with Autoregressive Self-Boost Refinement5.670.475, 6, 6
1219SciRepEval: A Multi-Format Benchmark for Scientific Document Representations5.672.056, 8, 3
1220Seeing Differently, Acting Similarly: Heterogeneously Observable Imitation Learning5.672.056, 3, 8
1221Learning Globally Smooth Functions on Manifolds5.670.476, 6, 5
1222UniKGQA: Unified Retrieval and Reasoning for Solving Multi-hop Question Answering Over Knowledge Graph5.670.476, 6, 5
1223Large Language Models are Human-Level Prompt Engineers5.670.475, 6, 6
1224Enhancing Meta Learning via Multi-Objective Soft Improvement Functions5.672.053, 8, 6
1225Transferable Unlearnable Examples5.670.476, 5, 6
1226Random Laplacian Features for Learning with Hyperbolic Space5.672.056, 8, 3
1227Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding5.670.475, 6, 6
1228GOGGLE: Generative Modelling for Tabular Data by Learning Relational Structure5.672.058, 3, 6
1229Optimal Data Sampling for Training Neural Surrogates of Programs5.673.308, 8, 1
1230HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers5.670.476, 5, 6
1231Learning multi-scale local conditional probability models of images5.670.476, 5, 6
1232Adversarial Imitation Learning with Preferences5.670.476, 5, 6
1233Synthetic Data Generation of Many-to-Many Datasets via Random Graph Generation5.670.476, 6, 5
1234Function-space regularized Rényi divergences5.672.058, 3, 6
1235Constant-Factor Approximation Algorithms for Socially Fair $k$-Clustering5.670.475, 6, 6
1236Personalized Reward Learning with Interaction-Grounded Learning (IGL)5.670.476, 5, 6
1237Grounding Graph Network Simulators using Physical Sensor Observations5.672.053, 8, 6
1238Performance Bounds for Model and Policy Transfer in Hidden-parameter MDPs5.672.053, 8, 6
1239DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics5.670.475, 6, 6
1240Effective passive membership inference attacks in federated learning against overparameterized models5.672.056, 3, 8
1241Gaussian-Bernoulli RBMs Without Tears5.672.056, 8, 3
1242Proposal-Contrastive Pretraining for Object Detection from Fewer Data5.672.056, 8, 3
1243Neural Network Differential Equation Solvers allow unsupervised error estimation and correction5.672.056, 8, 3
1244Spectral Augmentation for Self-Supervised Learning on Graphs5.672.058, 6, 3
1245PAC Reinforcement Learning for Predictive State Representations5.670.476, 5, 6
1246Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning5.670.476, 6, 5
1247Active Learning based Structural Inference5.672.056, 8, 3
1248No-Regret Learning in Strongly Monotone Games Converges to a Nash Equilibrium5.670.476, 6, 5
1249Latent Graph Inference using Product Manifolds5.672.053, 8, 6
1250Representation Balancing with Decomposed Patterns for Treatment Effect Estimation5.670.476, 5, 6
1251Learning Probabilistic Topological Representations Using Discrete Morse Theory5.672.058, 6, 3
1252Random Matrix Analysis to Balance between Supervised and Unsupervised Learning under the Low Density Separation Assumption5.672.058, 6, 3
1253Heterogeneous Loss Function with Aggressive Rejection for Contaminated data in anomaly detection5.670.476, 6, 5
1254Local KL Convergence Rate for Stein Variational Gradient Descent with Reweighted Kernel5.672.058, 6, 3
1255Learning Discrete Representation with Optimal Transport Quantized Autoencoders5.670.475, 6, 6
1256MonoFlow: A Unified Generative Modeling Framework for GAN Variants5.672.053, 8, 6
1257Graph-based Deterministic Policy Gradient for Repetitive Combinatorial Optimization Problems5.672.056, 8, 3
1258Coordination Scheme Probing for Generalizable Multi-Agent Reinforcement Learning5.672.053, 8, 6
1259Neural-based classification rule learning for sequential data5.672.056, 3, 8
1260Shifts 2.0: Extending The Dataset of Real Distributional Shifts5.670.476, 6, 5
1261Indiscriminate Poisoning Attacks on Unsupervised Contrastive Learning5.670.476, 5, 6
1262Budgeted Training for Vision Transformer5.670.476, 5, 6
1263Mosaic Representation Learning for Self-supervised Visual Pre-training5.670.476, 5, 6
1264Language model with Plug-in Knowldge Memory5.670.476, 6, 5
1265Hierarchical Gaussian Mixture based Task Generative Model for Robust Meta-Learning5.670.475, 6, 6
1266Winning Both the Accuracy of Floating Point Activation and the Simplicity of Integer Arithmetic5.670.476, 6, 5
1267More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization5.670.476, 5, 6
1268Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks5.670.476, 6, 5
1269Any-scale Balanced Samplers for Discrete Space5.672.053, 8, 6
1270Pre-trained Language Models can be Fully Zero-Shot Learners5.670.476, 6, 5
1271Certified Robustness on Structural Graph Matching5.670.476, 6, 5
1272Explaining Temporal Graph Models through an Explorer-Navigator Framework5.670.476, 5, 6
1273On the Soft-Subnetwork for Few-Shot Class Incremental Learning5.672.053, 6, 8
1274Distributed Differential Privacy in Multi-Armed Bandits5.670.476, 6, 5
1275Prometheus: Endowing Low Sample and Communication Complexities to Constrained Decentralized Stochastic Bilevel Learning5.670.476, 6, 5
1276Mutual Partial Label Learning with Competitive Label Noise5.672.053, 8, 6
1277simpleKT: A Simple But Tough-to-Beat Baseline for Knowledge Tracing5.672.053, 8, 6
1278An Extensible Multi-modal Multi-task Object Dataset with Materials5.670.476, 6, 5
1279Revisiting the Assumption of Latent Separability for Backdoor Defenses5.670.475, 6, 6
1280Characterizing the spectrum of the NTK via a power series expansion5.672.053, 6, 8
1281ChordMixer: A Scalable Neural Attention Model for Sequences with Different Length5.672.056, 3, 8
1282A non-asymptotic analysis of oversmoothing in Graph Neural Networks5.672.058, 6, 3
1283Class-Incremental Learning with Repetition5.672.056, 3, 8
1284Imitation Learning for Mean Field Games with Correlated Equilibria5.670.476, 5, 6
1285Graph Neural Networks are Inherently Good Generalizers: Insights by Bridging GNNs and Multi-Layer Perceptrons5.670.476, 5, 6
1286Approximation and non-parametric estimation of functions over high-dimensional spheres via deep ReLU networks5.672.053, 6, 8
1287TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation5.670.476, 5, 6
1288Learning to Reason and Act in Cascading Processes5.672.053, 8, 6
1289PMixUp: Simultaneous Utilization of Part-of-Speech Replacement and Feature Space Interpolation for Text Data Augmentation5.672.056, 8, 3
1290Efficient Offline Policy Optimization with a Learned Model5.670.476, 6, 5
1291PowerQuant: Automorphism Search for Non-Uniform Quantization5.670.475, 6, 6
1292Leveraging Future Relationship Reasoning for Vehicle Trajectory Prediction5.672.056, 3, 8
1293Toward Adversarial Training on Contextualized Language Representation5.672.056, 3, 8
1294Learned Index with Dynamic $epsilon$5.670.475, 6, 6
1295Test-Time Adaptation for Visual Document Understanding5.670.476, 6, 5
1296Exploring Transformer Backbones for Heterogeneous Treatment Effect Estimation5.670.476, 5, 6
1297MemoNav: Working Memory Model for Visual Navigation5.670.476, 5, 6
1298The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation5.670.476, 5, 6
1299Algorithmic Determination of the Combinatorial Structure of the Linear Regions of ReLU Neural Networks5.670.476, 6, 5
1300Understanding new tasks through the lens of training data via exponential tilting5.670.476, 6, 5
1301Data Poisoning Attacks Against Multimodal Encoders5.670.475, 6, 6
1302InfoOT: Information Maximizing Optimal Transport5.670.476, 5, 6
1303Impossibly Good Experts and How to Follow Them5.670.476, 6, 5
1304Beyond calibration: estimating the grouping loss of modern neural networks5.672.058, 6, 3
1305Asynchronous Gradient Play in Zero-Sum Multi-agent Games5.670.476, 5, 6
1306An Exact Poly-Time Membership-Queries Algorithm for Extracting a Three-Layer ReLU Network5.670.476, 6, 5
1307SAAL: Sharpness-Aware Active Learning5.670.475, 6, 6
1308An Adaptive Entropy-Regularization Framework for Multi-Agent Reinforcement Learning5.672.053, 8, 6
1309Gradient Boosting Performs Gaussian Process Inference5.670.475, 6, 6
1310Distribution Shift Detection for Deep Neural Networks5.670.476, 5, 6
1311Towards Effective and Interpretable Human-Agent Collaboration in MOBA Games: A Communication Perspective5.670.476, 5, 6
1312FedSpeed: Larger Local Interval, Less Communication Round, and Higher Generalization Accuracy5.670.476, 6, 5
1313Globally Optimal Training of Neural Networks with Threshold Activation Functions5.670.475, 6, 6
1314A Laplace-inspired Distribution on SO(3) for Probabilistic Rotation Estimation5.672.056, 3, 8
1315Measuring and Narrowing the Compositionality Gap in Language Models5.670.476, 5, 6
1316Guiding continuous operator learning through Physics-based boundary constraints5.672.056, 8, 3
1317Human MotionFormer: Transferring Human Motions with Vision Transformers5.672.058, 3, 6
1318Can We Faithfully Represent Absence States to Compute Shapley Values on a DNN?5.670.476, 6, 5
1319One-Pixel Shortcut: On the Learning Preference of Deep Neural Networks5.670.475, 6, 6
1320Combating Exacerbated Heterogeneity for Robust Decentralized Models5.670.476, 6, 5
1321Offline Reinforcement Learning with Closed-Form Policy Improvement Operators5.670.475, 6, 6
1322Maximizing Communication Efficiency for Large-scale Training via 0/1 Adam5.670.476, 5, 6
1323An Additive Instance-Wise Approach to Multi-class Model Interpretation5.672.058, 6, 3
1324Knowledge-Consistent Dialogue Generation with Language Models and Knowledge Graphs5.672.056, 6, 3, 8, 8, 3
1325Meta Knowledge Condensation for Federated Learning5.672.053, 6, 8
1326Cycle-consistent Masked AutoEncoder for Unsupervised Domain Generalization5.670.475, 6, 6
1327Towards Addressing Label Skews in One-shot Federated Learning5.670.476, 6, 5
1328Relaxed Combinatorial Optimization Networks with Self-Supervision: Theoretical and Empirical Notes on the Cardinality-Constrained Case5.670.476, 5, 6
1329Rethinking the Effect of Data Augmentation in Adversarial Contrastive Learning5.670.476, 6, 5
1330Unified Detoxifying and Debiasing in Language Generation via Inference-time Adaptive Optimization5.670.476, 6, 5
1331DeepPipe: Deep, Modular and Extendable Representations of Machine Learning Pipelines5.670.475, 6, 6
1332TIB: Detecting Unknown Objects via Two-Stream Information Bottleneck5.670.475, 6, 6
1333Hidden Poison: Machine unlearning enables camouflaged poisoning attacks5.670.475, 6, 6
1334Adversarial Collaborative Learning on Non-IID Features5.670.476, 5, 6
1335D2Match: Leveraging Deep Learning and Degeneracy for Subgraph Matching5.670.475, 6, 6
1336Topologically faithful image segmentation via induced matching of persistence barcodes5.670.476, 5, 6
1337On the Lower Bound of Minimizing Polyak-Łojasiewicz functions5.670.475, 6, 6
1338Rotamer Density Estimators are Unsupervised Learners of the Effect of Mutations on Protein-Protein Interaction5.670.475, 6, 6
1339Cross-Level Distillation and Feature Denoising for Cross-Domain Few-Shot Classification5.672.058, 6, 3
1340Individual Privacy Accounting for Differentially Private Stochastic Gradient Descent5.672.058, 3, 6
1341Attention De-sparsification Matters: Inducing Diversity in Digital Pathology Representation Learning5.670.476, 6, 5
1342Transcendental Idealism of Planner: Evaluating Perception from Planning Perspective for Autonomous Driving5.670.476, 6, 5
1343The Augmented Image Prior: Distilling 1000 Classes by Extrapolating from a Single Image5.670.476, 5, 6
1344Contextual Image Masking Modeling via Synergized Contrasting without View Augmentation for Faster and Better Visual Pretraining5.670.476, 5, 6
1345Factorized Fourier Neural Operators5.602.243, 8, 3, 6, 8
1346INSPIRE: A Framework for Integrating Individual User Preferences in Recourse5.601.623, 5, 6, 6, 8
1347TypeT5: Seq2seq Type Inference using Static Analysis5.600.495, 6, 6, 5, 6
1348Contrastive Audio-Visual Masked Autoencoder5.601.625, 6, 3, 6, 8
1349SemPPL: Predicting Pseudo-Labels for Better Contrastive Representations5.600.496, 6, 5, 5, 6
1350CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers5.601.626, 3, 8, 5, 6
1351Malign Overfitting: Interpolation and Invariance are Fundamentally at Odds5.601.628, 5, 6, 3, 6
1352How to prepare your task head for finetuning5.600.496, 6, 5, 6, 5
1353Searching Lottery Tickets in Graph Neural Networks: A Dual Perspective5.601.626, 3, 8, 5, 6
1354Out-of-distribution Representation Learning for Time Series Classification5.601.205, 8, 5, 5, 5
1355Early Stopping for Deep Image Prior5.600.495, 6, 5, 6, 6
1356Agent-based Graph Neural Networks5.601.628, 6, 3, 6, 5
1357GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis5.601.625, 6, 8, 3, 6
1358The KFIoU Loss for Rotated Object Detection5.601.628, 6, 6, 5, 3
1359Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning5.601.626, 5, 6, 3, 8
1360On the Word Boundaries of Emergent Languages Based on Harris's Articulation Scheme5.601.626, 3, 6, 5, 8
1361SoundCount: Sound Counting from Raw Audio with Dyadic Decomposition Neural Network5.601.626, 6, 3, 5, 8
1362SGD Through the Lens of Kolmogorov Complexity5.571.405, 6, 6, 6, 3, 5, 8
1363TVSPrune - Pruning Non-discriminative filters via Total Variation separability of intermediate representations without fine tuning5.501.803, 5, 6, 8
1364Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flow5.500.505, 5, 6, 6
1365Adaptive Block-wise Learning for Knowledge Distillation5.501.803, 8, 5, 6
1366Share Your Representation Only: Guaranteed Improvement of the Privacy-Utility Tradeoff in Federated Learning5.501.808, 5, 3, 6
1367Cross-utterance Conditioned Coherent Speech Editing via Biased Training and Entire Inference5.501.805, 8, 3, 6
1368Learning Geometric Representations of Interactive Objects5.501.803, 5, 6, 8
1369Online Bias Correction for Task-Free Continual Learning5.501.805, 3, 8, 6
1370Meta-Learning the Inductive Biases of Simple Neural Circuits5.501.808, 3, 6, 5
1371Iterative Circuit Repair Against Formal Specifications5.500.506, 6, 5, 5
1372Improving Adversarial Robustness by Putting More Regularizations on Less Robust Samples5.501.803, 5, 8, 6
1373Toward Learning Geometric Eigen-Lengths Crucial for Robotic Fitting Tasks5.501.803, 8, 6, 5
1374Individual Privacy Accounting with Gaussian Differential Privacy5.500.506, 5, 5, 6
1375Improving Differentiable Neural Architecture Search by Encouraging Transferability5.500.506, 5, 6, 5
1376Cross-Window Self-Training via Context Variations from Sparsely-Labeled Time Series5.500.505, 6, 5, 6
1377A theoretical study of inductive biases in contrastive learning5.500.506, 6, 5, 5
1378M$^3$SAT: A Sparsely Activated Transformer for Efficient Multi-Task Learning from Multiple Modalities5.501.805, 6, 8, 3
1379Importance of Class Selectivity in Early Epochs of Training5.500.505, 6, 5, 6
1380Conservative Exploration in Linear MDPs under Episode-wise Constraints5.500.505, 5, 6, 6
1381Fighting Fire with Fire: Contrastive Debiasing without Bias-free Data via Generative Bias-transformation5.500.506, 6, 5, 5
1382Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel5.500.506, 5, 6, 5
1383Game Theoretic Mixed Experts for Combinational Adversarial Machine Learning5.501.805, 3, 6, 8
1384Reproducible Bandits5.501.805, 8, 3, 6
1385Solving Continual Learning via Problem Decomposition5.501.805, 8, 3, 6
1386How Useful are Gradients for OOD Detection Really?5.501.805, 3, 8, 6
1387Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games5.501.803, 5, 6, 8
1388Simple Emergent Action Representations from Multi-Task Policy Training5.500.506, 5, 5, 6
1389Avoiding spurious correlations via logit correction5.500.506, 6, 5, 5
1390HesScale: Scalable Computation of Hessian Diagonals5.502.508, 3, 3, 8
1391Building Normalizing Flows with Stochastic Interpolants5.501.808, 5, 6, 3
1392Does progress on ImageNet transfer to real world datasets?5.501.803, 8, 6, 5
1393Competitive Physics Informed Networks5.501.805, 6, 8, 3
1394Decomposed Prompting: A Modular Approach for Solving Complex Tasks5.500.506, 5, 5, 6
1395Energy-Inspired Self-Supervised Pretraining for Vision Models5.500.505, 5, 6, 5, 6, 6
1396A Time Series is Worth 64 Words: Long-term Forecasting with Transformers5.500.505, 6, 5, 6
1397Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay5.500.506, 5, 5, 6
1398Confidence-Conditioned Value Functions for Offline Reinforcement Learning5.501.806, 8, 5, 3
1399Stochastic Constrained DRO with a Complexity Independent of Sample Size5.501.803, 5, 8, 6
1400Kernel Regression with Infinite-Width Neural Networks on Millions of Examples5.501.808, 3, 5, 6
1401Evaluating Unsupervised Denoising Requires Unsupervised Metrics5.500.505, 5, 6, 6
1402The Value of Out-of-distribution Data5.502.8710, 3, 6, 3
1403First Steps Toward Understanding the Extrapolation of Nonlinear Models to Unseen Domains5.500.506, 5, 5, 6
1404LogicDP: Creating Labels for Graph Data via Inductive Logic Programming5.501.806, 5, 3, 8
1405A VAE for Transformers with Nonparametric Variational Information Bottleneck5.500.505, 6, 6, 5
1406Information-Theoretic Underpinnings of Generalization and Translation in Emergent Communication5.501.806, 3, 8, 5
1407The Brainy Student: Scalable Unlearning by Selectively Disobeying the Teacher5.500.506, 5, 5, 6
1408A Neural PDE Solver with Temporal Stencil Modeling5.501.805, 8, 6, 3
1409Recitation-Augmented Language Models5.500.505, 5, 6, 6
1410Credible, Sealed-bid, Optimal Repeated Auctions With Differentiable Economics5.502.503, 8, 8, 3
1411Towards Efficient Gradient-Based Meta-Learning in Heterogenous Environments5.501.805, 6, 8, 3
1412Optimal Transport for Offline Imitation Learning5.500.506, 5, 6, 5
1413FedorAS: Federated Architecture Search under system heterogeneity5.500.505, 6, 6, 5
1414Towards A Unified View of Sparse Feed-Forward Network in Transformer5.501.803, 5, 6, 8
1415SuperFed: Weight Shared Federated Learning5.500.505, 5, 6, 6
1416Images as Weight Matrices: Sequential Image Generation Through Synaptic Learning Rules5.500.506, 6, 5, 5
1417SGD with large step sizes learns sparse features5.501.803, 5, 8, 6
1418ProSampler: Improving Contrastive Learning by Better Mini-batch Sampling5.501.808, 6, 5, 3
1419Make-A-Video: Text-to-Video Generation without Text-Video Data5.500.506, 5, 6, 5
1420In-distribution and Out-of-distribution Generalization for Graph Neural Networks5.500.506, 6, 5, 5
1421Effectively using public data in privacy preserving Machine learning5.500.505, 5, 6, 6
1422CADet: Fully Self-Supervised Anomaly Detection With Contrastive Learning5.500.505, 6, 5, 6
1423On the System-Level Effectiveness of Physical Object-Hiding Adversarial Attack in Autonomous Driving5.500.505, 6, 6, 5
1424Is Conditional Generative Modeling all you need for Decision Making?5.501.806, 8, 5, 3
1425META-STORM: Generalized Fully-Adaptive Variance Reduced SGD for Unbounded Functions5.500.505, 6, 5, 6
1426TEMPERA: Test-Time Prompt Editing via Reinforcement Learning5.500.505, 5, 6, 6
1427What Matters In The Structured Pruning of Generative Language Models?5.500.505, 6, 5, 6
1428Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning5.501.805, 8, 3, 6
1429Optimizing Bi-Encoder for Named Entity Recognition via Contrastive Learning5.501.805, 6, 8, 3
1430Differentially Private Adaptive Optimization with Delayed Preconditioners5.501.803, 8, 6, 5
1431Long Range Language Modeling via Gated State Spaces5.500.505, 5, 6, 6
1432Task-customized Masked Autoencoder via Mixture of Cluster-conditional Experts5.500.506, 5, 5, 6
1433Investigating Multi-task Pretraining and Generalization in Reinforcement Learning5.501.805, 6, 8, 3
1434Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models5.500.506, 6, 5, 5
1435Noise-Robust De-Duplication at Scale5.500.506, 6, 5, 5
1436Hyperparameter Optimization through Neural Network Partitioning5.501.808, 5, 6, 3
1437Concept-based Explanations for Out-of-Distribution Detectors5.500.505, 6, 5, 6
1438Architectural optimization over subgroups of equivariant neural networks5.500.505, 6, 5, 6
1439Accelerating Hamiltonian Monte Carlo via Chebyshev Integration Time5.501.808, 6, 5, 3
1440Revisiting Structured Dropout5.500.505, 6, 5, 6
1441HiT-MDP: Learning the SMDP option framework on MDPs with Hidden Temporal Variables5.501.806, 8, 3, 5
1442Fusion over the Grassmann Manifold for Incomplete-Data Clustering5.502.875, 8, 8, 1
1443Unsupervised Model-based Pre-training for Data-efficient Control from Pixels5.501.808, 3, 5, 6
1444Fine-grain Inference on Out-of-Distribution Data with Hierarchical Classification5.501.803, 8, 6, 5
1445TTN: A Domain-Shift Aware Batch Normalization in Test-Time Adaptation5.500.505, 6, 6, 5
1446Repository-Level Prompt Generation for Large Language Models of Code5.501.808, 6, 3, 5
1447Variational Prompt Tuning Improves Generalization of Vision-Language Models5.500.506, 6, 5, 5
1448Bridging the Gap to Real-World Object-Centric Learning5.501.803, 8, 6, 5
1449Energy-Based Test Sample Adaptation for Domain Generalization5.500.505, 6, 5, 6
1450A GENERAL SCENARIO-AGNOSTIC REINFORCEMENT LEARNING FOR TRAFFIC SIGNAL CONTROL5.500.505, 6, 6, 5
1451BALTO: efficient tensor program optimization with diversity-based active learning5.501.806, 3, 8, 5
1452Autoregressive Generative Modeling with Noise Conditional Maximum Likelihood Estimation5.502.508, 8, 3, 3
1453How robust is unsupervised representation learning to distribution shift?5.501.803, 5, 8, 6
1454Affinity-Aware Graph Networks5.500.505, 6, 6, 5
1455Edge Guided GANs with Contrastive Learning for Semantic Image Synthesis5.501.803, 5, 6, 8
1456Federated Learning as Variational Inference: A Scalable Expectation Propagation Approach5.500.506, 5, 5, 6
1457Enhancing the Inductive Biases of Graph Neural ODE for Modeling Dynamical Systems5.500.506, 5, 5, 6
1458Mastering Spatial Graph Prediction of Road Networks5.501.805, 8, 6, 3
1459A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning5.501.803, 5, 8, 6
1460Multi-objective optimization via equivariant deep hypervolume approximation5.500.506, 5, 6, 5
1461Inversely Eliciting Numerical Reasoning in Language Models via Solving Linear Systems5.501.808, 3, 6, 5
1462On Explaining Neural Network Robustness with Activation Path5.500.505, 6, 5, 6
1463Structure by Architecture: Structured Representations without Regularization5.501.806, 8, 5, 3
1464DECAP: Decoding CLIP Latents for Zero-shot Captioning5.500.505, 6, 6, 5, 5, 6
1465Robust Explanation Constraints for Neural Networks5.501.803, 6, 5, 8
1466Hidden Schema Networks5.502.503, 3, 8, 8
1467Learning Input-agnostic Manipulation Directions in StyleGAN with Text Guidance5.500.506, 5, 6, 5
1468Domain Generalisation via Domain Adaptation: An Adversarial Fourier Amplitude Approach5.500.505, 5, 6, 6
1469Anti-Symmetric DGN: a stable architecture for Deep Graph Networks5.501.805, 3, 6, 8
1470FastFill: Efficient Compatible Model Update5.501.803, 6, 5, 8
1471SLTUNET: A Simple Unified Model for Sign Language Translation5.500.505, 6, 5, 6
1472DetectBench: An Object Detection Benchmark for OOD Generalization Algorithms5.501.805, 3, 8, 6
1473Leveraging Unlabeled Data to Track Memorization5.500.505, 5, 6, 6
1474Efficient Out-of-Distribution Detection based on In-Distribution Data Patterns Memorization with Modern Hopfield Energy5.500.506, 5, 6, 5
1475NAGphormer: A Tokenized Graph Transformer for Node Classification in Large Graphs5.500.506, 5, 6, 5
1476Near Optimal Private and Robust Linear Regression5.500.506, 6, 5, 5
1477Tensor-Based Sketching Method for the Low-Rank Approximation of Data Streams.5.500.505, 5, 6, 6
1478Data augmentation alone can improve adversarial training5.500.505, 6, 6, 5
1479Valid P-Value for Deep Learning-driven Salient Region5.500.505, 6, 5, 6
1480Learning from conflicting data with hidden contexts5.502.503, 8, 8, 3
1481MeGraph: Graph Representation Learning on Connected Multi-scale Graphs5.502.503, 8, 8, 3
1482Self-supervised debiasing using low rank regularization5.501.803, 6, 5, 8
1483Multi-Vector Retrieval as Sparse Alignment5.500.505, 6, 5, 6
1484Knowledge Unlearning for Mitigating Privacy Risks in Language Models5.500.506, 5, 6, 5
1485Open-domain Visual Entity Linking5.501.805, 3, 6, 8
1486The Final Ascent: When Bigger Models Generalize Worse on Noisy-Labeled Data5.501.805, 3, 8, 6
1487Proportional Amplitude Spectrum Training Augmentation for Synthetic-to-Real Domain Generalization5.501.803, 5, 8, 6
1488Equivariant Shape-Conditioned Generation of 3D Molecules for Ligand-Based Drug Design5.500.506, 5, 6, 5
1489Linearly Constrained Bilevel Optimization: A Smoothed Implicit Gradient Approach5.500.506, 5, 5, 6
1490Memorization-Dilation: Modeling Neural Collapse Under Noise5.500.505, 6, 5, 6
1491Multi-level Protein Structure Pre-training via Prompt Learning5.500.506, 6, 5, 5
1492Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 Small5.502.503, 3, 8, 8
1493FedMT: Federated Learning with Mixed-type Labels5.501.806, 8, 5, 3
1494Denoising MCMC for Accelerating Diffusion-Based Generative Models5.500.506, 6, 5, 5
1495Confidence Estimation Using Unlabeled Data5.501.808, 5, 6, 3
1496Sequential Attention for Feature Selection5.501.803, 6, 5, 8
1497Multi-Epoch Matrix Factorization Mechanisms for Private Machine Learning5.500.506, 5, 6, 5
1498Learning Listwise Domain-Invariant Representations for Ranking5.500.505, 6, 5, 6
1499Exp-$alpha$: Beyond Proportional Aggregation in Federated Learning5.500.505, 6, 5, 6
1500Guiding Safe Exploration with Weakest Preconditions5.501.803, 8, 6, 5
1501Gated Neural ODEs: Trainability, Expressivity and Interpretability5.501.803, 8, 6, 5
1502Learning Multimodal Data Augmentation in Feature Space5.501.805, 3, 8, 6
1503Achieving Sub-linear Regret in Infinite Horizon Average Reward Constrained MDP with Linear Function Approximation5.501.806, 8, 3, 5
1504FedFA: Federated Feature Augmentation5.500.506, 5, 6, 5
1505A critical look at evaluation of GNNs under heterophily: Are we really making progress?5.500.505, 6, 5, 6
1506Interpretable Debiasing of Vectorized Language Representations with Iterative Orthogonalization5.500.506, 6, 5, 5
1507Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations5.500.506, 5, 6, 5
1508VIMA: General Robot Manipulation with Multimodal Prompts5.501.803, 6, 5, 8
1509AUTOJOIN: EFFICIENT ADVERSARIAL TRAINING FOR ROBUST MANEUVERING VIA DENOISING AUTOEN- CODER AND JOINT LEARNING5.500.505, 6, 6, 5
1510The power of choices in decision tree learning5.501.806, 3, 8, 5
1511Boosting Adversarial Transferability using Dynamic Cues5.500.506, 5, 5, 6
1512MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models5.500.506, 5, 6, 5
1513Part-Based Models Improve Adversarial Robustness5.500.506, 5, 6, 5
1514Extremely Simple Activation Shaping for Out-of-Distribution Detection5.501.805, 8, 6, 3
1515Hebbian and Gradient-based Plasticity Enables Robust Memory and Rapid Learning in RNNs5.500.505, 6, 5, 6
1516Equivariant Hypergraph Diffusion Neural Operators5.500.506, 5, 6, 5
1517Biases in Evaluation of Molecular Optimization Methods and Bias Reduction Strategies5.501.803, 5, 6, 8
1518Neural Agents Struggle to Take Turns in Bidirectional Emergent Communication5.501.808, 6, 3, 5
1519Observational Robustness and Invariances in Reinforcement Learning via Lexicographic Objectives5.501.505, 3, 8, 5, 6, 6
1520Prompting GPT-3 To Be Reliable5.500.505, 6, 5, 6
1521Turning the Curse of Heterogeneity in Federated Learning into a Blessing for Out-of-Distribution Detection5.501.806, 3, 5, 8
1522Neural Lagrangian Schr'{o}dinger Bridge: Diffusion Modeling for Population Dynamics5.500.505, 6, 5, 6
1523Warping the Space: Weight Space Rotation for Class-Incremental Few-Shot Learning5.501.805, 3, 6, 8
1524Jointly Learning Visual and Auditory Speech Representations from Raw Data5.501.808, 5, 3, 6
1525On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning5.500.505, 6, 6, 5
1526Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC5.500.505, 6, 5, 6
1527Discovering Policies with DOMiNO5.500.505, 6, 6, 5
1528Improving Out-of-distribution Generalization with Indirection Representations5.501.806, 5, 3, 8
1529SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient5.502.068, 3, 5, 6, 8, 3
1530Sinkhorn Discrepancy for Counterfactual Generalization5.500.506, 5, 6, 5
1531Distributional Meta-Gradient Reinforcement Learning5.501.805, 8, 6, 3
1532Interval-based Offline Policy Evaluation without Sufficient Exploration or Realizability5.501.808, 3, 5, 6
1533Dense Correlation Fields for Motion Modeling in Action Recognition5.501.808, 3, 6, 5
1534CBLab: Scalable Traffic Simulation with Enriched Data Supporting5.501.808, 5, 6, 3
1535Time to augment visual self-supervised learning5.501.805, 3, 6, 8
1536Towards Lightweight, Model-Agnostic and Diversity-Aware Active Anomaly Detection5.501.805, 8, 3, 6
1537Switching One-Versus-the-Rest Loss to Increase Logit Margins for Adversarial Robustness5.500.506, 5, 5, 6
1538Q-Pensieve: Boosting Sample Efficiency of Multi-Objective RL Through Memory Sharing of Q-Snapshots5.500.506, 5, 6, 5
1539Learning Invariant Features for Online Continual Learning5.501.808, 5, 3, 6
1540ODAM: Gradient-based Instance-Specific Visual Explanations for Object Detection5.500.506, 5, 5, 6
1541Unsupervised Object-Centric Learning with Bi-level Optimized Query Slot Attention5.501.808, 6, 3, 5
1542EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model5.500.505, 6, 6, 5
1543Smoothed-SGDmax: A Stability-Inspired Algorithm to Improve Adversarial Generalization5.500.506, 5, 5, 6
1544Learning to Generate All Feasible Actions5.501.808, 5, 6, 3
1545Empirical Study of Pre-training a Backbone for 3D Human Pose and Shape Estimation5.500.506, 5, 6, 5
1546Class Prototype-based Cleaner for Label Noise Learning5.502.503, 3, 8, 8
1547AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection5.501.803, 8, 6, 5
1548ILA-DA: Improving Transferability of Intermediate Level Attack with Data Augmentation5.501.806, 3, 8, 5
1549A Closer Look at the Calibration of Differentially Private Learners5.500.506, 5, 6, 5
1550Schema Inference for Interpretable Image Classification5.500.506, 5, 6, 5
1551Covariance-Robust Minimax Probability Machines for Algorithmic Recourse5.502.503, 8, 3, 8
1552Spiking Convolutional Neural Networks for Text Classification5.501.806, 8, 3, 5
1553Improving Language Model Pretraining with Text Structure Information5.501.803, 5, 8, 6
1554Cluster and Landmark Attributes Infused Graph Neural Networks for Link prediction5.500.506, 6, 5, 5
1555Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions5.500.506, 5, 5, 6
1556Average Sensitivity of Decision Tree Learning5.500.506, 6, 5, 5
1557Bridging the Gap Between Cascade and End-to-End Cross-modal Translation Models: A Zero-Shot Approach5.501.803, 6, 8, 5
1558Learning by Distilling Context5.501.803, 5, 6, 8
1559Structured Pruning of CNNs at Initialization5.500.506, 5, 5, 6
1560Generating Adversarial Examples with Task Oriented Multi-Objective Optimization5.501.803, 8, 5, 6
1561Neural Network Approximations of PDEs Beyond Linearity: Representational Perspective5.501.803, 5, 6, 8
1562Analytical Composition of Differential Privacy via the Edgeworth Accountant5.500.505, 5, 6, 6
1563Predictor-corrector algorithms for stochastic optimization under gradual distribution shift5.500.506, 5, 5, 6
1564Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation5.500.506, 5, 5, 6
1565Unicom: Universal and Compact Representation Learning for Image Retrieval5.500.506, 5, 5, 6
1566A unified optimization framework of ANN-SNN Conversion: towards optimal mapping from activation values to firing rates5.502.878, 5, 8, 1
1567Trading Information between Latents in Hierarchical Variational Autoencoders5.501.808, 5, 6, 3
1568Towards Skilled Population Curriculum for MARL5.500.505, 6, 5, 6
1569Bringing Saccades and Fixations into Self-supervised Video Representation Learning5.500.506, 6, 5, 5
1570Improve learning combining crowdsourced labels by weighting Areas Under the Margin5.500.505, 6, 5, 6
1571Neural Field Discovery Disentangles Equivariance in Interacting Dynamical Systems5.500.506, 6, 5, 5
1572An Optimal Transport Perspective on Unpaired Image Super-Resolution5.501.808, 6, 5, 3
1573Learning Heterogeneous Interaction Strengths by Trajectory Prediction with Graph Neural Network5.500.506, 5, 6, 5
1574Neural Volumetric Mesh Generator5.501.806, 3, 8, 5
1575Protein Representation Learning via Knowledge Enhanced Primary Structure Reasoning5.500.506, 5, 5, 6
1576LPMARL: Linear Programming based Implicit Task Assignment for Hierarchical Multi-agent Reinforcement Learning5.500.505, 5, 6, 6
1577Neural Frailty Machine: Beyond proportional hazard assumption in neural survival regressions5.500.505, 6, 5, 6
1578Robust Learning with Decoupled Meta Label Purifier5.501.806, 3, 5, 8
1579Basic Binary Convolution Unit for Binarized Image Restoration Network5.501.805, 8, 3, 6
1580Sweet Gradient Matters: Designing Consistent and Efficient Estimator for Zero-Shot Neural Architecture Search5.500.505, 6, 6, 5
1581Constrained Hierarchical Deep Reinforcement Learning with Differentiable Formal Specifications5.501.803, 5, 6, 8
1582Limitations of the NTK for Understanding Generalization in Deep Learning5.501.806, 8, 3, 5
1583Scalable Estimation of Nonparametric Markov Networks with Mixed-Type Data5.500.506, 5, 5, 6
1584Diffusion Probabilistic Modeling of Protein Backbones in 3D for the motif-scaffolding problem5.500.505, 6, 6, 5
1585Joint rotational invariance and adversarial training of a dual-stream Transformer yields state of the art Brain-Score for Area V45.501.805, 8, 6, 3
1586A Unified Causal View of Domain Invariant Representation Learning5.500.506, 6, 5, 5
1587On the Robustness of Safe Reinforcement Learning under Observational Perturbations5.500.505, 6, 5, 6
1588Behind the Scenes of Gradient Descent: A Trajectory Analysis via Basis Function Decomposition5.500.505, 5, 6, 6
1589T2D: Spatiotemporal Feature Learning Based on Triple 2D Decomposition5.501.803, 5, 8, 6
1590Data-Free One-Shot Federated Learning Under Very High Statistical Heterogeneity5.500.506, 5, 5, 6
1591An Efficient Mean-field Approach to High-Order Markov Logic5.501.803, 6, 5, 8
1592Downstream Datasets Make Surprisingly Good Pretraining Corpora5.501.805, 6, 3, 8
1593Unleashing Mask: Explore the Intrinsic Out-of-distribution Detection Capability5.501.806, 8, 5, 3
1594Universal Speech Enhancement with Score-based Diffusion5.500.505, 6, 6, 5
1595CodeT: Code Generation with Generated Tests5.502.508, 3, 3, 8
1596AdaStride: Using Adaptive Strides in Sequential Data for Effective Downsampling5.500.506, 5, 5, 6
1597On the Universality of Langevin Diffusion for Private Euclidean (Convex) Optimization5.500.505, 5, 6, 6
1598What Knowledge gets Distilled in Knowledge Distillation?5.501.806, 8, 5, 3
1599Simplicial Embeddings in Self-Supervised Learning and Downstream Classification5.500.506, 5, 5, 6
1600Thalamus: a brain-inspired algorithm for biologically-plausible continual learning and disentangled representations5.500.505, 5, 6, 6
1601Context Autoencoder for Self-Supervised Representation Learning5.500.505, 5, 6, 6
1602Progressive Purification for Instance-Dependent Partial Label Learning5.501.803, 8, 5, 6
1603CFlowNets: Continuous control with Generative Flow Networks5.500.506, 5, 5, 6
1604Neural Radiance Fields with Geometric Consistency for Few-Shot Novel View Synthesis5.501.806, 3, 5, 8
1605Semi-supervised Community Detection via Structural Similarity Metrics5.501.808, 3, 5, 6
1606Multivariate Time-series Imputation with Disentangled Temporal Representations5.500.506, 6, 5, 5
1607LPT: Long-tailed Prompt Tuning for Image Classification5.500.506, 5, 6, 5
1608TopoZero: Digging into Topology Alignment on Zero-Shot Learning5.501.803, 6, 8, 5
1609Knowledge Distillation based Degradation Estimation for Blind Super-Resolution5.500.505, 5, 6, 6
1610Temporary feature collapse phenomenon in early learning of MLPs5.501.806, 8, 5, 3
1611Meta-Evolve: Continuous Robot Evolution for One-to-many Policy Transfer5.501.808, 5, 6, 3
1612Learning Lightweight Object Detectors via Progressive Knowledge Distillation5.500.506, 5, 5, 6
1613Twofer: Tackling Continual Domain Shift with Simultaneous Domain Generalization and Adaptation5.501.806, 5, 3, 8
1614VectorMapNet: End-to-end Vectorized HD Map Learning5.501.803, 8, 5, 6
1615Domain Generalization with Small Data5.501.808, 3, 5, 6
1616Hierarchical Prompting Improves Visual Recognition On Accuracy, Data Efficiency and Explainability5.500.506, 6, 5, 5
1617Decomposing Texture and Semantics for Out-of-distribution Detection5.500.506, 5, 5, 6
1618One Transformer Can Understand Both 2D & 3D Molecular Data5.501.805, 8, 3, 6
1619An Analysis of Information Bottlenecks5.501.808, 6, 3, 5
1620Everyone's Preference Changes Differently: Weighted Multi-Interest Retrieval Model5.501.806, 5, 8, 3
1621Hierarchical Relational Learning for Few-Shot Knowledge Graph Completion5.501.803, 5, 8, 6
1622Function-Consistent Feature Distillation5.501.806, 3, 8, 5
1623The Devil is in the Wrongly-classified Samples: Towards Unified Open-set Recognition5.501.808, 6, 5, 3
1624Domain Generalization via Independent Regularization from Early-branching Networks5.501.808, 6, 3, 5
1625DELTA: DEBIASED FULLY TEST-TIME ADAPTATION5.500.505, 6, 5, 6
1626Bit-Pruning: A Sparse Multiplication-Less Dot-Product5.501.803, 5, 8, 6
1627KNN-Diffusion: Image Generation via Large-Scale Retrieval5.500.505, 5, 6, 6
1628IS SYNTHETIC DATA FROM GENERATIVE MODELS READY FOR IMAGE RECOGNITION?5.500.505, 5, 6, 6
1629IDEAL: Query-Efficient Data-Free Learning from Black-Box Models5.501.808, 5, 6, 3
1630Succinct Compression: Lossless Compression for Fast and Memory-Efficient Deep Neural Network Inference5.502.503, 8, 3, 8
1631BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection5.500.505, 5, 6, 6
1632MaPLe: Multi-modal Prompt Learning5.501.805, 6, 8, 3
1633Achieve the Minimum Width of Neural Networks for Universal Approximation5.501.806, 3, 5, 8
1634Example-based Planning via Dual Gradient Fields5.501.803, 8, 5, 6
1635Protein structure generation via folding diffusion5.501.808, 3, 5, 6
1636MBrain: A Multi-channel Self-Supervised Learning Framework for Brain Signals5.401.623, 8, 6, 5, 5
1637KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document Understanding5.400.496, 5, 6, 5, 5
1638Infusing Lattice Symmetry Priors in Neural Networks Using Soft Attention Masks5.400.495, 6, 5, 5, 6
1639Panning for Gold in Federated Learning: Targeted Text Extraction under Arbitrarily Large-Scale Aggregation5.401.203, 6, 6, 6, 6
1640Empowering Graph Representation Learning with Test-Time Graph Transformation5.401.625, 6, 3, 8, 5
1641Maximum Likelihood Learning of Energy-Based Models for Simulation-Based Inference5.401.623, 8, 5, 5, 6
1642Prompt Tuning with Prompt-aligned Gradient for Vision-Language Models5.401.206, 6, 3, 6, 6
1643Evaluating Representations with Readout Model Switching5.401.628, 5, 6, 5, 3
1644Scaling Laws For Deep Learning Based Image Reconstruction5.401.626, 3, 5, 5, 8
1645PASHA: Efficient HPO and NAS with Progressive Resource Allocation5.401.628, 5, 6, 3, 5
1646Tackling Diverse Tasks via Cross-Modal Transfer Learning5.401.625, 5, 3, 6, 8
1647On the Interplay Between Misspecification and Sub-optimality Gap: From Linear Contextual Bandits to Linear MDPs5.400.495, 5, 6, 5, 6
1648LT-SNN: Self-Adaptive Spiking Neural Network for Event-based Classification and Object Detection5.402.248, 5, 3, 8, 3
1649Scaling Convex Neural Networks with Burer-Monteiro Factorization5.401.626, 5, 8, 3, 5
1650$rm A^2Q$: Aggregation-Aware Quantization for Graph Neural Networks5.401.626, 8, 5, 5, 3
1651Learning Dynamical Characteristics with Neural Operators for Data Assimilation5.401.628, 5, 3, 5, 6
1652Wav2Tok: Deep Sequence Tokenizer for Audio Retrieval5.401.625, 5, 3, 8, 6
1653Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information5.401.628, 5, 3, 5, 6
1654GNNDelete: A General Unlearning Strategy for Graph Neural Networks5.401.626, 3, 5, 8, 5
1655General Neural Gauge Fields5.400.495, 6, 5, 6, 5
1656Deep Dynamic AutoEncoder for Vision BERT Pretraining5.400.495, 6, 5, 5, 6
1657DiffMimic: Efficient Motion Mimicking with Differentiable Physics5.401.203, 6, 6, 6, 6
1658Do Not Train It: A Linear Neural Architecture Search of Graph Neural Networks5.400.495, 5, 6, 6, 5
1659ModelAngelo: Automated Model Building for Cryo-EM Maps5.401.626, 5, 3, 8, 5
1660UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers5.330.476, 5, 5
1661Convergence is Not Enough: Average-Case Performance of No-Regret Learning Dynamics5.332.058, 5, 3
1662Simple Spectral Graph Convolution from an Optimization Perspective5.330.476, 5, 5
1663Doing Fast Adaptation Fast: Conditionally Independent Deep Ensembles for Distribution Shifts5.330.475, 5, 6
1664RuDar: Weather Radar Dataset for Precipitation Nowcasting with Geographical and Seasonal Variability5.330.475, 6, 5
1665HyPHEN: A Hybrid Packing Method and Optimizations for Homomorphic Encryption-Based Neural Network5.330.476, 5, 5
1666Unveiling the sampling density in non-uniform geometric graphs5.330.475, 6, 5
1667Geometrically regularized autoencoders for non-Euclidean data5.330.476, 5, 5
1668Evolving Populations of Diverse RL Agents with MAP-Elites5.330.476, 5, 5
1669Mid-Vision Feedback for Convolutional Neural Networks5.332.058, 3, 5
1670Prefer to Classify: Improving Text Classifier via Pair-wise Preference Learning5.332.055, 8, 3
1671Editing models with task arithmetic5.330.475, 6, 5
1672Context-Aware Image Completion5.330.476, 5, 5
1673Architecture Matters in Continual Learning5.332.053, 8, 5
1674Efficient Data Subset Selection to Generalize Training Across Models: Transductive and Inductive Networks5.330.475, 6, 5
1675Raisin: Residual Algorithms for Versatile Offline Reinforcement Learning5.330.475, 5, 6
1676Learning Shareable Bases for Personalized Federated Image Classification5.330.476, 5, 5
1677Learning Mixture Models with Simultaneous Data Partitioning and Parameter Estimation5.330.475, 5, 6
1678Neural Bregman Divergences for Distance Learning5.332.055, 8, 3
1679Offline Reinforcement Learning from Heteroskedastic Data Via Support Constraints5.330.476, 5, 5
1680Bias Propagation in Federated Learning5.330.476, 5, 5
1681LUNA: Language as Continuing Anchors for Referring Expression Comprehension5.330.475, 6, 5
1682Many-Body Approximation for Tensors5.332.058, 3, 5
1683What do large networks memorize?5.330.475, 5, 6
1684Linear Mode Connectivity of Deep Neural Networks via Permutation Invariance and Renormalization5.332.055, 3, 8
1685Differentially Private Diffusion Models5.332.058, 5, 3
1686Teaching Algorithmic Reasoning via In-context Learning5.332.055, 3, 8
1687Instruction-Following Agents with Jointly Pre-Trained Vision-Language Models5.330.475, 6, 5
1688GPTQ: Accurate Quantization for Generative Pre-trained Transformers5.330.475, 5, 6
1689A new characterization of the edge of stability based on a sharpness measure aware of batch gradient distribution5.330.476, 5, 5
1690Continual Post-Training of Language Models5.332.058, 3, 5
1691Min-Max Multi-objective Bilevel Optimization with Applications in Robust Machine Learning5.330.475, 6, 5
1692Spotlight: Mobile UI Understanding using Vision-Language Models with a Focus5.330.475, 6, 5
1693Data Subset Selection via Machine Teaching5.330.475, 6, 5
1694Elicitation Inference Optimization for Multi-Principal-Agent Alignment5.330.475, 6, 5
1695Self-Ensemble Protection: Training Checkpoints Are Good Data Protectors5.330.476, 5, 5
1696Probability flow solution of the Fokker-Planck equation5.330.475, 6, 5
1697Recycling Scraps: Improving Private Learning by Leveraging Intermediate Checkpoints5.330.475, 6, 5
1698BC-IRL: Learning Generalizable Reward Functions from Demonstrations5.332.053, 5, 8
1699Provable Robustness against Wasserstein Distribution Shifts via Input Randomization5.330.475, 6, 5
1700Deep Learning From Crowdsourced Labels: Coupled Cross-Entropy Minimization, Identifiability, and Regularization5.330.476, 5, 5
1701A Kernel-Based View of Language Model Fine-Tuning5.330.476, 5, 5
1702Learning Multiobjective Program Through Online Learning5.332.053, 5, 8
1703ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret5.330.475, 5, 6
1704The Challenges of Exploration for Offline Reinforcement Learning5.330.475, 6, 5
1705Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Approach5.332.058, 5, 3
1706Accelerated Single-Call Methods for Constrained Min-Max Optimization5.332.053, 8, 5
1707Understanding the Complexity Gains of Contextual Multi-task RL with Curricula5.330.475, 6, 5
1708Expected Probabilistic Hierarchies5.330.475, 6, 5
1709SP2 : A Second Order Stochastic Polyak Method5.330.475, 6, 5
1710Improved Group Robustness via Classifier Retraining on Independent Splits5.330.475, 6, 5
1711Density Sketches for Sampling and Estimation5.330.475, 5, 6
1712Beyond Link Prediction: On Pre-Training Knowledge Graph Embeddings5.330.475, 6, 5
1713Univariate vs Multivariate Time Series Forecasting with Transformers5.330.476, 5, 5
1714On the optimization and generalization of overparameterized implicit neural networks5.330.475, 5, 6
1715Learning to Unlearn: Instance-wise Unlearning for Pre-trained Classifiers5.332.058, 5, 3
17163D Neural Embedding Likelihood for Robust Sim-to-Real Transfer in Inverse Graphics5.330.476, 5, 5
1717MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection5.330.476, 5, 5
1718Trimsformer: Trimming Transformer via Searching for Low-Rank Structure5.330.475, 6, 5
1719Towards a Unified Theoretical Understanding of Non-contrastive Learning via Rank Differential Mechanism5.330.475, 6, 5
1720AE-FLOW: Autoencoders with Normalizing Flows for Medical Images Anomaly Detection5.332.053, 5, 8
1721Causal Mean Field Multi-Agent Reinforcement Learning5.330.475, 5, 6
1722Towards Robust Model Watermark via Reducing Parametric Vulnerability5.332.053, 5, 8
1723On the Robustness of Dataset Inference5.332.053, 8, 5
1724Towards Conditionally Dependent Masked Language Models5.330.475, 6, 5
1725DAVA: Disentangling Adversarial Variational Autoencoder5.330.475, 6, 5
1726Online Low Rank Matrix Completion5.332.053, 8, 5
1727Keypoint Matching via Random Network Consensus5.332.053, 5, 8
1728Private and Efficient Meta-Learning with Low Rank and Sparse decomposition5.330.475, 5, 6
1729On discrete symmetries of robotics systems: A group-theoretic and data-driven analysis5.330.475, 5, 6
1730BO-Muse: A Human expert and AI teaming framework for accelerated experimental design5.330.476, 5, 5
1731Policy-Based Self-Competition for Planning Problems5.332.053, 5, 8
1732Bayesian Oracle for bounding information gain in neural encoding models5.330.475, 5, 6
1733Unsupervised Performance Predictor for Architecture Search5.330.475, 5, 6
1734Learning Reduced Fluid Dynamics5.332.053, 5, 8
1735Confident Sinkhorn Allocation for Pseudo-Labeling5.330.476, 5, 5
1736UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction5.332.053, 5, 8
1737UNDERSTANDING PURE CLIP GUIDANCE FOR VOXEL GRID NERF MODELS5.330.476, 5, 5
1738Learning to Predict Parameter for Unseen Data5.330.475, 5, 6
1739BinSGDM: Extreme One-Bit Quantization for Communication Efficient Large-Scale Distributed Training5.330.476, 5, 5
1740Free Lunch for Domain Adversarial Training: Environment Label Smoothing5.330.475, 6, 5
1741One-Vs-All AUC Maximization: an effective solution to the low-resource named entity recognition problem5.332.053, 5, 8
1742Learning to Extrapolate: A Transductive Approach5.332.055, 8, 3
1743Detecting and Mitigating Indirect Stereotypes in Word Embeddings5.330.475, 5, 6
1744ASGNN: Graph Neural Networks with Adaptive Structure5.330.475, 5, 6
1745Spatial reasoning as Object Graph Energy Minimization5.330.475, 5, 6
1746BAT-Chain: Bayesian-Aware Transport Chain for Topic Hierarchies Discovery5.330.476, 5, 5
1747Exact Representation of Sparse Networks with Symmetric Nonnegative Embeddings5.330.476, 5, 5
1748Neural DAG Scheduling via One-Shot Priority Sampling5.330.475, 6, 5
1749Bias Amplification Improves Worst-Group Accuracy without Group Information5.330.475, 5, 6
1750A CMDP-within-online framework for Meta-Safe Reinforcement Learning5.332.053, 5, 8
1751Conditional Permutation Invariant Flows5.330.475, 5, 6
1752Learned Neural Network Representations are Spread Diffusely with Redundancy5.330.475, 5, 6
1753Multi-Segmental Informational Coding for Self-Supervised Representation Learning5.330.476, 5, 5
1754Learning to Segment from Noisy Annotations: A Spatial Correction Approach5.330.476, 5, 5
1755DiP-GNN: Discriminative Pre-Training of Graph Neural Networks5.330.476, 5, 5
1756Faster Reinforcement Learning with Value Target Lower Bounding5.330.475, 6, 5
1757Quasi-optimal Learning with Continuous Treatments5.330.475, 6, 5
1758On Structural Expressive Power of Graph Transformers5.332.058, 5, 3
1759Learning Critically in Federated Learning with Noisy and Heterogeneous Clients5.330.475, 6, 5
1760Deep Evidential Reinforcement Learning for Dynamic Recommendations5.332.053, 8, 5
1761SuperWeight Ensembles: Automated Compositional Parameter Sharing Across Diverse Architechtures5.330.476, 5, 5
1762Robust Self-Supervised Learning with Lie Groups5.332.055, 3, 8
1763D4FT: A Deep Learning Approach to Kohn-Sham Density Functional Theory5.330.476, 5, 5
1764Differentially Private Optimization on Large Model at Small Cost5.330.475, 6, 5
1765Contrastive Value Learning: Implicit Models for Simple Offline RL5.332.053, 8, 5
1766Normalizing Flows for Interventional Density Estimation5.330.476, 5, 5
1767GuoFeng: A Discourse-aware Evaluation Benchmark for Language Understanding, Translation and Generation5.332.058, 3, 5
1768SpectraNet: multivariate forecasting and imputation under distribution shifts and missing data5.332.058, 5, 3
1769Benchmarking Constraint Inference in Inverse Reinforcement Learning5.330.475, 5, 6
1770Forward and Backward Lifelong Learning with Time-dependent Tasks5.330.475, 6, 5
1771Homeomorphism Alignment in Two Spaces for Unsupervised Domain Adaptation5.330.475, 5, 6
1772Warped Convolutional Networks: Bridge Homography to $mathfrak{sl}(3)$ algebra by Group Convolution5.332.053, 5, 8
1773FEAT: A general framework for Feature-aware Multivariate Time-series Representation Learning5.330.475, 5, 6
1774RankCSE: Unsupervised Sentence Representations Learning via Learning to Rank5.330.475, 6, 5
1775Label-distribution-agnostic Ensemble Learning on Federated Long-tailed Data5.330.476, 5, 5
1776Masked Vector Quantization5.333.303, 3, 10
1777Measuring Image Complexity as a Discrete Hierarchy using MDL Clustering5.330.475, 5, 6
1778Agent Prioritization with Interpretable Relation for Trajectory Prediction5.330.475, 5, 6
1779Maximizing Spatio-Temporal Entropy of Deep 3D CNNs for Efficient Video Recognition5.332.053, 5, 8
1780Latent State Marginalization as a Low-cost Approach to Improving Exploration5.330.475, 5, 6
1781Supernet Training for Federated Image Classification Under System Heterogeneity5.330.475, 6, 5
1782Generalizable Person Re-identification Without Demographics5.330.476, 5, 5
1783Behavior Prior Representation learning for Offline Reinforcement Learning5.332.053, 5, 8
1784How Does Adaptive Optimization Impact Local Neural Network Geometry?5.330.475, 6, 5
1785Concentric Ring Loss for Face Forgery Detection5.332.058, 3, 5
1786Representational Task Bias in Zero-shot Recognition at Scale5.330.476, 5, 5
1787Relational Curriculum Learning for Graph Neural Networks5.330.475, 6, 5
1788ACMP: Allen-Cahn Message Passing with Attractive and Repulsive Forces for Graph Neural Networks5.330.475, 6, 5
1789An Upper Bound for the Distribution Overlap Index and Its Applications5.330.476, 5, 5
1790Retrieval-based Controllable Molecule Generation5.330.476, 5, 5
1791Data Drift Correction via Time-varying Importance Weight Estimator5.330.475, 6, 5
1792Solving and Learning non-Markovian Stochastic Control problems in continuous-time with Neural RDEs5.330.476, 5, 5
1793Sequential Latent Variable Models for Few-Shot High-Dimensional Time-Series Forecasting5.330.475, 5, 6
1794On the Fast Convergence of Unstable Reinforcement Learning Problems5.330.475, 6, 5
1795Universal approximation and model compression for radial neural networks5.330.476, 5, 5
1796Learn Low-dimensional Shortest-path Representation of Large-scale and Complex Graphs5.330.475, 5, 6
1797Generalized Sum Pooling for Metric Learning5.330.476, 5, 5
1798Learning to Estimate Single-View Volumetric Flow Motions without 3D Supervision5.330.475, 5, 6
1799$Delta$-PINNs: physics-informed neural networks on complex geometries5.332.058, 5, 3
1800Temperature Schedules for self-supervised contrastive methods on long-tail data5.330.476, 5, 5
1801SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification5.332.053, 8, 5
1802Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup5.332.058, 3, 5
1803Identifying Weight-Variant Latent Causal Models5.331.495, 5, 8, 3, 6, 5
1804Can CNNs Be More Robust Than Transformers?5.332.058, 5, 3
1805Rethinking Graph Lottery Tickets: Graph Sparsity Matters5.330.476, 5, 5
1806On the Universal Approximation Property of Deep Fully Convolutional Neural Networks5.330.475, 5, 6
1807Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval5.330.476, 5, 5
1808Continual Learning In Low-coherence Subspace: A Strategy To Mitigate Learning Capacity Degradation5.330.475, 6, 5
1809GSCA: Global Spatial Correlation Attention5.330.476, 5, 5
1810Understanding Incremental Learning of Gradient Descent: A Fine-grained analysis of Matrix Sensing5.332.053, 5, 8
1811Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models5.330.476, 5, 5
1812Critical Sampling for Robust Evolution Behavior Learning of Unknown Dynamical Systems5.332.053, 8, 5
1813Effective Cross-instance Positive Relations for Generalized Category Discovery5.330.475, 5, 6
1814Assessing Model Out-of-distribution Generalization with Softmax Prediction Probability Baselines and A Correlation Method5.330.476, 5, 5
1815Progressive Compressed Auto-Encoder for Self-supervised Representation Learning5.331.116, 6, 6, 6, 3, 5
1816Knowledge-driven Scene Priors for Semantic Audio-Visual Embodied Navigation5.330.475, 6, 5
1817Distribution Aware Metrics for Conditional Natural Language Generation5.330.475, 5, 6
1818Recommender Transformers with Behavior Pathways5.330.475, 6, 5
1819HNeRV: A Hybrid Neural Representation for Videos5.330.476, 5, 5
1820Filter-Recovery Network for Multi-Speaker Audio-Visual Speech Separation5.330.476, 5, 5
1821Deep Physics-based Deformable Models for Efficient Shape Abstractions5.330.476, 5, 5
1822Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies5.330.476, 5, 5
1823Active Learning with Controllable Augmentation Induced Acquisition5.332.055, 8, 3
1824Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game5.330.475, 5, 6
1825Mind the Gap: Offline Policy Optimizaiton for Imperfect Rewards5.332.058, 5, 3
1826Time Series are Images: Vision Transformer for Irregularly Sampled Time Series5.332.058, 5, 3
1827Understanding Self-Supervised Pretraining with Part-Aware Representation Learning5.330.476, 5, 5
1828Volumetric Optimal Transportation by Fast Fourier Transform5.332.053, 8, 5
1829Robustness Exploration of Semantic Information in Adversarial Training5.330.475, 6, 5
1830Learning GFlowNets from partial episodes for improved convergence and stability5.330.475, 6, 5
1831Boosting Out-of-Distribution Detection with Multiple Pre-trained Models5.330.475, 6, 5
1832Confidence and Dispersity Speak: Characterising Prediction Matrix for Unsupervised Accuracy Estimation5.332.053, 5, 8
1833Molecular Geometry Pretraining with SE(3)-Invariant Denoising Distance Matching5.330.475, 5, 6
1834Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking5.330.475, 6, 5
1835Understanding the Training Dynamics in Federated Deep Learning via Aggregation Weight Optimization5.330.475, 5, 6
1836ONLINE RESTLESS BANDITS WITH UNOBSERVED STATES5.250.435, 5, 6, 5
1837Learning Representations for Reinforcement Learning with Hierarchical Forward Models5.251.303, 6, 6, 6
1838Randomized Sharpness-Aware Training for Boosting Computational Efficiency in Deep Learning5.251.795, 3, 5, 8
1839Light and Accurate: Neural Architecture Search via Two Constant Shared Weights Initialisations5.250.435, 6, 5, 5
1840Protein Sequence and Structure Co-Design with Equivariant Translation5.251.306, 6, 3, 6
1841Efficiently Meta-Learning for Robust Deep Networks without Prior Unbiased Set5.251.795, 8, 5, 3
1842Regression with Label Differential Privacy5.252.591, 6, 8, 6
1843Theoretical Study of Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward5.250.436, 5, 5, 5
1844Backpropagation through Combinatorial Algorithms: Identity with Projection Works5.251.793, 5, 5, 8
1845GradientMix: A Simple yet Effective Regularization for Large Batch Training5.250.435, 6, 5, 5
1846Towards Learning Implicit Symbolic Representation for Visual Reasoning5.250.435, 5, 6, 5
1847SKTformer: A Skeleton Transformer for Long Sequence Data5.251.306, 3, 6, 6
1848Specformer: Spectral Graph Neural Networks Meet Transformers5.250.435, 6, 5, 5
1849MetaP: How to Transfer Your Knowledge on Learning Hidden Physics5.250.435, 5, 6, 5
1850CommsVAE: Learning the brain's macroscale communication dynamics using coupled sequential VAEs5.250.435, 5, 5, 6
1851Long Term Fairness via Performative Distributionally Robust Optimization5.251.795, 3, 8, 5
1852Multi-View Masked Autoencoders for Visual Control5.250.435, 5, 6, 5
1853Safe Exploration Incurs Nearly No Additional Sample Complexity for Reward-Free RL5.251.798, 3, 5, 5
18543D-IntPhys: Learning 3D Visual Intuitive Physics for Fluids, Rigid Bodies, and Granular Materials5.252.8610, 3, 5, 3
1855Benchmarking Algorithms for Domain Generalization in Federated Learning5.250.436, 5, 5, 5
1856Continual Learning Based on Sub-Networks and Task Similarity5.250.435, 6, 5, 5
1857Heavy-tailed Noise Does Not Explain the Gap Between SGD and Adam, but Sign Descent Might5.251.306, 6, 3, 6
1858Efficient parametric approximations of neural net function space distance5.251.798, 5, 3, 5
1859What Spurious Features Can Pretrained Language Models Combat?5.250.435, 5, 6, 5
1860Cramming: Training a language model on a single GPU in one day5.250.435, 5, 5, 6
1861Probabilistic Categorical Adversarial Attack and Adversarial Training5.251.798, 5, 5, 3
1862Dissecting adaptive methods in GANs5.251.798, 5, 5, 3
1863Robustness for Free: Adversarially Robust Anomaly Detection Through Diffusion Model5.250.435, 6, 5, 5
1864ErrorAug: Making Errors to Find Errors in Semantic Segmentation5.250.436, 5, 5, 5
1865When is Offline Hyperparameter Selection Feasible for Reinforcement Learning?5.250.435, 5, 5, 6
1866Denoising Diffusion Samplers5.250.435, 6, 5, 5
1867Model-free Reinforcement Learning that Transfers Using Random Reward Features5.251.795, 3, 5, 8
1868Progressive Mix-Up for Few-Shot Supervised Multi-Source Domain Transfer5.250.435, 5, 6, 5
1869Brain-like representational straightening of natural movies in robust feedforward neural networks5.251.306, 3, 6, 6
1870Using Both Demonstrations and Language Instructions to Efficiently Learn Robotic Tasks5.251.795, 5, 3, 8
1871Calibrating the Rigged Lottery: Making All Tickets Reliable5.251.798, 3, 5, 5
1872Open-Vocabulary Panoptic Segmentation MaskCLIP5.250.435, 6, 5, 5
1873Laser: Latent Set Representations for 3D Generative Modeling5.250.435, 5, 6, 5
1874Finding and only finding local Nash equilibria by both pretending to be a follower5.250.435, 6, 5, 5
1875Fake It Until You Make It : Towards Accurate Near-Distribution Novelty Detection5.251.306, 3, 6, 6
1876Generative Pretraining for Black-Box Optimization5.250.435, 6, 5, 5
1877The ethical ambiguity of AI data enrichment: Measuring gaps in research ethics norms and practices5.252.863, 5, 3, 10
1878Neural multi-event forecasting on spatio-temporal point processes using probabilistically enriched transformers5.251.795, 5, 3, 8
1879Detecting Small Query Graphs in A Large Graph via Neural Subgraph Search5.250.436, 5, 5, 5
1880Planning with Language Models through Iterative Energy Minimization5.251.306, 6, 3, 6
1881Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction5.250.435, 6, 5, 5
1882Joint-Predictive Representations for Multi-Agent Reinforcement Learning5.251.306, 6, 6, 3
1883PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Category Discovery5.250.435, 5, 5, 6
1884Learning implicit hidden Markov models using neural likelihood-free inference5.251.793, 5, 8, 5
1885Making Better Decision by Directly Planning in Continuous Control5.251.306, 6, 3, 6
1886Heterogeneous Neuronal and Synaptic Dynamics for Spike-Efficient Unsupervised Learning: Theory and Design Principles5.251.795, 8, 3, 5
1887Shuffled Transformers for Blind Training5.251.793, 5, 8, 5
1888Hardware-aware compression with Random Operation Access Specific Tile (ROAST) hashing5.250.435, 5, 6, 5
1889Neural Implicit Shape Editing using Boundary Sensitivity5.250.435, 5, 5, 6
1890Amortised Invariance Learning for Contrastive Self-Supervision5.251.795, 5, 3, 8
1891Generating Sequences by Learning to Self-Correct5.250.435, 5, 6, 5
1892An ensemble view on mixup5.251.793, 5, 8, 5
1893ULF: UNSUPERVISED LABELING FUNCTION CORRECTION USING CROSS-VALIDATION FOR WEAK SUPERVISION5.250.436, 5, 5, 5
1894Continual Zero-shot Learning through Semantically Guided Generative Random Walks5.251.795, 8, 3, 5
1895Self-Guided Diffusion Models5.250.436, 5, 5, 5
1896Stay Moral and Explore: Learn to Behave Morally in Text-based Games5.250.436, 5, 5, 5
1897Memory-Efficient Reinforcement Learning with Priority based on Surprise and On-policyness5.251.795, 5, 8, 3
1898Uncertainty-aware off policy learning5.251.793, 5, 8, 5
1899Analyzing diffusion as serial reproduction5.251.793, 5, 8, 5
1900Pseudo-label Training and Model Inertia in Neural Machine Translation5.251.795, 5, 8, 3
1901Understanding weight-magnitude hyperparameters in training binary networks5.250.435, 5, 6, 5
1902Graph Backup: Data Efficient Backup Exploiting Markovian Transitions5.250.435, 5, 6, 5
1903Adversarial Driving Policy Learning by Misunderstanding the Traffic Flow5.250.435, 5, 6, 5
1904Sequential Learning of Neural Networks for Prequential MDL5.250.436, 5, 5, 5
1905ReaKE: Contrastive Molecular Representation Learning with Chemical Synthetic Knowledge Graph5.250.436, 5, 5, 5
1906Parameter Averaging for SGD Stabilizes the Implicit Bias towards Flat Regions5.251.798, 5, 3, 5
1907A New Hierarchy of Expressivity for Graph Neural Networks5.250.435, 6, 5, 5
1908Lmser-pix2seq: Learning Stable Sketch Representations For Sketch Healing5.251.798, 5, 5, 3
1909Consolidator: Mergable Adapter with Group Connections for Vision Transformer5.250.435, 5, 6, 5
1910Explaining RL Decisions with Trajectories5.250.435, 5, 6, 5
1911ProtoGNN: Prototype-Assisted Message Passing Framework for Non-Homophilous Graphs5.250.435, 5, 6, 5
1912Two Birds, One Stone: An Equivalent Transformation for Hyper-relational Knowledge Graph Modeling5.251.798, 3, 5, 5
1913Generalization Bounds with Arbitrary Complexity Measures5.250.435, 5, 6, 5
1914On student-teacher deviations in distillation: does it pay to disobey?5.251.795, 8, 5, 3
1915Merging Models Pre-Trained on Different Features with Consensus Graph5.251.795, 5, 8, 3
1916CUTS: Neural Causal Discovery from Unstructured Time-Series Data5.250.435, 5, 5, 6
1917On the Importance of In-distribution Class Prior for Out-of-distribution Detection5.251.306, 3, 6, 6
1918Curved Data Representations in Deep Learning5.251.798, 5, 5, 3
1919Learning Binary Networks on Long-Tailed Distributions5.251.798, 5, 5, 3
1920Concealing Sensitive Samples for Enhanced Privacy in Federated Learning5.251.793, 5, 8, 5
1921Understanding Graph Contrastive Learning From A Statistical Perspective5.250.435, 5, 5, 6
1922Stochastic Optimization under Strongly Convexity and Lipschitz Hessian: Minimax Sample Complexity5.251.306, 6, 3, 6
1923Label-free Concept Bottleneck Models5.250.435, 5, 5, 6
1924Push and Pull: Competing Feature-Prototype Interactions Improve Semi-supervised Semantic Segmentation5.250.435, 5, 5, 6
1925A computational framework to unify representation similarity and function in biological and artificial neural networks5.251.793, 8, 5, 5
1926Temporally Consistent Video Transformer for Long-Term Video Prediction5.250.435, 5, 5, 6
1927DITTO: Offline Imitation Learning with World Models5.250.436, 5, 5, 5
1928Disentangling the Mechanisms Behind Implicit Regularization in SGD5.251.303, 6, 6, 6
1929Provably Efficient Lifelong Reinforcement Learning with Linear Representation5.250.436, 5, 5, 5
1930Copula Conformal Prediction for Multi-step Time Series Forecasting5.251.303, 6, 6, 6
1931Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy5.250.435, 5, 5, 6
1932TrajGRU-Attention-ODE: Novel Spatiotemporal Predictive Models5.250.436, 5, 5, 5
1933Is a Caption Worth a Thousand Images? A Study on Representation Learning5.251.798, 5, 5, 3
1934Parameter-Efficient Fine-Tuning Design Spaces5.251.793, 8, 5, 5
1935Variational Latent Branching Model for Off-Policy Evaluation5.250.435, 5, 5, 6
1936Polarity is all you need to learn and transfer faster5.251.793, 5, 5, 8
1937On the Geometry of Reinforcement Learning in Continuous State and Action Spaces5.250.436, 5, 5, 5
1938AUGMENTING ZERO-SHOT DENSE RETRIEVERS WITH PLUG-IN MIXTURE-OF-MEMORIES5.250.436, 5, 5, 5
1939Perfectly Secure Steganography Using Minimum Entropy Coupling5.252.596, 8, 1, 6
1940Identifiability of Label Noise Transition Matrix5.250.435, 5, 6, 5
1941Towards Explaining Distribution Shifts5.250.436, 5, 5, 5
1942CAMA: A New Framework for Safe Multi-Agent Reinforcement Learning Using Constraint Augmentation5.250.435, 5, 5, 6
1943Visual Prompt Tuning For Test-time Domain Adaptation5.250.435, 5, 5, 6
1944ReD-GCN: Revisit the Depth of Graph Convolutional Network5.250.436, 5, 5, 5
1945Rethinking Positive Sampling for Contrastive Learning with Kernel5.250.435, 5, 5, 6
1946FaiREE: fair classification with finite-sample and distribution-free guarantee5.251.798, 5, 3, 5
1947Unbiased Stochastic Proximal Solver for Graph Neural Networks with Equilibrium States5.250.436, 5, 5, 5
1948On The Implicit Bias of Weight Decay in Shallow Univariate ReLU Networks5.251.798, 3, 5, 5
1949Improving Deep Policy Gradients with Value Function Search5.250.435, 5, 6, 5
1950Weakly Supervised Knowledge Transfer with Probabilistic Logical Reasoning for Object Detection5.252.596, 8, 6, 1
1951Over-parameterized Model Optimization with Polyak-{L}ojasiewicz Condition5.251.795, 5, 3, 8
1952DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning5.250.435, 5, 6, 5
1953A Curriculum Perspective to Robust Loss Functions5.251.303, 6, 6, 6
1954Decoupled Training for Long-Tailed Classification With Stochastic Representations5.250.436, 5, 5, 5
1955IT-NAS: Integrating Lite-Transformer into NAS for Architecture Seletion5.251.306, 3, 6, 6
1956Simplicity bias in $1$-hidden layer neural networks5.250.435, 5, 5, 6
1957Memory Gym: Partially Observable Challenges to Memory-Based Agents5.251.795, 8, 5, 3
1958On the effectiveness of out-of-distribution data in self-supervised long-tail learning.5.250.435, 5, 6, 5
1959Vera Verto: Multimodal Hijacking Attack5.250.436, 5, 5, 5
1960Joint Attention-Driven Domain Fusion and Noise-Tolerant Learning for Multi-Source Domain Adaptation5.251.798, 3, 5, 5
1961Model Obfuscation for Securing Deployed Neural Networks5.251.795, 8, 3, 5
1962MultiViz: Towards Visualizing and Understanding Multimodal Models5.252.591, 6, 6, 8
1963Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN5.251.795, 8, 3, 5
1964New Insights for the Stability-Plasticity Dilemma in Online Continual Learning5.251.795, 8, 3, 5
1965Ti-MAE: Self-Supervised Masked Time Series Autoencoders5.250.435, 5, 5, 6
1966Are More Layers Beneficial to Graph Transformers?5.251.306, 6, 3, 6
1967Clean-image Backdoor: Attacking Multi-label Models with Poisoned Labels Only5.251.306, 6, 3, 6
1968Bandit Learning in Many-to-one Matching Markets with Uniqueness Conditions5.250.435, 6, 5, 5
1969Predictive Inference with Feature Conformal Prediction5.250.435, 5, 5, 6
1970OrthoReg: Improving Graph-regularized MLPs via Orthogonality Regularization5.250.435, 5, 6, 5
1971Personalized Semantics Excitation for Federated Image Classification5.251.798, 5, 5, 3
1972Intrinsic Motivation via Surprise Memory5.251.798, 3, 5, 5
1973TensorVAE: A Direct Generative Model for Molecular Conformation Generation driven by Novel Feature Engineering5.251.793, 5, 8, 5
1974MaskFusion: Feature Augmentation for Click-Through Rate Prediction via Input-adaptive Mask Fusion5.251.795, 8, 3, 5
1975NERDS: A General Framework to Train Camera Denoisers from Single Noisy Images5.251.303, 6, 6, 6
1976Coverage-centric Coreset Selection for High Pruning Rates5.250.435, 6, 5, 5
1977Chasing Better Deep Image Priors Between Over- and Under-parameterization5.250.436, 5, 5, 5
1978Data Valuation Without Training of a Model5.251.303, 6, 6, 6
1979RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning5.250.435, 5, 6, 5
1980Speculative Decoding: Lossless Speedup of Autoregressive Translation5.250.435, 6, 5, 5
1981Transformer Module Networks for Systematic Generalization in Visual Question Answering5.250.435, 5, 5, 6
1982Constructive TT-representation of the tensors given as index interaction functions with applications5.251.306, 6, 6, 3
1983VoGE: A Differentiable Volume Renderer using Gaussian Ellipsoids for Analysis-by-Synthesis5.251.795, 8, 3, 5
1984Unravel Structured Heterogeneity of Tasks in Meta-Reinforcement Learning via Exploratory Clustering5.250.436, 5, 5, 5
1985Find Your Friends: Personalized Federated Learning with the Right Collaborators5.251.306, 6, 6, 3
1986Equilibrium-finding via exploitability descent with learned best-response functions5.251.795, 8, 5, 3
1987Masked inverse folding with sequence transfer for protein representation learning5.250.436, 5, 5, 5
1988FedDAR: Federated Domain-Aware Representation Learning5.251.306, 6, 6, 3
1989Interval Bound Interpolation for Few-shot Learning with Few Tasks5.250.435, 5, 5, 6
1990ELRT: Towards Efficient Low-Rank Training for Compact Neural Networks5.250.435, 5, 5, 6
1991Tangential Wasserstein Projections5.251.303, 6, 6, 6
1992SYNG4ME: Model Evaluation using Synthetic Test Data5.250.436, 5, 5, 5
1993Long-Tailed Learning Requires Feature Learning5.250.435, 6, 5, 5
1994Revisiting Pretraining Objectives for Tabular Deep Learning5.251.795, 3, 5, 8
1995Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization5.251.798, 5, 5, 3
1996Relative Positional Encoding Family via Unitary Transformation5.251.303, 6, 6, 6
1997Continual Vision-Language Representaion Learning with Off-Diagonal Information5.251.795, 5, 3, 8
1998COFS: COntrollable Furniture layout Synthesis5.250.435, 6, 5, 5
1999A Functional Perspective on Multi-Layer Out-of-Distribution Detection5.250.435, 6, 5, 5
2000Active Learning with Partial Labels5.251.795, 8, 3, 5
2001Fed-CBS: Heterogeneity-Aware Client Sampling Mechanism for Federated Learning via Class-Imbalance Reduction5.251.795, 8, 5, 3
2002Delving into Discrete Normalizing Flows on SO(3) Manifold for Probabilistic Rotation Modeling5.251.795, 3, 8, 5
2003Enabling Probabilistic Inference on Large-Scale Spiking Neural Networks5.251.798, 5, 3, 5
2004A Closer Look at Dual Batch Normalization and Two-domain Hypothesis In Adversarial Training With Hybrid Samples5.250.435, 5, 5, 6
2005Communication-Efficient Federated Learning with Accelerated Client Gradient5.250.435, 6, 5, 5
2006Ranking-Enhanced Unsupervised Sentence Representation Learning5.251.793, 5, 8, 5
2007Revisiting Graph Adversarial Attack and Defense From a Data Distribution Perspective5.250.435, 5, 6, 5
2008On Fairness Measurement for Generative Models5.250.436, 5, 5, 5
2009Analyzing the Latent Space of GAN through Local Dimension Estimation5.251.303, 6, 6, 6
2010Neural Collaborative Filtering Bandits via Meta Learning5.251.798, 5, 5, 3
2011Decoupled Mixup for Data-efficient Learning5.250.435, 5, 5, 6
2012FAIRER: Fairness as Decision Rationale Alignment5.250.435, 5, 5, 6
2013Bi-level Physics-Informed Neural Networks for PDE Constrained Optimization using Broyden's Hypergradients5.250.435, 6, 5, 5
2014Learning Continuous Grasping Function with a Dexterous Hand from Human Demonstrations5.251.795, 8, 5, 3
2015When Do Models Generalize? A Perspective From Data-Algorithm Compatibility5.251.303, 6, 6, 6
2016Learning PDE Solution Operator for Continuous Modeling of Time-Series5.250.435, 5, 5, 6
2017Trainable Weight Averaging: Efficient Training by Optimizing Historical Solutions5.251.793, 5, 5, 8
2018Neural Radiance Field Codebooks5.250.435, 5, 5, 6
2019Data-Efficient and Interpretable Tabular Anomaly Detection5.250.435, 6, 5, 5
2020The Impact of Approximation Errors on Warm-Start Reinforcement Learning: A Finite-time Analysis5.251.306, 6, 3, 6
20213D-Aware Video Generation5.251.795, 3, 8, 5
2022Correcting Data Distribution Mismatch in Offline Meta-Reinforcement Learning with Few-Shot Online Adaptation5.250.435, 5, 6, 5
2023Online Placebos for Class-incremental Learning5.251.798, 3, 5, 5
2024Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning5.251.306, 6, 6, 3
2025IEDR: A Context-aware Intrinsic and Extrinsic Disentangled Recommender System5.251.306, 6, 3, 6
2026Exploring Chemical Space with Score-based Out-of-distribution Generation5.251.798, 3, 5, 5
2027DFlow: Learning to Synthesize Better Optical Flow Datasets via a Differentiable Pipeline5.250.435, 5, 6, 5
2028NormSoftmax: Normalize the Input of Softmax to Accelerate and Stabilize Training5.250.436, 5, 5, 5
2029TimelyFL: Heterogeneity-aware Asynchronous Federated Learning with Adaptive Partial Training5.251.306, 6, 3, 6
2030Graph Domain Adaptation via Theory-Grounded Spectral Regularization5.251.306, 6, 3, 6
2031Cross Modal Domain Generalization for Query-based Video Segmentation5.251.793, 8, 5, 5
2032Language Model Pre-training with Linguistically Motivated Curriculum Learning5.250.435, 5, 5, 6
2033Your Denoising Implicit Model is a Sub-optimal Ensemble of Denoising Predictions5.250.435, 6, 5, 5
2034NOAH: A New Head Structure To Improve Deep Neural Networks For Image Classification5.250.436, 5, 5, 5
2035InPL: Pseudo-labeling the Inliers First for Imbalanced Semi-supervised Learning5.251.306, 3, 6, 6
2036Imitate Your Own Refinement: Knowledge Distillation Sheds Light on Efficient Image-to-Image Translation5.250.435, 6, 5, 5
2037Self-Supervised Set Representation Learning for Unsupervised Meta-Learning5.250.435, 6, 5, 5
2038Learning Specialized Activation Functions for Physics-informed Neural Networks5.251.793, 8, 5, 5
2039Dateformer: Transformer Extends Look-back Horizon to Predict Longer-term Time Series5.251.306, 6, 3, 6
2040Focusing on what to decode and what to train: Efficient Training with HOI Split Decoders and Split Target Guided DeNoising5.250.436, 5, 5, 5
2041Reliability of CKA as a Similarity Measure in Deep Learning5.251.795, 5, 8, 3
2042Comfort Zone: A Vicinal Distribution for Regression Problems5.251.303, 6, 6, 6
2043Locally Invariant Explanations: Towards Stable and Unidirectional Explanations through Local Invariant Learning5.250.435, 5, 6, 5
2044Self-Organizing Pathway Expansion for Non-Exemplar Incremental Learning5.250.436, 5, 5, 5
2045DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection5.252.598, 6, 1, 6
2046DDM$^2$: Self-Supervised Diffusion MRI Denoising with Generative Diffusion Models5.252.591, 6, 6, 8
2047Pareto Automatic Multi-Task Graph Representation Learning5.251.795, 8, 5, 3
2048Outlier Robust Adversarial Training5.251.798, 5, 3, 5
2049Sparse Tokens for Dense Prediction - The Medical Image Segmentation Case5.250.435, 5, 6, 5
2050NTK-SAP: Improving neural network pruning by aligning training dynamics5.251.306, 3, 6, 6
2051Discovering Distinctive ``Semantics'' in Super-Resolution Networks5.251.795, 8, 3, 5
2052BQ-NCO: Bisimulation Quotienting for Generalizable Neural Combinatorial Optimization5.251.793, 5, 5, 8
2053Distilling Cognitive Backdoor within an Image5.251.798, 5, 3, 5
20543D generation on ImageNet5.251.306, 3, 6, 6
2055Revisiting Higher-Order Gradient Methods for Multi-Agent Reinforcement Learning5.250.435, 5, 6, 5
2056DIVISION: Memory Efficient Training via Dual Activation Precision5.251.793, 5, 8, 5
2057CLIP-PAE: Projection-Augmentation Embedding to Extract Relevant Features for a Disentangled, Interpretable and Controllable Text-Guided Image Manipulation5.250.435, 5, 6, 5
2058Provable Adaptivity in Adam5.251.795, 3, 5, 8
2059De Novo Molecular Generation via Connection-aware Motif Mining5.251.795, 3, 5, 8
2060Gradient Estimation for Unseen Domain Risk Minimization with Pre-Trained Models5.250.436, 5, 5, 5
2061Semi-supervised Counting via Pixel-by-pixel Density Distribution Modelling5.251.306, 6, 3, 6
2062E-CRF: Embedded Conditional Random Field for Boundary-caused Class Weights Confusion in Semantic Segmentation5.250.435, 5, 6, 5
2063CAN: A simple, efficient and scalable contrastive masked autoencoder framework for learning visual representations5.251.795, 5, 8, 3
2064Self-conditioned Embedding Diffusion for Text Generation5.250.435, 5, 5, 6
2065Towards a Unified View on Visual Parameter-Efficient Transfer Learning5.250.435, 5, 5, 6
2066BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation5.250.435, 5, 5, 6
2067Towards Sustainable Self-supervised Learning5.250.436, 5, 5, 5
2068Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features5.251.795, 3, 8, 5
2069Efficient Automatic Machine Learning via Design Graphs5.251.795, 5, 8, 3
2070Motion-inductive Self-supervised Object Discovery in Videos5.251.793, 5, 5, 8
2071SIMPLE: Specialized Model-Sample Matching for Domain Generalization5.251.798, 5, 3, 5
2072A Study of Causal Confusion in Preference-Based Reward Learning5.201.608, 5, 5, 5, 3
2073CodeT5Mix: A Pretrained Mixture of Encoder-decoder Transformers for Code Understanding and Generation5.201.176, 6, 6, 3, 5
2074TILDE-Q: a Transformation Invariant Loss Function for Time-Series Forecasting5.202.793, 6, 8, 8, 1
2075Efficient neural representation in the cognitive neuroscience domain: Manifold Capacity in One-vs-rest Recognition Limit5.201.946, 8, 3, 6, 3
2076Revisit Finetuning strategy for Few-Shot Learning to Strengthen the Equivariance of Emdeddings5.201.176, 6, 6, 3, 5
2077Lossy Image Compression with Conditional Diffusion Models5.200.405, 5, 6, 5, 5
2078Active Learning for Object Detection with Evidential Deep Learning and Hierarchical Uncertainty Aggregation5.201.176, 3, 6, 6, 5
2079Understanding and Mitigating Robust Overfitting through the Lens of Feature Dynamics5.201.176, 6, 3, 6, 5
2080Synchronized Contrastive Pruning for Efficient Self-Supervised Learning5.201.605, 8, 5, 3, 5
2081Faster federated optimization under second-order similarity5.200.405, 5, 6, 5, 5
2082Where to Go Next for Recommender Systems? ID- vs. Modality-based recommender models revisited5.201.603, 8, 5, 5, 5
2083Optimising 2D Pose Representation: Improving Accuracy, Stability and Generalisability inUnsupervised 2D-3D Human Pose Estimation5.201.603, 8, 5, 5, 5
2084Test-time Adaptation for Better Adversarial Robustness5.200.405, 5, 5, 5, 6
2085RGI: robust GAN-inversion for mask-free image inpainting and unsupervised pixel-wise anomaly detection5.201.173, 6, 6, 5, 6
2086MIMT: Masked Image Modeling Transformer for Video Compression5.200.405, 5, 5, 6, 5
2087On the Necessity of Disentangled Representations for Downstream Tasks5.201.176, 5, 6, 6, 3
2088Domain-Adjusted Regression or: ERM May Already Learn Features Sufficient for Out-of-Distribution Generalization5.201.943, 6, 6, 3, 8
2089Edge-Varying Fourier Graph Network for Multivariate Time Series Forecasting5.200.405, 5, 6, 5, 5
2090How do Variational Autoencoders Learn? Insights from Representational Similarity5.201.608, 3, 5, 5, 5
2091Dilated convolution with learnable spacings5.201.176, 6, 3, 5, 6
2092Grassmannian Class Representation in Deep Learning5.201.173, 6, 5, 6, 6
2093SPI-GAN: Denoising Diffusion GANs with Straight-Path Interpolations5.171.775, 3, 8, 6, 3, 6
2094The Reward Hypothesis is False5.171.463, 5, 5, 8, 5, 5
2095A Study of Biologically Plausible Neural Network: the Role and Interactions of Brain-Inspired Mechanisms in Continual Learning5.002.128, 3, 6, 3
2096Proper Scoring Rules for Survival Analysis5.000.005, 5, 5
2097PPAT: Progressive Graph Pairwise Attention Network for Event Causality Identification5.000.005, 5, 5
2098Disentangled Feature Swapping Augmentation for Weakly Supervised Semantic Segmentation5.001.226, 3, 5, 6
2099Kinship Representation Learning with Face Componential Relation5.002.123, 6, 8, 3
2100Improved Training of Physics-Informed Neural Networks with Model Ensembles5.002.128, 6, 3, 3
2101RephraseTTS: Dynamic Length Text based Speech Insertion with Speaker Style Transfer5.001.225, 6, 6, 3
2102Beyond Reward: Offline Preference-guided Policy Optimization5.002.128, 3, 3, 6
2103Can BERT Refrain from Forgetting on Sequential Tasks? A Probing Study5.001.416, 6, 3
2104UiTTa: Online Test-Time Adaptation by User Interaction5.000.005, 5, 5, 5
2105Compression-aware Training of Neural Networks using Frank-Wolfe5.002.126, 3, 3, 8
2106MEDOE: A Multi-Expert Decoder and Output Ensemble Framework for Long-tailed Semantic Segmentation5.000.005, 5, 5, 5
2107TransFool: An Adversarial Attack against Neural Machine Translation Models5.001.223, 6, 6, 5
2108Denoising Differential Privacy in Split Learning5.001.223, 5, 6, 6
2109Extracting Meaningful Attention on Source Code: An Empirical Study of Developer and Neural Model Code Exploration5.001.106, 3, 5, 6, 5
2110Asynchronous Distributed Bilevel Optimization5.000.005, 5, 5
2111Confidence-Based Feature Imputation for Graphs with Partially Known Features5.001.416, 3, 6
2112Offline imitation learning by controlling the effective planning horizon5.001.226, 3, 5, 6
2113A Hierarchical Bayesian Approach to Federated Learning5.001.226, 6, 5, 3
2114On the Existence of a Trojaned Twin Model5.001.226, 3, 6, 5
2115Counterfactual Generation Under Confounding5.000.005, 5, 5, 5
2116FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation5.001.416, 3, 6
2117MA-BERT: Towards Matrix Arithmetic-only BERT Inference by Eliminating Complex Non-linear Functions5.000.005, 5, 5
2118Offline Reinforcement Learning via Weighted $f$-divergence5.000.005, 5, 5, 5
2119Revisiting and Improving FGSM Adversarial Training5.000.005, 5, 5, 5
2120TrojText: Test-time Invisible Textual Trojan Insertion5.001.226, 5, 6, 3
2121Robustness Guarantees for Adversarially Trained Neural Networks5.001.226, 5, 6, 3
2122Fast-PINN for Complex Geometry: Solving PDEs with Boundary Connectivity Loss5.001.223, 6, 6, 5
2123UniMax: Fairer and More Effective Language Sampling for Large-Scale Multilingual Pretraining5.001.226, 3, 5, 6
2124GNNInterpreter: A Probabilistic Generative Model-Level Explanation for Graph Neural Networks5.001.226, 3, 6, 5
2125On Pre-training Language Model for Antibody5.001.223, 6, 6, 5
2126L2B: Learning to Bootstrap for Combating Label Noise5.000.005, 5, 5
2127Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis5.001.225, 6, 6, 3
2128Differentially Private Algorithms for Smooth Nonconvex ERM5.001.226, 3, 6, 5
2129Answer Me if You Can: Debiasing Video Question Answering via Answering Unanswerable Questions5.001.226, 6, 3, 5
2130Learning Rewards and Skills to Follow Commands with a Data Efficient Visual-Audio Representation5.000.005, 5, 5
2131Auto-Encoding Goodness of Fit5.001.226, 6, 5, 3
2132Understanding the Covariance Structure of Convolutional Filters5.001.225, 6, 6, 3
2133Nuisances via Negativa: Adjusting for Spurious Correlations via Data Augmentation5.000.005, 5, 5, 5
2134Do We Really Need Graph Models for Skeleton-Based Action Recognition? A Topology-Agnostic Approach with Fully-Connected Networks5.000.005, 5, 5
2135On Representing Mixed-Integer Linear Programs by Graph Neural Networks5.002.556, 8, 1, 5
2136Dynamic Neural Network is All You Need: Understanding the Robustness of Dynamic Mechanisms in Neural Networks5.002.948, 1, 6
2137Stateful Active Facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning5.001.226, 5, 3, 6
2138PINTO: Faithful Language Reasoning Using Prompted-Generated Rationales5.001.416, 3, 6
2139Unsupervised 3D Scene Representation Learning via Movable Object Inference5.001.225, 3, 6, 6
2140Similarity-Based Cooperation5.000.005, 5, 5, 5
2141Augmentation Component Analysis: Modeling Similarity via the Augmentation Overlaps5.001.225, 6, 3, 6
2142On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness5.000.005, 5, 5
2143A Picture of the Space of Typical Learning Tasks5.001.416, 3, 6
2144UNICO: Efficient Unified Hardware-Software Co-Optimization For Deep Neural Networks5.000.005, 5, 5, 5
2145DyG2Vec: Representation Learning for Dynamic Graphs With Self-supervision5.001.223, 6, 6, 5
2146Deep Watermarks for Attributing Generative Models5.001.416, 6, 3
2147Learning Latent Structural Causal Models5.002.458, 3, 3, 8, 3
2148S$^6$-DAMON: Bridging Self-Supervised Speech Models and Real-time Speech Recognition5.000.005, 5, 5
2149ORCA: Interpreting Prompted Language Models via Locating Supporting Evidence in the Ocean of Pretraining Data5.001.223, 6, 6, 5
2150FedTiny: Pruned Federated Learning Towards Specialized Tiny Models5.000.005, 5, 5, 5
2151Learning to represent and predict evolving visual signals via polar straightening5.000.005, 5, 5
2152Interpretable (meta)factorization of clinical questionnaires to identify general dimensions of psychopathology5.001.903, 3, 8, 6, 5
2153Attentive MLP for Non-Autoregressive Generation5.000.005, 5, 5
2154The Plug and Play of Language Models for Text-to-image Generation5.001.225, 6, 3, 6
2155A Score-Based Model for Learning Neural Wavefunctions5.001.226, 3, 5, 6
2156Multi-Grid Tensorized Fourier Neural Operator for High Resolution PDEs5.000.005, 5, 5
2157Dual Student Networks for Data-Free Model Stealing5.002.128, 3, 3, 6
2158Equal Improvability: A New Fairness Notion Considering the Long-term Impact5.001.225, 6, 3, 6
2159Target Conditioned Representation Independence (TCRI); from Domain-Invariant to Domain-General Representations5.001.225, 3, 6, 6
2160Multi-Task Option Learning and Discovery for Stochastic Path Planning5.001.225, 3, 6, 6
2161Bandwith Enables Generalization in Quantum Kernel Models5.002.123, 6, 8, 3
2162SpENCNN: Orchestrating Encoding and Sparsity for Fast Homomorphically Encrypted Neural Network Inference5.000.005, 5, 5
2163Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning5.001.223, 6, 6, 5
2164Transformers Implement First-Order Logic with Majority Quantifiers5.001.908, 3, 6, 5, 3
2165FedX: Federated Learning for Compositional Pairwise Risk Optimization5.001.413, 6, 6
2166Multi-Sample Contrastive Neural Topic Model as Multi-Task Learning5.002.123, 8, 3, 6
2167Towards Fair Classification against Poisoning Attacks5.000.005, 5, 5
2168Fed-Cor: Federated Correlation Test with Secure Aggregation5.001.413, 6, 6
2169Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments5.002.123, 3, 6, 8
2170Plansformer: Generating Multi-Domain Symbolic Plans using Transformers5.001.223, 6, 6, 5
2171Multi-Environment Pretraining Enables Transfer to Action Limited Datasets5.001.906, 3, 5, 3, 8
2172Fast Sampling of Diffusion Models with Exponential Integrator5.001.226, 6, 5, 3
2173Movement-to-Action Transformer Networks for Temporal Action Proposal Generation5.002.123, 3, 6, 8
2174Interpretations of Domain Adaptations via Layer Variational Analysis5.000.005, 5, 5
2175Progressive Prompts: Continual Learning for Language Models without Forgetting5.001.225, 6, 3, 6
2176Multiple sequence alignment as a sequence-to-sequence learning problem5.001.416, 3, 6
2177Semi-Supervised Single Domain Generalization with Label-Free Adversarial Data Augmentation5.000.005, 5, 5, 5
2178Mitigating Propagation Failures in PINNs using Evolutionary Sampling5.001.416, 3, 6
2179Exploring perceptual straightness in learned visual representations5.000.005, 5, 5
2180Is Forgetting Less a Good Inductive Bias for Forward Transfer?5.000.005, 5, 5, 5
2181Simulating Environments for Evaluating Scarce Resource Allocation Policies5.002.558, 6, 5, 1
2182Revisiting Curiosity for Exploration in Procedurally Generated Environments5.002.453, 8, 3, 3, 8
2183The Power of Feel-Good Thompson Sampling: A Unified Framework for Linear Bandits5.000.005, 5, 5
2184Reward Design with Language Models5.001.226, 6, 3, 5
2185DSI++: Updating Transformer Memory with New Documents5.001.226, 5, 6, 3
2186The Game of Hidden Rules: A New Challenge for Machine Learning5.001.416, 6, 3
2187In-Time Refining Optimization Trajectories Toward Improved Robust Generalization5.000.005, 5, 5
2188Speed Up Iterative Non-Autoregressive Transformers by Distilling Multiple Steps5.000.005, 5, 5
2189When Rigid Coherency Hurts: Distributional Coherency Regularization for Probabilistic Hierarchical Time Series Forecasting5.002.558, 6, 1, 5
2190MolJET: Multimodal Joint Embedding Transformer for Conditional de novo Molecular Design and Multi-Property Optimization5.002.453, 3, 3, 8, 8
2191$O(T^{-1})$ Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games5.001.416, 3, 6
2192Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise5.001.223, 6, 6, 5
2193Explainable Machine Learning Predictions for the Long-term Performance of Brain-Computer Interfaces5.002.128, 3, 6, 3
2194Federated Learning from Small Datasets5.001.105, 6, 5, 6, 3
2195Contrastive introspection (ConSpec) to rapidly identify invariant steps for success5.001.223, 6, 5, 6
2196Panoptically guided Image Inpainting with Image-level and Object-level Semantic Discriminators5.001.225, 6, 3, 6
2197REM: Routing Entropy Minimization for Capsule Networks5.001.223, 6, 6, 5
2198Variational Classification5.000.005, 5, 5
2199ContraNorm: A Contrastive Learning Perspective on Oversmoothing and Beyond5.001.225, 6, 6, 3
2200Learning Robust Representations via Nuisance-extended Information Bottleneck5.000.005, 5, 5
2201Understanding Train-Validation Split in Meta-Learning with Neural Networks5.001.226, 3, 5, 6
2202Blessing from Experts: Super Reinforcement Learning in Confounded Environments5.001.416, 6, 3
2203DP-SGD-LF: Improving Utility under Differentially Private Learning via Layer Freezing5.001.416, 3, 6
2204A Simulation-based Framework for Robust Federated Learning to Training-time Attacks5.000.005, 5, 5, 5
2205PALM: Preference-based Adversarial Manipulation against Deep Reinforcement Learning5.001.106, 5, 3, 6, 5
2206Multi-Hypothesis 3D human pose estimation metrics favor miscalibrated distributions5.001.226, 6, 3, 5
2207Flatter, Faster: Scaling Momentum for Optimal Speedup of SGD5.001.413, 6, 6
2208SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration5.002.123, 6, 8, 3
2209AlphaFold Distillation for Improved Inverse Protein Folding5.002.126, 3, 8, 3
2210A Cognitive-inspired Multi-Module Architecture for Continual Learning5.000.005, 5, 5, 5
2211Masked Siamese ConvNets: Towards an Effective Masking Strategy for General-purpose Siamese Networks5.000.005, 5, 5
2212Training Normalizing Flows from Dependent Data5.001.416, 6, 3
2213Autoregressive Conditional Neural Processes5.001.416, 3, 6
2214Islands of Confidence: Robust Neural Network Classification with Uncertainty Quantification5.000.005, 5, 5
2215Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural Dynamics5.001.413, 6, 6
2216Renamer: A Transformer Architecture In-variant to Variable Renaming5.001.413, 6, 6
2217Learning a Domain-Agnostic Policy through Adversarial Representation Matching for Cross-Domain Policy Transfer5.001.416, 3, 6
2218Enforcing Delayed-Impact Fairness Guarantees5.000.005, 5, 5
2219Towards Reliable Link Prediction with Robust Graph Information Bottleneck5.001.226, 6, 5, 3
2220UNICORN: A Unified Backdoor Trigger Inversion Framework5.001.413, 6, 6
2221Contrastive Meta-Learning for Partially Observable Few-Shot Learning5.001.226, 3, 6, 5
2222Analyzing Transformers in Embedding Space5.002.128, 3, 3, 6
2223Simplicity bias leads to amplified performance disparities5.000.005, 5, 5, 5
2224Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection5.001.225, 3, 6, 6
2225Distributed Inference and Fine-tuning of Large Language Models Over The Internet5.000.005, 5, 5, 5
2226Irregularity Reflection Neural Network for Time Series Forecasting5.001.416, 6, 3
2227Interpreting Class Conditional GANs with Channel Awareness5.000.005, 5, 5
2228Graph MLP-Mixer5.000.005, 5, 5, 5
2229Fine-grained Few-shot Recognition by Deep Object Parsing5.001.226, 3, 5, 6
2230Learning to Solve Constraint Satisfaction Problems with Recurrent Transformers5.002.123, 3, 8, 6
2231Learning Fast and Slow for Time Series Forecasting5.001.416, 3, 6
2232Holistic Adversarially Robust Pruning5.001.225, 6, 3, 6
2233Text-Guided Diffusion Image Style Transfer with Contrastive Loss Fine-tuning5.000.005, 5, 5
2234Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling5.000.005, 5, 5
2235Prescribed Safety Performance Imitation Learning from A Single Expert Dataset5.000.005, 5, 5, 5
2236Modality Complementariness: Towards Understanding Multi-modal Robustness5.002.126, 3, 3, 8
2237No-regret Learning in Repeated First-Price Auctions with Budget Constraints5.001.733, 5, 5, 6, 3, 8
2238Robustness of Unsupervised Representation Learning without Labels5.001.226, 3, 6, 5
2239Generative Spoken Language Model based on continuous word-sized audio tokens5.000.005, 5, 5, 5
2240Better with Less: Data-Active Pre-training of Graph Neural Networks5.002.123, 6, 8, 3
2241Generalization error bounds for Neural Networks with ReLU activation5.000.005, 5, 5, 5
2242Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL5.001.413, 6, 6
2243Graphics Capsule: Learning hierarchical 3D representations from 2D images and its application on human faces5.001.005, 6, 5, 6, 5, 3
2244Group-wise Verifiable Distributed Computing for Machine Learning under Adversarial Attacks5.002.126, 3, 8, 3
2245Uncertainty-oriented Order Learning for Facial Beauty Prediction5.001.223, 5, 6, 6
2246Revisiting Uncertainty Estimation for Node Classification: New Benchmark and Insights5.000.005, 5, 5
2247SoTeacher: Toward Student-oriented Teacher Network Training for Knowledge Distillation5.001.225, 6, 6, 3
2248GuardHFL: Privacy Guardian for Heterogeneous Federated Learning5.001.413, 6, 6
2249Unsupervised 3d object learning through neuron activity aware plasticity5.001.416, 3, 6
2250Unsupervised Learning of Structured Representations via Closed-Loop Transcription5.001.226, 6, 3, 5
2251DETRDistill: A Simple Knowledge Distillation Framework for DETR-Families5.001.226, 3, 6, 5
2252Multi-Layered 3D Garments Animation5.000.005, 5, 5
2253When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning5.001.226, 6, 5, 3
2254Efficient debiasing with contrastive weight pruning5.000.005, 5, 5
2255Global Nash Equilibrium in a Class of Nonconvex N-player Games5.000.005, 5, 5, 5
2256Task-Agnostic Online Meta-Learning in Non-stationary Environments5.001.105, 5, 3, 6, 6
2257Task Ambiguity in Humans and Language Models5.001.416, 3, 6
2258Tensor Decompositions For Temporal Knowledge Graph Completion with Time Perspective5.000.005, 5, 5
2259Restoration based Generative Models5.001.226, 5, 3, 6
2260GAPS: Few-Shot Incremental Semantic Segmentation via Guided Copy-Paste Synthesis5.000.005, 5, 5
2261The Eigenlearning Framework: A Conservation Law Perspective on Kernel Ridge Regression and Wide Neural Networks5.001.226, 6, 5, 3
2262Generative Gradual Domain Adaptation with Optimal Transport5.001.226, 3, 5, 6
2263Rethinking Symbolic Regression Datasets and Benchmarks for Scientific Discovery5.000.005, 5, 5
2264Precautionary Unfairness in Self-Supervised Contrastive Pre-training5.000.005, 5, 5, 5
2265VEHICLE-INFRASTRUCTURE COOPERATIVE 3D DETECTION VIA FEATURE FLOW PREDICTION5.001.223, 6, 5, 6
2266Mesh-Independent Operator Learning for PDEs using Set Representations5.000.005, 5, 5
2267FlexRound: Learnable Rounding by Element-wise Division for Post-Training Quantization5.000.005, 5, 5, 5
2268LA-BALD: An Information-Theoretic Image Labeling Task Sampler5.001.226, 3, 5, 6
2269Anchor Sampling for Federated Learning with Partial Client Participation5.001.416, 3, 6
2270What do Vision Transformers Learn? A Visual Exploration5.000.005, 5, 5, 5
2271Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency5.001.223, 6, 5, 6
2272An efficient encoder-decoder architecture with top-down attention for speech separation5.001.413, 6, 6
2273Rethinking Identity in Knowledge Graph Embedding5.001.226, 6, 5, 3
2274Energy-based Predictive Representation for Reinforcement Learning5.002.123, 6, 8, 3
2275Exclusive Supermask Subnetwork Training for Continual Learning5.001.223, 6, 6, 5
2276Dual personalization for federated recommendation on devices5.001.226, 3, 6, 5
2277Time-Transformer AAE: Connecting Temporal Convolutional Networks and Transformer for Time Series Generation5.001.223, 5, 6, 6
2278Autoencoding Hyperbolic Representation for Adversarial Generation5.001.416, 6, 3
2279RLSBench: A Large-Scale Empirical Study of Domain Adaptation Under Relaxed Label Shift5.001.226, 5, 6, 3
2280Deep Bayesian Active Learning for Accelerating Stochastic Simulation5.001.413, 6, 6
2281On $mathcal{O}(1/K)$ Convergence and Low Sample Complexity for Single-Timescale Policy Evaluation with Nonlinear Function Approximation5.001.226, 3, 5, 6
2282Generating Features with Increased Crop-Related Diversity for Few-shot Object Detection5.001.226, 6, 3, 5
2283A Theoretical Understanding of Vision Transformers: Learning, Generalization, and Sample Complexity5.001.223, 5, 6, 6
2284Skill-Based Reinforcement Learning with Intrinsic Reward Matching5.001.223, 6, 6, 5
2285Assessing Neural Network Robustness via Adversarial Pivotal Tuning of Real Images5.000.005, 5, 5
2286Actionable Recourse Guided by User Preference5.001.413, 6, 6
2287Lipschitz regularized gradient flows and latent generative particles5.001.226, 3, 5, 6
2288Constraining Representations Yields Models That Know What They Don't Know5.001.416, 3, 6
2289Learning Controllable Adaptive Simulation for Multi-scale Physics5.001.223, 5, 6, 6
2290Posthoc Privacy guarantees for neural network queries5.001.416, 3, 6
2291Discretization Invariant Learning on Neural Fields5.001.226, 3, 5, 6
2292Global Counterfactual Explanations Are Reliable Or Efficient, But Not Both5.002.285, 1, 8, 6, 5
2293Agnostic Learning of General ReLU Activation Using Gradient Descent5.001.413, 6, 6
2294SlenderGNN: Accurate, Robust, and Interpretable GNN, and the Reasons for its Success5.001.223, 5, 6, 6
2295Noise$^+$2Noise: Co-taught De-noising Autoencoders for Time-Series Data5.001.226, 6, 5, 3
2296Neural Constraint Inference: Inferring Energy Constraints in Interacting Systems5.001.226, 3, 6, 5
2297Cortically motivated recurrence enables task extrapolation5.001.226, 5, 3, 6
2298Countering the Attack-Defense Complexity Gap for Robust Classifiers5.001.416, 6, 3
2299Tier Balancing: Towards Dynamic Fairness over Underlying Causal Factors5.001.226, 6, 3, 5
2300Peaks2Image: Reconstructing fMRI Statistical Maps from Peaks5.000.005, 5, 5
2301ContraSim -- A Similarity Measure Based on Contrastive Learning5.002.128, 6, 3, 3
2302Prefix Conditioning Unifies Language and Label Supervision5.001.226, 3, 5, 6
2303Discovering Latent Knowledge in Language Models Without Supervision5.001.225, 6, 3, 6
2304Learning Intuitive Policies Using Action Features5.001.416, 3, 6
2305Private Data Stream Analysis for Universal Symmetric Norm Estimation5.002.123, 8, 6, 3
2306Leveraging Incompatibility to Defend Against Backdoor Poisoning5.001.226, 5, 3, 6
2307Scaling Laws for a Multi-Agent Reinforcement Learning Model5.001.226, 6, 3, 5
2308Federated Learning with Openset Noisy Labels5.000.005, 5, 5, 5
2309Bi-Stride Multi-Scale Graph Neural Network for Mesh-Based Physical Simulation5.001.226, 3, 6, 5
2310Offline Policy Comparison with Confidence: Benchmarks and Baselines5.001.226, 6, 5, 3
2311Asymmetric Certified Robustness via Feature-Convex Neural Networks5.001.226, 3, 6, 5
2312Learning Efficient Models From Few Labels By Distillation From Multiple Tasks5.000.005, 5, 5
2313Do Perceptually Aligned Gradients Imply Robustness?5.001.106, 5, 3, 5, 6
2314Hard-Meta-Dataset++: Towards Understanding Few-Shot Performance on Difficult Tasks5.001.223, 6, 6, 5
2315Sharper Analysis of Sparsely Activated Wide Neural Networks with Trainable Biases5.001.223, 5, 6, 6
2316Generalization Properties of Retrieval-based Models5.001.226, 3, 6, 5
2317Semi-Variance Reduction for Fair Federated Learning5.001.226, 5, 6, 3
2318Siamese DETR5.000.005, 5, 5, 5
2319How Predictors Affect Search Strategies in Neural Architecture Search?5.000.005, 5, 5, 5
2320Incomplete to complete multiphysics forecasting - a hybrid approach for learning unknown phenomena5.002.123, 6, 8, 3
2321Gradient-based optimization is not necessary for generalization in neural networks5.001.416, 3, 6
2322Mitigating Memorization of Noisy Labels via Regularization between Representations5.001.906, 3, 3, 8, 5
2323Temporal Coherent Test Time Optimization for Robust Video Classification5.001.416, 3, 6
2324Non-parametric Outlier Synthesis5.001.413, 6, 6
2325Population-Based Reinforcement Learning for Combinatorial Optimization Problems5.000.005, 5, 5
2326Enhanced Temporal Knowledge Embeddings with Contextualized Language Representations5.001.226, 6, 3, 5
2327Data Pricing Mechanism Based on Property Rights Compensation Distribution5.000.005, 5, 5
2328Traversing Between Modes in Function Space for Fast Ensembling5.000.005, 5, 5, 5
2329Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning5.000.005, 5, 5, 5
2330When are smooth-ReLUs ReLU-like?5.000.005, 5, 5
2331Learning to mine approximate network motifs5.000.005, 5, 5, 5
2332Accelerating Guided Diffusion Sampling with Splitting Numerical Methods5.001.225, 6, 3, 6
2333oViT: An Accurate Second-Order Pruning Framework for Vision Transformers5.000.005, 5, 5
2334TOAST: Topological Algorithm for Singularity Tracking5.001.416, 6, 3
2335Simple and Scalable Nearest Neighbor Machine Translation5.001.225, 6, 3, 6
2336Topic and Hyperbolic Transformer to Handle Multi-modal Dependencies5.000.005, 5, 5
2337Name Your Colour For the Task: Artificially Discover Colour Naming via Colour Quantisation Transformer5.001.223, 5, 6, 6
2338Symmetrical SyncMap for Imbalanced General Chunking Problems5.000.005, 5, 5, 5
2339Optimising Event-Driven Spiking Neural Network with Regularisation and Cutoff5.001.105, 6, 5, 6, 3
2340How Informative is the Approximation Error from Tensor Decomposition for Neural Network Compression?5.001.226, 5, 3, 6
2341On the Expressive Equivalence Between Graph Convolution and Attention Models5.003.088, 3, 8, 1
2342Continual Learning via Adaptive Neuron Selection5.002.123, 3, 6, 8
2343Exact Group Fairness Regularization via Classwise Robust Optimization5.001.225, 6, 6, 3
2344Pairwise Confidence Difference on Unlabeled Data is Sufficient for Binary Classification5.001.416, 6, 3
2345Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning5.001.223, 6, 6, 5
2346Variance Reduction is an Antidote to Byzantines: Better Rates, Weaker Assumptions and Communication Compression as a Cherry on the Top5.002.285, 1, 5, 6, 8
2347Open Set Recognition by Mitigating Prompt Bias5.001.226, 6, 5, 3
2348Momentum Tracking: Momentum Acceleration for Decentralized Deep Learning on Heterogeneous Data5.000.005, 5, 5, 5
2349Deep Graph-Level Orthogonal Hypersphere Compression for Anomaly Detection5.001.226, 6, 3, 5
2350Gradient Deconfliction via Orthogonal Projections onto Subspaces For Multi-task Learning5.001.106, 3, 5, 5, 6
2351On the Importance of the Policy Structure in Offline Reinforcement Learning5.001.226, 3, 6, 5
2352Exact manifold Gaussian Variational Bayes5.001.226, 3, 6, 5
2353LMSeg: Language-guided Multi-dataset Segmentation5.001.416, 3, 6
2354In Search of Smooth Minima for Purifying Backdoor in Deep Neural Networks5.000.005, 5, 5
2355Deep Learning-based Source Code Complexity Prediction5.001.226, 5, 6, 3
2356Improving Explanation Reliability through Group Attribution5.001.226, 3, 6, 5
2357Finite-time Analysis of Single-timescale Actor-Critic on Linear Quadratic Regulator5.001.416, 6, 3
2358Towards Boosting the Open-Domain Chatbot with Human Feedback5.001.103, 5, 6, 5, 6
2359SWIFT: Rapid Decentralized Federated Learning via Wait-Free Model Communication5.001.225, 6, 3, 6
2360Multiscale Multimodal Transformer for Multimodal Action Recognition5.000.005, 5, 5
23613EF: Class-Incremental Learning via Efficient Energy-Based Expansion and Fusion5.001.106, 5, 3, 5, 6
2362Important Channel Tuning5.001.225, 3, 6, 6
2363Rethinking the Structure of Stochastic Gradients: Empirical and Statistical Evidence5.000.005, 5, 5
2364Clustering-Assisted Foreground and Background Separation for Weakly-supervised Temporal Action Localization5.001.223, 6, 6, 5
2365Offline Reinforcement Learning with Differential Privacy5.001.416, 6, 3
2366Policy Architectures for Compositional Generalization in Control5.002.123, 8, 6, 3
2367Lower Bounds for Differentially Private ERM: Unconstrained and Non-Euclidean5.000.005, 5, 5
2368Explainable Recommender with Geometric Information Bottleneck5.000.005, 5, 5
2369In-Context Policy Iteration5.001.226, 5, 3, 6
2370Learning Control Policies for Region Stabilization in Stochastic Systems5.000.005, 5, 5, 5
2371Semantic Video Synthesis from Video Scene Graphs5.001.223, 6, 5, 6
2372Convolutions are competitive with transformers for protein sequence pretraining5.001.416, 3, 6
2373Learning differentiable solvers for systems with hard constraints5.002.128, 3, 3, 6
2374CEPD: Co-Exploring Pruning and Decomposition for Compact DNN Models5.000.005, 5, 5, 5, 5
2375Causal discovery from conditionally stationary time series5.001.225, 3, 6, 6
2376Spatio-temporal Self-Attention for Egocentric 3D Pose Estimation5.001.416, 3, 6
2377RNAS-CL: Robust Neural Architecture Search by Cross-Layer Knowledge Distillation5.000.005, 5, 5
2378Multi-Agent Policy Transfer via Task Relationship Modeling5.001.225, 6, 3, 6
2379Distributionally Robust Post-hoc Classifiers under Prior Shifts5.001.416, 6, 3
2380Cross-Quality Few-Shot Transfer for Alloy Yield Strength Prediction: A New Material Science Benchmark and An Integrated Optimization Framework5.001.413, 6, 6
2381LEARNING THE SPECTROGRAM TEMPORAL RESOLUTION FOR AUDIO CLASSIFICATION5.001.413, 6, 6
2382Inducing Gaussian Process Networks5.000.005, 5, 5
2383DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images5.002.123, 3, 6, 8
2384Take One Gram of Neural Features, Get Enhanced Group Robustness5.001.223, 6, 6, 5
2385FaceMAE: Privacy-Preserving Face Recognition via Masked Autoencoders5.001.105, 6, 5, 3, 6
2386What can be learnt with wide convolutional neural networks?5.001.416, 6, 3
2387Logit Clipping for Robust Learning against Label Noise5.002.123, 8, 6, 3
2388FedCL: Critical Learning Periods-aware Adaptive Client Selection in Federated Learning5.000.005, 5, 5, 5
2389Decentralized Online Bandit Optimization on Directed Graphs with Regret Bounds5.002.123, 3, 8, 6
2390BED: Boundary-Enhanced Decoder for Chinese Word Segmentation5.000.005, 5, 5, 5
2391SYNC: SAFETY-AWARE NEURAL CONTROL FOR STABILIZING STOCHASTIC DELAY-DIFFERENTIAL EQUATIONS5.000.005, 5, 5
2392Reinforcement learning for instance segmentation with high-level priors5.000.005, 5, 5
2393Mutual Information-guided Knowledge Transfer for Open-World Semi-Supervised Learning5.001.226, 3, 6, 5
2394DIMENSION-REDUCED ADAPTIVE GRADIENT METHOD5.000.005, 5, 5, 5
2395Online Policy Optimization for Robust MDP5.001.223, 6, 5, 6
2396Revisiting Feature Acquisition Bias for Few-Shot Fine-Grained Image Classification5.001.223, 6, 5, 6
2397Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and Implicit Bias5.001.225, 6, 6, 3
2398Generalization bounds and algorithms for estimating the effect of multiple treatments and dosage5.000.005, 5, 5, 5
2399On the optimal precision of GANs5.001.103, 5, 5, 6, 6
2400Prompt Generation Networks for Efficient Adaptation of Frozen Vision Transformers5.000.005, 5, 5, 5
2401How Normalization and Weight Decay Can Affect SGD? Insights from a Simple Normalized Model5.000.005, 5, 5, 5
2402DCAPS: Dual Cross-Attention Coupled with Stabilizer for Few-Shot Common Action Localization5.001.226, 6, 3, 5
2403Adapting Pre-trained Language Models for Quantum Natural Language Processing5.000.005, 5, 5
2404CO3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving5.002.128, 3, 6, 3
2405PathFusion: Path-consistent Lidar-Camera Deep Feature Fusion5.000.005, 5, 5
2406Less is More: Identifying the Cherry on the Cake for Dynamic Networks5.000.005, 5, 5, 5
2407HRBP: Hardware-friendly Regrouping towards Block-wise Pruning for Sparse Training5.000.005, 5, 5, 5
2408HIVE: HIerarchical Volume Encoding for Neural Implicit Surface Reconstruction5.001.226, 3, 5, 6
2409Improving Adversarial Transferability with Worst-case Aware Attacks5.000.005, 5, 5, 5
2410Curved Representation Space of Vision Transformers5.001.225, 6, 6, 3
2411Self-Architectural Knowledge Distillation for Spiking Neural Networks5.000.005, 5, 5, 5
2412Federated Semi-supervised Learning with Dual Regulator5.001.413, 6, 6
2413Cross-modal Graph Contrastive Learning with Cellular Images5.002.123, 3, 8, 6
2414ContraGen: Effective Contrastive Learning For Causal Language Model5.001.225, 3, 6, 6
2415Beyond Single Path Integrated Gradients for Reliable Input Attribution via Randomized Path Sampling5.001.225, 3, 6, 6
2416Decoupled and Patch-based Contrastive Learning for Long-tailed Visual Recognition5.001.106, 5, 6, 5, 3
2417The Geometry of Self-supervised Learning Models and its Impact on Transfer Learning5.001.413, 6, 6
2418Rethink Depth Separation with Intra-layer Links5.001.225, 6, 3, 6
2419Unsupervised Model Selection for Time Series Anomaly Detection5.001.225, 3, 6, 6
2420Deep Active Anomaly Detection With Diverse Queries5.001.416, 3, 6
2421Augmentation Backdoors5.000.005, 5, 5
2422Compact Bilinear Pooling via General Bilinear Projection5.001.416, 3, 6
2423Stochastic Gradient Methods with Preconditioned Updates5.000.005, 5, 5
2424Modeling Multimodal Aleatoric Uncertainty in Segmentation with Mixture of Stochastic Experts5.001.413, 6, 6
2425Neural Decoding of Visual Imagery via Hierarchical Variational Autoencoders5.003.393, 6, 1, 10
2426Exploring The Role of Mean Teachers in Self-supervised Masked Auto-Encoders5.001.225, 6, 3, 6
2427Revisiting Domain Randomization Via Relaxed State-Adversarial Policy Optimization5.001.226, 6, 3, 5
2428Consistent Targets Provide Better Supervision in Semi-supervised Object Detection5.001.226, 5, 6, 3
2429Multi-Agent Sequential Decision-Making via Communication5.001.226, 6, 3, 5
2430EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion5.000.005, 5, 5
2431Single-level Adversarial Data Synthesis based on Neural Tangent Kernels5.002.123, 3, 8, 6
2432Unified Algorithms for RL with Decision-Estimation Coefficients: No-Regret, PAC, and Reward-Free Learning5.000.005, 5, 5
2433Evaluating Fairness Without Sensitive Attributes: A Framework Using Only Auxiliary Models5.001.226, 6, 5, 3
2434Parallel Deep Neural Networks Have Zero Duality Gap5.002.123, 8, 6, 3
2435Causal RL Agents for Out-of-distribution Generalization5.001.416, 6, 3
2436Make Memory Buffer Stronger in Continual Learning: A Continuous Neural Transformation Approach5.000.005, 5, 5, 5
2437Initial Value Problem Enhanced Sampling for Closed-Loop Optimal Control Design with Deep Neural Networks5.001.416, 6, 3
2438Global Context Vision Transformers5.001.225, 6, 3, 6
2439Highway Reinforcement Learning5.001.226, 3, 6, 5
2440Rememory-Based SimSiam for Unsupervised Continual Learning5.001.226, 3, 5, 6
2441Pruning with Output Error Minimization for Producing Efficient Neural Networks5.000.005, 5, 5, 5
2442DREAM: Domain-free Reverse Engineering Attributes of Black-box Model5.001.226, 6, 3, 5
2443Approximate Vanishing Ideal Computations at Scale5.001.416, 6, 3
2444Exploiting Spatial Separability for Deep Learning Multichannel Speech Enhancement with an Align-and-Filter Network5.001.226, 5, 3, 6
2445CausalAgents: A Robustness Benchmark for Motion Forecasting Using Causal Relationships5.001.105, 3, 6, 5, 6
2446Critic Sequential Monte Carlo5.001.226, 5, 3, 6
2447Learning to Take a Break: Sustainable Optimization of Long-Term User Engagement5.001.416, 6, 3
2448Laziness, Barren Plateau, and Noises in Machine Learning5.001.226, 6, 3, 5
2449Towards Online Real-Time Memory-based Video Inpainting Transformers5.001.223, 6, 6, 5
2450Gated Class-Attention with Cascaded Feature Drift Compensation for Exemplar-free Continual Learning of Vision Transformers5.001.225, 6, 6, 3
2451Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training5.001.416, 3, 6
2452TPC-NAS: Sub-Five-Minute Neural Architecture Search for Image Classification, Object-Detection, and Super-Resolution5.000.005, 5, 5, 5
2453Mutual Information Regularized Offline Reinforcement Learning5.001.223, 5, 6, 6
2454Visual Timing For Sound Source Depth Estimation in the Wild5.001.226, 3, 6, 5
2455Subclass-balancing Contrastive Learning for Long-tailed Recognition5.001.226, 5, 3, 6
2456Learning Robust Goal Space with Hypothetical Analogy-Making5.001.226, 6, 3, 5
2457Learning Disentanglement in Autoencoders through Euler Encoding5.001.223, 6, 5, 6
2458$mathrm{R}^2$-VOS: Robust Referring Video Object Segmentation via Relational Cycle Consistency5.000.005, 5, 5, 5
2459Lossless Filter Pruning via Adaptive Clustering for Convolutional Neural Networks5.000.005, 5, 5, 5
2460Divide to Adapt: Mitigating Confirmation Bias for Domain Adaptation of Black-Box Predictors5.001.105, 5, 6, 6, 3
2461Denoising Masked Autoencoders are Certifiable Robust Vision Learners5.002.126, 8, 3, 3
2462Neural Prompt Search5.000.005, 5, 5, 5
2463Few-Shot Transferable Robust Representation Learning via Bilevel Attacks5.001.225, 6, 3, 6
2464Distortion-Aware Network Pruning and Feature Reuse for Real-time Video Segmentation5.000.005, 5, 5
2465Learning to Linearize Deep Neural Networks for Secure and Efficient Private Inference5.001.416, 6, 3
2466TempCLR: Temporal Alignment Representation with Contrastive Learning5.001.223, 5, 6, 6
2467The Power of Regularization in Solving Extensive-Form Games5.000.005, 5, 5, 5
2468Neural Topic Modeling with Embedding Clustering Regularization5.001.223, 5, 6, 6
2469MLPInit: Embarrassingly Simple GNN Training Acceleration with MLP Initialization5.002.128, 6, 3, 3
2470Promoting Semantic Connectivity: Dual Nearest Neighbors Contrastive Learning for Unsupervised Domain Generalization5.001.223, 5, 6, 6
2471Towards Equivariant Graph Contrastive Learning via Cross-Graph Augmentation5.002.123, 8, 6, 3
2472One Ring to Bring Them All: Model Adaptation under Domain and Category Shift5.001.413, 6, 6
2473On the Importance of Architectures and Hyperparameters for Fairness in Face Recognition5.000.005, 5, 5, 5
2474Evaluating natural language processing models with generalization metrics that do not need access to any training or testing data5.001.226, 6, 5, 3
2475The Effects of Nonlinearity on Approximation Capacity of Recurrent Neural Networks5.002.555, 8, 1, 6
2476Curiosity-Driven Unsupervised Data Collection for Offline Reinforcement Learning5.001.226, 5, 6, 3
2477Understanding and Bridging the Modality Gap for Speech Translation5.001.223, 6, 6, 5
2478MIA: A Framework for Certified Robustness of Time-Series Classification and Forecasting Against Temporally-Localized Perturbations5.000.005, 5, 5
2479Spike Calibration: Bridging the Gap between ANNs and SNNs in ANN-SNN Conversion5.002.555, 6, 8, 1
2480Split and Merge Proxy: pre-training protein-protein contact prediction by mining rich information from monomer data5.001.226, 5, 6, 3
2481Adversarial Counterfactual Environment Model Learning5.001.413, 6, 6
2482PointDP: Diffusion-driven Purification against 3D Adversarial Point Clouds5.001.223, 5, 6, 6
2483DeSCo: Towards Scalable Deep Subgraph Counting5.001.413, 6, 6
2484Supervised Contrastive Regression5.001.226, 5, 6, 3
2485Provable Benefits of Representational Transfer in Reinforcement Learning5.001.416, 3, 6
2486Set Discrimination Contrastive Learning5.000.005, 5, 5, 5
2487A Class-Aware Representation Refinement Framework for Graph Classification5.000.005, 5, 5, 5
2488An information-theoretic approach to unsupervised keypoint representation learning5.001.226, 5, 3, 6
2489A simple but effective and efficient global modeling paradigm for image restoration5.002.126, 8, 3, 3
2490ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation5.001.226, 6, 3, 5
2491A Close Look at Token Mixer: From Attention to Convolution5.000.005, 5, 5
2492MiSAL: Active Learning for Every Budget5.002.128, 3, 6, 3
2493SOM-CPC: Unsupervised Contrastive Learning with Self-Organizing Maps for Structured Representations of High-Rate Time Series5.001.413, 6, 6
2494CLIP-FLOW: CONTRASTIVE LEARNING WITH ITERATIVE PSEUDO LABELING FOR OPTICAL FLOW5.000.005, 5, 5
2495Bidirectional Learning for Offline Model-based Biological Sequence Design5.000.005, 5, 5
2496AQUILA: Communication Efficient Federated Learning with Adaptive Quantization of Lazily-Aggregated Gradients5.001.223, 6, 6, 5
2497Multi-User Reinforcement Learning with Low Rank Rewards5.001.103, 5, 5, 6, 6
2498Bayesian Robust Graph Contrastive Learning5.000.005, 5, 5, 5
2499SoundNeRirF: Receiver-to-Receiver Sound Neural Room Impulse Response Field5.001.416, 6, 3
2500Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization5.001.226, 6, 5, 3
2501Sparse Misinformation Detector5.000.005, 5, 5
2502Trainability Preserving Neural Pruning5.001.226, 3, 5, 6
2503Harnessing Out-Of-Distribution Examples via Augmenting Content and Style5.001.225, 6, 3, 6
2504A Unified Framework of Soft Threshold Pruning5.001.416, 6, 3
2505STViT: Semantic Tokens for Efficient Global and Local Vision Transformers5.001.223, 6, 5, 6
2506Expanding Datasets With Guided Imagination5.002.123, 6, 8, 3
2507Communication Efficient Fair Federated Recommender System5.001.225, 3, 6, 6
2508Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment5.000.005, 5, 5
2509Multi-Domain Long-Tailed Learning by Augmenting Disentangled Representations5.001.226, 5, 6, 3
2510Mesh-free Eulerian Physics-Informed Neural Networks4.831.346, 3, 6, 3, 6, 5
2511Show and Write: Entity-aware Article Generation with Image Information4.831.343, 6, 6, 3, 6, 5
2512Rate-Distortion Optimized Post-Training Quantization for Learned Image Compression4.831.675, 8, 3, 5, 3, 5
2513Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance4.831.343, 6, 3, 5, 6, 6
2514Implicit Neural Spatial Representations for Time-dependent PDEs4.831.346, 5, 6, 3, 6, 3
2515Benchmarking and Improving Robustness of 3D Point Cloud Recognition against Common Corruptions4.831.675, 5, 8, 5, 3, 3
2516Adaptive IMLE for Few-shot Image Synthesis4.801.476, 6, 3, 3, 6
2517Curriculum-inspired Training for Selective Neural Networks4.800.986, 5, 5, 5, 3
2518Actor-Critic Alignment for Offline-to-Online Reinforcement Learning4.800.985, 5, 3, 5, 6
2519Learning Deep Operator Networks: The Benefits of Over-Parameterization4.801.833, 3, 5, 5, 8
2520A distinct unsupervised reference model from the environment helps continual learning4.800.985, 5, 6, 5, 3
2521Gradient Gating for Deep Multi-Rate Learning on Graphs4.800.985, 3, 5, 6, 5
2522Self-Supervised Extreme Compression of Gigapixel Images4.800.985, 5, 6, 3, 5
2523Evaluating Robustness of Cooperative MARL: A Model-based Approach4.800.983, 5, 5, 5, 6
2524Forces are not Enough: Benchmark and Critical Evaluation for Machine Learning Force Fields with Molecular Simulations4.801.476, 6, 3, 3, 6
2525An alternative approach to train neural networks using monotone variational inequality4.800.986, 5, 5, 3, 5
2526Risk-aware Bayesian RL for Cautious Exploration4.802.713, 3, 10, 5, 3
2527Attention Enables Zero Approximation Error4.800.985, 5, 3, 6, 5
2528The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels4.800.985, 3, 6, 5, 5
2529Efficient Personalized Federated Learning via Sparse Model-Adaptation4.800.986, 3, 5, 5, 5
2530Deformable Graph Transformer4.800.986, 5, 5, 5, 3
2531Data-efficient Supervised Learning is Powerful for Neural Combinatorial Optimization4.800.983, 6, 5, 5, 5
2532Entropy-Regularized Model-Based Offline Reinforcement Learning4.800.986, 3, 5, 5, 5
2533KITE: A Kernel-based Improved Transferability Estimation Method4.800.985, 6, 5, 3, 5
2534FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training4.800.985, 5, 5, 3, 6
2535Sensitivity-aware Visual Parameter-efficient Tuning4.800.985, 5, 6, 3, 5
2536Variational Imbalanced Regression4.801.945, 6, 6, 6, 1
2537MotifExplainer: a Motif-based Graph Neural Network Explainer4.800.985, 5, 3, 5, 6
2538QCRS: Improve Randomized Smoothing using Quasi-Concave Optimization4.800.985, 6, 3, 5, 5
2539Self-attentive Rationalization for Graph Contrastive Learning4.800.985, 6, 3, 5, 5
2540NeuralStagger: accelerating physics constrained neural PDE solver with spatial-temporal decomposition4.751.096, 5, 3, 5
2541Latent Linear ODEs with Neural Kalman Filtering for Irregular Time Series Forecasting4.751.095, 3, 5, 6
2542Learning with Non-Uniform Label Noise: A Cluster-Dependent Semi-Supervised Approach4.751.095, 6, 3, 5
2543SWRM: Similarity Window Reweighting and Margins for Long-Tailed Recognition4.751.095, 6, 5, 3
2544Supervised Q-Learning can be a Strong Baseline for Continuous Control4.751.095, 6, 3, 5
2545Self-Supervised Off-Policy Ranking via Crowd Layer4.751.096, 3, 5, 5
2546Incremental Predictive Coding: A Parallel and Fully Automatic Learning Algorithm4.751.093, 5, 6, 5
2547When and Why Is Pretraining Object-Centric Representations Good for Reinforcement Learning?4.751.093, 6, 5, 5
2548Contrastive Representation Learning for Multi-scale Spatial Scenes4.752.498, 5, 5, 1
2549Exploiting Personalized Invariance for Better Out-of-distribution Generalization in Federated Learning4.751.096, 5, 5, 3
2550Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management4.751.095, 3, 6, 5
2551Adaptive Computation with Elastic Input Sequence4.751.093, 6, 5, 5
2552Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?4.751.095, 3, 6, 5
2553Contrastive Learning of Molecular Representation with Fragmented Views4.752.055, 3, 3, 8
2554Contextualized Generative Retrieval4.751.093, 5, 6, 5
2555Discrete State-Action Abstraction via the Successor Representation4.752.053, 8, 3, 5
2556MiDAS: Multi-integrated Domain Adaptive Supervision for Fake News Detection4.751.093, 5, 6, 5
2557Walking the Tightrope: An Investigation of the Convolutional Autoencoder Bottleneck4.751.095, 5, 6, 3
2558The Role of Pre-training Data in Transfer Learning4.751.095, 5, 6, 3
2559Limits of Algorithmic Stability for Distributional Generalization4.752.053, 5, 8, 3
2560VQR: Automated Software Vulnerability Repair Through Vulnerability Queries4.751.095, 6, 5, 3
2561Fully Online Meta Learning4.752.498, 5, 1, 5
2562What Do We Maximize in Self-Supervised Learning And Why Does Generalization Emerge?4.751.096, 3, 5, 5
2563Sufficient Subgraph Embedding Memory for Continual Graph Representation Learning4.752.053, 8, 5, 3
2564Iterative Task-adaptive Pretraining for Unsupervised Word Alignment4.751.093, 5, 6, 5
2565Pretraining One Language Model for All With the Text-To-Text Framework Using Model-Generated Signals4.751.093, 6, 5, 5
2566TOWARD RELIABLE NEURAL SPECIFICATIONS4.752.053, 5, 8, 3
2567Pyramidal Denoising Diffusion Probabilistic Models4.751.093, 6, 5, 5
2568Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning4.751.095, 5, 6, 3
2569An Analytic Framework for Robust Training of Differentiable Hypothesis4.751.095, 6, 5, 3
2570Supervised Metric Learning for Retrieval via Contextual Similarity Optimization4.752.053, 8, 5, 3
2571How Can Deep Learning Performs Deep (Hierarchical) Learning4.751.093, 6, 5, 5
2572Sequential Brick Assembly with Efficient Constraint Satisfaction4.751.093, 5, 5, 6
2573Augmentation Curriculum Learning For Generalization in RL4.751.095, 6, 5, 3
2574Using the Training History to Detect and Prevent Overfitting in Deep Learning Models4.751.095, 5, 6, 3
2575CORE-PERIPHERY PRINCIPLE GUIDED REDESIGN OF SELF-ATTENTION IN TRANSFORMERS4.751.093, 5, 6, 5
2576How Hard is Trojan Detection in DNNs? Fooling Detectors With Evasive Trojans4.751.093, 5, 6, 5
2577Less Is More: Training on Low-Fidelity Images Improves Robustness to Adversarial Attacks4.751.093, 5, 5, 6
2578A Differentiable Loss Function for Learning Heuristics in A*4.752.058, 3, 3, 5
2579AsymQ: Asymmetric Q-loss to mitigate overestimation bias in off-policy reinforcement learning4.752.055, 3, 8, 3
2580Balance is Essence: Accelerating Sparse Training via Adaptive Gradient Correction4.751.095, 3, 6, 5
2581TEXTCRAFT: ZERO-SHOT GENERATION OF HIGH FIDELITY AND DIVERSE SHAPES FROM TEXT4.751.095, 5, 3, 6
2582Transformer-based World Models Are Happy With 100k Interactions4.752.058, 3, 3, 5
2583Robust Federated Learning with Majority Adversaries via Projection-based Re-weighting4.751.095, 5, 6, 3
2584Resource Efficient Self-Supervised Learning for Speech Recognition4.751.096, 5, 5, 3
2585HyperTime: Implicit Neural Representations for Time Series Generation4.751.095, 6, 5, 3
2586Unsupervised Pretraining for Neural Value Approximation4.752.055, 3, 8, 3
2587MALIBO: Meta-Learning for Likelihood-free Bayesian Optimization4.751.095, 5, 3, 6
2588Asynchronous Message Passing: A new Framework for Learning in Graphs4.751.095, 3, 6, 5
2589From Adaptive Query Release to Machine Unlearning4.751.096, 3, 5, 5
2590Meta-Learning Black-Box Optimization via Black-Box Optimization4.751.095, 5, 6, 3
2591Optimal Membership Inference Bounds for Adaptive Composition of Sampled Gaussian Mechanisms4.752.058, 5, 3, 3
2592SPRINT: Scalable Semantic Policy Pre-training via Language Instruction Relabeling4.751.095, 5, 6, 3
2593Data Feedback Loops: Model-driven Amplification of Dataset Biases4.751.093, 6, 5, 5
2594A Large Scale Sample Complexity Analysis of Neural Policies in the Low-Data Regime4.752.058, 3, 3, 5
2595Action Matching: A Variational Method for Learning Stochastic Dynamics from Samples4.751.095, 5, 6, 3
2596An Empirical Study on the Efficacy of Deep Active Learning Techniques4.751.096, 5, 3, 5
2597EF21-P and Friends: Improved Theoretical Communication Complexity for Distributed Optimization with Bidirectional Compression4.752.491, 8, 5, 5
2598Unleash Model Capacity for Universal Dense Retrieval by Task Specialty Optimization4.751.095, 5, 3, 6
2599Key Design Choices for Double-transfer in Source-free Unsupervised Domain Adaptation4.751.096, 5, 3, 5
2600$Phi$-DVAE: Learning Physically Interpretable Representations with Nonlinear Filtering4.751.096, 5, 3, 5
2601Rethinking Uniformity in Self-Supervised Representation Learning4.751.095, 6, 5, 3
2602Self-Supervised Learning of Maximum Manifold Capacity Representations4.751.095, 3, 6, 5
2603PMI-guided Masking Strategy to Enable Few-shot Learning for Genomic Applications4.752.055, 3, 8, 3
2604Fast Bayesian Updates for Deep Learning with a Use Case in Active Learning4.751.095, 5, 6, 3
2605FP_AINet: Fusion Prototype with Adaptive Induction Network for Few-Shot Learning4.751.093, 6, 5, 5
2606DCT-DiffStride: Differentiable Strides with Real-Valued Data4.751.095, 6, 5, 3
2607Removing Structured Noise with Diffusion Models4.752.053, 8, 3, 5
2608Closed-loop Transcription via Convolutional Sparse Coding4.751.095, 5, 6, 3
2609MC-SSL: Towards Multi-Concept Self-Supervised Learning4.751.093, 5, 6, 5
2610Latent Hierarchical Imitation Learning for Stochastic Environments4.752.058, 5, 3, 3
2611Efficient Discovery of Dynamical Laws in Symbolic Form4.752.058, 3, 5, 3
2612Human-AI Coordination via Human-Regularized Search and Learning4.752.058, 3, 3, 5
2613Inducing Meaningful Units from Character Sequences with Dynamic Capacity Slot Attention4.751.095, 5, 6, 3
2614CounterNet: End-to-End Training of Prediction Aware Counterfactual Explanations4.753.033, 10, 3, 3
2615Adaptive Smoothing Gradient Learning for Spiking Neural Networks4.752.058, 3, 3, 5
2616Going Beyond Approximation: Encoding Constraints for Explainable Multi-hop Inference via Differentiable Combinatorial Solvers4.751.095, 5, 3, 6
2617DBA: Efficient Transformer with Dynamic Bilinear Low-Rank Attention4.751.093, 6, 5, 5
2618Client-agnostic Learning and Zero-shot Adaptation for Federated Domain Generalization4.751.095, 6, 5, 3
2619Prompt-Based Metric Learning for Few-Shot NER4.751.095, 6, 3, 5
2620MetaPhysiCa: Causality-aware Robustness to OOD Initial Conditions in Physics-informed Machine Learning4.751.095, 6, 5, 3
2621InteriorSim: A Photorealistic Simulator for Embodied AI4.751.095, 3, 5, 6
2622Spatial Entropy as an Inductive Bias for Vision Transformers4.751.095, 6, 5, 3
2623A Simple Framework for Low-Resolution Detection with High-resolution Knowledge4.751.093, 5, 6, 5
2624Zero-Label Prompt Selection4.751.095, 3, 5, 6
2625Adversarial Text to Continuous Image Generation4.751.095, 5, 6, 3
2626A GNN-Guided Predict-and-Search Framework for Mixed-Integer Linear Programming4.751.093, 6, 5, 5
2627A Weight Variation-Aware Training Method for Hardware Neuromorphic Chips4.751.096, 5, 5, 3
2628Hybrid-Regressive Neural Machine Translation4.751.093, 5, 6, 5
2629PET-NeuS: Positional Encoding Triplanes for Neural Surfaces4.752.055, 8, 3, 3
2630Effective Offline Reinforcement Learning via Conservative State Value Estimation4.752.058, 3, 5, 3
2631Visually-augmented pretrained language models for NLP Tasks without Images4.751.093, 5, 6, 5
2632Cold Rao-Blackwellized Straight-Through Gumbel-Softmax Gradient Estimator4.751.095, 5, 3, 6
2633Out-of-Domain Intent Detection Considering Multi-turn Dialogue Contexts4.751.093, 5, 5, 6
2634$epsilon$-Invariant Hierarchical Reinforcement Learning for Building Generalizable Policy4.751.095, 5, 6, 3
2635CCIL: Context-conditioned imitation learning for urban driving4.751.095, 6, 5, 3
2636Conditional Policy Similarity: An Overlooked Factor in Zero-Shot Coordination4.751.093, 5, 5, 6
2637ECLAD: Extracting Concepts with Local Aggregated Descriptors4.751.095, 3, 5, 6
2638So-TVAE: Sentiment-oriented Transformer-based Variational Autoencoder Network for Live Video Commenting4.751.095, 3, 5, 6
2639SDAC: Efficient Safe Reinforcement Learning with Low-Biased Distributional Actor-Critic4.751.095, 3, 5, 6
2640Prompt Tuning for Graph Neural Networks4.752.058, 3, 5, 3
2641Neural Unbalanced Optimal Transport via Cycle-Consistent Semi-Couplings4.751.093, 5, 6, 5
2642Learning to Boost Resilience of Complex Networks via Neural Edge Rewiring4.752.058, 3, 5, 3
2643Adversarial Robustness based on Randomized Smoothing in Quantum Machine Learning4.751.093, 6, 5, 5
2644Linear Convergence of Decentralized FedAvg for Non-Convex Objectives: The Interpolation Regime4.751.095, 3, 5, 6
2645Rethinking Missing Modality Learning: From a Decoding View4.751.095, 3, 5, 6
2646Meta-Weighted Language Model Tuning for Augmentation-Enhanced Few-Shot Learning4.751.095, 5, 3, 6
2647Graph-informed Neural Point Process With Monotonic Nets4.751.095, 6, 3, 5
2648Learning to Decouple Complex System for Sequential Data4.752.058, 5, 3, 3
2649Efficient Large-scale Transformer Training via Random and Layerwise Token Dropping4.751.093, 5, 5, 6
2650Hypothetical Training for Robust Machine Reading Comprehension of Tabular Context4.752.055, 3, 3, 8
2651On the Efficacy of Server-Aided Federated Learning against Partial Client Participation4.751.095, 6, 5, 3
2652Toxicity in Multilingual Machine Translation at Scale4.752.058, 5, 3, 3
2653Bandit Learning with General Function Classes: Heteroscedastic Noise and Variance-dependent Regret Bounds4.751.093, 5, 5, 6
2654Adaptive Sparse Softmax: An Effective and Efficient Softmax Variant for Text Classification4.751.093, 5, 6, 5
2655Continuous Goal Sampling: A Simple Technique to Accelerate Automatic Curriculum Learning4.751.096, 3, 5, 5
2656Towards Better Selective Classification4.752.053, 3, 5, 8
2657Offline Equilibrium Finding4.751.095, 5, 6, 3
2658Brainformers: Trading Simplicity for Efficiency4.751.093, 6, 5, 5
2659Effective Self-Supervised Transformers For Sparse Time Series Data4.751.096, 5, 3, 5
2660Efficient Shapley Values Estimation by Amortization for Text Classification4.752.058, 3, 5, 3
2661Precision Collaboration for Federated Learning4.751.093, 5, 5, 6
2662Offline RL of the Underlying MDP from Heterogeneous Data Sources4.751.093, 5, 6, 5
2663On the Importance of Calibration in Semi-supervised Learning4.751.095, 5, 6, 3
2664Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs4.751.096, 3, 5, 5
2665Fast Adaptation via Human Diagnosis of Task Distribution Shift4.751.093, 5, 6, 5
2666Shortcut Learning Through the Lens of Early Training Dynamics4.752.171, 6, 6, 6
2667Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures4.752.058, 3, 3, 5
2668EmbedDistill: A geometric knowledge distillation for information retrieval4.751.095, 5, 3, 6
2669Learning from Labeled Images and Unlabeled Videos for Video Segmentation4.752.055, 8, 3, 3
2670REV: Information-Theoretic Evaluation of Free-Text Rationales4.751.095, 3, 5, 6
2671Uncertainty-Driven Exploration for Generalization in Reinforcement Learning4.751.093, 5, 6, 5
2672Adaptive Parametric Prototype Learning for Cross-Domain Few-Shot Classification4.751.093, 5, 5, 6
2673Building compact representations for image-language learning4.752.058, 3, 5, 3
2674HEAV: Hierarchical Ensembling of Augmented Views for Image Captioning4.751.093, 5, 5, 6
2675Dynamic Pretraining of Vision-Language Models4.751.095, 6, 3, 5
2676Epistemological Bias As a Means for the Automated Detection of Injustices in News Media4.752.053, 8, 3, 5
2677Have Missing Data? Make It Miss More! Imputing Tabular Data with Masked Autoencoding4.751.095, 5, 3, 6
2678Federated Self-supervised Learning for Heterogeneous Clients4.751.095, 6, 5, 3
2679Waveformer: Linear-Time Attention with Forward and Backward Wavelet Transform4.751.093, 6, 5, 5
2680Semantic Image Manipulation with Background-guided Internal Learning4.751.095, 5, 3, 6
2681Reconciling Security and Communication Efficiency in Federated Learning4.751.095, 5, 3, 6
2682Noise Injection Node Regularization for Robust Learning4.751.095, 3, 5, 6
2683Taming the Long Tail of Deep Probabilistic Forecasting4.751.095, 3, 6, 5
2684Risk Control for Online Learning Models4.752.053, 8, 5, 3
2685Perturbation Analysis of Neural Collapse4.751.095, 3, 6, 5
2686Leveraging the Third Dimension in Contrastive Learning4.751.096, 5, 5, 3
2687Learning Top-k Classification with Label Ranking4.751.095, 6, 5, 3
2688Language-Aware Soft Prompting for Vision & Language Foundation Models4.751.096, 5, 3, 5
2689Theoretical Characterization of How Neural Network Pruning Affects its Generalization4.751.096, 3, 5, 5
2690Collaborative Symmetricity Exploitation for Offline Learning of Hardware Design Solver4.751.096, 5, 3, 5
2691Policy Expansion for Bridging Offline-to-Online Reinforcement Learning4.751.095, 3, 6, 5
2692Prosody-TTS: Self-Supervised Prosody Pretraining with Latent Diffusion For Text-to-Speech4.751.095, 5, 3, 6
2693Confounder Identification-free Causal Visual Feature Learning4.752.491, 5, 5, 8
2694A Neural Mean Embedding Approach for Back-door and Front-door Adjustment4.752.491, 5, 5, 8
2695Multi-View Independent Component Analysis with Shared and Individual Sources4.752.053, 8, 3, 5
2696Label-Efficient Online Continual Object Detection in Streaming Video4.751.095, 3, 5, 6
2697Multi-Agent Multi-Game Entity Transformer4.751.093, 5, 6, 5
2698RealSinger: Ultra-Realistic Singing Voice Generation via Stochastic Differential Equations4.752.053, 3, 8, 5
2699On the Role of Self-supervision in Deep Multi-view Clustering4.752.053, 8, 5, 3
2700Skill Machines: Temporal Logic Composition in Reinforcement Learning4.751.095, 3, 5, 6
2701Learning Basic Interpretable Factors from Temporal Signals via Physics Symmetry4.751.095, 5, 6, 3
2702Can Single-Pass Contrastive Learning Work for Both Homophilic and Heterophilic Graph?4.752.053, 8, 5, 3
2703Dynamical Equations With Bottom-up Self-Organizing Properties Learn Accurate Dynamical Hierarchies Without Any Loss Function4.751.095, 3, 5, 6
2704Video Scene Graph Generation from Single-Frame Weak Supervision4.751.096, 5, 3, 5
2705Contrastive Consistent Representation Distillation4.751.096, 5, 5, 3
2706CLEEGN: A Convolutional Neural Network for Plug-and-Play Automatic EEG Reconstruction4.751.093, 5, 6, 5
2707Unified neural representation model for physical and conceptual spaces4.752.058, 3, 3, 5
2708Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models4.751.095, 3, 6, 5
2709What's Behind the Mask: Estimating Uncertainty in Image-to-Image Problems4.751.096, 5, 3, 5
2710Style Balancing and Test-Time Style Shifting for Domain Generalization4.751.093, 5, 6, 5
2711Least Disagree Metric-based Active Learning4.751.093, 6, 5, 5
2712Selective Classifier Ensemble4.751.096, 3, 5, 5
2713Few-Shot Anomaly Detection on Industrial Images through Contrastive Fine-Tuning4.751.095, 5, 3, 6
2714On the robustness of self-supervised models for generative spoken language modeling4.751.096, 5, 3, 5
2715Multi-Level Contrastive Learning for Dense Prediction Task4.751.095, 5, 6, 3
2716ETSformer: Exponential Smoothing Transformers for Time-series Forecasting4.751.095, 6, 5, 3
2717Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization4.751.096, 5, 5, 3
2718SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data4.752.055, 3, 3, 8
2719Scalable 3D Object-centric Learning4.751.096, 3, 5, 5
2720Analysis of Error Feedback in Compressed Federated Non-Convex Optimization4.751.095, 6, 5, 3
2721Causal Proxy Models For Concept-Based Model Explanations4.751.095, 3, 6, 5
2722Graph Contrastive Learning Under Heterophily: Utilizing Graph Filters to Generate Graph Views4.752.055, 3, 8, 3
2723Output Distribution over the Entire Input Space: A Novel Perspective to Understand Neural Networks4.751.095, 6, 3, 5
2724Decentralized Robust V-learning for Solving Markov Games with Model Uncertainty4.751.095, 6, 3, 5
2725A Unified Framework for Comparing Learning Algorithms4.751.095, 6, 3, 5
2726KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal4.751.096, 5, 3, 5
2727Reward-free Policy Learning through Active Human Involvement4.752.053, 5, 8, 3
2728Robust Attention for Contextual Biased Visual Recognition4.751.095, 5, 6, 3
2729Complex-Target-Guided Open-Domain Conversation based on offline reinforcement learning4.752.055, 8, 3, 3
2730ObPose: Leveraging Pose for Object-Centric Scene Inference and Generation in 3D4.751.096, 3, 5, 5
2731Don't Throw Your Old Policies Away: Knowledge-based Policy Recycling Protects Against Adversarial Attacks4.752.058, 3, 3, 5
2732Contrastive Adversarial Loss for Point Cloud Reconstruction4.751.093, 6, 5, 5
2733Ahead-of-Time P-Tuning4.751.096, 3, 5, 5
2734SimST: A GNN-Free Spatio-Temporal Learning Framework for Traffic Forecasting4.751.096, 5, 5, 3
2735Social and environmental impact of recent developments in machine learning on biology and chemistry research4.752.055, 3, 8, 3
2736Environment Partitioning For Invariant Learning By Decorrelation4.751.093, 5, 6, 5
2737Style Spectroscope: Improve Interpretability and Controllability through Fourier Analysis4.752.058, 5, 3, 3
2738Cascaded Teaching Transformers with Data Reweighting for Long Sequence Time-series Forecasting4.751.093, 5, 6, 5
2739Hazard Gradient Penalty for Survival Analysis4.751.093, 5, 5, 6
2740Reach the Remote Neighbors: Dual-Encoding Transformer for Graphs4.751.095, 5, 6, 3
2741Only For You: Deep Neural Anti-Forwarding Watermark Preserves Image Privacy4.751.095, 6, 3, 5
2742PromptCast: A New Prompt-based Learning Paradigm for Time Series Forecasting4.752.058, 3, 5, 3
2743Revealing Single Frame Bias for Video-and-Language Learning4.751.095, 6, 3, 5
2744Union Subgraph Neural Networks4.751.096, 5, 5, 3
2745NEW TRAINING FRAMEWORK FOR SPEECH ENHANCEMENT USING REAL NOISY SPEECH4.752.055, 3, 3, 8
2746Dataset Condensation with Latent Space Knowledge Factorization and Sharing4.751.095, 5, 3, 6
2747Can GNNs Learn Heuristic Information for Link Prediction?4.751.093, 6, 5, 5
2748Spatial Attention Kinetic Networks with E(n)-Equivariance4.751.095, 6, 5, 3
2749Human Pose Estimation in the Dark4.751.095, 6, 3, 5
2750ETAD: A Sampling-Based Approach for Efficient Temporal Action Detection4.751.093, 5, 5, 6
2751HierBatching: Locality-Aware Out-of-Core Training of Graph Neural Networks4.751.093, 5, 5, 6
2752Bias Mitigation Framework for Intersectional Subgroups in Neural Networks4.752.058, 5, 3, 3
2753HyperQuery: A Framework for Higher Order Link Prediction4.751.096, 5, 5, 3
2754Tiny Adapters for Vision Transformers4.751.095, 5, 6, 3
2755Proximal Curriculum for Reinforcement Learning Agents4.751.095, 5, 3, 6
2756Random Weight Factorization improves the training of Continuous Neural Representations4.752.058, 5, 3, 3
2757Improving group robustness under noisy labels using predictive uncertainty4.751.095, 3, 6, 5
2758MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection4.751.093, 5, 5, 6
2759Your Neighbors Are Communicating: Towards Powerful and Scalable Graph Neural Networks4.751.096, 5, 5, 3
2760Fair Attribute Completion on Graph with Missing Attributes4.751.096, 3, 5, 5
2761Improving Generalization with Domain Convex Game4.751.093, 5, 5, 6
2762Edge Wasserstein Distance Loss for Oriented Object Detection4.751.096, 5, 5, 3
2763StyleGenes: Discrete and Efficient Latent Distributions for GANs4.752.055, 3, 3, 8
2764ZERO: A Large-scale Chinese Cross-modal Benchmark with a New Vision-Language Framework4.752.055, 3, 3, 8
2765SinGRAV: Learning a Generative Radiance Volume from a Single Natural Scene4.752.053, 5, 8, 3
2766ConBaT: Control Barrier Transformer for Safety-Critical Policy Learning4.751.095, 6, 5, 3
2767Reinforced Sample Reweighting Policy for Semi-supervised Learning4.751.093, 6, 5, 5
2768Multiple Instance Learning via Iterative Self-Paced Supervised Contrastive Learning4.751.095, 3, 6, 5
2769TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second4.751.095, 3, 5, 6
2770Friends to Help: Saving Federated Learning from Client Dropout4.751.093, 5, 6, 5
2771GraphPNAS: Learning Distribution of Good Neural Architectures via Deep Graph Generative Models4.751.095, 5, 6, 3
2772Interpretability with full complexity by constraining feature information4.751.095, 6, 3, 5
2773Stealing and Defending Transformer-based Encoders4.751.093, 6, 5, 5
2774Curriculum Reinforcement Learning via Morphology-Environment Co-Evolution4.751.095, 3, 5, 6
2775Token-Label Alignment for Vision Transformers4.751.095, 5, 3, 6
2776Efficient Covariance Estimation for Sparsified Functional Data4.751.093, 5, 5, 6
2777Does Continual Learning Equally Forget All Parameters?4.752.176, 1, 6, 6
2778EAGLE: Large-scale Learning of Turbulent Fluid Dynamics with Mesh Transformers4.751.093, 5, 5, 6
2779Cross-Domain Autonomous Driving Perception using Contrastive Appearance Adaptation4.751.095, 3, 5, 6
2780On The Inadequacy of Optimizing Alignment and Uniformity in Contrastive Learning of Sentence Representations4.751.095, 3, 5, 6
2781Approximated Anomalous Diffusion: Gaussian Mixture Score-based Generative Models4.752.053, 5, 3, 8
2782AutoSKDBERT: Learn to Stochastically Distill BERT4.751.095, 5, 3, 6
2783An Empirical Study of Metrics to Measure Representational Harms in Pre-Trained Language Models4.751.095, 5, 6, 3
2784Unsupervised Learning of Causal Relationships from Unstructured Data4.752.058, 5, 3, 3
2785Parameterized projected Bellman operator4.751.095, 5, 3, 6
2786Examining the Value of Neural Filter Pruning -- Retrospect and Prospect4.751.096, 5, 5, 3
2787Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program4.751.095, 5, 6, 3
2788DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training4.751.096, 3, 5, 5
2789Measuring Asymmetric Gradient Discrepancy in Parallel Continual Learning4.751.095, 6, 3, 5
2790Design of the topology for contrastive visual-textual alignment4.751.093, 5, 6, 5
2791Defactorization Transformer: Modeling Long Range Dependency with Local Window Cost4.751.095, 6, 5, 3
2792Multi-Modal Few-Shot Temporal Action Detection4.751.095, 6, 3, 5
2793In the ZONE: Measuring difficulty and progression in curriculum generation4.751.093, 5, 5, 6
2794Dual-Domain Diffusion Based Progressive Style Rendering towards Semantic Structure Preservation4.671.253, 5, 6
2795Mini-batch $k$-means terminates within $O(d/epsilon)$ iterations4.671.253, 5, 6
2796Functional Risk Minimization4.671.256, 5, 3
2797Causal Inference for Knowledge Graph Completion4.671.253, 6, 5
2798Rethinking Metric Based Contrastive Learning Method’s Generalization Capability4.671.256, 5, 3
2799Enriching Online Knowledge Distillation with Specialist Ensemble4.671.253, 5, 6
2800Variational Learning ISTA4.671.253, 6, 5
2801Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning4.671.255, 6, 3
2802FedGC: An Accurate and Efficient Federated Learning under Gradient Constraint for Heterogeneous Data4.671.256, 5, 3
2803MASTER: Multi-task Pre-trained Bottlenecked Masked Autoencoders are Better Dense Retrievers4.671.255, 3, 6
2804Some Practical Concerns and Solutions for Using Pretrained Representation in Industrial Systems4.671.255, 3, 6
2805Efficient Bayesian Optimization with Deep Kernel Learning and Transformer Pre-trained on Muliple Heterogeneous Datasets4.671.255, 3, 6
2806Untangling Effect and Side Effect: Consistent Causal Inference in Non-Targeted Trials4.671.256, 5, 3
2807Pseudometric guided online query and update for offline reinforcement learning4.671.256, 3, 5
2808Convergence Analysis of Split Learning on Non-IID Data4.671.255, 6, 3
2809Do Not Blindly Imitate the Teacher: Loss Perturbation for Knowledge Distillation4.672.363, 3, 8
2810Beyond Deep Learning: An Evolutionary Feature Engineering Approach to Tabular Data Classification4.671.255, 3, 6
2811Is margin all you need? An extensive empirical study of active learning on tabular data4.671.253, 6, 5
2812MolEBM: Molecule Generation and Design by Latent Space Energy-Based Modeling4.671.253, 6, 5
2813How Does Self-supervised Learning Work? A Representation Learning Perspective4.671.255, 6, 3
2814A Reproducible and Realistic Evaluation of Partial Domain Adaptation Methods4.671.253, 5, 6
2815Accelerated Training via Principled Methods for Incrementally Growing Neural Networks4.671.255, 6, 3
2816Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization4.671.255, 3, 6
2817System identification of neural systems: If we got it right, would we know?4.672.368, 3, 3
2818Axiomatic Explainer Locality With Optimal Transport4.671.253, 5, 6
2819Progressive Knowledge Distillation: Constructing Ensembles for Efficient Inference4.671.253, 5, 6
2820Blockwise self-supervised learning with Barlow Twins4.671.253, 6, 5
2821Achieving Communication-Efficient Policy Evaluation for Multi-Agent Reinforcement Learning: Local TD-Steps or Batching?4.671.253, 5, 6
2822Two-Tailed Averaging: Anytime Adaptive Once-in-a-while Optimal Iterate Averaging for Stochastic Optimization4.672.368, 3, 3
2823Replay Buffer with Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning4.671.256, 3, 5
2824DECODING LAYER SALIENCY IN TRANSFORMERS4.671.253, 5, 6
2825Decision Transformer under Random Frame Dropping4.671.253, 5, 6
2826On the Importance of Contrastive Loss in Multimodal Learning4.671.253, 6, 5
2827Generative Adversarial Federated Model4.671.256, 5, 3
2828EENet: Learning to Early Exit for Adaptive Inference4.671.256, 3, 5
2829Injecting knowledge into language generation: a case study in auto-charting after-visit care instructions from medical dialogue4.671.253, 6, 5
2830Continual Learning with Soft-Masking of Parameter-Level Gradient Flow4.671.255, 3, 6
2831Unsupervised Adaptation for Fairness under Covariate Shift4.672.368, 3, 3
2832Towards convergence to Nash equilibria in two-team zero-sum games4.671.255, 3, 6
2833Towards Understanding How Machines Can Learn Causal Overhypotheses4.671.255, 3, 6
2834The Union of Manifolds Hypothesis4.672.363, 8, 3
2835P2PRISM - Peer to peer learning with individual prism for secure aggregation4.671.253, 6, 5
2836Few-shot Backdoor Attacks via Neural Tangent Kernels4.671.256, 5, 3
2837MMVAE+: Enhancing the Generative Quality of Multimodal VAEs without Compromises4.671.255, 6, 3
2838$ell$Gym: Natural Language Visual Reasoning with Reinforcement Learning4.671.253, 5, 6
2839Towards Antisymmetric Neural Ansatz Separation4.671.253, 6, 5
2840Optimal Scalarizations for Provable Multiobjective Optimization4.671.255, 6, 3
2841A new photoreceptor-inspired CNN layer enables deep learning models of retina to generalize across lighting conditions4.671.253, 6, 5
2842Deep Probabilistic Time Series Forecasting over Long Horizons4.672.363, 8, 3
2843AN OPERATOR NORM BASED PASSIVE FILTER PRUNING METHOD FOR EFFICIENT CNNS4.671.255, 3, 6
2844Weighted Regularization for Efficient Neural Network Compression4.672.368, 3, 3
2845HiT-DVAE: Human Motion Generation via Hierarchical Transformer Dynamical VAE4.671.256, 3, 5
2846Learning Privacy-Preserving Graph Embeddings Against Sensitive Attributes Inference4.671.255, 3, 6
2847Finding Generalization Measures by Contrasting Signal and Noise4.671.255, 6, 3
2848Learning Dictionaries over Datasets through Wasserstein Barycenters4.671.256, 5, 3
2849KeyCLD: Learning Constrained Lagrangian Dynamics in Keypoint Coordinates from Images4.671.253, 5, 6
2850Score Matching via Differentiable Physics4.671.253, 5, 6
2851Short-Term Memory Convolutions4.671.253, 5, 6
2852Unbiased Decisions Reduce Regret: Adversarial Optimism for the Bank Loan Problem4.671.255, 3, 6
2853Diversity of Generated Unlabeled Data Matters for Few-shot Hypothesis Adaptation4.672.363, 8, 3
2854CAKE: CAusal and collaborative proxy-tasKs lEarning for Semi-Supervised Domain Adaptation4.671.255, 6, 3
2855How to Keep Cool While Training4.671.253, 5, 6
2856Model-Based Decentralized Policy Optimization4.671.256, 3, 5
2857Few-bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction4.671.255, 6, 3
2858Pruning by Active Attention Manipulation4.671.256, 3, 5
2859On Threshold Functions in Learning to Generate Feasible Solutions of Mixed Integer Programs4.672.363, 3, 8
2860Closed Boundary Learning for NLP Classification Tasks with the Universum Class4.671.255, 3, 6
2861UNREAL: Unlabeled Nodes Retrieval and Labeling for Heavily-imbalanced Node Classification4.671.253, 6, 5
2862GRAPHSENSOR: A Graph Attention Network for Time-Series Sensor Data4.671.256, 5, 3
2863CRISP: Curriculum inducing Primitive Informed Subgoal Prediction for Hierarchical Reinforcement Learning4.671.256, 5, 3
2864An Equal-Size Hard EM Algorithm for Diverse Dialogue Generation4.671.253, 5, 6
2865NeuralEQ: Neural-Network-Based Equalizer for High-Speed Wireline Communication4.671.255, 6, 3
2866Network Controllability Perspectives on Graph Representation4.671.253, 6, 5
2867VARIATIONAL ADAPTIVE GRAPH TRANSFORMER FOR MULTIVARIATE TIME SERIES MODELING4.671.256, 5, 3
2868Large Language Models Can Self-improve4.672.363, 3, 8
2869COMBAT: Alternated Training for Near-Perfect Clean-Label Backdoor Attacks4.671.256, 3, 5
2870Safe Reinforcement Learning with Contrastive Risk Prediction4.671.256, 3, 5
2871Imbalanced Lifelong Learning with AUC Maximization4.671.255, 3, 6
2872MoCa: Cognitive Scaffolding for Language Models in Causal and Moral Judgment Tasks4.672.363, 3, 8
2873Lattice Convolutional Networks for Learning Ground States of Quantum Many-Body Systems4.672.363, 8, 3
2874Learning to Optimize Quasi-Newton Methods4.671.253, 5, 6
2875An Adaptive Policy to Employ Sharpness-Aware Minimization4.671.256, 3, 5
2876Stochastic Bridges as Effective Regularizers for Parameter-Efficient Tuning4.671.256, 5, 3
2877Latent Bottlenecked Attentive Neural Processes4.671.253, 5, 6
2878VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment4.671.253, 5, 6
2879Annealed Training for Combinatorial Optimization on Graphs4.671.255, 3, 6
2880A Novel Fast Exact Subproblem Solver for Stochastic Quasi-Newton Cubic Regularized Optimization4.671.255, 3, 6
2881On the Mysterious Optimization Geometry of Deep Neural Networks4.671.255, 3, 6
2882On the Implicit Bias Towards Depth Minimization in Deep Neural Networks4.671.255, 3, 6
2883Quantum 3D graph structure learning with applications to molecule computing4.671.256, 5, 3
2884Score-based Generative 3D Mesh Modeling4.671.253, 5, 6
2885Why Self Attention is Natural for Sequence-to-Sequence Problems? A Perspective from Symmetries4.671.255, 6, 3
2886Large Learning Rate Matters for Non-Convex Optimization4.671.255, 6, 3
2887Value-Based Membership Inference Attack on Actor-Critic Reinforcement Learning4.671.255, 6, 3
2888FOCUS: Fairness via Agent-Awareness for Federated Learning on Heterogeneous Data4.671.253, 5, 6
2889RainProof: An Umbrella to Shield Text Generator from Out-Of-Distribution Data4.671.255, 6, 3
2890PerFedMask: Personalized Federated Learning with Optimized Masking Vectors4.671.255, 3, 6
2891Neural Implicit Manifold Learning for Topology-Aware Generative Modelling4.671.256, 3, 5
2892Characterizing neural representation of cognitively-inspired deep RL agents during an evidence accumulation task4.671.255, 3, 6
2893Rule-based policy regularization for reinforcement learning-based building control4.671.253, 6, 5
2894Deep Dependency Networks for Action Classification in Video4.671.253, 5, 6
2895Structural Adversarial Objectives for Self-Supervised Representation Learning4.671.255, 6, 3
2896Defending against Reconstruction attacks using Rényi Differential Privacy4.671.255, 6, 3
2897Abstracting Imperfect Information Away from Two-Player Zero-Sum Games4.671.253, 5, 6
2898Black-Box Adversarial Attack Guided by Model Behavior for Programming Pre-trained Language Models4.671.255, 3, 6
2899Joint Embedding Self-Supervised Learning in the Kernel Regime4.671.256, 5, 3
2900SeedGNN: Graph Neural Network for Supervised Seeded Graph Matching4.671.253, 6, 5
2901Variational Counterfactual Prediction under Runtime Domain Corruption4.671.255, 6, 3
2902Automatic Clipping: Differentially Private Deep Learning Made Easier and Stronger4.671.256, 5, 3
2903ELBO-ing Stein Mixtures4.672.363, 3, 8
2904Breaking the Curse of Dimensionality for Parametric Elliptic PDEs4.673.861, 3, 10
2905Accelerated Riemannian Optimization: Handling Constraints to Bound Geometric Penalties4.671.255, 6, 3
2906Volumetric Disentanglement for 3D Scene Manipulation4.671.253, 5, 6
2907DEEP ACCURATE SOLVER FOR THE GEODESIC PROBLEM4.672.363, 8, 3
2908Signal to Sequence Attention-Based Multiple Instance Network for Segmentation Free Inference of RNA Modifications4.671.255, 6, 3
2909Global-Local Bayesian Transformer for Semantic Correspondence4.671.255, 6, 3
2910Deep Graph-Level Clustering Using Pseudo-Label-Guided Mutual Information Maximization Network4.671.253, 5, 6
2911Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories4.671.255, 3, 6
2912Semi-Implicit Variational Inference via Score Matching4.671.256, 5, 3
2913Non-equispaced Fourier Neural Solvers for PDEs4.671.253, 5, 6
2914Group-oriented Cooperation in Multi-Agent Reinforcement Learning4.671.253, 6, 5
2915Horizon-Free Reinforcement Learning for Latent Markov Decision Processes4.671.255, 3, 6
2916Estimating Riemannian Metric with Noise-Contaminated Intrinsic Distance4.672.363, 3, 8
2917EMP: Effective Multidimensional Persistence for Graph Representation Learning4.671.256, 5, 3
2918Self-Adaptive Perturbation Radii for Adversarial Training4.671.253, 5, 6
2919Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning4.671.253, 5, 6
2920EM-Network: Learning Better Latent Variable for Sequence-to-Sequence Models4.671.253, 5, 6
2921HotProtein: A Novel Framework for Protein Thermostability Prediction and Editing4.671.255, 3, 6
2922On the Neural Tangent Kernel of Equilibrium Models4.671.253, 6, 5
2923HYPERPRUNING: EFFICIENT PRUNING THROUGH LYAPUNOV METRIC HYPERSEARCH4.671.253, 6, 5
2924Minimum Curvature Manifold Learning4.671.255, 6, 3
2925Min-Max Zero-Shot Multi-Label Classification4.671.253, 6, 5
2926Generated Graph Detection4.671.256, 3, 5
2927Quantum Fourier Networks for solving Parametric PDEs4.671.256, 3, 5
2928ADVERSARIALLY BALANCED REPRESENTATION FOR CONTINUOUS TREATMENT EFFECT ESTIMATION4.671.256, 5, 3
2929D-CIPHER: Discovery of Closed-form Partial Differential Equations4.672.363, 3, 8
2930Learning with MISELBO: The Mixture Cookbook4.671.253, 5, 6
2931Monotonicity and Double Descent in Uncertainty Estimation with Gaussian Processes4.671.255, 6, 3
2932Analyzing the Effects of Classifier Lipschitzness on Explainers4.671.255, 6, 3
2933Enhance Local Consistency for Free: A Multi-Step Inertial Momentum Approach4.671.255, 3, 6
2934Robust Constrained Reinforcement Learning4.671.253, 5, 6
2935CorruptEncoder: Data Poisoning Based Backdoor Attacks to Contrastive Learning4.671.253, 5, 6
2936Revitalize Region Feature for Democratizing Video-language Pre-training of Retrieval4.671.256, 3, 5
2937Byzantine-robust Decentralized Learning via ClippedGossip4.671.256, 3, 5
2938Towards the Out-of-Distribution Generalization of Contrastive Self-Supervised Learning4.671.255, 6, 3
2939ColoristaNet for Photorealistic Video Style Transfer4.671.253, 5, 6
2940Low-complexity Deep Video Compression with A Distributed Coding Architecture4.671.256, 5, 3
2941Property Inference Attacks Against t-SNE Plots4.671.253, 5, 6
2942D4AM: A General Denoising Framework for Downstream Acoustic Models4.671.255, 6, 3
2943Saliency-guided Vision Transformer for Few-shot Keypoint Detection4.671.256, 5, 3
2944Holistically Explainable Vision Transformers4.671.255, 3, 6
2945Instance-wise Batch Label Restoration via Gradients in Federated Learning4.671.253, 6, 5
2946GoBigger: A Scalable Platform for Cooperative-Competitive Multi-Agent Interactive Simulation4.671.255, 3, 6
2947Simultaneously Learning Stochastic and Adversarial Markov Decision Process with Linear Function Approximation4.671.255, 6, 3
2948Gated Domain Units for Multi-source Domain Generalization4.671.255, 6, 3
2949Bag of Tricks for FGSM Adversarial Training4.671.253, 5, 6
2950Exploring interactions between modalities for deepfake detection4.671.255, 6, 6, 3, 5, 3
2951A Causal Approach to Detecting Multivariate Time-series Anomalies and Root Causes4.671.256, 5, 3
2952A Closer Look at Self-supervised Lightweight Vision Transformers4.671.256, 5, 3
2953Exploring the Generalizability of CNNs via Activated Representational Substitution4.671.256, 3, 5
2954FedFA: Federated Learning with Feature Alignment for Heterogeneous Data4.671.256, 5, 3
2955MABA-Net: Masked Additive Binary Activation Network4.671.255, 3, 6
2956Quantum-Inspired Tensorized Embedding with Application to Node Representation Learning4.672.363, 8, 3
2957Federated Learning of Large Models at the Edge via Principal Sub-Model Training4.671.256, 5, 3
2958Sharper Rates and Flexible Framework for Nonconvex SGD with Client and Data Sampling4.671.253, 6, 5
2959Rademacher Complexity Over $mathcal{H} Delta mathcal{H}$ Class for Adversarially Robust Domain Adaptation4.671.253, 6, 5
2960Differentially Private Dataset Condensation4.671.253, 6, 5
2961Dynamics-inspired Neuromorphic Representation Learning4.672.363, 3, 8
2962Radial Spike and Slab Bayesian Neural Networks for Sparse Data in Ransomware Attacks4.671.256, 5, 3
2963Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks4.671.253, 5, 6
2964Receding Neuron Importances for Structured Pruning4.671.256, 3, 5
2965CONTINUAL MODEL EVOLVEMENT WITH INNER-PRODUCT RESTRICTION4.671.256, 5, 3
2966PREF: Phasorial Embedding Fields for Compact Neural Representations4.671.256, 3, 5
2967FedPSE: Personalized Sparsification with Element-wise Aggregation for Federated Learning4.671.253, 6, 5
2968Multigraph Topology Design for Cross-Silo Federated Learning4.671.253, 6, 5
2969Exploit Unlabeled Data on the Server! Federated Learning via Uncertainty-aware Ensemble Distillation and Self-Supervision4.671.253, 5, 6
2970Parallel Federated Learning over Heterogeneous Devices4.671.255, 3, 6
2971Mugs: A Multi-Granular Self-Supervised Learning Framework4.672.363, 8, 3
2972Grafting Vision Transformers4.671.256, 3, 5
2973PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction4.671.256, 3, 5
2974NIERT: Accurate Numerical Interpolation through Unifying Scattered Data Representations using Transformer Encoder4.671.255, 3, 6
2975Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets4.671.253, 6, 5
2976Manifold Characteristics That Predict Downstream Task Performance4.671.255, 3, 6
2977Improved Fully Quantized Training via Rectifying Batch Normalization4.671.255, 3, 6
2978Lottery Aware Sparsity Hunting: Enabling Federated Learning on Resource-Limited Edge4.671.253, 6, 5
2979Phase transition for detecting a small community in a large network4.671.253, 6, 5
2980Zipper: Decoupling the tradeoff Between Robustness and Accuracy4.671.256, 3, 5
2981Learning Visual Representation with Synthetic Images and Topologically-defined Labels4.671.253, 6, 5
2982A prototype-oriented clustering for domain shift with source privacy4.671.255, 6, 3
2983FADE: Enabling Large-Scale Federated Adversarial Training on Resource-Constrained Edge Devices4.671.253, 6, 5
2984Temporal Relevance Analysis for Video Action Models4.671.253, 5, 6
2985Towards Understanding Convergence and Generalization of AdamW4.671.255, 3, 6
2986Learning from Interval-valued Data4.672.363, 3, 8
2987Efficient Hyperdimensional Computing4.671.255, 6, 3
2988Auxiliary task discovery through generate and test4.671.255, 3, 6
2989Categorial Grammar Induction as a Compositionality Measure for Emergent Languages in Signaling Games4.671.253, 6, 5
2990Exploring Neural Network Representational Similarity using Filter Subspaces4.671.256, 5, 3
2991Probing into Overfitting for Video Recognition4.671.256, 3, 5
2992Universal Unlearnable Examples: Cluster-wise Perturbations without Label-consistency4.671.256, 5, 3
2993Interpretable Single/Multi-label Text Classification with Unsupervised Constituent-label alignments4.671.253, 6, 5
2994Functional Relation Field: A Model-Agnostic Framework for Multivariate Time Series Forecasting4.671.255, 6, 3
2995Generalized Category Discovery via Adaptive GMMs without Knowing the Class Number4.671.256, 3, 5
2996A Mutual Information Duality Algorithm for Multi-Agent Specialization4.621.323, 3, 5, 6, 6, 3, 6, 5
2997Graph Mixup with Soft Alignments4.601.363, 6, 6, 3, 5
2998Emergence of shared sensory-motor graphical language from visual input4.601.363, 6, 3, 5, 6
2999Temporal Dynamics Aware Adversarial Attacks On Discrete-Time Graph Models4.601.851, 5, 6, 6, 5
3000Escaping saddle points in zeroth-order optimization: two function evaluations suffice4.601.366, 5, 3, 6, 3
3001Variational Causal Dynamics: Discovering Modular World Models from Interventions4.601.366, 3, 6, 3, 5
3002Feed-Forward Latent Domain Adaptation4.602.063, 3, 3, 6, 8
3003Test-time Adaptation for Segmentation via Image Synthesis4.601.363, 6, 6, 3, 5
3004Similarity of Neural Architectures Based on Input Gradient Transferability4.602.425, 3, 1, 6, 8
3005Equivariant Descriptor Fields: SE(3)-Equivariant Energy-Based Models for End-to-End Visual Robotic Manipulation Learning4.601.363, 3, 5, 6, 6
3006Look in The Mirror: Molecular Graph Contrastive Learning with Line Graph4.602.063, 8, 3, 3, 6
3007Linear convergence for natural policy gradient with log-linear policy parametrization4.600.805, 5, 5, 5, 3
3008Chopping Formers is what you need in Vision4.601.363, 6, 6, 3, 5
3009Variance Covariance Regularization Enforces Pairwise Independence in Self-Supervised Representations4.601.363, 6, 3, 5, 6
3010Multi-Label Knowledge Distillation4.602.063, 3, 6, 8, 3
3011FrAug: Frequency Domain Augmentation for Time Series Forecasting4.600.803, 5, 5, 5, 5
3012Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity4.601.363, 6, 3, 6, 5
3013A Unimodal, Uncertainty-Aware Deep Learning Approach for Ordinal Regression4.600.805, 5, 3, 5, 5
3014Does Dataset Lottery Ticket Hypothesis Exist?4.601.363, 3, 6, 6, 5
3015Exploring The Capacity Mismatch Problem in Knowledge Distillation from the View of Soft Labels4.600.805, 3, 5, 5, 5
3016Revisiting Residual Networks for Adversarial Robustness4.600.805, 3, 5, 5, 5
3017QFuture: Learning Future Expectations in Multi-Agent Reinforcement Learning4.601.366, 3, 6, 3, 5
3018Free Bits: Platform-Aware Latency Optimization of Mixed-Precision Neural Networks for Edge Deployment4.500.875, 5, 5, 3
3019DELTA: Diverse Client Sampling for Fasting Federated Learning4.501.506, 6, 3, 3
3020Grounded Contrastive Learning for Open-world Semantic Segmentation4.500.875, 5, 3, 5
3021Batch Normalization and Bounded Activation Functions4.500.875, 5, 5, 3
3022Deep Equilibrium Non-Autoregressive Sequence Learning4.500.875, 3, 5, 5
3023On the Adversarial Robustness against Natural Weather Perturbations4.500.875, 3, 5, 5
3024Optimistic Exploration in Reinforcement Learning Using Symbolic Model Estimates4.501.506, 3, 3, 6
3025Rényi Supervised Contrastive Learning for Transferable Representation4.500.875, 3, 5, 5
3026Topology Matters in Fair Graph Learning: a Theoretical Pilot Study4.501.503, 3, 6, 6
3027Beyond the injective assumption in causal representation learning4.501.506, 3, 6, 3
3028Approximation ability of Transformer networks for functions with various smoothness of Besov spaces: error analysis and token extraction4.500.873, 5, 5, 5
3029Reinforcement Logic Rule Learning for Temporal Point Processes4.501.506, 3, 3, 6
3030UNDERSTANDING HTML WITH LARGE LANGUAGE MODELS4.500.875, 5, 3, 5
3031Semi-Autoregressive Energy Flows: Towards Determinant-Free Training of Normalizing Flows4.501.506, 3, 6, 3
3032ACE-EM: Boosted ab initio Cryo-EM 3D Reconstruction with Asymmetric Complementary Autoencoder4.501.506, 6, 3, 3
3033A Fast, Well-Founded Approximation to the Empirical Neural Tangent Kernel4.500.875, 5, 5, 3
3034Towards Unsupervised Time Series Representation Learning: A Decomposition Perspective4.501.506, 6, 3, 3
3035Steerable Equivariant Representation Learning4.500.875, 3, 5, 5
3036Federated Learning with Heterogeneous Label Noise: A Dual Structure Approach4.500.875, 3, 5, 5
3037Spatiotemporal Modeling of Multivariate Signals with Graph Neural Networks and Structured State Space Models4.500.875, 5, 5, 3
3038Domain-Invariant Auxiliary Learning for Robust Few-Shot Predictions from Noisy Data4.501.503, 3, 6, 6
3039ConserWeightive Behavioral Cloning for Reliable Offline Reinforcement Learning4.500.875, 5, 3, 5
3040ProtFIM: Fill-in-Middle Protein Sequence Design via Protein Language Models4.500.873, 5, 5, 5
3041MUG: Interactive Multimodal Grounding on User Interfaces4.500.873, 5, 5, 5
3042SIMPLE: A Gradient Estimator for k-Subset Sampling4.501.506, 3, 3, 6
3043Greedy Information Maximization for Online Feature Selection4.501.126, 5, 3, 3, 5, 5
3044Cross-Domain Few-Shot Relation Extraction via Representation Learning and Domain Adaptation4.500.875, 5, 5, 3
3045Koopman Operator Learning for Accelerating Quantum Optimization and Machine Learning4.501.506, 3, 6, 3
3046Projective Proximal Gradient Descent for Nonconvex Nonsmooth Optimization: Fast Convergence Without Kurdyka-Lojasiewicz (KL) Property4.501.503, 6, 6, 3
3047Variable Compositionality Reliably Emerges in Neural Networks4.500.873, 5, 5, 5
3048Causally-guided Regularization of Graph Attention improves Generalizability4.500.873, 5, 5, 5
3049A Simple Approach for State-Action Abstraction using a Learned MDP Homomorphism4.501.506, 3, 3, 6
3050Optimal Transport-Based Supervised Graph Summarization4.501.503, 3, 6, 6
3051Double Wins: Boosting Accuracy and Efficiency of Graph Neural Networks by Reliable Knowledge Distillation4.501.506, 3, 6, 3
3052Motif-based Graph Representation Learning with Application to Chemical Molecules4.500.875, 3, 5, 5
3053Beam Tree Recursive Cells4.500.875, 5, 3, 5
3054Cross-Silo Training of Differentially Private Models with Secure Multiparty Computation4.501.503, 6, 6, 3
3055Illusory Adversarial Attacks on Sequential Decision-Makers and Countermeasures4.500.875, 5, 3, 5
3056Catastrophic overfitting is a bug but it is caused by features4.501.506, 3, 6, 3
3057Robust Universal Adversarial Perturbations4.500.875, 3, 5, 5
3058SARNET: SARCASM VS TRUE-HATE DETECTION NETWORK4.500.875, 5, 5, 3
3059On Gradient Descent Convergence beyond the Edge of Stability4.500.875, 3, 5, 5
3060Robustifying Language Models via Adversarial Training with Masked Gradient4.500.875, 5, 5, 3
3061Convexifying Transformers: Improving optimization and understanding of transformer networks4.500.875, 5, 3, 5
3062TimeSeAD: Benchmarking Deep Time-Series Anomaly Detection4.500.875, 5, 5, 3
3063Towards Multi-spatiotemporal-scale Generalized PDE Modeling4.500.875, 3, 5, 5
3064REST: REtrieve & Self-Train for generative action recognition4.500.873, 5, 5, 5
3065Internet-augmented language models through few-shot prompting for open-domain question answering4.501.506, 6, 3, 3
3066Generalized Belief Transport4.502.065, 6, 6, 1
3067Maximal Correlation-Based Post-Nonlinear Learning for Bivariate Causal Discovery4.501.506, 6, 3, 3
3068Interactive Sequential Generative Models4.501.503, 6, 3, 6
3069Relaxed Attention for Transformer Models4.500.875, 5, 3, 5
3070Vector Quantization and Shifting: Exploiting Latent Properties to Optimize Neural Codecs4.501.506, 3, 3, 6
3071MARLlib: Extending RLlib for Multi-agent Reinforcement Learning4.500.875, 3, 5, 5
3072Energy Consumption-Aware Tabular Benchmarks for Neural Architecture Search4.500.873, 5, 5, 5
3073Delve into the Layer Choice of BP-based Attribution Explanations4.500.875, 3, 5, 5
3074Query The Agent: Improving Sample Efficiency Through Epistemic Uncertainty Estimation4.500.875, 5, 3, 5
3075Cold Posteriors through PAC-Bayes4.500.875, 3, 5, 5
3076Toward Effective Deep Reinforcement Learning for 3D Robotic Manipulation: End-to-End Learning from Multimodal Raw Sensory Data4.500.875, 3, 5, 5
3077ChemAlgebra : Algebraic Reasoning on Chemical Reactions4.501.506, 3, 3, 6
3078Improving Adversarial Robustness via Frequency Regularization4.500.875, 3, 5, 5
3079$omega$GNNs: Deep Graph Neural Networks Enhanced by Multiple Propagation Operators4.500.875, 5, 3, 5
3080Learning from Asymmetrically-corrupted Data in Regression for Sensor Magnitude4.502.066, 1, 6, 5
3081Modeling the Uncertainty with Maximum Discrepant Students for Semi-supervised 2D Pose Estimation4.500.875, 3, 5, 5
3082Adversarial Causal Augmentation for Graph Covariate Shift4.501.506, 3, 3, 6
3083On the Robustness of Randomized Ensembles to Adversarial Perturbations4.501.506, 6, 3, 3
3084Deep Transformer Q-Networks for Partially Observable Reinforcement Learning4.502.066, 6, 5, 1
3085Visual Expertise and the Log-Polar Transform Explain Image Inversion Effects4.500.873, 5, 5, 5
3086Neural Semi-Counterfactual Risk Minimization4.502.698, 6, 3, 1
3087FedDebias: Reducing the Local Learning Bias Improves Federated Learning on Heterogeneous Data4.500.875, 3, 5, 5
3088Best Possible Q-Learning4.501.503, 6, 6, 3
3089Self-Supervised Logit Adjustment4.500.875, 5, 3, 5
3090Leaves: Learning Views for Time-Series Data in Contrastive Learning4.500.873, 5, 5, 5
3091DeepGuiser: Learning to Disguise Neural Architectures for Impeding Adversarial Transfer Attacks4.501.503, 6, 3, 6
3092The Cost of Privacy in Fair Machine Learning4.500.873, 5, 5, 5
3093When Majorities Prevent Learning: Eliminating Bias to Improve Worst-group and Out-of-distribution Generalization4.500.873, 5, 5, 5
3094Fairness-Aware Model-Based Multi-Agent Reinforcement Learning for Traffic Signal Control4.500.875, 5, 5, 3
3095Learning Unified Representations for Multi-Resolution Face Recognition4.500.875, 3, 5, 5
3096Graph Signal Sampling for Inductive One-Bit Matrix Completion: a Closed-form Solution4.501.506, 3, 3, 6
3097Adaptive Weight Decay: On The Fly Weight Decay Tuning for Improving Robustness4.500.875, 3, 5, 5
3098Machine Unlearning of Federated Clusters4.501.506, 3, 3, 6
3099Link Prediction with Non-Contrastive Learning4.500.873, 5, 5, 5
3100Goal-Space Planning with Subgoal Models4.500.875, 5, 5, 3
3101Learning Unsupervised Forward Models from Object Keypoints4.500.873, 5, 5, 5
3102Meta Temporal Point Processes4.500.873, 5, 5, 5
3103DCI-ES: An Extended Disentanglement Framework with Connections to Identifiability4.500.873, 5, 5, 5
3104OTCOP: Learning optimal transport maps via constraint optimizations4.501.506, 6, 3, 3
3105Graduated Non-Convexity for Robust Self-Trained Language Understanding4.501.503, 6, 6, 3
3106SemSup-XC: Semantic Supervision for Extreme Classification4.500.875, 5, 5, 3
3107Wide Graph Neural Network4.502.066, 5, 1, 6
3108Integrating Episodic and Global Novelty Bonuses for Efficient Exploration4.500.875, 3, 5, 5
3109Dynamics-aware Skill Generation from Behaviourally Diverse Demonstrations4.501.506, 3, 6, 3
3110Calibrating Transformers via Sparse Gaussian Processes4.501.503, 6, 3, 6
3111Multimodal Open-Vocabulary Video Classification via Vision and Language Models4.501.506, 6, 3, 3
3112When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning4.500.875, 5, 3, 5
3113Domain-Unified Prompt Representations for Source-Free Domain Generalization4.500.875, 5, 3, 5
3114Disentangling Learning Representations with Density Estimation4.500.875, 5, 3, 5
3115A Risk-Averse Equilibrium for Multi-Agent Systems4.501.506, 3, 6, 3
3116A Learning Based Hypothesis Test for Harmful Covariate Shift4.500.875, 5, 3, 5
3117On the Relationship Between Adversarial Robustness and Decision Region in Deep Neural Networks4.500.875, 3, 5, 5
3118Noether Embeddings: Fast Temporal Association Mining4.500.875, 5, 5, 3
3119Poisson Process for Bayesian Optimization4.500.875, 5, 5, 3
3120Where prior learning can and can't work in unsupervised inverse problems4.501.506, 6, 3, 3
3121Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training4.501.506, 3, 3, 6
3122An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale Multitask Learning Systems4.502.061, 6, 6, 5
3123Schedule-Robust Online Continual Learning4.500.873, 5, 5, 5
3124Contrastive Hierarchical Clustering4.500.873, 5, 5, 5
3125On Incremental Learning with Long Short Term Strategy4.500.875, 5, 5, 3
3126ESP: Exponential Smoothing on Perturbations for Increasing Robustness to Data Corruptions4.500.875, 5, 5, 3
3127Multiple Invertible and Equivariant Transformation for Disentanglement in VAEs4.500.875, 5, 3, 5
3128Revisiting Fast Adversarial Training4.500.875, 5, 3, 5
3129Bayesian semi-supervised learning with a principled likelihood from a generative model of data curation4.500.875, 5, 3, 5
3130Deep High-Frequency Extrapolation for Neuronal Spike Restoration4.501.503, 3, 6, 6
3131Emergent Communication with Attention4.500.875, 3, 5, 5
3132Black-box Knowledge Distillation4.500.873, 5, 5, 5
3133Self-Consistent Learning: Cooperation between Generators and Discriminators4.502.061, 5, 6, 6
3134Personalized Decentralized Bilevel Optimization over Stochastic and Directed Networks4.500.875, 3, 5, 5
3135Data-Free Continual Graph Learning4.501.506, 3, 3, 6
3136Can you Trust your Disentanglement?4.502.698, 6, 3, 1
3137Visual Reinforcement Learning with Self-Supervised 3D Representations4.501.506, 6, 3, 3
3138CUSTOMIZING PRE-TRAINED DIFFUSION MODELS FOR YOUR OWN DATA4.500.875, 5, 3, 5
3139Dr-Fairness: Dynamic Data Ratio Adjustment for Fair Training on Real and Generated Data4.500.875, 5, 3, 5
3140Adversarially Robust Neural Lyapunov Control4.500.875, 5, 5, 3
3141Domain-Specific Risk Minimization for Out-of-Distribution Generalization4.500.875, 5, 5, 3
3142Temporally-Weighted Spike Encoding for Event-based Object Detection and Classification4.501.503, 3, 6, 6
3143SimA: Simple Softmax-free Attention For Vision Transformers4.500.873, 5, 5, 5
3144What does a platypus look like? Generating customized prompts for zero-shot image classification4.501.506, 3, 3, 6
3145Hyperbolic Contrastive Learning for Visual Representations beyond Objects4.500.875, 3, 5, 5
3146Hybrid RL: Using both offline and online data can make RL efficient4.502.061, 5, 6, 6
3147Scalable and Privacy-enhanced Graph Generative Model for Graph Neural Networks4.501.503, 6, 6, 3
3148Understanding Hindsight Goal Relabeling Requires Rethinking Divergence Minimization4.502.061, 6, 6, 5
3149Heterogeneous Continual Learning4.500.873, 5, 5, 5
3150Momentum Diminishes the Effect of Spectral Bias in Physics-Informed Neural Networks4.502.693, 1, 8, 6
3151SeqSHAP: Subsequence Level Shapley Value Explanations for Sequential Predictions4.500.875, 5, 3, 5
3152Group-level Brain Decoding with Deep Learning4.500.873, 5, 5, 5
3153Learning Inductive Object-Centric Slot Initialization via Clustering4.500.875, 5, 3, 5
3154The Continuous CNN: from Task-Specific to Unified CNN Architecture4.501.503, 3, 6, 6
3155Pixel-Aligned Non-parametric Hand Mesh Reconstruction4.500.875, 5, 3, 5
3156Is the Deep Model Representation Sparse and Symbolic with Causal Patterns?4.500.873, 5, 5, 5
3157TransformMix: Learning Transformation and Mixing Strategies for Sample-mixing Data Augmentation4.500.873, 5, 5, 5
3158Disentangled Knowledge Transfer: A New Perspective for Personalized Federated Learning4.500.873, 5, 5, 5
3159DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization4.500.873, 5, 5, 5
3160Defense against Backdoor Attacks via Identifying and Purifying Bad Neurons4.500.875, 5, 3, 5
3161DSP: Dynamic Semantic Prototype for Generative Zero-Shot Learning4.500.875, 5, 5, 3
3162Topic Aware Transformer: Domain Shift for Unconditional Text Generation Model4.501.506, 6, 3, 3
3163Extracting Expert's Goals by What-if Interpretable Modeling4.500.873, 5, 5, 5
3164Improving Molecular Pretraining with Complementary Featurizations4.501.506, 3, 6, 3
3165AutoSparse: Towards Automated Sparse Training4.501.125, 5, 3, 3, 5, 6
3166PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets4.502.691, 3, 8, 6
3167Bootstrap Motion Forecasting With Self-Consistent Constraints4.500.875, 3, 5, 5
3168Learning to Split for Automatic Bias Detection4.501.503, 6, 3, 6
3169Physics-empowered Molecular Representation Learning4.500.873, 5, 5, 5
3170MINI: Mining Implicit Novel Instances for Few-Shot Object Detection4.500.875, 5, 3, 5
3171FedGSNR: Accelerating Federated Learning on Non-IID Data via Maximum Gradient Signal to Noise Ratio4.501.506, 3, 3, 6
3172Learning to acquire novel cognitive tasks with evolution, plasticity and meta-meta-learning4.500.875, 3, 5, 5
3173Light-weight probing of unsupervised representations for Reinforcement Learning4.501.506, 3, 3, 6
3174Tackling the Retrieval Trilemma with Cross-Modal Indexing4.500.873, 5, 5, 5
3175Watch What You Pretrain For: Targeted, Transferable Adversarial Examples on Self-Supervised Speech Recognition models4.501.503, 6, 6, 3
3176Shot Retrieval and Assembly with Text Script for Video Montage Generation4.501.503, 6, 3, 6
3177Margin-based Neural Network Watermarking4.500.875, 5, 3, 5
3178Revisiting Global Pooling through the Lens of Optimal Transport4.500.875, 5, 3, 5
3179Towards Expressive Graph Representations for Graph Neural Networks4.500.875, 3, 5, 5
3180Efficient, Stable, and Analytic Differentiation of the Sinkhorn Loss4.501.503, 6, 6, 3
3181Dynamical Isometry for Residual Networks4.501.506, 3, 3, 6
3182Deep Learning meets Nonparametric Regression: Are Weight-Decayed DNNs Locally Adaptive?4.500.873, 5, 5, 5
3183Minibatch Stochastic Three Points Method for Unconstrained Smooth Minimization4.500.875, 3, 5, 5
3184Learning Symbolic Rules for Reasoning in Quasi-Natural Language4.500.875, 3, 5, 5
3185Least-to-Most Prompting Enables Complex Reasoning in Large Language Models4.502.066, 1, 6, 5
3186Approximate Bayesian Inference with Stein Functional Variational Gradient Descent4.500.875, 3, 5, 5
3187It Takes Two: Masked Appearance-Motion Modeling for Self-Supervised Video Transformer Pre-Training4.500.875, 3, 5, 5
3188In-the-wild Pretrained Models Are Good Feature Extractors for Video Quality Assessment4.500.875, 5, 3, 5
3189Mitigating Forgetting in Online Continual Learning via Contrasting Semantically Distinct Augmentations4.500.875, 5, 3, 5
3190Contextual Symbolic Policy For Meta-Reinforcement Learning4.500.875, 3, 5, 5
3191Node Classification Beyond Homophily: Towards a General Solution4.501.506, 3, 3, 6
3192Decouple Graph Neural Networks: Train Multiple Simple GNNs Simultaneously Instead of One4.500.875, 5, 3, 5
3193On the Effectiveness of Adapting Pre-trained Transformer Models via Adversarial Noise4.500.873, 5, 5, 5
3194Federated Learning in Non-IID Settings Aided by Differentially Private Synthetic Data4.500.873, 5, 5, 5
3195A UNIFIED VIEW OF FINDING AND TRANSFORMING WINNING LOTTERY TICKETS4.501.506, 3, 3, 6
3196Revisiting Group Robustness: Class-specific Scaling is All You Need4.501.503, 3, 6, 6
3197DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models4.500.875, 5, 3, 5
3198Semi-Supervised Semantic Segmentation via Boosting Uncertainty on Unlabeled Data4.500.873, 5, 5, 5
3199Gamma Sampling: Fine-grained Controlling Language Models without Training4.500.875, 5, 5, 3
3200Contrastive Continuity on Augmentation Stability Rehearsal for Continual Self-Supervised Learning4.501.506, 3, 3, 6
3201Uncertainty Calibration via Knowledge Flow under Long-tailed Distribution4.500.875, 3, 5, 5
3202$1times1$ Convolution is All You Need for Image Super-Resolution4.500.873, 5, 5, 5
3203Multiplane NeRF-Supervised Disentanglement of Depth and Camera Pose from Videos4.500.873, 5, 5, 5
3204ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing4.500.875, 3, 5, 5
3205Parameter Averaging for Feature Ranking4.500.875, 5, 3, 5
3206Smooth-Reduce: Leveraging Patches for Improved Certified Robustness4.501.506, 3, 6, 3
3207Stochastic Differentially Private and Fair Learning4.500.873, 5, 5, 5
3208SegNeRF: 3D Part Segmentation with Neural Radiance Fields4.500.873, 5, 5, 5
3209Faster Neural Architecture 'Search' for Deep Image Prior4.500.875, 5, 3, 5
3210Object Localization helps Action Recognition Models Adapt to New Environments4.500.875, 3, 5, 5
3211Is Self-Supervised Contrastive Learning More Robust Than Supervised Learning?4.500.873, 5, 5, 5
3212Correcting the Sub-optimal Bit Allocation4.502.698, 1, 6, 3
3213Partial transportability for domain generalization4.501.503, 3, 6, 6
3214Quasi-Conservative Score-based Generative Models4.500.873, 5, 5, 5
3215Neural Attention Memory4.501.506, 6, 3, 3
3216Mimic before Reconstruct: Enhance Masked Autoencoders with Feature Mimicking4.500.873, 5, 5, 5
3217Meta Optimal Transport4.500.875, 3, 5, 5
3218Backpropagation Path Search On Adversarial Transferability4.500.875, 5, 3, 5
3219Efficient Exploration via Fragmentation and Recall4.500.875, 5, 5, 3
3220CLEP: Exploiting Edge Partitioning for Graph Contrastive Learning4.401.968, 5, 3, 3, 3
3221Behavior Proximal Policy Optimization4.401.205, 3, 6, 5, 3
3222Fairness via Adversarial Attribute Neighbourhood Robust Learning4.401.203, 5, 6, 5, 3
3223Representation Power of Graph Convolutions : Neural Tangent Kernel Analysis4.401.963, 5, 3, 3, 8
3224Accuracy Boosters: Epoch-Driven Mixed-Mantissa Block Floating-Point for DNN Training4.401.965, 3, 8, 3, 3
3225End-to-end Invariance Learning with Relational Inductive Biases in Multi-Object Robotic Manipulation4.401.205, 6, 5, 3, 3
3226Homotopy-based training of NeuralODEs for accurate dynamics discovery4.401.203, 5, 3, 6, 5
3227Learning To Invert: Simple Adaptive Attacks for Gradient Inversion in Federated Learning4.401.205, 6, 3, 5, 3
3228Robustify Transformers with Robust Kernel Density Estimation4.401.203, 6, 5, 3, 5
3229Rethinking Knowledge Distillation with Raw Features for Semantic Segmentation4.401.745, 6, 1, 5, 5
3230M-L2O: Towards Generalizable Learning-to-Optimize by Test-Time Fast Self-Adaptation4.401.205, 3, 3, 6, 5
3231Node Importance Specific Meta Learning in Graph Neural Networks4.401.205, 5, 6, 3, 3
3232Self-supervised Speech Enhancement using Multi-Modal Data4.401.203, 5, 6, 3, 5
3233Contrastive Graph Few-Shot Learning4.401.206, 5, 3, 5, 3
3234DropAut: Automatic Dropout Approaches to learn and adapt Drop Rates4.401.205, 6, 3, 5, 3
3235MUTUAL EXCLUSIVE MODULATOR FOR LONG-TAILED RECOGNITION4.401.206, 5, 3, 3, 5
3236Conditional Invariances for Conformer Invariant Protein Representations4.401.203, 6, 5, 3, 5
3237HOYER REGULARIZER IS ALL YOU NEED FOR EXTREMELY SPARSE SPIKING NEURAL NETWORKS4.401.205, 6, 3, 3, 5
3238Breaking Beyond COCO Object Detection4.401.203, 5, 3, 6, 5
3239A Deep Conjugate Direction Method for Iteratively Solving Linear Systems4.401.963, 3, 5, 3, 8
3240MixMask: Revisiting Masked Siamese Self-supervised Learning in Asymmetric Distance4.401.203, 5, 3, 6, 5
3241Topology-aware robust optimization4.401.203, 5, 5, 3, 6
3242Decoupling Concept Bottleneck Model4.401.203, 5, 5, 3, 6
3243Active Topological Mapping by Metric-Free Exploration via Task and Motion Imitation4.401.203, 3, 5, 5, 6
3244pFedKT: Personalized Federated Learning via Knowledge Transfer4.330.945, 5, 3
3245Deep Reinforcement Learning based Insight Selection Policy4.330.945, 3, 5
3246Coreset for Rational Functions4.330.945, 5, 3
3247Enabling Equation Learning with the Bayesian Model Evidence via systematic $R^2$-elimination4.330.945, 3, 5
3248PTUnifier: Pseudo Tokens as Paradigm Unifiers in Medical Vision-and-Language Pre-training4.330.945, 5, 3
3249Improving the Calibration of Fine-tuned Language Models via Denoising Variational Auto-Encoders4.330.945, 3, 5
3250SELCOR: Self-Correction for Weakly Supervised Learning4.330.945, 5, 3
3251An Experiment Design Paradigm using Joint Feature Selection and Task Optimization4.330.943, 5, 5
3252Intra-Instance VICReg: Bag of Self-Supervised Image Patch Embedding Explains the Performance4.330.943, 5, 5
3253Deep Latent State Space Models for Time-Series Generation4.330.945, 3, 5
3254Covariance Matrix Adaptation MAP-Annealing4.330.943, 5, 5
3255AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers4.330.945, 3, 5
3256Kuiper: Moderated Asynchronous Federated Learning on Heterogeneous Mobile Devices with Non-IID Data4.330.943, 5, 5
3257A Computationally Efficient Sparsified Online Newton Method4.330.943, 5, 5
3258MILE: Memory-Interactive Learning Engine for Solving Mathematical Problems4.330.945, 5, 3
3259Outlier-Robust Group Inference via Gradient Space Clustering4.330.945, 3, 5
3260The Vendi Score: A Diversity Evaluation Metric for Machine Learning4.330.945, 5, 3
3261Designing and Using Goal-Conditioned Tools4.330.945, 5, 3
3262Gradient Preconditioning for Non-Lipschitz smooth Nonconvex Optimization4.330.945, 5, 3
3263BertNet: Harvesting Knowledge Graphs from Pretrained Language Models4.330.945, 3, 5
32643D Surface Reconstruction in the Wild by Deforming Shape Priors from Synthetic Data4.330.945, 5, 3
3265Linkless Link Prediction via Relational Distillation4.330.945, 3, 5
3266Efficient Proxy for NAS is Extensible Now4.330.945, 3, 5
3267DIGEST: FAST AND COMMUNICATION EFFICIENT DECENTRALIZED LEARNING WITH LOCAL UPDATES4.330.945, 3, 5
3268Learning to Improve Code Efficiency4.330.945, 3, 5
3269Aging with GRACE: Lifelong Model Editing with Key-Value Adaptors4.330.945, 5, 3
3270Contrastive Vision Transformer for Self-supervised Out-of-distribution Detection4.330.943, 5, 5
3271Selection Collider Bias in Large Language Models4.330.945, 3, 5
3272Mind the Privacy Budget: How Generative Models Spend their Privacy Budgets4.330.945, 3, 5
3273MAD for Robust Reinforcement Learning in Machine Translation4.330.943, 5, 5
3274Zero-Shot Retrieval with Search Agents and Hybrid Environments4.330.945, 5, 3
3275Learning the Visualness of Text Using Large Vision-Language Models4.330.945, 5, 3
3276Explanation Uncertainty with Decision Boundary Awareness4.330.943, 5, 5
3277Do We Really Need Labels for Backdoor Defense?4.330.945, 5, 3
3278Non-Gaussian Process Regression4.330.945, 5, 3
3279The Adversarial Regulation of the Temporal Difference Loss Costs More Than Expected4.330.945, 3, 5
3280A Subspace Correction Method for ReLU Neural Networks for Solving PDEs4.330.943, 5, 5
3281$mathcal{O}$-GNN: incorporating ring priors into molecular modeling4.330.943, 5, 5
3282Graph Contrastive Learning with Model Perturbation4.330.945, 5, 3
3283Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models4.330.943, 5, 5
3284Highly Parallel Deep Ensemble Learning4.330.943, 5, 5
3285Brain2GAN; Reconstructing perceived faces from the primate brain via StyleGAN34.330.943, 5, 5
3286Learning to Cooperate and Communicate Over Imperfect Channels4.330.943, 5, 5
3287Towards Federated Learning of Deep Graph Neural Networks4.330.943, 5, 5
3288Hidden Markov Mixture of Gaussian Process Functional Regression: Utilizing Multi-Scale Structure for Time-Series Forecasting4.330.943, 5, 5
3289Multivariate Time Series Forecasting By Graph Attention Networks With Theoretical Guarantees4.330.945, 5, 3
3290Hierarchical Prototypes for Unsupervised Dynamics Generalization in Model-Based Reinforcement Learning4.330.943, 5, 5
3291Learning to Register Unbalanced Point Pairs4.332.366, 6, 1
3292Thinking fourth dimensionally: Treating Time as a Random Variable in EBMs4.330.945, 3, 5
3293FedProp: Cross-client Label Propagation for Federated Semi-supervised Learning4.330.943, 5, 5
3294Scalable Multi-Modal Continual Meta-Learning4.330.945, 3, 5
3295Does Structural Information have been Fully Exploited in Graph Data?4.330.943, 5, 5
3296DeepGRAND: Deep Graph Neural Diffusion4.330.945, 3, 5
3297ASIF: coupled data turns unimodal models to multimodal without training4.330.943, 5, 5
3298Two-Dimensional Weisfeiler-Lehman Graph Neural Networks for Link Prediction4.330.945, 5, 3
3299Object Detection with OOD Generalizable Neural Architecture Search4.330.943, 5, 5
3300Inverse Learning with Extremely Sparse Feedback for Recommendation4.330.945, 3, 5
3301CLUTR: Curriculum Learning via Unsupervised Task Representation Learning4.330.945, 5, 3
3302Robust Quantity-Aware Aggregation for Federated Learning4.330.943, 5, 5
3303Local Distance Preserving Auto-encoders using Continuous k-Nearest Neighbours Graphs4.330.945, 5, 3
3304PADDLES: Phase-Amplitude Spectrum Disentangled Early Stopping for Learning with Noisy Labels4.330.945, 3, 5
3305Textless Phrase Structure Induction from Visually-Grounded Speech4.330.943, 5, 5
3306On Regularization for Explaining Graph Neural Networks: An Information Theory Perspective4.332.366, 1, 6
3307COMNET : CORTICAL MODULES ARE POWERFUL4.330.945, 3, 5
3308Intrinsic Computational Complexity of Equivariant Neural Networks4.330.945, 3, 5
3309Weakly-Supervised Domain Adaptation in Federated Learning4.330.943, 5, 5
3310Text and Patterns: For Effective Chain of Thought It Takes Two to Tango4.330.945, 3, 5
3311Unlearning with Fisher Masking4.330.945, 5, 3
3312How Weakly Supervised Information helps Contrastive Learning4.330.945, 3, 5
3313Adaptive Kernel Selection for Convolutional Neural Network4.330.943, 5, 5
3314Online Min-max Optimization: Nonconvexity, Nonstationarity, and Dynamic Regret4.330.945, 3, 5
3315Treatment Effect Estimation with Collider Bias and Confounding Bias4.330.945, 3, 5
3316Upcycled-FL: Improving Accuracy and Privacy with Less Computation in Federated Learning4.330.943, 5, 5
3317Unsupervised Manifold Linearizing and Clustering4.330.945, 5, 3
3318Towards Class-Balanced Transductive Few-Shot Learning4.330.943, 5, 5
3319Eigenvalue Initialisation and Regularisation for Koopman Autoencoders4.330.945, 5, 3
3320A Quasistatic Derivation of Optimization Algorithms' Exploration on Minima Manifolds4.330.943, 5, 5
3321A Deep Learning Framework for Musical Acoustics Simulations4.330.943, 5, 5
3322Amos: An Adam-style Optimizer with Adaptive Weight Decay towards Model-Oriented Scale4.330.945, 3, 5
3323Anamnesic Neural Differential Equations with Orthogonal Polynomial Projections4.332.361, 6, 6
3324uGLAD: A deep learning model to recover conditional independence graphs4.330.945, 3, 5
3325Graph in Graph Neural Network4.330.943, 5, 5
3326Generative Adversarial Training for Neural Combinatorial Optimization Models4.330.945, 5, 3
3327Spatially Resolved Temporal Networks: Online Unsupervised Representation Learning of High Frequency Time Series4.330.945, 5, 3
3328How does overparametrization affect performance on minority groups?4.330.945, 3, 5
3329MSQ-BioBERT: Ambiguity Resolution to Enhance BioBERT Medical Question-Answering4.330.945, 3, 5
3330G-CEALS: Gaussian Cluster Embedding in Autoencoder Latent Space for Tabular Data Representation4.330.945, 3, 5
3331Performance Disparities Between Accents in Automatic Speech Recognition4.330.943, 5, 5
3332Pruning Parameterization with Bi-level Optimization for Efficient Semantic Segmentation on the Edge4.330.945, 5, 3
3333Adversarial Attack Detection Under Realistic Constraints4.330.945, 3, 5
3334Towards Estimating Transferability using Hard Subsets4.330.945, 5, 3
3335Trust Your $nabla$: Gradient-based Intervention Targeting for Causal Discovery4.330.945, 5, 3
3336Efficient Point Cloud Geometry Compression Through Neighborhood Point Transformer4.330.945, 5, 3
3337Uncovering the Effectiveness of Calibration on Open Intent Classification4.330.943, 5, 5
3338Lossy Compression with Gaussian Diffusion4.330.945, 5, 3
3339Deep Generative Wasserstein Gradient Flows4.330.945, 3, 5
3340DISCO-DANCE: Learning to Discover Skills with Guidance4.330.943, 5, 5
3341Lightweight Uncertainty for Offline Reinforcement Learning via Bayesian Posterior4.330.945, 5, 3
3342GPR-Net: Multi-view Layout Estimation via a Geometry-aware Panorama Registration Network4.330.945, 3, 5
3343Pareto Optimization for Active Learning under Out-of-Distribution Data Scenarios4.330.945, 3, 5
3344Deep Causal Generative Modeling for Tabular Data Imputation and Intervention4.330.945, 5, 3
3345Semantic Category Discovery with Vision-language Representations4.330.945, 3, 5
3346Non-Parametric State-Space Models: Identifiability, Estimation and Forecasting4.330.945, 3, 5
3347FedDM: Iterative Distribution Matching for Communication-Efficient Federated Learning4.330.945, 3, 5
3348Grounding High Dimensional Representation Similarity by Comparing Decodability and Network Performance4.330.943, 5, 5
3349Likelihood adjusted semidefinite programs for clustering heterogeneous data4.330.943, 5, 5
3350Hybrid and Collaborative Passage Reranking4.330.945, 3, 5
3351Few-Shot Learning with Representative Global Prototype4.330.945, 3, 5
3352Causal Knowledge Transfer from Task Affinity4.330.945, 5, 3
3353Hybrid Federated Learning for Feature & Sample Heterogeneity: Algorithms and Implementation4.330.943, 5, 5
3354RelationCLIP: Training-free Fine-grained Visual and Language Concept Matching4.330.943, 5, 5
3355Thinking Two Moves Ahead: Anticipating Other Users Improves Backdoor Attacks in Federated Learning4.330.943, 5, 5
3356Progressive Transformation Learning For Leveraging Virtual Images in Training4.330.945, 3, 5
3357Neighborhood Gradient Clustering: An Efficient Decentralized Learning Method for Non-IID Data Distributions4.330.943, 5, 5
3358Predicting Drug Repurposing Candidates and Their Mechanisms from A Biomedical Knowledge Graph4.330.945, 5, 3
3359Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees4.330.945, 5, 3
3360Policy-Induced Self-Supervision Improves Representation Finetuning in Visual RL4.330.945, 5, 3
3361NeuralPCG: Learning Preconditioner for Solving Partial Differential Equations with Graph Neural Network4.330.943, 5, 5
3362Parameter-varying neural ordinary differential equations with partition-of-unity networks4.330.945, 5, 3
3363OoD-Control: Out-of-Distribution Generalization for Adaptive UAV Flight Control4.330.943, 5, 5
3364VLG: General Video Recognition with Web Textual Knowledge4.330.945, 3, 5
3365Take 5: Interpretable Image Classification with a Handful of Features4.330.945, 3, 5
3366M$^3$Video: Masked Motion Modeling for Self-Supervised Video Representation Learning4.330.945, 3, 5
3367A New Paradigm for Federated Structure Non-IID Subgraph Learning4.330.945, 3, 5
3368Fine-Grained Image Retrieval with Neighbor-Attention Label Correction4.330.945, 3, 5
3369Provable Unsupervised Data Sharing for Offline Reinforcement Learning4.330.945, 5, 3
3370Boosting Discriminative Visual Representation Learning with Scenario-Agnostic Mixup4.330.943, 5, 5
3371AutoDisc: Automatic Distillation Schedule for Large Language Model Compression4.330.943, 5, 5
3372E$^2$: Entropy Discrimination and Energy Optimization for Source-free Universal Domain Adaptation4.330.945, 3, 5
3373Benchmarking Encoder-Decoder Architectures for Biplanar X-ray to 3D Bone Shape Reconstruction4.330.945, 5, 3
3374AdaWAC: Adaptively Weighted Augmentation Consistency Regularization for Volumetric Medical Image Segmentation4.330.945, 3, 5
3375Implicit Offline Reinforcement Learning via Supervised Learning4.330.945, 5, 3
3376Learnable Visual Words for Interpreting Image Recognition Models4.330.945, 3, 5
3377PIPS: Path Integral Stochastic Optimal Control for Path Sampling in Molecular Dynamics4.330.943, 5, 5
3378Visual Transformation Telling4.330.945, 3, 5
3379Rethinking the Training Shot Number in Robust Model-Agnostic Meta-Learning4.330.945, 3, 5
3380OpenFE: Automated Feature Generation beyond Expert-level Performance4.330.943, 5, 5
3381Learning to Count Everything: Transformer-based Trackers are Strong Baselines for Class Agnostic Counting4.330.945, 3, 5
3382Optimal Neural Network Approximation of Wasserstein Gradient Direction via Convex Optimization4.330.943, 5, 5
3383DELVING INTO THE HIERARCHICAL STRUCTURE FOR EFFICIENT LARGE-SCALE BI-LEVEL LEARNING4.330.943, 5, 5
3384Towards predicting dynamic stability of power grids with Graph Neural Networks4.330.945, 5, 3
3385ACAT: Adversarial Counterfactual Attention for Classification and Detection in Medical Imaging4.330.943, 5, 5
3386Structural Generalization of Visual Imitation Learning with Position-Invariant Regularization4.330.945, 5, 3
3387Generative Model Based Noise Robust Training for Unsupervised Domain Adaptation4.330.943, 5, 5
3388Triangle Inequality for Inverse Optimal Control4.330.943, 5, 5
3389CAMVR: Context-Adaptive Multi-View Representation Learning for Dense Retrieval4.330.943, 5, 5
3390BIL: Bandit Inference Learning for Online Representational Similarity Test4.330.943, 5, 5
3391Spatially constrained Adversarial Attack Detection and Localization in the Representation Space of Optical Flow Networks4.330.943, 5, 5
3392Learn the Time to Learn: Replay Scheduling in Continual Learning4.330.945, 3, 5
3393Coordinate and Generalize: A Unified Framework for Audio-Visual Zero-Shot Learning4.330.943, 5, 5
3394Iterative Relaxing Gradient Projection for Continual Learning4.330.945, 5, 3
3395Private GANs, Revisited4.330.945, 3, 5
3396FEW-SHOT NODE PROMPT TUNING4.330.943, 5, 5
3397Unfixed Bias Iterator: A New Iterative Format4.330.945, 3, 5
3398MS3: A Multimodal Supervised Pretrained Model for Semantic Segmentation4.330.945, 5, 3
3399Unified Vision and Language Prompt Learning4.330.945, 5, 3
3400Module-wise Training of Residual Networks via the Minimizing Movement Scheme4.330.945, 5, 3
3401On the Dynamics under the Averaged Sample Margin Loss and Beyond4.332.361, 6, 6
3402Learning a 3D-Aware Encoder for Style-based Generative Radiance Field4.330.945, 3, 5
3403TT-NF: Tensor Train Neural Fields4.330.945, 3, 5
3404Reward Learning with Trees: Methods and Evaluation4.330.943, 5, 5
3405HyperFeel: An Efficient Federated Learning Framework Using Hyperdimensional Computing4.330.943, 5, 5
3406Learning to aggregate: A parameterized aggregator to debias aggregation for cross-device federated learning4.251.306, 3, 5, 3
3407Long-horizon video prediction using a dynamic latent hierarchy4.251.303, 3, 5, 6
3408Gene finding revisited: improved robustness through structured decoding from learning embeddings4.252.598, 3, 5, 1
3409Towards a Complete Theory of Neural Networks with Few Neurons4.251.303, 6, 3, 5
3410Gradient-Based Transfer Learning4.251.303, 3, 5, 6
3411FLOP: Tasks for Fitness Landscapes Of Protein families using sequence- and structure-based representations4.251.303, 5, 6, 3
3412Diversity Boosted Learning for Domain Generalization with a Large Number of Domains4.251.305, 6, 3, 3
3413The guide and the explorer: smart agents for resource-limited iterated batch reinforcement learning4.251.306, 5, 3, 3
3414Smooth image-to-image translations with latent space interpolations4.251.305, 3, 6, 3
3415Protein Sequence Design in a Latent Space via Model-based Reinforcement Learning4.252.173, 3, 3, 8
3416On the convergence of SGD under the over-parameter setting4.251.921, 6, 5, 5
3417Exphormer: Scaling Graph Transformers with Expander Graphs4.251.305, 3, 3, 6
3418Challenging Common Assumptions about Catastrophic Forgetting4.251.303, 6, 5, 3
3419How to fine-tune vision models with SGD4.251.303, 5, 3, 6
3420Machine Learning Force Fields with Data Cost Aware Training4.251.303, 6, 3, 5
3421A Probabilistic Framework For Modular Continual Learning4.251.303, 3, 5, 6
3422Automatic Data Augmentation via Invariance-Constrained Learning4.251.303, 5, 6, 3
3423NEURAL HAMILTONIAN FLOWS IN GRAPH NEURAL NETWORKS4.251.303, 3, 5, 6
3424Finding Private Bugs: Debugging Implementations of Differentially Private Stochastic Gradient Descent4.251.303, 5, 6, 3
3425Robust Generative Flows on Reliable Image Reconstruction without Training Data4.251.305, 3, 6, 3
3426Boomerang: Local sampling on image manifolds using diffusion models4.252.173, 3, 8, 3
3427Latent Topology Induction for Understanding Contextualized Representations4.251.925, 1, 6, 5
3428Adaptive Anchor for Robust Keypoint Localization4.251.926, 1, 5, 5
3429Getting away with more network pruning: From sparsity to geometry and linear regions4.252.591, 8, 3, 5
3430Faster Hyperparameter Search for GNNs via Calibrated Dataset Condensation4.251.303, 5, 6, 3
3431High-dimensional Continuum Armed and High-dimensional Contextual Bandit: with Applications to Assortment and Pricing4.251.305, 3, 3, 6
3432Do Summarization Models Synthesize?4.251.303, 5, 3, 6
3433$beta$-Stochastic Sign SGD: A Byzantine Resilient and Differentially Private Gradient Compressor for Federated Learning4.251.303, 5, 6, 3
3434Graph Fourier MMD for signals on data graphs4.251.306, 3, 5, 3
3435Proportional Multicalibration4.251.305, 3, 3, 6
3436Effectively Modeling Time Series with Simple Discrete State Spaces4.252.173, 3, 3, 8
3437Tabular Deep Learning when $d gg n$ by Using an Auxiliary Knowledge Graph4.252.591, 3, 5, 8
3438Preserving In-Context Learning Ability in Large Language Model Fine-tuning4.251.306, 3, 5, 3
3439Meta-Learning with Explicit Task Information4.252.598, 5, 1, 3
3440Differentiable Channel Selection for Self-Attention4.251.306, 3, 3, 5
3441Membership Inference Attacks Against Text-to-image Generation Models4.251.306, 5, 3, 3
3442Fair Graph Message Passing with Transparency4.251.306, 5, 3, 3
3443DeepReShape: Redesigning Neural Networks for Private Inference4.251.303, 3, 5, 6
3444Learning to reason with relational abstractions4.251.303, 5, 3, 6
3445General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States4.251.303, 6, 5, 3
3446Does the Half Adversarial Robustness Represent the Whole? It Depends... A Theoretical Perspective of Subnetwork Robustness4.251.303, 6, 3, 5
3447Few-Shot Incremental Learning Using HyperTransformers4.251.305, 3, 3, 6
3448Graph schemas as abstractions for transfer learning, inference, and planning4.251.305, 6, 3, 3
3449Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits4.251.305, 3, 6, 3
3450Efficient One-Shot Neural Architecture Search With Progressive Choice Freezing Evolutionary Search4.252.173, 8, 3, 3
3451GraphEditor: An Efficient Graph Representation Learning and Unlearning Approach4.251.303, 3, 6, 5
3452Towards a More Rigorous Science of Blindspot Discovery in Image Models4.251.303, 3, 6, 5
3453Self-supervised video pretraining yields strong image representations4.251.303, 3, 5, 6
3454Loop Unrolled Shallow Equilibrium Regularizer (LUSER) - A Memory-Efficient Inverse Problem Solver4.251.306, 3, 3, 5
3455FedLite: Improving Communication Efficiency in Federated Split Learning4.251.303, 6, 5, 3
3456Reinforcement Learning for Bandits with Continuous Actions and Large Context Spaces4.251.305, 3, 3, 6
3457How to Enable Uncertainty Estimation in Proximal Policy Optimization4.251.303, 5, 6, 3
3458Training Equilibria in Reinforcement Learning4.251.305, 6, 3, 3
3459Planning with Large Language Models for Code Generation4.252.173, 3, 8, 3
3460Conformal Prediction is Robust to Label Noise4.251.303, 6, 5, 3
3461MyoDex: Generalizable Representations for Dexterous Physiological Manipulation4.251.306, 5, 3, 3
3462On the Expressive Power of Geometric Graph Neural Networks4.252.173, 8, 3, 3
3463CLMIU: Commonsense Learning in Multimodal Image Understanding.4.251.305, 3, 3, 6
3464TOWARDS AN OBJECTIVE EVALUATION OF THE TRUSTWORTHINESS OF CLASSIFIERS4.252.591, 3, 8, 5
3465Direct-Effect Risk Minimization4.251.303, 6, 5, 3
3466Predicting Out-of-Domain Generalization with Local Manifold Smoothness4.252.173, 8, 3, 3
3467Burstormer: Burst Image Restoration and Enhancement Transformer4.252.173, 3, 8, 3
3468$sigma$Reparam: Stable Transformer Training with Spectral Reparametrization4.252.173, 3, 8, 3
3469Federated Learning on Adaptively Weighted Nodes by Bilevel Optimization4.251.306, 5, 3, 3
3470MultiQuan RDP: Rate-Distortion-Perception Coding via Offset Quantizers4.251.303, 5, 6, 3
3471Removing Backdoors in Pre-trained Models by Regularized Continual Pre-training4.251.306, 3, 3, 5
3472CLAS: Central Latent Action Spaces for Coordinated Multi-Robot Manipulation4.251.303, 6, 3, 5
3473Sample-efficient multi-objective molecular optimization with GFlowNets4.252.593, 8, 5, 1
3474A Simple Nadaraya-Watson Head for Explainable and Calibrated Classification4.251.303, 5, 3, 6
3475Conditional Execution Of Cascaded Models Improves The Accuracy-Efficiency Trade-Off4.252.173, 3, 8, 3
3476DynaMS: Dyanmic Margin Selection for Efficient Deep Learning4.251.303, 3, 6, 5
3477Dimensionless instance segmentation by learning graph representations of point clouds4.252.173, 8, 3, 3
3478Semantic Prior for Weakly Supervised Class-Incremental Segmentation4.251.305, 3, 3, 6
3479Biological Factor Regulatory Neural Network4.251.303, 6, 3, 5
3480Differentiable Logic Programming for Probabilistic Reasoning4.251.306, 3, 5, 3
3481Graph Neural Networks as Gradient Flows: understanding graph convolutions via energy4.251.306, 3, 3, 5
3482Memory Learning of Multivariate Asynchronous Time Series4.251.305, 6, 3, 3
3483Improving Generative Flow Networks with Path Regularization4.251.305, 3, 6, 3
3484Calibration for Decision Making via Empirical Risk Minimization4.251.305, 3, 3, 6
3485Contextual Transformer for Offline Reinforcement Learning4.251.305, 3, 3, 6
3486Improving Continual Learning by Accurate Gradient Reconstructions of the Past4.251.306, 3, 5, 3
3487FairGrad: Fairness Aware Gradient Descent4.251.303, 6, 3, 5
3488A Mathematical Framework for Characterizing Dependency Structures of Multimodal Learning4.251.926, 1, 5, 5
3489Unbiased Representation of Electronic Health Records for Patient Outcome Prediction4.251.303, 5, 6, 3
3490Class-wise Visual Explanations for Deep Neural Networks4.251.305, 6, 3, 3
3491Identification of the Adversary from a Single Adversarial Example4.251.303, 3, 5, 6
3492A HIERARCHICAL FRAGMENT-BASED MODEL FOR 3D DRUG-LIKE MOLECULE GENERATION4.251.305, 6, 3, 3
3493Poisoning Generative Models to Promote Catastrophic Forgetting4.251.306, 5, 3, 3
3494Equivariant Disentangled Transformation for Domain Generalization under Combination Shift4.251.303, 5, 3, 6
3495Deep Contrastive Learning Approximates Ensembles of One-Class SVMs with Neural Tangent Kernels4.251.305, 6, 3, 3
3496Limitations of Piecewise Linearity for Efficient Robustness Certification4.251.306, 3, 5, 3
3497Leveraged Asymmetric Loss with Disambiguation for Multi-label Recognition with One-Positive Annotations4.251.303, 3, 5, 6
3498A Semantic Hierarchical Graph Neural Network for Text Classification4.252.178, 3, 3, 3
3499DROP: Conservative Model-based Optimization for Offline Reinforcement Learning4.251.303, 5, 3, 6
3500Semi-Supervised Segmentation-Guided Tumor-Aware Generative Adversarial Network for Multi-Modality Brain Tumor Translation4.251.305, 3, 6, 3
3501HSVC: Transformer-based Hierarchical Distillation for Software Vulnerability Classification4.251.305, 3, 6, 3
3502Oracles and Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning4.251.306, 3, 5, 3
3503What Deep Representations Should We Learn? -- A Neural Collapse Perspective4.251.303, 6, 3, 5
3504Towards Adversarially Robust Deepfake Detection: An Ensemble Approach4.252.173, 3, 3, 8
3505AlphaDesign: A graph protein design method and benchmark on AlphaFold DB4.251.925, 1, 6, 5
3506A Scalable and Exact Gaussian Process Sampler via Kernel Packets4.251.305, 3, 6, 3
3507Model ChangeLists: Characterizing Changes in ML Prediction APIs4.251.303, 5, 6, 3
3508Towards Large Scale Transfer Learning for Differentially Private Image Classification4.251.305, 6, 3, 3
3509Mixed Federated Learning: Joint Decentralized and Centralized Learning4.251.303, 6, 5, 3
3510Toward Discovering Options that Achieve Faster Planning4.251.306, 3, 3, 5
3511Stable Optimization of Gaussian Likelihoods4.251.305, 3, 6, 3
3512Efficient Sequence Packing without Cross-contamination: Accelerating Large Language Models without Impacting Performance4.251.306, 5, 3, 3
3513Evaluating Counterfactual Explainers4.251.303, 5, 3, 6
3514A Reinforcement Learning Approach to Estimating Long-term Treatment Effects4.251.306, 3, 3, 5
3515Conceptual SCAN: Learning With and About Rules4.251.305, 6, 3, 3
3516Unsupervised learning of features and object boundaries from local prediction4.251.303, 3, 5, 6
3517On the Activation Function Dependence of the Spectral Bias of Neural Networks4.251.305, 3, 6, 3
3518MERMADE: $K$-shot Robust Adaptive Mechanism Design via Model-Based Meta-Learning4.251.303, 5, 3, 6
3519Unpacking Large Language Models with Conceptual Consistency4.252.178, 3, 3, 3
3520StarGraph: Knowledge Representation Learning based on Incomplete Two-hop Subgraph4.252.173, 3, 8, 3
3521Memory-efficient Trajectory Matching for Scalable Dataset Distillation4.251.303, 6, 3, 5
3522Attentional Context Alignment for Multimodal Sequential Learning4.251.305, 3, 3, 6
3523REAP: A Large-Scale Realistic Adversarial Patch Benchmark4.251.306, 5, 3, 3
3524Federated Training of Dual Encoding Models on Small Non-IID Client Datasets4.251.305, 6, 3, 3
3525REDUCING OVERSMOOTHING IN GRAPH NEURAL NETWORKS BY CHANGING THE ACTIVATION FUNCTION4.251.303, 3, 5, 6
3526Multitask Reinforcement Learning by Optimizing Neural Pathways4.251.303, 5, 6, 3
3527Input Perturbation Reduces Exposure Bias in Diffusion Models4.251.303, 3, 6, 5
3528RangeAugment: Efficient Online Augmentation with Range Learning4.252.173, 3, 3, 8
3529Privacy-Preserving Vision Transformer on Permutation-Encrypted Images4.251.925, 1, 5, 6
3530FastDiff 2: Dually Incorporating GANs into Diffusion Models for High-Quality Speech Synthesis4.251.305, 6, 3, 3
3531On the Convergence and Calibration of Deep Learning with Differential Privacy4.251.305, 6, 3, 3
3532Critical Batch Size Minimizes Stochastic First-Order Oracle Complexity of Deep Learning Optimizer using Hyperparameters Close to One4.251.306, 5, 3, 3
3533Restricted Generative Projection for One-Class Classification and Anomaly detection4.251.305, 3, 3, 6
3534learning hierarchical multi-agent cooperation with long short-term intention4.251.306, 3, 3, 5
3535Pixel-Level Task Helps Pruned Network Transfer to Downstream Tasks4.251.305, 3, 3, 6
3536Efficient block contrastive learning via parameter-free meta-node approximation4.251.306, 3, 5, 3
3537Improving Model Consistency of Decentralized Federated Learning via Sharpness Aware Minimization and Multiple Gossip Approaches4.251.303, 3, 5, 6
3538Supplementing Domain Knowledge to BERT with Semi-structured Information of Documents4.251.305, 3, 3, 6
3539Window Projection Features are All You Need for Time Series Anomaly Detection4.251.303, 3, 6, 5
3540Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes4.251.303, 6, 3, 5
3541MetaFS: An Effective Wrapper Feature Selection via Meta Learning4.251.303, 3, 6, 5
3542A Time-Consistency Curriculum for Learning from Instance-Dependent Noisy Labels4.251.303, 6, 5, 3
3543Learning Object Affordance with Contact and Grasp Generation4.251.303, 5, 6, 3
3544Benchmarking Approximate k-Nearest Neighbour Search for Big High Dimensional Dynamic Data4.251.303, 6, 5, 3
3545Bias Mimicking: A Simple Sampling Approach for Bias Mitigation4.251.303, 5, 3, 6
3546From Coarse to Fine-grained Concept based Discrimination for Phrase Detection4.251.303, 6, 3, 5
3547k-Median Clustering via Metric Embedding: Towards Better Initialization with Differential Privacy4.251.305, 3, 6, 3
3548Randomized Smoothing with Masked Inference for Adversarially Robust NLP Systems4.251.306, 3, 5, 3
3549A Data-Based Perspective on Transfer Learning4.251.303, 5, 6, 3
3550GeONet: a neural operator for learning the Wasserstein geodesic4.251.303, 3, 6, 5
3551The Convergence Rate of SGD's Final Iterate: Analysis on Dimension Dependence4.251.303, 6, 5, 3
3552FAME: Fast Adaptive Moment Estimation based on Triple Exponential Moving Average4.252.173, 8, 3, 3
3553No Double Descent in PCA: Training and Pre-Training in High Dimensions4.251.303, 5, 3, 6
3554To be robust and to be fair: aligning fairness with robustness4.252.178, 3, 3, 3
3555Fair Clustering via Equalized Confidence4.251.306, 3, 3, 5
3556Diagnosing and exploiting the computational demands of videos games for deep reinforcement learning4.251.305, 3, 3, 6
3557Improving Information Retention in Large Scale Online Continual Learning4.251.303, 6, 3, 5
3558ON INJECTING NOISE DURING INFERENCE4.251.303, 6, 3, 5
3559Uncertainty-based Multi-Task Data Sharing for Offline Reinforcement Learning4.251.303, 3, 6, 5
3560Differentiable Meta-Logical Programming4.251.303, 5, 3, 6
3561Regularizing hard examples improves robustness4.251.303, 3, 5, 6
3562From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models4.251.305, 6, 3, 3
3563High probability error bounds of SGD in unbounded domain4.251.306, 3, 3, 5
3564MAXENT LOSS: CONSTRAINED MAXIMUM ENTROPY FOR CALIBRATING DEEP NEURAL NETWORKS4.251.303, 5, 3, 6
3565Efficient and Stealthy Backdoor Attack Triggers are Close at Hand4.251.303, 3, 5, 6
3566Teaching Others is Teaching Yourself Regularization For Controllable Language Models4.251.303, 3, 5, 6
3567On Intriguing Layer-Wise Properties of Robust Overfitting in Adversarial Training4.251.303, 5, 3, 6
3568Uncertainty-Aware Meta-Learning for Multimodal Task Distributions4.251.305, 3, 6, 3
3569Federated Learning for Inference at Anytime and Anywhere4.251.303, 5, 6, 3
3570Low-Rank Graph Neural Networks Inspired by the Weak-balance Theory in Social Networks4.251.303, 5, 3, 6
3571Node-Level Membership Inference Attacks Against Graph Neural Networks4.251.303, 6, 5, 3
3572Holding Monotonic Improvement and Generality for Multi-Agent Proximal Policy Optimization4.252.173, 3, 8, 3
3573Towards the gradient adjustment by loss status for Neural Network Optimization4.251.305, 6, 3, 3
3574Linear Video Transformer with Feature Fixation4.251.303, 3, 6, 5
3575Local Coefficient Optimization in Federated Learning4.251.303, 3, 6, 5
3576DSPNet: Towards Slimmable Pretrained Networks based on Discriminative Self-supervised Learning4.251.303, 3, 6, 5
3577Why pseudo-label based algorithm is effective? --from the perspective of pseudo-labeled data4.251.303, 5, 3, 6
3578RbX: Region-based explanations of prediction models4.251.303, 5, 3, 6
3579Motif-induced Graph Normalization4.251.305, 6, 3, 3
3580Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks4.251.303, 3, 6, 5
3581Evaluation of Attribution Explanations without Ground Truth4.251.303, 5, 6, 3
3582Going Deeper with Spiking Neurons: Towards Binary Outputs of Deep Logic Spiking Neural Network4.252.591, 8, 5, 3
3583Correcting Three Existing Beliefs on Mutual Information in Contrastive Learning4.251.305, 6, 3, 3
3584Batch Normalization Is Blind to the First and Second Derivatives of the Loss w.r.t. Features4.251.925, 1, 6, 5
3585Node Number Awareness Representation for Graph Similarity Learning4.251.303, 5, 6, 3
3586Improving the Transferability of Adversarial Attacks through Experienced Precise Nesterov Momentum4.251.303, 5, 6, 3
3587Sparse Random Networks for Communication-Efficient Federated Learning4.251.305, 3, 6, 3
3588WaveMix-Lite: A Resource-efficient Neural Network for Image Analysis4.251.303, 5, 6, 3
3589Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask4.251.305, 6, 3, 3
3590Imposing conservation properties in deep dynamics modeling via contrastive learning4.251.303, 5, 3, 6
3591Accumulative Poisoning Defense with Memorization Discrepancy4.251.305, 6, 3, 3
3592S^2-Transformer for Mask-Aware Hyperspectral Image Reconstruction4.251.306, 3, 3, 5
3593A deep top-down approach to hierarchically coherent probabilistic forecasting4.251.303, 3, 6, 5
3594Smart Multi-tenant Federated Learning4.252.173, 8, 3, 3
3595Accelerating Inverse Reinforcement Learning with Expert Bootstrapping4.251.303, 3, 6, 5
3596Intepreting & Improving Pretrained Language Models: A Probabilistic Conceptual Approach4.252.178, 3, 3, 3
3597Efficient Trojan Injection: 90% Attack Success Rate Using 0.04% Poisoned Samples4.251.305, 3, 3, 6
3598Multi-Dataset Multi-Task Framework for Learning Molecules and Protein-target Interactions Properties4.251.306, 3, 3, 5
3599Deep Ensembles for Graphs with Higher-order Dependencies4.251.306, 3, 5, 3
3600MEGAN: Multi Explanation Graph Attention Network4.252.173, 8, 3, 3
3601Practical Approaches for Fair Learning with Multitype and Multivariate Sensitive Attributes4.251.921, 5, 6, 5
3602FedREP: A Byzantine-Robust, Communication-Efficient and Privacy-Preserving Framework for Federated Learning4.251.303, 5, 6, 3
3603Targeted Adversarial Self-Supervised Learning4.251.303, 6, 3, 5
3604Triplet Similarity Learning on Concordance Constraint4.251.303, 3, 5, 6
3605Robust Transfer Learning Based on Minimax Principle4.251.303, 5, 6, 3
3606Interpreting Neural Networks Through the Lens of Heat Flow4.251.303, 5, 3, 6
3607DCE: Offline Reinforcement Learning With Double Conservative Estimates4.251.303, 5, 3, 6
3608Efficient Surrogate Gradients for Training Spiking Neural Networks4.251.303, 5, 3, 6
3609Extreme Masking for Learning Instance and Distributed Visual Representations4.252.593, 8, 5, 1
3610Leveraging Hierarchical Structure for Multi-Domain Active Learning with Theoretical Guarantees4.251.306, 3, 5, 3
3611Configuring Mixed-Integer Linear Programming Solvers with Deep Metric Learning4.252.178, 3, 3, 3
3612Graph Neural Bandits4.251.303, 6, 5, 3
3613Deep Power Laws for Hyperparameter Optimization4.251.303, 6, 3, 5
3614Prompt-Matched Semantic Segmentation4.251.303, 3, 5, 6
3615GeoVeX: Geospatial Vectors with Hexagonal Convolutional Autoencoders4.251.303, 6, 5, 3
3616Feature Synchronization in Backdoor Attacks4.251.306, 3, 3, 5
3617MMTSA: Multi-Modal Temporal Segment Attention Network for Efficient Human Activity Recognition4.251.305, 3, 6, 3
3618Look Back When Surprised: Stabilizing Reverse Experience Replay for Neural Approximation4.252.593, 5, 8, 1
3619Multiscale Neural Operator: Learning Fast and Grid-independent PDE Solvers4.251.303, 6, 5, 3
3620A Massively Parallel Benchmark for Safe Dexterous Manipulation4.251.303, 5, 6, 3
3621Rethinking the Explanation of Graph Neural Network via Non-parametric Subgraph Matching4.252.173, 8, 3, 3
3622Q-Match: Self-Supervised Learning For Tabular Data by Matching Distributions Induced by a Queue4.251.303, 3, 6, 5
3623Voting from Nearest Tasks: Meta-Vote Pruning of Pretrained Models for Downstream Tasks4.251.303, 5, 3, 6
3624Momentum in Momentum for Adaptive Optimization4.252.178, 3, 3, 3
3625NICO++: Towards Better Benchmarking for Domain Generalization4.251.305, 3, 6, 3
3626Gradient Norm Regularizer Seeks Flat Minima and Improves Generalization4.251.303, 3, 5, 6
3627Calibrating Multimodal Learning4.251.305, 3, 6, 3
3628Token Turing Machines4.251.303, 5, 6, 3
3629Cutting Long Gradient Flows: Decoupling End-to-End Backpropagation Based on Supervised Contrastive Learning4.251.303, 5, 3, 6
3630ThinkSum: Probabilistic reasoning over sets using large language models4.252.178, 3, 3, 3
3631Model-agnostic Measure of Generalization Difficulty4.252.173, 3, 3, 8
3632Hedge Your Actions: Flexible Reinforcement Learning for Complex Action Spaces4.252.591, 3, 5, 8
3633Online Learning for Obstacle Avoidance4.201.943, 6, 6, 5, 1
3634FaDIn: Fast Discretized Inference for Hawkes Processes with General Parametric Kernels4.200.983, 5, 5, 5, 3
3635Game-Theoretic Understanding of Misclassification4.201.943, 5, 6, 6, 1
3636Improving Vision Attention with Random Walk Graph Kernel4.200.985, 5, 3, 3, 5
3637Lifting the Curse of Capacity Gap in Distilling Large Language Models4.200.983, 5, 5, 3, 5
3638Semi-supervised learning of partial differential operators and dynamical flows4.200.983, 5, 5, 3, 5
3639Language Models Can See: Plugging Visual Controls in Text Generation4.201.473, 3, 3, 6, 6
3640Logic-aware Pre-training of Language Models4.201.601, 5, 5, 5, 5
3641Towards Discovering Neural Architectures from Scratch4.201.476, 3, 6, 3, 3
3642Neural Autoregressive Refinement for Self-Supervised Outlier Detection beyond Images4.171.675, 5, 5, 1, 6, 3
3643Data Leakage in Tabular Federated Learning4.001.416, 3, 3
3644Towards Robust Online Dialogue Response Generation4.001.003, 5, 5, 3
3645MolBART: Generative Masked Language Models for Molecular Representations4.001.003, 5, 3, 5
3646Formal Specifications from Natural Language4.001.005, 3, 3, 5
3647Pseudo-Differential Integral Operator for Learning Solution Operators of Partial Differential Equations4.001.003, 3, 5, 5
3648A Simple Contrastive Learning Objective for Alleviating Neural Text Degeneration4.001.005, 3, 5, 3
3649Moment Distributionally Robust Probabilistic Supervised Learning4.001.003, 5, 5, 3
3650Accelerating spiking neural network training using the $d$-block model4.001.263, 3, 6, 5, 3
3651RG: OUT-OF-DISTRIBUTION DETECTION WITH REACTIVATE GRADNORM4.001.003, 5, 5, 3
3652Breaking Large Language Model-based Code Generation4.001.413, 6, 3
3653Proximal Validation Protocol4.001.003, 5, 3, 5
3654AUTOMATIC CURRICULUM FOR UNSUPERVISED REIN- FORCEMENT LEARNING4.002.161, 5, 6
3655On Representation Learning in the First Layer of Deep CNNs and the Dynamics of Gradient Descent4.001.005, 3, 5, 3
3656Learning Layered Implicit Model for 3D Avatar Clothing Representation4.001.003, 5, 5, 3
3657Generalizable Multi-Relational Graph Representation Learning: A Message Intervention Approach4.003.101, 10, 3, 3, 3
3658Explicitly Maintaining Diverse Playing Styles in Self-Play4.001.413, 6, 3
3659Label Similarity Aware Contrastive Learning4.001.005, 5, 3, 3
3660Incompatibility between Deterministic Policy and Generative Adversarial Imitation Learning4.001.263, 3, 6, 3, 5
3661CAT: Collaborative Adversarial Training4.001.005, 3, 3, 5
3662Therbligs in Action: Video Understanding through Motion Primitives4.001.005, 3, 3, 5
3663DEFENDING BACKDOOR ATTACKS VIA ROBUSTNESS AGAINST NOISY LABEL4.001.005, 3, 5, 3
3664Efficient, probabilistic analysis of combinatorial neural codes4.001.413, 6, 3
3665Simple and Deep Graph Attention Networks4.001.005, 3, 5, 3
3666GNN Domain Adaptation using Optimal Transport4.001.003, 5, 5, 3
3667An Integrated Multi-Label Multi-Modal Framework in Deep Metric Learning4.001.416, 3, 3
3668Autoregressive Graph Network for Learning Multi-step Physics4.001.003, 3, 5, 5
3669Layer-wise Balanced Activation Mechanism4.001.003, 5, 3, 5
3670Neural Integral Equations4.001.416, 3, 3
3671Consistent Data Distribution Sampling for Large-scale Retrieval4.001.003, 5, 3, 5
3672Mixture of Quantized Experts (MoQE): Complementary Effect of Low-bit Quantization and Robustness4.001.266, 3, 3, 3, 5
3673Dynamics Model Based Adversarial Training For Competitive Reinforcement Learning4.001.005, 3, 3, 5
3674A Stable and Scalable Method for Solving Initial Value PDEs with Neural Networks4.001.413, 3, 6
3675CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets4.001.005, 3, 3, 5
3676Forgetful causal masking makes causal language models better zero-shot learners4.002.121, 6, 6, 3
3677Marich: A Query-efficient & Online Model Extraction Attack using Public Data4.001.413, 3, 6
3678Connecting representation and generation via masked vision-language transformer4.001.005, 3, 5, 3
3679Current Anomaly Detectors are Anomalous: On Semantic Treatment of OOD Inputs4.001.005, 3, 3, 5
3680Event-former: A Self-supervised Learning Paradigm for Temporal Point Processes4.002.123, 1, 6, 6
3681Controllable Concept Transfer of Intermediate Representations4.001.003, 3, 5, 5
3682SaiT: Sparse Vision Transformers through Adaptive Token Pruning4.001.005, 3, 3, 5
3683Differentiable Rendering with Reparameterized Volume Sampling4.001.003, 3, 5, 5
3684Just Avoid Robust Inaccuracy: Boosting Robustness Without Sacrificing Accuracy4.001.413, 6, 3
3685Invariant Aggregator for Defending against Federated Backdoor Attacks4.001.005, 3, 5, 3
3686UNDERSTANDING THE ROLE OF POSITIONAL ENCODINGS IN SENTENCE REPRESENTATIONS4.001.003, 5, 3, 5
3687Attribution Scores are Redundant: Explaining Feature Contribution By Trajectories4.001.263, 3, 6, 5, 3
3688Neural Networks as Paths through the Space of Representations4.001.003, 3, 5, 5
3689From Points to Functions: Infinite-dimensional Representations in Diffusion Models4.001.005, 5, 3, 3
3690ESC: A Benchmark For Multi-Domain End-to-End Speech Recognition4.001.005, 3, 5, 3
3691Towards Dynamic Sparsification by Iterative Prune-Grow LookAheads4.001.416, 3, 3
3692Skill Decision Transformer4.001.003, 3, 5, 5
36933D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction4.001.413, 3, 6
3694Synthetic Pre-Training Tasks for Neural Machine Translation4.001.005, 3, 5, 3
3695Optimizing the Performance of Text Classification Models by Improving the Isotropy of the Embeddings using a Joint Loss Function4.001.003, 5, 3, 5
3696A $2$-parameter Persistence Layer for Learning4.001.003, 5, 5, 3
3697UniS-MMC: Learning Unimodality-supervised Multimodal Contrastive Representations4.001.003, 3, 5, 5
3698NAG-GS: semi-implicit, accelerated and robust stochastic optimizer.4.001.003, 5, 3, 5
3699Adversarial Policies Beat Professional-Level Go AIs4.001.413, 6, 3
3700Pre-train Graph Neural Networks for Brain Network Analysis4.001.003, 5, 3, 5
3701Unscented Autoencoder4.002.121, 3, 6, 6
3702AQuaMaM: An Autoregressive, Quaternion Manifold Model for Rapidly Estimating Complex SO(3) Distributions4.001.413, 3, 6
3703Multi-Objective GFlowNets4.001.413, 6, 3
3704A Scalable Training Strategy for Blind Multi-Distribution Noise Removal4.001.005, 5, 3, 3
3705Triplet learning of task representations in latent space for continual learning4.001.003, 5, 3, 5
3706The Robustness Limits of SoTA Vision Models to Natural Variation4.001.005, 3, 3, 5
3707DLP: Data-Driven Label-Poisoning Backdoor Attack4.001.003, 5, 5, 3
3708ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech4.001.003, 3, 5, 5
3709Semantic Transformation-based Data Augmentation for Few-Shot Learning4.001.413, 6, 3
3710COC curve: operating neural networks at high accuracy and low manual effort4.001.416, 3, 3
3711Wide Attention is the Way Forward for Transformers4.001.005, 5, 3, 3
3712Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning4.001.005, 3, 3, 5
3713SAGE: Semantic-Aware Global Explanations for Named Entity Recognition4.001.265, 3, 6, 3, 3
3714On the Forward Invariance of Neural ODEs4.002.123, 1, 6, 6
3715Learning Debiased Representations via Conditional Attribute Interpolation4.001.413, 6, 3
3716Multi-stationary point losses for robust model4.002.941, 3, 8
3717Learning Stackelberg Equilibria and Applications to Economic Design Games4.002.123, 1, 6, 6
3718Personalized federated composite learning with forward-backward envelopes4.001.005, 3, 3, 5
3719Attention Based Models for Cell Type Classification on Single-Cell RNA-Seq Data4.001.003, 5, 3, 5
3720Robust and accelerated single-spike spiking neural network training with applicability to challenging temporal tasks4.001.005, 3, 3, 5
3721Annealed Fisher Implicit Sampler4.001.003, 5, 5, 3
3722Differentiable and transportable structure learning4.001.003, 3, 5, 5
3723SeKron: A Decomposition Method Supporting Many Factorization Structures4.002.161, 6, 5
3724Deep Class Conditional Gaussians for Continual Learning4.001.413, 6, 3
3725On Feature Diversity in Energy-based Models4.001.795, 5, 1, 6, 3
3726How does Uncertainty-aware Sample-selection Help Decision against Action Noise?4.001.413, 3, 6
3727QuAFL: Federated Averaging Made Asynchronous and Communication-Efficient4.001.003, 5, 3, 5
3728Targeted Attacks on Timeseries Forecasting4.001.003, 3, 5, 5
3729Flareon: Stealthy Backdoor Injection via Poisoned Augmentation4.001.413, 3, 6
3730Multi-Head State Space Model for Sequence Modeling4.002.123, 6, 1, 6
3731Rewiring with Positional Encodings for GNNs4.001.005, 3, 3, 5
3732Gated Inference Network: Inferencing and Learning State-Space Models4.001.416, 3, 3
3733Optimizing Spca-based Continual Learning: A Theoretical Approach4.002.126, 3, 1, 6
3734Learning Task Agnostic Temporal Consistency Correction4.001.003, 5, 5, 3
3735Transformers with Multiresolution Attention Heads4.001.413, 6, 3
3736Reinforcement Learning using a Molecular Fragment Based Approach for Reaction Discovery4.001.263, 3, 3, 6, 5
3737Invariance Makes a Difference: Disentangling the Role of Invariance and Equivariance in Representations4.001.413, 3, 6
3738Learning DAGs from Fourier-Sparse Data4.001.003, 5, 3, 5
3739Momentum Boosted Episodic Memory for Improving Learning in Long-Tailed RL Environments4.001.005, 5, 3, 3
3740Neural Image Compression with a Diffusion-based Decoder4.001.413, 3, 6
3741Caption supervision enables robust learners: a controlled study of distributionally robust model training4.001.796, 1, 5, 3, 5
3742Pessimistic Policy Iteration for Offline Reinforcement Learning4.001.263, 6, 3, 3, 5
3743Prototypical Context-aware Dynamics Generalization for High-dimensional Model-based Reinforcement Learning4.001.003, 3, 5, 5
3744Efficient Hyperparameter Optimization Through Tensor Completion4.001.005, 3, 5, 3
3745UTS: When Monotonic Value Factorisation Meets Non-monotonic and Stochastic Targets4.001.413, 3, 6
3746Learning Rotation-Equivariant Features for Visual Correspondence4.001.005, 3, 5, 3
3747PAVI: Plate-Amortized Variational Inference4.001.003, 3, 5, 5
3748Multimodal Masked Autoencoders Learn Transferable Representations4.001.003, 3, 5, 5
3749Test-Time AutoEval with Supporting Self-supervision4.001.005, 3, 3, 5
3750MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning4.001.005, 3, 5, 3
3751Partial Differential Equation-Regularized Neural Networks: An Application to Image Classification4.001.003, 5, 5, 3
3752On Nullspace of Vision Transformers and What Does it Tell Us?4.001.005, 3, 5, 3
3753Which is Better for Learning with Noisy Labels: The Semi-supervised Method or Modeling Label Noise?4.001.003, 5, 3, 5
3754FACS: FAST ADAPTIVE CHANNEL SQUEEZING4.001.003, 5, 5, 3
3755DYNAMIC ENSEMBLE FOR PROBABILISTIC TIME- SERIES FORECASTING VIA DEEP REINFORCEMENT LEARNING4.001.005, 3, 5, 3
3756Understanding Pruning at Initialization: An Effective Node-Path Balancing Perspective4.001.003, 5, 3, 5
3757Oracle-oriented Robustness: Robust Image Model Evaluation with Pretrained Models as Surrogate Oracle4.001.003, 3, 5, 5
3758Mitigating Demographic Bias of Federated Learning Models via Global Domain Smoothing4.002.165, 6, 1
3759Analysis of differentially private synthetic data: a general measurement error approach4.001.005, 3, 5, 3
3760Counterfactual Contrastive Learning for Robust Text Classification4.001.003, 5, 3, 5
3761Which Invariance Should We Transfer? A Causal Minimax Learning Approach4.001.003, 5, 3, 5
3762Graph Contrastive Learning with Reinforced Augmentation4.001.003, 5, 5, 3
3763Trusted Aggregation (TAG): Model Filtering Backdoor Defense In Federated Learning4.001.005, 5, 3, 3
3764BiViT: Exploring Binary Vision Transformers4.001.416, 3, 3
3765LVQ-VAE:End-to-end Hyperprior-based Variational Image Compression with Lattice Vector Quantization4.001.003, 3, 5, 5
3766Towards Solving Industrial Sequential Decision-making Tasks under Near-predictable Dynamics via Reinforcement Learning: an Implicit Corrective Value Estimation Approach4.001.003, 3, 5, 5
3767The Graph Learning Attention Mechanism: Learnable Sparsification Without Heuristics4.001.003, 5, 3, 5
3768On Convergence of Federated Averaging Langevin Dynamics4.001.413, 6, 3
3769BYPASSING THE STABILITY-PLASTICITY TRADEOFF TO REDUCE PREDICTIVE CHURN4.002.371, 8, 3, 5, 3
3770Learning Object-Centric Dynamic Modes from Video and Emerging Properties4.001.003, 5, 5, 3
3771Invertible normalizing flow neural networks by JKO scheme4.001.005, 3, 3, 5
3772Towards Causal Concepts for Explaining Language Models4.001.003, 3, 5, 5
3773Leveraging Human Features at Test-Time4.001.413, 3, 6
3774SaMoE: Parameter Efficient MoE Language Models via Self-Adaptive Expert Combination4.001.413, 3, 6
3775Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size4.001.005, 3, 5, 3
3776Learning from Others: Similarity-based Regularization for Mitigating Artifacts4.001.005, 5, 3, 3
3777Red PANDA: Disambiguating Anomaly Detection by Removing Nuisance Factors4.002.126, 1, 6, 3
3778Taming Policy Constrained Offline Reinforcement Learning for Non-expert Demonstrations4.001.005, 5, 3, 3
3779Internal Purity: A Differential Entropy based Internal Validation Index for Clustering Validation4.001.003, 5, 3, 5
3780PromptSum: Planning with Mixed Prompts for Parameter-Efficient Controllable Abstractive Summarization4.001.003, 5, 3, 5
3781A Theory of Equivalence-Preserving Program Embeddings4.001.003, 5, 3, 5
3782Formal Interpretability with Merlin-Arthur Classifiers4.001.005, 5, 3, 3
3783How deep convolutional neural networks lose spatial information with training4.001.413, 6, 3
3784gGN: learning to represent nodes in directed graphs as low-rank Gaussian distributions4.001.005, 5, 3, 3
3785Provable Sharpness-Aware Minimization with Adaptive Learning Rate4.001.003, 5, 5, 3
3786Beyond re-balancing: distributionally robust augmentation against class-conditional distribution shift in long-tailed recognition4.001.005, 3, 5, 3
3787Offline Communication Learning with Multi-source Datasets4.001.005, 5, 3, 3
3788Training via Confidence Ranking4.001.003, 5, 3, 5
3789Computational Doob h-transforms for Online Filtering of Discretely Observed Diffusions4.001.735, 5, 1, 5
3790Reconciling feature sharing and multiple predictions with MIMO Vision Transformers4.001.005, 3, 3, 5
3791$Q$-learning with regularization converges with non-linear non-stationary features4.001.413, 6, 3
3792Backdoor or Feature? A New Perspective on Data Poisoning4.001.003, 5, 5, 3
3793SpeedyZero: Mastering Atari with Limited Data and Time4.001.413, 3, 6
3794Source-Target Coordinated Training with Multi-head Hybrid-Attention for Domain Adaptive Semantic Segmentation4.001.005, 3, 3, 5
3795Revisiting Activation Function Design for Improving Adversarial Robustness at Scale4.001.005, 5, 3, 3
3796What Does Vision Supervision Bring to Language Models? A Case Study of CLIP4.001.005, 3, 5, 3
3797Learning to Counter: Stochastic Feature-based Learning for Diverse Counterfactual Explanations4.001.005, 3, 5, 3
3798Exploiting Certified Defences to Attack Randomised Smoothing4.001.005, 3, 5, 3
3799How and Why We Detect Distribution Shift: Critical Analysis of Methods and Benchmarks4.001.413, 3, 6
3800$textrm{D}^3textrm{Former}$: Debiased Dual Distilled Transformer for Incremental Learning4.001.003, 5, 3, 5
3801Score-Based Graph Generative Modeling with Self-Guided Latent Diffusion4.001.005, 3, 3, 5
3802BrGANs: Stabilizing GANs' Training Process with Brownian Motion Control4.001.005, 5, 3, 3
3803Unfair geometries: exactly solvable data model with fairness implications4.001.005, 3, 3, 5
3804ExtraMix: Extrapolatable Data Augmentation for Regression using Generative Models4.001.005, 5, 3, 3
3805Learning Combinatorial Node Labeling Algorithms4.001.005, 3, 3, 5
3806PBFormer: Capturing Complex Scene Text Shape with Polynomial Band Transformer4.001.003, 5, 5, 3
3807Addressing Variable Dependency in GNN-based SAT Solving4.001.003, 5, 5, 3
3808Adversarial Examples Guided Pseudo-label Refinement for Decentralized Domain Adaptation4.001.005, 3, 5, 3
3809Molecule Generation for Target Receptor Binding via Continuous Normalizing Flows4.001.003, 5, 5, 3
3810Lost Domain Generalization Is a Natural Consequence of Lack of Training Domains4.001.413, 6, 3
3811ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading4.002.166, 5, 1
3812OCD: Learning to Overfit with Conditional Diffusion Models4.001.003, 5, 5, 3
3813Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives4.001.005, 3, 3, 5
3814$z$-SignFedAvg: A Unified Stochastic Sign-based Compression for Federated Learning4.001.416, 3, 3
3815DECN: Evolution Inspired Deep Convolution Network for Black-box Optimization4.001.263, 5, 6, 3, 3
3816Multi-Treatment Effect Estimation with Proxy: Contrastive Learning and Rank Weighting4.001.003, 5, 5, 3
3817DeepTime: Deep Time-index Meta-learning for Non-stationary Time-series Forecasting4.001.003, 5, 5, 3
3818Efficient Method for Bi-level Optimization with Non-smooth Lower-Level Problem4.001.003, 5, 3, 5
3819Learning an Invertible Output Mapping Can Mitigate Simplicity Bias in Neural Networks4.001.003, 3, 5, 5
3820Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk4.001.003, 5, 5, 3
3821MaskConver: A Universal Panoptic and Semantic Segmentation Model with Pure Convolutions4.001.003, 3, 5, 5
3822AxBERT: An Explainable Chinese Spelling Correction Method Driven by Associative Knowledge Network4.001.005, 5, 3, 3
3823Towards Efficient Posterior Sampling in Deep Neural Networks via Symmetry Removal4.002.003, 3, 8, 3, 3
3824Are Neurons Actually Collapsed? On the Fine-Grained Structure in Neural Representations4.001.003, 3, 5, 5
3825Knowledge-Driven New Drug Recommendation4.001.003, 5, 5, 3
3826Contrastive Prompt Tuning Improves Generalization in Vision-Language Models4.001.005, 3, 5, 3
3827On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly-Communicating MDPs4.001.416, 3, 3
3828Robust Reinforcement Learning with Distributional Risk-averse formulation4.001.003, 5, 5, 3
3829Model-based Value Exploration in Actor-critic Deep Reinforcement Learning4.001.005, 5, 3, 3
3830Adversarial Detector for Decision Tree Ensembles Using Representation Learning4.001.005, 3, 3, 5
3831'Why did the Model Fail?': Attributing Model Performance Changes to Distribution Shifts4.001.003, 5, 3, 5
3832Imitation Improvement Learning for Large-scale Capacitated Vehicle Routing Problems4.001.005, 5, 3, 3
3833Points2NeRF: Generating Neural Radiance Fields from 3D point cloud4.001.003, 5, 5, 3
3834DEEPER-GXX: DEEPENING ARBITRARY GNNS4.001.003, 3, 5, 5
3835Music-to-Text Synaesthesia: Generating Descriptive Text from Music Recordings4.001.005, 3, 3, 5
3836HyperMAML: Few-Shot Adaptation of Deep Models with Hypernetworks4.001.005, 3, 5, 3
3837EIT: Enhanced Interactive Transformer for Sequence Generation4.001.003, 5, 3, 5
3838Local Attention Layers for Vision Transformers4.001.005, 5, 3, 3
3839Neural Discrete Reinforcement Learning4.001.005, 3, 3, 5
3840Memory-Augmented Variational Adaptation for Online Few-Shot Segmentation4.001.003, 3, 5, 5
3841QUANTILE-LSTM: A ROBUST LSTM FOR ANOMALY DETECTION4.001.005, 3, 3, 5
3842Auto-Encoding Adversarial Imitation Learning4.001.003, 5, 3, 5
3843BiTAT: Neural Network Binarization with Task-Dependent Aggregated Transformation4.001.003, 5, 5, 3
3844Constrained Reinforcement Learning for Safety-Critical Tasks via Scenario-Based Programming4.001.413, 3, 6
3845Physically Plausible and Conservative Solutions to Navier-Stokes Equations Using Physics-Informed CNNs4.001.413, 6, 3
3846Does Federated Learning Really Need Backpropagation?4.001.416, 3, 3
3847Specialization of Sub-paths for Adaptive Depth Networks4.001.005, 3, 5, 3
3848Closing the Performance Gap between Cumbersome and Lightweight Contrastive Models4.001.263, 3, 6, 5, 3
3849MAGA: Modeling a Group Action4.001.003, 3, 5, 5
3850Recursion of Thought: Divide and Conquer Reasoning with Language Models4.002.948, 1, 3
3851Geo-NN: An End-to-End Framework for Geodesic Mean Estimation on the Manifold of Symmetric Positive Definite Matrices4.001.413, 3, 6
3852Progressive Image Synthesis from Semantics to Details with Denoising Diffusion GAN4.001.005, 5, 3, 3
3853Learning large-scale Kernel Networks4.001.005, 3, 3, 5
3854Path Regularization: A Convexity and Sparsity Inducing Regularization for Parallel ReLU Networks4.001.416, 3, 3
3855MAT: Mixed-Strategy Game of Adversarial Training in Fine-tuning4.001.003, 5, 5, 3
3856MAFormer: A Transformer Network with Multi-scale Attention Fusion for Visual Recognition4.001.003, 5, 3, 5
3857MQSP: Micro-Query Sequence Parallelism for Linearly Scaling Long Sequence Transformer4.001.005, 3, 3, 5
3858Schrödinger's FP: Training Neural Networks with Dynamic Floating-Point Containers4.001.005, 3, 3, 5
3859Continual Learning with Group-wise Neuron Normalization4.001.005, 3, 3, 5
3860Sparse Hyperbolic Representation Learning4.001.413, 6, 3
3861Universal embodied intelligence: learning from crowd, recognizing the world, and reinforced with experience4.002.121, 6, 6, 3
3862LAMDA: Latent mapping for domain adaption of image generators4.001.005, 3, 5, 3
3863Novel Class Discovery under Unreliable Sampling4.001.416, 3, 3
3864Teach me how to Interpolate a Myriad of Embeddings4.001.413, 3, 6
3865Interventional Rationalization4.001.003, 3, 5, 5
3866Diverse, Difficult, and Odd Instances (D2O): A New Test Set for Object Classification4.001.003, 5, 3, 5
3867Effective dimension of machine learning models4.001.003, 5, 3, 5
3868A theory of representation learning in neural networks gives a deep generalisation of kernel methods4.001.413, 6, 3
3869A spatiotemporal graph neural network with multi granularity for air quality prediction4.001.413, 3, 6
3870OPERA: Omni-Supervised Representation Learning with Hierarchical Supervisions4.001.003, 3, 5, 5
3871How you start matters for generalization4.001.413, 3, 6
3872PVT++: A Simple End-to-End Latency-Aware Visual Tracking Framework4.001.005, 5, 3, 3
3873Dimensionality-Varying Diffusion Process4.001.003, 3, 5, 5
3874Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents4.001.005, 3, 5, 3
3875On Storage Neural Network Augmented Approximate Nearest Neighbor Search4.001.003, 5, 3, 5
3876Sample Importance in SGD Training4.001.003, 5, 3, 5
3877Critical Learning Periods Augmented Model Poisoning Attacks to Byzantine-Robust Federated Learning4.001.003, 3, 5, 5
3878Individual Fairness of Data Provider Regarding Privacy Risk and Gain4.001.005, 3, 3, 5
3879Semi-supervised Node Classification with Imbalanced Receptive Field4.001.003, 5, 5, 3
3880CEREAL: Few-Sample Clustering Evaluation4.001.005, 3, 3, 5
3881Computational-Unidentifiability in Representation for Fair Downstream Tasks4.001.416, 3, 3
3882Accelerating Federated Learning Convergence via Opportunistic Mobile Relaying4.001.416, 3, 3
3883Learning Control Lyapunov Functions For High-dimensional Unknown Systems using Guided Iterative State Space Exploration4.001.005, 3, 3, 5
3884Universal Mini-Batch Consistency for Set Encoding Functions4.001.005, 5, 3, 3
3885Soundness and Completeness: An Algorithmic Perspective on Evaluation of Feature Attribution4.001.003, 5, 3, 5
3886Improving Differentially-Private Deep Learning with Gradients Index Pruning4.001.263, 5, 6, 3, 3
3887Shuffle Gaussian Mechanism for Differential Privacy4.001.416, 3, 3
3888MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning4.001.003, 5, 3, 5
3889Distributional Reinforcement Learning via Sinkhorn Iterations4.001.003, 5, 3, 5
3890MLM with Global Co-occurrence4.001.003, 5, 5, 3
3891Breaking Correlation Shift via Conditional Invariant Regularizer4.001.005, 5, 3, 3
3892How Powerful is Implicit Denoising in Graph Neural Networks4.002.126, 1, 3, 6
3893ChemSpacE: Interpretable and Interactive Chemical Space Exploration4.001.005, 3, 5, 3
3894Probing into the Fine-grained Manifestation in Multi-modal Image Synthesis4.001.416, 3, 3
3895Eliminating Catastrophic Overfitting Via Abnormal Adversarial Examples Regularization4.001.003, 3, 5, 5
3896Factor Learning Portfolio Optimization Informed by Continuous-Time Finance Models4.001.416, 3, 3
3897Closing the Gap Between SVRG and TD-SVRG with Gradient Splitting4.001.735, 1, 5, 5
3898Sorted eigenvalue comparison $d_{mathsf{Eig}}$: A simple alternative to $d_{mathsf{FID}}$4.001.003, 5, 3, 5
3899Never Revisit: Continuous Exploration in Multi-Agent Reinforcement Learning4.001.003, 5, 5, 3
3900SepRep-Net: Multi-source Free Domain Adaptation via Model Separation and Reparameterization4.001.005, 3, 3, 5
3901Generalizability of Adversarial Robustness Under Distribution Shifts4.001.003, 5, 3, 5
3902Uncertainty-Driven Active Vision for Implicit Scene Reconstruction4.001.003, 3, 5, 5
3903Spurious Local Minima Provably Exist for Deep Convolutional Neural Networks4.001.003, 3, 5, 5
3904Graph Contrastive Learning with Personalized Augmentation4.001.003, 5, 5, 3
3905Variational Reparametrized Policy Learning with Differentiable Physics4.001.413, 3, 6
3906Stable, Efficient, and Flexible Monotone Operator Implicit Graph Neural Networks4.001.416, 3, 3
3907LSAP: Rethinking Inversion Fidelity, Perception and Editability in GAN Latent Space4.001.005, 3, 3, 5
3908Neural Sorting Networks with Error-Free Differentiable Swap Functions4.001.413, 3, 6
3909SWORD: Demystify the Secrets of Open-world Instance Recognition4.001.416, 3, 3
3910Learning Antidote Data to Individual Unfairness4.001.003, 5, 3, 5
3911TiDAL: Learning Training Dynamics for Active Learning4.001.416, 3, 3
3912CompletionFormer: Depth Completion with Convolutions and Vision Transformers4.002.121, 3, 6, 6
3913Demystifying the Optimization and Generalization of Deep PAC-Bayesian Learning4.001.003, 3, 5, 5
3914Nearing or Surpassing: Overall Evaluation of Human-Machine Dynamic Vision Ability4.001.413, 3, 6
3915Learn to Know Unknowns: A Bionic Memory Network for Unsupervised Anomaly Detection4.001.003, 5, 3, 5
3916Double dynamic sparse training for GANs4.001.003, 3, 5, 5
3917Improving Corruption Robustness with Adversarial Feature Alignment Transformers4.001.413, 6, 3
3918Slimmable Networks for Contrastive Self-supervised Learning4.001.003, 3, 5, 5
3919TEAS: Exploiting Spiking Activity for Temporal-wise Adaptive Spiking Neural Networks4.001.005, 3, 3, 5
3920Exploring Visual Interpretability for Contrastive Language-Image Pretraining4.001.263, 3, 5, 3, 6
3921BiBench: Benchmarking and Analyzing Network Binarization4.001.416, 3, 3
3922Identifying Phase Transition Thresholds of Permuted Linear Regression via Message Passing3.801.941, 6, 6, 3, 3
3923Speech denoising by listening to noise3.800.983, 3, 5, 3, 5
3924Knowledge-Grounded Reinforcement Learning3.800.983, 3, 5, 5, 3
3925Auditing Fairness Online through Interactive Refinement3.800.983, 5, 5, 3, 3
3926G-Censor: Graph Contrastive Learning with Task-Oriented Counterfactual Views3.800.983, 5, 5, 3, 3
3927GLASU: A Communication-Efficient Algorithm for Federated Learning with Vertically Distributed Graph Data3.800.983, 5, 3, 3, 5
3928MODULAR FEDERATED CONTRASTIVE LEARNING WITH PEER NORMALIZATION3.800.983, 3, 3, 5, 5
3929SwinZS3: Zero-Shot Semantic Segmentation with a Swin Transformer3.751.921, 5, 3, 6
3930Thresholded Lexicographic Ordered Multi-Objective Reinforcement Learning3.751.303, 3, 3, 6
3931xTrimoABFold: Improving Antibody Structure Prediction without Multiple Sequence Alignments3.751.923, 6, 5, 1
3932Gandalf : Data Augmentation is all you need for Extreme Classification3.751.306, 3, 3, 3
3933Model-based Unknown Input Estimation via Partially Observable Markov Decision Processes3.751.925, 1, 6, 3
3934Help Me Explore: Combining Autotelic and Social Learning via Active Goal Queries3.751.925, 6, 3, 1
3935Learning to reason over visual objects3.751.303, 3, 6, 3
3936VER: Learning Natural Language Representations for Verbalizing Entities and Relations3.751.303, 3, 3, 6
3937Training Neural Networks with Low-Precision Model Memory3.751.303, 6, 3, 3
3938FoveaTer: Foveated Transformer for Image Classification3.751.921, 3, 5, 6
3939TG-Gen: A Deep Generative Model Framework for Temporal Graphs3.751.303, 6, 3, 3
3940Comparing Human and Machine Bias in Face Recognition3.751.303, 3, 6, 3
3941Finding the smallest tree in the forest: Monte Carlo Forest Search for UNSAT solving3.751.303, 3, 6, 3
3942Predictive Coding with Approximate Laplace Monte Carlo3.751.303, 6, 3, 3
3943The Ultimate Combo: Boosting Adversarial Example Transferability by Composing Data Augmentations3.751.303, 3, 6, 3
3944Improving Aspect Ratio Distribution Fairness in Detector Pretraining via Cooperating RPN’s3.751.923, 6, 5, 1
3945How to Do a Vocab Swap? A Study of Embedding Replacement for Pre-trained Transformers3.751.303, 3, 3, 6
3946UnDiMix: Hard Negative Sampling Strategies for Contrastive Representation Learning3.751.921, 3, 6, 5
3947Exploring Connections Between Memorization And Membership Inference3.751.306, 3, 3, 3
3948FedAvg Converges to Zero Training Loss Linearly: The Power of Overparameterized Multi-Layer Neural Networks3.751.303, 3, 3, 6
3949ResFed: Communication Efficient Federated Learning by Transmitting Deep Compressed Residuals3.751.303, 3, 3, 6
3950Multi-instance Interactive Segmentation with Self-Supervised Transformer3.751.303, 3, 6, 3
3951CLUSTERBERT: MULTI-STAGE FINE-TUNING OF TRANSFORMERS FOR DEEP TEXT CLUSTERING3.751.303, 3, 6, 3
3952Distilling Pre-trained Knowledge in Chemical Reactions for Molecular Property Prediction3.751.303, 3, 3, 6
3953Batch Normalization Explained3.751.303, 6, 3, 3
3954CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration3.751.303, 3, 3, 6
3955RoCourseNet: Distributionally Robust Training of a Prediction Aware Recourse Model3.751.303, 3, 3, 6
3956Global-Scale Species Mapping From Crowdsourced Data3.751.303, 3, 3, 6
3957Learning Robust Kernel Ensembles with Kernel Average Pooling3.751.303, 3, 6, 3
3958Harnessing Client Drift with Decoupled Gradient Dissimilarity3.751.303, 6, 3, 3
3959VQ-TR: Vector Quantized Attention for Time Series Forecasting3.751.925, 3, 6, 1
3960Emergent collective intelligence from massive-agent cooperation and competition3.751.923, 6, 5, 1
3961CLIP model is an Efficient Continual Learner3.751.303, 3, 6, 3
3962GAML: geometry-aware meta-learning via a fully adaptive preconditioner3.751.926, 1, 3, 5
3963Graph Neural Networks for Aerodynamic Flow Reconstruction from Sparse Sensing3.751.303, 3, 6, 3
3964Revisiting the Activation Function for Federated Image Classification3.751.921, 3, 6, 5
3965Route, Interpret, Repeat: Blurring the Line Between Posthoc Explainability and Interpretable Models3.751.923, 5, 1, 6
3966Analysis of Radio Localiser Networks under Distribution Shift3.751.303, 3, 3, 6
3967Bayesian Optimal Experimental Design for the Survey Bandit Setting3.751.306, 3, 3, 3
3968Pathfinding Neural Cellular Automata3.751.303, 3, 6, 3
3969Unleashing the Potential of Data Sharing in Ensemble Deep Reinforcement Learning3.751.923, 5, 6, 1
3970K-SAM: Sharpness-Aware Minimization at the Speed of SGD3.751.303, 3, 6, 3
3971A Simple Unsupervised Data Depth-based Method to Detect Adversarial Images3.752.593, 1, 8, 3
3972Counterfactual Memorization in Neural Language Models3.751.303, 3, 6, 3
3973Safer Reinforcement Learning with Counterexample-guided Offline Training3.751.303, 3, 3, 6
3974Enhancing Cross-Category Learning in Recommendation Systems with Multi-Layer Embedding Training3.751.306, 3, 3, 3
3975Populating memory in Continual Learning with Consistency Aware Sampling3.751.303, 3, 3, 6
3976System Identification as a Reinforcement Learning Problem3.751.925, 3, 1, 6
3977Projected Latent Distillation for Data-Agnostic Consolidation in Multi-Agent Continual Learning3.751.303, 3, 6, 3
3978Domain Generalization in Regression3.751.303, 6, 3, 3
3979Latent-space disentanglement with untrained generator networks allows to isolate different motion types in video data3.751.921, 3, 6, 5
3980FedMAE: Federated Self-Supervised Learning with One-Block Masked Auto-Encoder3.751.303, 6, 3, 3
3981Learning Sampling Policy to Achieve Fewer Queries for Zeroth-Order Optimization3.751.925, 6, 3, 1
3982Learning Graph Neural Network Topologies3.751.303, 6, 3, 3
3983Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning3.751.303, 6, 3, 3
3984Deep Generative Model based Rate-Distortion for Image Downscaling Assessment3.751.303, 3, 6, 3
3985Impact of the Last Fully Connected Layer on Out-of-distribution Detection3.751.303, 6, 3, 3
3986Optformer: Beyond Transformer for Black-box Optimization3.751.303, 3, 6, 3
3987Group-Equivariant Transformers Without Positional Encoding3.751.303, 6, 3, 3
3988Beyond Counting Linear Regions of Neural Networks, Simple Linear Regions Dominate!3.751.303, 6, 3, 3
3989Local Stochastic Bilevel Optimization with Momentum-Based Variance Reduction3.751.303, 6, 3, 3
3990FixEval: Execution-based Evaluation of Program Fixes for Competitive Programming Problems3.751.303, 6, 3, 3
3991Learning with Instance-Dependent Label Noise: Balancing Accuracy and Fairness3.751.303, 3, 6, 3
3992VC Theoretical Explanation of Double Descent3.751.303, 3, 3, 6
3993Distraction is All You Need For Fairness3.751.303, 6, 3, 3
3994Formal Conceptual Views in Neural Networks3.751.306, 3, 3, 3
3995Understanding Masked Image Modeling via Learning Occlusion Invariant Feature3.752.955, 8, 1, 1
3996RegQ: Convergent Q-Learning with Linear Function Approximation using Regularization3.751.923, 1, 5, 6
3997Fast 6D Object Pose Refinement via Implicit Surface Representation Driven Optimization3.751.303, 3, 6, 3
3998Variation-based Cause Effect Identification3.751.306, 3, 3, 3
3999Physics-Regularized Stereo Matching for Depth Estimation3.752.591, 8, 3, 3
4000Additive Poisson Process: Learning Intensity of Higher-Order Interaction in Poisson Processes3.751.306, 3, 3, 3
4001Hyperbolic Binary Neural Network3.751.926, 1, 5, 3
4002Training Instability and Disharmony Between ReLU and Batch Normalization3.751.303, 3, 3, 6
4003The Biased Artist: Exploiting Cultural Biases via Homoglyphs in Text-Guided Image Generation Models3.751.925, 3, 1, 6
4004Semantic Grouping Network for Audio Source Separation3.751.925, 1, 6, 3
4005On Stability and Generalization of Bilevel Optimization Problems3.751.921, 6, 3, 5
4006Do Spiking Neural Networks Learn Similar Representation with Artificial Neural Networks? A Pilot Study on SNN Representation3.751.303, 3, 6, 3
4007Learning to Perturb for Contrastive Learning of Unsupervised Sentence Representations3.670.943, 3, 5
4008A Hybrid Framework for Generating A Country-scale Synthetic Population3.670.943, 3, 5
4009Pocket-specific 3D Molecule Generation by Fragment-based Autoregressive Diffusion Models3.670.943, 5, 3
4010Graph Spline Networks for Efficient Continuous Simulation of Dynamical Systems3.670.943, 5, 3
4011AMA: Asymptotic Midpoint Augmentation for Margin Balancing and Moderate Broadening3.670.943, 5, 3
4012Towards A Unified Neural Architecture for Visual Recognition and Reasoning3.670.945, 3, 3
4013Estimating Treatment Effects using Neurosymbolic Program Synthesis3.670.943, 3, 5
4014Boosting Drug-Target Affinity Prediction from Nearest Neighbors3.670.943, 3, 5
4015Tight Non-asymptotic Inference via Sub-Gaussian Intrinsic Moment Norm3.670.943, 3, 5
4016PBES: PCA Based Exemplar Sampling Algorithm for Continual Learning3.670.943, 3, 5
4017Multi-scale Sinusoidal Embeddings Enable Learning on High Resolution Mass Spectrometry Data3.671.895, 5, 1
4018Protecting DNN from Evasion Attacks using Ensemble of High Focal Diversity3.670.943, 3, 5
4019Giving Robots a Hand: Broadening Generalization via Hand-Centric Human Video Demonstrations3.670.943, 3, 5
4020No Pairs Left Behind: Improving Metric Learning with Regularized Triplet Objective3.670.943, 5, 3
4021Matrix factorization under the constraint of connectivity between observed and source data ~ Muscle synergy analysis based on connectivity between muscle and brain activities ~3.670.943, 5, 3
4022VISION TRANSFORMER FOR MULTIVARIATE TIME- SERIES CLASSIFICATION (VITMTSC)3.670.945, 3, 3
4023Factors Influencing Generalization in Chaotic Dynamical Systems3.670.943, 3, 5
4024Query by Self3.670.945, 3, 3
4025Graph Neural Networks Are More Powerful Than we Think3.670.943, 5, 3
4026On a Benefit of Masked Language Model Pretraining: Robustness to Simplicity Bias3.670.943, 5, 3
4027Improving Subgraph Representation Learning via Multi-View Augmentation3.670.943, 5, 3
4028CrystalBox: Efficient Model-Agnostic Explanations for Deep RL Controllers3.670.945, 3, 3
4029Efficient Approximations of Complete Interatomic Potentials for Crystal Property Prediction3.670.943, 3, 5
4030RankMe: Assessing the Downstream Performance of Pretrained Self-Supervised Representations by Their Rank3.670.943, 3, 5
4031Soft Diffusion: Score Matching For General Corruptions3.670.943, 3, 5
4032Online Continual Learning with Feedforward Adaptation3.670.943, 5, 3
4033Learning parsimonious dynamics for generalization in reinforcement learning3.670.945, 3, 3
4034Homotopy Learning of Parametric Solutions to Constrained Optimization Problems3.670.943, 3, 5
4035Re-calibrated Wasserstein GAN for large-scale imputation with informative missing3.670.943, 5, 3
4036Domain Invariant Q-Learning for model-free robust continuous control under visual distractions3.670.943, 3, 5
4037Learning Useful Representations for Shifting Tasks and Distributions3.670.943, 5, 3
4038Architectural Backdoors in Neural Networks3.670.943, 3, 5
4039Can we achieve robustness from data alone?3.670.943, 3, 5
4040Perceptual Grouping in Vision-Language Models3.670.943, 3, 5
4041A Deep Dive into Dataset Imbalance and Bias in Face Identification3.670.943, 3, 5
4042Causally Constrained Data Synthesis For Private Data Release3.670.943, 3, 5
4043A simple Training-Free Method for Rejection Option3.670.943, 5, 3
4044Reducing the Capacity Gap via Spherical Knowledge Distillation3.671.895, 5, 1
4045Time Series Subsequence Anomaly Detection via Graph Neural Networks3.670.945, 3, 3
4046Improving Generalization of Motor-Imagery Brainwave Decoding via Dynamic Convolutions3.671.895, 5, 1
4047Bridging between Pool- and Stream-Based Active Learning with Temporal Data Coherence3.671.895, 1, 5
4048SYNC: Efficient Neural Code Search Through Structurally Guided Hard Negative Curricula3.670.945, 3, 3
4049Semi-parametric Prompt-Generation for Model Editing3.670.945, 3, 3
4050Dynamic Scheduled Sampling with Imitation Loss for Neural Text Generation3.670.945, 3, 3
4051Fourier PINNs: From Strong Boundary Conditions to Adaptive Fourier Bases3.670.943, 3, 5
4052Exploring Methods for Parsing Movie Scripts - Feature Extraction for Further Social Injustice Analysis3.670.945, 3, 3
4053Conceptual Behavior and Human-Likeness in Vision-and-Language Models3.670.943, 3, 5
4054Quantization-aware Policy Distillation (QPD)3.670.943, 5, 3
4055Active Learning at the ImageNet Scale3.670.943, 5, 3
4056Automatic Curriculum Generation for Reinforcement Learning in Zero-Sum Games3.670.945, 3, 3
4057Language Modeling Using Tensor Trains3.671.891, 5, 5
4058Would decentralization hurt generalization?3.671.495, 3, 1, 5, 3, 5
4059Tackling Imbalanced Class in Federated Learning via Class Distribution Estimation3.670.945, 3, 3
4060Solving Math Word Problems with Process-based and Outcome-based Feedback3.670.943, 3, 5
4061Few-shot Lifelong Reinforcement Learning with Generalization Guarantees: An Empirical PAC-Bayes Approach3.670.943, 3, 5
4062SEQuence-rPPG: A Fast BVP Signal Extraction Method From Frame Sequences3.670.943, 3, 5
4063Linearised Implicit Variational Inference3.670.943, 3, 5
4064Learning Interpretable Neural Discrete Representation for Time Series Classification3.670.943, 5, 3
4065SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data3.670.943, 5, 3
4066Perturbation Defocusing for Adversarial Defense3.671.895, 1, 5
4067Preserving Semantics in Textual Adversarial Attacks3.670.943, 5, 3
4068A Decomposition Based Dual Projection Model for Multivariate Time Series Forecasting and Anomaly Detection3.670.943, 5, 3
4069FedHPO-Bench: A Benchmark Suite for Federated Hyperparameter Optimization3.670.943, 5, 3
4070Cyclophobic Reinforcement Learning3.670.943, 3, 5
4071Dynamic-Aware GANs: Time-Series Generation with Handy Self-Supervision3.670.943, 5, 3
4072Learning System Dynamics from Sensory Input under Optimal Control Principles3.670.943, 5, 3
4073On the Shortcut Learning in Multilingual Neural Machine Translation3.670.943, 3, 5
4074I Speak, You Verify: Toward Trustworthy Neural Program Synthesis3.671.895, 1, 5
4075ACQL: An Adaptive Conservative Q-Learning Framework for Offline Reinforcement Learning3.670.945, 3, 3
4076Efficient Controllable Generation with Guarantee3.671.895, 1, 5
4077Weak Supervision Variational Auto-Encoder3.670.945, 3, 3
4078Extending graph transformers with quantum computed aggregation3.670.945, 3, 3
4079Self-Supervised SVDE from Videos with Depth Variance to Shifted Positional Information3.670.943, 3, 5
4080TransLog: A Unified Transformer-based Framework for Log Anomaly Detection3.670.943, 3, 5
4081Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning3.670.943, 3, 5
4082Continuous Monte Carlo Graph Search3.670.943, 3, 5
4083Backdoor Mitigation by Correcting Activation Distribution Alteration3.670.943, 5, 3
4084Pose Transfer using a Single Spatial Transformation3.671.895, 1, 5
4085How Distinguishable Are Vocoder Models? Analyzing Vocoder Fingerprints for Fake Audio3.670.943, 3, 5
4086Holographic-(V)AE: an end-to-end SO(3)-Equivariant (Variational) Autoencoder in Fourier Space3.670.943, 5, 3
4087Wasserstein Barycenter-based Model Fusion and Linear Mode Connectivity of Neural Networks3.670.945, 3, 3
4088Robust Multi-Agent Reinforcement Learning against Adversaries on Observation3.670.945, 3, 3
4089Self-supervised Learning for Cell Segmentation and Quantification in Digital Pathology Images3.670.943, 5, 3
4090Learning to Generate Pseudo Anomalies3.670.945, 3, 3
4091Scalable feature selection via sparse learnable masks3.670.943, 3, 5
4092Dataset Projection: Finding Target-aligned Subsets of Auxiliary Data3.670.943, 5, 3
4093Decentralized Federated Learning via Overlapping Data Augmentation3.670.943, 5, 3
4094An interpretable contrastive logical knowledge learning method for sentiment analysis3.670.943, 3, 5
4095Training image classifiers using Semi-Weak Label Data3.670.945, 3, 3
4096Magnum: Tackling High-Dimensional Structures with Self-Organization3.671.891, 5, 5
4097Vector Quantized Wasserstein Auto-Encoder3.670.943, 5, 3
4098A Sample Based Method for Understanding The Decisions of Neural Networks Semantically3.670.945, 3, 3
4099Deep Biological Pathway Informed Pathology-Genomic Multimodal Survival Prediction3.670.943, 3, 5
4100Explaining Patterns in Data with Language Models via Interpretable Autoprompting3.670.943, 5, 3
4101Neural DAEs: Constrained neural networks3.670.943, 3, 5
4102Variance Double-Down: The Small Batch Size Anomaly in Multistep Deep Reinforcement Learning3.670.945, 3, 3
4103Adversarial Representation Learning for Canonical Correlation Analysis3.670.943, 5, 3
4104Explaining Image Classification through Knowledge-aware Neuron Interpretation3.670.943, 3, 5
4105PointConvFormer: Revenge of the Point-Based Convolution3.670.943, 3, 5
4106Recurrent Real-valued Neural Autoregressive Density Estimator for Online Density Estimation and Classification of Streaming Data3.670.943, 3, 3, 5, 3, 5
4107StructViT: Learning Correlation Structures for Vision Transformers3.670.945, 3, 3
4108Stationary Deep Reinforcement Learning with Quantum K-spin Hamiltonian Equation3.670.943, 3, 5
4109Interpolating Compressed Parameter Subspaces3.670.943, 3, 5
4110Multi-Modality Alone is Not Enough: Generating Scene Graphs using Cross-Relation-Modality Tokens3.670.943, 3, 5
4111Clustering and Ordering Variable-Sized Sets: The Catalog Problem3.670.945, 3, 3
4112KerDEQ: Optimization induced Deep Equilibrium models via Gaussian Kernel3.670.943, 5, 3
4113TCNL: Transparent and Controllable Network Learning Via Embedding Human-Guided Concepts3.670.945, 3, 3
4114From ChebNet to ChebGibbsNet3.670.945, 3, 3
4115Towards Understanding Robust Memorization in Adversarial Training3.670.943, 3, 5
4116FV-MgNet: Fully Connected V-cycle MgNet for Interpretable Time Series Forecasting3.670.943, 3, 5
4117FFCV: Accelerating Training by Removing Data Bottlenecks3.670.943, 5, 3
4118Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label Thresholding3.670.943, 3, 5
4119Uncertainty and Traffic Light Aware Pedestrian Crossing Intention Prediction3.670.945, 3, 3
4120Token-level Fitting Issues of Seq2seq Models3.670.943, 5, 3
4121Worst-case Few-shot Evaluation: Are Neural Networks Robust Few-shot Learners?3.671.891, 5, 5
4122Leveraging Online Semantic Point Fusion for 3D-Aware Object Goal Navigation3.671.895, 5, 1
4123Robust Manifold Estimation Approach for Evaluating Fidelity and Diversity3.670.943, 5, 3
4124CAPE: Channel-Attention-Based PDE Parameter Embeddings for SciML3.670.943, 3, 5
4125Solving Partial Label Learning Problem with Multi-Agent Reinforcement Learning3.670.945, 3, 3
4126SDT: Specific Domain Training in Domain Generalization3.670.943, 5, 3
4127Is Class Incremental Learning Truly Learning Representations Continually?3.670.943, 3, 5
4128Understanding Adversarial Transferability in Federated Learning3.670.943, 3, 5
4129Attribute Alignment and Enhancement for Generalized Zero-Shot Learning3.670.943, 5, 3
4130Unified Probabilistic Modeling of Image Aesthetic Rating Distributions towards Measuring Subjectivity3.670.943, 5, 3
4131Analyzing adversarial robustness of vision transformers against spatial and spectral attacks3.670.943, 5, 3
4132The Progressive Alignment-aware Multimodal Fusion with Easy2hard Strategy for Multimodal Neural Machine Translation3.670.945, 3, 3
4133CacheGNN: Enhancing Graph Neural Networks with Global Information Caching3.670.943, 3, 5
4134Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning3.670.943, 5, 3
4135Towards Identification of Microaggressions in real-life and Scripted conversations, using Context-Aware Machine Learning Techniques.3.670.945, 3, 3
4136When does Bias Transfer in Transfer Learning?3.670.943, 5, 3
4137Towards Realtime Distributed Virtual Flow Meter via Compressed Continual Learning3.670.943, 3, 5
4138Robust Neural ODEs via Contractivity-promoting Regularization3.670.943, 5, 3
4139A Robust Stacking Framework for Training Deep Graph Models with Multifaceted Node Features3.670.943, 5, 3
4140Learning Diverse and Effective Policies with Non-Markovian Rewards3.670.943, 3, 5
4141BAMBI: Vertical Federated Bilevel Optimization with Privacy-Preserving and Computation Efficiency3.670.943, 5, 3
4142MULTILEVEL XAI: VISUAL AND LINGUISTIC BONDED EXPLANATIONS3.670.945, 3, 3
4143miCSE: Mutual Information Contrastive Learning for Low-shot Sentence Embeddings3.670.943, 3, 5
4144When Few-shot Meets Cross-domain Object Detection: Learning Instance-level Class Prototypes for Knowledge Transfer3.670.945, 3, 3
4145Unsupervised Threshold Learning with '$L$'-trend Prior For Visual Anomaly Detection3.670.945, 3, 3
4146Grouped self-attention mechanism for a memory-efficient Transformer3.670.943, 5, 3
4147Synergistic Neuromorphic Federated Learning with ANN-SNN Conversion For Privacy Protection3.670.943, 3, 5
4148Time Series Anomaly Detection via Hypothesis Testing for Dynamical Systems3.671.895, 1, 5
4149Identifying Latent Causal Content for Multi-Source Domain Adaptation3.670.945, 3, 3
4150NetBooster: Empowering Tiny Deep Learning By Standing on the Shoulders of Deep Giants3.670.943, 3, 5
4151On the Difficulties of Video Summarization: Structure and Subjectivity3.670.943, 5, 3
4152Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer3.670.943, 3, 5
4153Personalized Subgraph Federated Learning3.670.945, 3, 3
4154Adversarial Learned Fair Representations using Dampening and Stacking3.670.943, 3, 5
4155On the Importance of Pretrained Knowledge Distillation for 3D Object Detection3.670.943, 3, 5
4156Harnessing spectral representations for subgraph alignment3.670.945, 3, 3
4157Mixed-Precision Inference Quantization: Problem Resetting, Mapping math concept and Branch&bound methods3.670.943, 3, 5
4158Partial Advantage Estimator for Proximal Policy Optimization3.670.943, 5, 3
4159PatchBlender: A Motion Prior for Video Transformers3.670.943, 3, 5
4160Similarity and Generalization: from Noise to Corruption3.670.943, 5, 3
4161A Generalized EigenGame With Extensions to Deep Multiview Representation Learning3.670.943, 5, 3
4162Offline Model-Based Reinforcement Learning with Causal Structure3.670.943, 5, 3
4163Temporal Label Smoothing for Early Prediction of Adverse Events3.670.943, 3, 5
4164What's Wrong with the Robustness of Object Detectors?3.671.895, 1, 5
4165Corruption Depth: Analysis of DNN depth for Misclassification3.670.945, 3, 3
4166How Does Value Distribution in Distributional Reinforcement Learning Help Optimization?3.670.943, 5, 3
4167Quality Matters: Embracing Quality Clues for Robust 3D Multi-Object Tracking3.670.943, 3, 5
4168GOAT: A Global Transformer on Large-scale Graphs3.670.943, 5, 3
4169HeatDETR: Hardware-Efficient DETR with Device-Adaptive Thinning3.670.943, 3, 5
4170An Incremental Learning Approach for Sustainable Regional Isolation and Integration3.670.943, 5, 3
4171Very Large Scale Multi-Agent Reinforcement Learning with Graph Attention Mean Field3.670.943, 5, 3
4172Representation Mutual Learning for End-to-End Weakly-Supervised Semantic Segmentation3.670.943, 3, 5
4173Consistent and Truthful Interpretation with Fourier Analysis3.670.945, 3, 3
4174GENERALIZED MATRIX LOCAL LOW RANK REPRESENTATION BY RANDOM PROJECTION AND SUBMATRIX PROPAGATION3.670.943, 3, 5
4175Formulating and Proving the Trend of DNNs Learning Simple Concepts3.670.945, 3, 3
4176Selective Classification Via Neural Network Training Dynamics3.670.945, 3, 3
4177FlexPose: Pose Distribution Adaptation with Few-shot Guidance3.670.943, 3, 5
4178Structure-Sensitive Graph Dictionary Embedding for Graph Classification3.670.943, 5, 3
4179Variational Autoencoders with Decremental Information Bottleneck for Disentanglement3.670.943, 3, 5
4180FreeSeg: Free Mask from Interpretable Contrastive Language-Image Pretraining for Semantic Segmentation3.670.943, 3, 5
4181(LA)YER-NEIGH(BOR) SAMPLING: DEFUSING NEIGHBORHOOD EXPLOSION3.670.945, 3, 3
4182Feint in Multi-Player Games3.671.895, 5, 1
4183Metro: Memory-Enhanced Transformer for Retrosynthetic Planning via Reaction Tree3.670.943, 3, 5
4184Efficient Multi-Task Reinforcement Learning via Selective Behavior Sharing3.670.943, 5, 3
4185Recursive Neural Programs: Variational Learning of Image Grammars and Part-Whole Hierarchies3.601.203, 6, 3, 3, 3
4186SPC-Net: A New Scalable Point Cloud Compression Framework for Both Machine and Human Vision Tasks3.601.203, 3, 6, 3, 3
4187Addressing High-dimensional Continuous Action Space via Decomposed Discrete Policy-Critic3.601.206, 3, 3, 3, 3
4188Fully Continuous Gated Recurrent Units For processing Time Series3.601.743, 6, 5, 1, 3
4189Why Adversarial Training of ReLU Networks Is Difficult?3.601.743, 3, 5, 1, 6
4190Machine Learning from Explanations3.501.663, 5, 5, 1
4191Transformer needs NMDA receptor nonlinearity for long-term memory3.500.873, 3, 3, 5
4192Rethinking the Value of Prompt Learning for Vision-Language Models3.500.873, 3, 3, 5
4193Towards Performance-maximizing Network Pruning via Global Channel Attention3.500.873, 3, 3, 5
4194Object-Centric Learning with Slot Mixture Models3.500.873, 3, 5, 3
4195RISC-V MICROARCHITECTURE EXPLORATION VIA REINFORCEMENT LEARNING3.500.873, 3, 3, 5
4196How (Un)Fair is Text Summarization?3.500.875, 3, 3, 3
4197Simulating Task-Free Continual Learning Streams From Existing Datasets3.500.873, 3, 5, 3
4198Attention Flows for General Transformers3.500.873, 3, 3, 5
4199Group-Disentangling Conditional Shift3.500.873, 3, 3, 5
4200Distance VS. Coordinate: Distance Based Embedding Improves Model Generalization for Routing Problems3.500.873, 3, 3, 5
4201Text2Model: Model Induction for Zero-shot Generalization Using Task Descriptions3.500.873, 3, 5, 3
4202Opportunistic Actor-Critic (OPAC) with Clipped Triple Q-learning3.500.875, 3, 3, 3
4203On Information Maximisation in Multi-View Self-Supervised Learning3.501.663, 1, 5, 5
4204SRBGCN: Tangent space-Free Lorentz Transformations for Graph Feature Learning3.500.875, 3, 3, 3
4205Mirror Training for Input Convex Neural Network3.501.665, 3, 5, 1
4206A Benchmark Dataset for Learning from Label Proportions3.500.875, 3, 3, 3
4207Don’t Bet on Sparsity: Designing Brain-inspired Distance-preserving Encoder3.500.873, 3, 5, 3
4208Learned Nearest-Class-Mean for Biased Representations in Long-Tailed Recognition3.500.873, 3, 3, 5
4209DYNAMIC BATCH NORM STATISTICS UPDATE FOR NATURAL ROBUSTNESS3.500.873, 5, 3, 3
4210MixBin: Towards Budgeted Binarization3.500.873, 3, 3, 5
4211Corruption-free Single-view Self-supervised Learning on Graphs3.500.875, 3, 3, 3
4212Quasiconvex Shallow Neural Network3.500.873, 5, 3, 3
4213Text-Conditioned Graph Generation Using Discrete Graph Variational Autoencoders3.500.873, 5, 3, 3
4214Diffusion-based point cloud generation with smoothness constraints3.500.875, 3, 3, 3
4215Towards Out-of-Distribution Adversarial Robustness3.500.873, 3, 3, 5
4216Learning to perceive objects by prediction3.500.873, 3, 5, 3
4217Why do Models with Conditional Computation Learn Suboptimal Solutions?3.500.873, 5, 3, 3
4218Divide-and-Cluster: Spatial Decomposition Based Hierarchical Clustering3.500.873, 3, 3, 5
4219Fast Yet Effective Graph Unlearning through Influence Analysis3.500.873, 3, 3, 5
4220TI-VAE: A temporally independent VAE with applications to latent factor learning in neuroimaging3.500.873, 5, 3, 3
4221On Representation Learning Under Class Imbalance3.500.873, 3, 5, 3
4222Efficient Stochastic Optimization for Attacking Randomness Involved Inference3.500.873, 3, 3, 5
4223GLINKX: A Scalable Unified Framework For Homophilous and Heterophilous Graphs3.500.873, 5, 3, 3
4224Graph Neural Networks as Multi-View Learning3.500.875, 3, 3, 3
4225A Retrieve-and-Read Framework for Knowledge Graph Reasoning3.500.873, 3, 5, 3
4226FLGAME: A Game-theoretic Defense against Backdoor Attacks In Federated Learning3.501.665, 1, 5, 3
4227High-Precision Regressors for Particle Physics3.501.661, 5, 5, 3
4228Fine-Tuning Offline Policies With Optimistic Action Selection3.500.873, 3, 5, 3
4229Test-Time Training on Video Streams3.501.665, 3, 5, 1
4230The Right Losses for the Right Gains: Improving the Semantic Consistency of Deep Text-to-Image Generation with Distribution-Sensitive Losses3.500.873, 3, 3, 5
4231CausalBench: A Large-scale Benchmark for Network Inference from Single-cell Perturbation Data3.501.665, 5, 3, 1
4232Semi-supervised consistency regularization for accurate cell type fraction and gene expression estimation3.500.873, 3, 5, 3
4233Pareto Rank-Preserving Supernetwork for HW-NAS3.500.873, 3, 5, 3
4234PGASL: Predictive and Generative Adversarial Semi-supervised Learning for imbalanced data3.500.873, 5, 3, 3
4235MaxMin-Novelty: Maximizing Novelty via Minimizing the State-Action Values in Deep Reinforcement Learning3.501.661, 3, 5, 5
4236Handling Covariate Shifts in Federated Learning with Generalization Guarantees3.500.873, 3, 5, 3
4237Hierarchical Neural Program Synthesis3.500.875, 3, 3, 3
4238SPIDER: Searching Personalized Neural Architecture for Federated Learning3.500.873, 5, 3, 3
4239Robust Graph Representation Learning via Predictive Coding3.500.875, 3, 3, 3
4240Brain Signal Generation and Data Augmentation with a Single-Step Diffusion Probabilistic Model3.501.661, 5, 3, 5
4241Bounded Attacks and Robustness in Image Transform Domains3.500.875, 3, 3, 3
4242Efficient Exploration using Model-Based Quality-Diversity with Gradients3.500.873, 3, 5, 3
4243Distinguishing Feature Model for Ranking From Pairwise Comparisons3.501.663, 5, 1, 5
4244Applying Second Order Optimization to Deep Transformers with Parameter-Efficient Tuning3.500.873, 3, 5, 3
4245Mask-tuning: Towards Improving Pre-trained Language Models' Generalization3.500.873, 5, 3, 3
4246Tiered Pruning for Efficient Differentialble Inference-Aware Neural Architecture Search3.500.873, 5, 3, 3
4247Spurious Features in Continual Learning3.500.875, 3, 3, 3
4248Why Did This Model Forecast This Future? Information-Theoretic Temporal Saliency for Counterfactual Explanations of Probabilistic Forecasts3.500.873, 3, 5, 3
4249Topological Data Analysis-Deep Learning Framework for Predicting Cancer Phenotypes3.501.661, 3, 5, 5
4250Reprogramming Large Pretrained Language Models for Antibody Sequence Infilling3.500.875, 3, 3, 3
4251Differentially Private Conditional Text Generation For Synthetic Data Production3.500.873, 5, 3, 3
4252FUN: Filter-based Unlearnable Datasets3.500.873, 5, 3, 3
4253GMML is All you Need3.500.873, 5, 3, 3
4254Variational Pseudo Labels for Meta Test-time Adaptation3.500.875, 3, 3, 3
4255Continuously Parameterized Mixture Models3.500.873, 3, 5, 3
4256DP-InstaHide: Data Augmentations Provably Enhance Guarantees Against Dataset Manipulations3.500.875, 3, 3, 3
4257Affinity-VAE for clustering and classification of objects in multidimensional image data3.500.873, 5, 3, 3
4258Guided Safe Shooting: model based reinforcement learning with safety constraints3.500.873, 3, 5, 3
4259Counterfactual Explanation via Search in Gaussian Mixture Distributed Latent Space3.500.873, 5, 3, 3
4260AGREE: A Simple Aggregator of Detectors’ Decisions3.500.873, 3, 5, 3
4261Prompt Injection: Parameterization of Fixed Inputs3.500.875, 3, 3, 3
4262LEXA: Language-agnostic Cross-consistency Training for Question Answering Tasks3.500.873, 3, 5, 3
4263RulE: Neural-Symbolic Knowledge Graph Reasoning with Rule Embedding3.500.873, 3, 3, 5
4264Improving the generalization ability of the chaotic time-series classification models by residual component extraction3.501.665, 1, 3, 5
4265Consciousness-Aware Multi-Agent Reinforcement Learning3.501.661, 5, 3, 5
4266Pseudo-Edge: Semi-Supervised Link Prediction with Graph Neural Networks3.500.873, 3, 3, 5
4267Can Fair Federated Learning reduce the need for personalization?3.500.873, 3, 3, 5
4268Dynamical Signatures of Learning in Recurrent Networks3.500.873, 5, 3, 3
4269Preventing Mode Collapse When Imitating Latent Policies from Observations3.500.875, 3, 3, 3
4270Compositional Image Generation and Manipulation with Latent Diffusion Models3.500.875, 3, 3, 3
4271Cross-Protein Wasserstein Transformer for Protein-Protein Interactions3.500.873, 3, 5, 3
4272Inverse Optimal Transport with Application to Contrastive Learning3.500.873, 3, 3, 5
4273Demystifying black-box DNN training processes through Concept-Monitor3.501.663, 1, 5, 5
4274Improving the Estimation of Instance-dependent Transition Matrix by using Self-supervised Learning3.501.661, 5, 3, 5
4275A general differentially private learning framework for decentralized data3.500.873, 3, 3, 5
4276ReG-NAS: Graph Neural Network Architecture Search using Regression Proxy Task3.500.873, 5, 3, 3
4277MaskNeRF: Masked Neural Radiance Fields for Sparse View Synthesis3.500.873, 5, 3, 3
4278Penalizing the High-likelihood: A Novel Sampling Method for Open-ended Neural Text Generation via Inverse Probability Weighting3.500.873, 3, 3, 5
4279Injecting Image Details into CLIP's Feature Space3.500.873, 3, 3, 5
4280OCIM : Object-centric Compositional Imagination for Visual Abstract Reasoning3.500.873, 5, 3, 3
4281SplitMixer: Fat Trimmed From MLP-like Models3.501.661, 3, 5, 5
4282Effectively Clarify Confusion via Visualized Aggregation and Separation of Deep Representation3.500.873, 3, 3, 5
4283The Impact of Neighborhood Distribution in Graph Convolutional Networks3.500.873, 3, 3, 5
4284Structural Code Representation Learning for Auto-Vectorization3.500.873, 3, 5, 3
4285MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input Features3.500.873, 3, 5, 3
4286Learning-Based Radiomic Prediction of Type 2 Diabetes Mellitus Using Image-Derived Phenotypes3.500.875, 3, 3, 3
4287Revisiting Instance-Reweighted Adversarial Training3.500.873, 3, 3, 5
4288Examining the Difference Among Transformers and CNNs with Explanation Methods3.501.663, 1, 5, 5
4289Few-Shot Text Classification with Dual Contrastive Consistency Training3.500.873, 3, 3, 5
4290Capsa: A Unified Framework for Quantifying Risk in Deep Neural Networks3.500.873, 3, 3, 5
4291Self-supervised Continual Learning based on Batch-mode Novelty Detection3.500.873, 3, 3, 5
4292A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games3.501.661, 5, 5, 3
4293TRIDE: A Temporal, Robust, and Informative Data Augmentation Framework for Disease Progression Modeling3.500.873, 5, 3, 3
4294Approximate Conditional Coverage via Neural Model Approximations3.500.873, 5, 3, 3
4295Towards Representative Subset Selection for Self-Supervised Speech Recognition3.500.873, 5, 3, 3
4296Learning to Act through Activation Function Optimization in Random Networks3.500.875, 3, 3, 3
4297Representation Learning via Consistent Assignment of Views over Random Partitions3.500.873, 3, 3, 5
4298PRANC: Pseudo RAndom Networks for Compacting deep models3.500.873, 3, 3, 5
4299Biological connectomes as a representation for the architecture of artificial neural networks3.501.665, 5, 1, 3
4300Perceive, Ground, Reason, and Act: A Benchmark for General-purpose Visual Representation3.500.875, 3, 3, 3
4301Task Regularized Hybrid Knowledge Distillation For Continual Object Detection3.500.875, 3, 3, 3
4302GOING BEYOND 1-WL EXPRESSIVE POWER WITH 1-LAYER GRAPH NEURAL NETWORKS3.500.873, 3, 3, 5
4303Backdoors Stuck At The Frontdoor: Multi-Agent Backdoor Attacks That Backfire3.501.661, 5, 5, 3
4304Less is More: Rethinking Few-Shot Learning and Recurrent Neural Nets3.500.873, 5, 3, 3
4305When Neural ODEs meet Neural Operators3.500.873, 5, 3, 3
4306Reducing Forgetting In Federated Learning with Truncated Cross-Entropy3.500.873, 5, 3, 3
4307FedEED: Efficient Federated Distillation with Ensemble of Aggregated Models3.500.873, 3, 5, 3
4308A Simple, Yet Effective Approach to Finding Biases in Code Generation3.500.873, 5, 3, 3
4309Surrogate Gradient Design for LIF networks3.500.873, 3, 3, 5
4310The Multiple Subnetwork Hypothesis: Enabling Multidomain Learning by Isolating Task-Specific Subnetworks in Feedforward Neural Networks3.500.873, 5, 3, 3
4311Gradient-Informed Quality Diversity for the Illumination of Discrete Spaces3.502.501, 6, 1, 6
4312Linear Scalarization for Byzantine-Robust Learning on non-IID data3.500.873, 3, 3, 5
4313Planning With Uncertainty: Deep Exploration in Model-Based Reinforcement Learning3.500.873, 3, 3, 5
4314A Hierarchical Hyper-rectangle Mass Model for Fine-grained Entity Typing3.500.873, 5, 3, 3
4315Enhancing the Transferability of Adversarial Examples via a Few Queries and Fuzzy Domain Eliminating3.501.661, 5, 3, 5
4316Towards Information-Theoretic Pattern Mining in Time Series3.501.661, 5, 5, 3
4317SuperMarioDomains: Generalizing to Domains with Evolving Graphics3.500.873, 3, 5, 3
4318AIA: learn to design greedy algorithm for NP-complete problems using neural networks3.500.873, 3, 3, 5
4319AVT: Audio-Video Transformer for Multimodal Action Recognition3.500.873, 3, 5, 3
4320Accelerating Adaptive Federated Optimization with Local Gossip Communications3.500.873, 5, 3, 3
4321On the Complexity of Bayesian Generalization3.500.873, 3, 3, 5
4322Compound Tokens: Channel Fusion for Vision-Language Representation Learning3.500.875, 3, 3, 3
4323Are vision transformers more robust than CNNs for Backdoor attacks?3.500.875, 3, 3, 3
4324Fair Federated Learning via Bounded Group Loss3.500.873, 5, 3, 3
4325Target-Free Ligand Scoring via One-Shot Learning3.500.873, 3, 3, 5
4326Beyond Traditional Transfer Learning: Co-finetuning for Action Localisation3.500.873, 3, 5, 3
4327Neural Embeddings for Text3.500.873, 3, 3, 5
4328Tessellated Neural Networks: A Robust Defence against Adversarial Attacks3.500.873, 3, 3, 5
4329Deep Reinforcement learning on Adaptive Pairwise Critic and Asymptotic Actor3.500.873, 5, 3, 3
4330Causal Inference via Nonlinear Variable Decorrelation in Healthcare3.500.873, 5, 3, 3
4331DoE2Vec: Representation Learning for Exploratory Landscape Analysis3.500.873, 5, 3, 3
4332Test-time recalibration of conformal predictors under distribution shift based on unlabeled examples3.500.875, 3, 3, 3
4333Newton Losses: Efficiently Including Second-Order Information into Gradient Descent3.500.873, 5, 3, 3
4334When is Adversarial Robustness Transferable?3.500.873, 3, 5, 3
4335On the Connection between Fisher's Criterion and Shannon's Capacity: Theoretical Concepts and Implementation3.500.873, 5, 3, 3
4336Neural Operator Variational Inference based on Regularized Stein Discrepancy for Deep Gaussian Processes3.500.873, 3, 3, 5
4337Self Check-in: Tight Privacy Amplification for Practical Distributed Learning3.501.661, 5, 5, 3
4338Understanding Catastrophic Overfitting in Fast Adversarial Training From a Non-robust Feature Perspective3.500.873, 3, 3, 5
4339TCFimt: Temporal Counterfactual Forecasting from Individual Multiple Treatment Perspective3.500.873, 3, 3, 5
4340Generative Multi-Flow Networks: Centralized, Independent and Conservation3.500.873, 3, 5, 3
4341motifNet: Functional motif interactions discovered in mRNA sequences with implicit neural representation learning3.500.873, 3, 3, 5
4342Rethinking Data Augmentation for Improving Transferable Targeted Attacks3.500.875, 3, 3, 3
4343ID and OOD Performance Are Sometimes Inversely Correlated on Real-world Datasets3.500.873, 3, 3, 5
4344Cali-NCE: Boosting Cross-modal Video Representation Learning with Calibrated Alignment3.500.873, 5, 3, 3
4345Strength-Adaptive Adversarial Training3.500.875, 3, 3, 3
4346Deep Deformation Based on Feature-Constraint for 3D Human Mesh Correspondence3.500.875, 3, 3, 3
4347An Improved Baseline for Masked Contrastive Learning3.501.661, 5, 5, 3
4348Towards Generalized Combinatorial Solvers via Reward Adjustment Policy Optimization3.501.661, 3, 5, 5
4349Revisiting Embeddings for Graph Neural Networks3.500.873, 5, 3, 3
4350Empirical analysis of representation learning and exploration in neural kernel bandits3.500.873, 3, 3, 5
4351EMO: Episodic Memory Optimization for Few-Shot Meta-Learning3.500.873, 5, 3, 3
4352Explainability of deep reinforcement learning algorithms in robotic domains by using Layer-wise Relevance Propagation3.500.873, 5, 3, 3
4353High Dimensional Bayesian Optimization with Reinforced Transformer Deep Kernels3.500.873, 3, 3, 5
4354Latent Offline Distributional Actor-Critic3.500.875, 3, 3, 3
4355Leveraging variational autoencoders for multiple data imputation3.501.665, 1, 3, 5
4356Rethinking Learning Dynamics in RL using Adversarial Networks3.500.873, 5, 3, 3
4357Is Stochastic Gradient Descent Near Optimal?3.500.875, 3, 3, 3
4358Elastic Mean-Teacher Distillation Mitigates the Continual Learning Stability Gap3.500.873, 3, 3, 5
4359FONDUE: an Algorithm to Find the Optimal Dimensionality of the Latent Representations of Variational Autoencoders3.500.873, 5, 3, 3
4360Interpreting Distributional Reinforcement Learning: A Regularization Perspective3.500.873, 3, 3, 5
4361Global Hardest Example Mining with Prototype-based Triplet Loss3.500.873, 3, 3, 5
4362MGMA: Mesh Graph Masked Autoencoders for Self-supervised Learning on 3D Shape3.500.873, 3, 3, 5
4363Improving the Latent Space of Image Style Transfer3.500.875, 3, 3, 3
4364Language-Guided Artistic Style Transfer Using the Latent Space of DALL-E3.500.873, 3, 5, 3
4365Out-of-distribution Detection with Diffusion-based Neighborhood3.500.873, 3, 3, 5
4366SELF-SUPERVISED PRETRAINING FOR DIFFERENTIALLY PRIVATE LEARNING3.500.873, 3, 5, 3
4367Learning Axis-Aligned Decision Trees with Gradient Descent3.500.875, 3, 3, 3
4368A Fairness Analysis on Differentially Private Aggregation of Teacher Ensembles3.500.873, 5, 3, 3
4369Domain Specific Denoising Diffusion Probabilistic Models for Brain Dynamics3.501.663, 5, 5, 1
4370A Simple and Provable Method to Adapt Pre-trained Model across Domains with Few Samples3.500.875, 3, 3, 3
4371EyeDAS: Securing Perception of Autonomous Cars Against the Stereoblindness Syndrome3.501.665, 1, 5, 3
4372Hardware-restriction-aware training (HRAT) for memristor neural networks3.500.873, 3, 5, 3
4373ViTKD: Practical Guidelines for ViT Feature Knowledge Distillation3.500.873, 5, 3, 3
4374Sharpness-aware Quantization for Deep Neural Networks3.500.873, 3, 5, 3
4375DOTIN: Dropping Out Task-Irrelevant Nodes for GNNs3.500.875, 3, 3, 3
4376GraphCG: Unsupervised Discovery of Steerable Factors in Graphs3.500.873, 5, 3, 3
4377Rethinking Knowledge Distillation via Cross-Entropy3.500.873, 3, 5, 3
4378Progressive Mixup Augmented Teacher-Student Learning for Unsupervised Domain Adaptation3.400.803, 3, 3, 5, 3
4379On Making Graph Continual Learning Easy, Fool-Proof, and Extensive: a Benchmark Framework and Scenarios3.401.503, 3, 5, 1, 5
4380Off Policy Average Reward Actor Critic with Deterministic Policy Search3.401.501, 3, 3, 5, 5
4381Rethinking Deep Spiking Neural Networks: A Multi-Layer Perceptron Approach3.400.805, 3, 3, 3, 3
4382Cooperative Adversarial Learning via Closed-Loop Transcription3.401.505, 1, 3, 3, 5
4383Dealing with missing data using attention and latent space regularization3.400.803, 5, 3, 3, 3
4384Revisiting Information-Based Clustering with Pseudo-Posterior Models3.332.051, 6, 3
4385BiasPAD: A Bias-Progressive Auto-Debiasing Framework3.332.053, 1, 6
4386Are Graph Attention Networks Attentive Enough? Rethinking Graph Attention by Capturing Homophily and Heterophily3.332.053, 6, 1
4387Human alignment of neural network representations3.332.053, 1, 6
4388ON COMPLEX-DOMAIN CNN REPRESENTATIONS FOR CLASSIFYING REAL/COMPLEX-VALUED DATA3.332.056, 1, 3
4389Enhancing Robustness of Deep Networks Based on a Two-phase Model of Their Training with Noisy Labels3.332.053, 1, 6
4390How Erdös and Rényi Win the Lottery3.332.056, 3, 1
4391Convergence Rate of Primal-Dual Approach to Constrained Reinforcement Learning with Softmax Policy3.251.796, 3, 1, 3
4392Towards biologically plausible Dreaming and Planning3.251.791, 3, 6, 3
4393On the Convergence of Federated Deep AUC Maximization3.251.791, 6, 3, 3
4394Who are playing the games?3.251.796, 3, 1, 3
4395Post-mortem on a deep learning contest: a Simpson’s paradox and the complementary roles of scale metrics versus shape metrics3.251.793, 3, 1, 6
4396Complete Likelihood Objective for Latent Variable Models3.252.861, 3, 1, 8
4397Meta-Learning via Classifier(-free) Guidance3.251.791, 3, 6, 3
4398Marginal Probability Explanation: A Saliency Map with Closed-loop Validation3.252.281, 5, 6, 1
4399Representation Interference Suppression via Non-linear Value Factorization for Indecomposable Markov Games3.252.281, 5, 6, 1
4400Rank-1 Matrix Completion with Gradient Descent and Small Random Initialization3.251.791, 3, 6, 3
4401Exploring semantic information in disease: Simple Data Augmentation Techniques for Chinese Disease Normalization3.251.793, 1, 6, 3
4402The Curse of Low Task Diversity: On the Failure of Transfer Learning to Outperform MAML and their Empirical Equivalence3.251.793, 1, 3, 6
4403Contrastive Unsupervised Learning of World Model with Invariant Causal Features3.251.793, 1, 3, 6
4404Certification of Attribution Robustness for Euclidean Distance and Cosine Similarity Measure3.251.791, 3, 6, 3
4405Quark: A Gradient-Free Quantum Learning Framework for Classification Tasks3.251.793, 6, 1, 3
4406On the Impact of Adversarially Robust Models on Algorithmic Recourse3.251.793, 1, 6, 3
4407Link Prediction without Graph Neural Networks3.251.791, 6, 3, 3
4408scFormer: a universal representation learning approach for single-cell data using transformers3.251.793, 6, 1, 3
4409The Crossword Puzzle: Simplifying Deep Neural Network Pruning with Fabulous Coordinates3.202.046, 5, 1, 1, 3
4410Suppression helps: Lateral Inhibition-inspired Convolutional Neural Network for Image Classification3.001.411, 3, 5, 3
4411Detecting Out-of-Distribution Data with Semi-supervised Graph “Feature' Networks3.001.413, 3, 1, 5
4412Joint Representations of Text and Knowledge Graphs for Retrieval and Evaluation3.000.003, 3, 3
4413Towards scalable and non-IID robust Hierarchical Federated Learning via Label-driven Knowledge Aggregator3.000.003, 3, 3
4414Loss Adapted Plasticity: Learning From Data With Unreliable Sources3.000.003, 3, 3
4415Online black-box adaptation to label-shift in the presence of conditional-shift3.000.003, 3, 3, 3
4416Improving Protein Interaction Prediction using Pretrained Structure Embedding3.000.003, 3, 3, 3
4417Versatile Energy-Based Models for High Energy Physics3.001.413, 5, 3, 1
4418Mixture of Basis for Interpretable Continual Learning with Distribution Shifts3.001.413, 1, 5, 3
4419Scrunch: Preventing sensitive property inference through privacy-preserving representation learning3.000.003, 3, 3
4420GM-VAE: Representation Learning with VAE on Gaussian Manifold3.000.003, 3, 3
4421Learning Test Time Augmentation with Cascade Loss Prediction3.000.003, 3, 3, 3
4422Optimizing Data-Flow in Binary Neural Networks3.000.003, 3, 3, 3
4423Neural Representations in Multi-Task Learning guided by Task-Dependent Contexts3.000.003, 3, 3, 3
4424Multi Task Learning of Different Class Label Representations for Stronger Models3.001.413, 3, 1, 5
4425Oscillation Neural Ordinary Differential Equations3.000.003, 3, 3
4426Noise Transforms Feed-Forward Networks into Sparse Coding Networks3.000.003, 3, 3, 3
4427Robust attributions require rethinking robustness metrics3.001.263, 3, 3, 5, 1
4428Atomized Deep Learning Models3.000.003, 3, 3, 3, 3
4429How Should I Plan? A Performance Comparison of Decision-Time vs. Background Planning3.001.415, 1, 3, 3
4430Towards Diverse Perspective Learning with Switch over Multiple Temporal Pooling3.001.631, 3, 5
4431Probe Into Multi-agent Adversarial Reinforcement Learning through Mean-Field Optimal Control3.001.413, 1, 5, 3
4432LEARNING DYNAMIC ABSTRACT REPRESENTATIONS FOR SAMPLE-EFFICIENT REINFORCEMENT LEARNING3.000.003, 3, 3
4433Boosting Adversarial Training with Masked Adaptive Ensemble3.000.003, 3, 3, 3
4434Disentangled Conditional Variational Autoencoder for Unsupervised Anomaly Detection3.000.003, 3, 3, 3
4435Protecting Bidder Information in Neural Auctions3.000.003, 3, 3
4436META-LEARNING FOR UNSUPERVISED OUTLIER DETECTION WITH OPTIMAL TRANSPORT3.001.415, 1, 3, 3
4437ADVL: Adaptive Distillation for Vision-Language Tasks3.000.003, 3, 3
4438Learning Arborescence with An Efficient Inference Algorithm3.000.003, 3, 3
4439Cross-Domain Self-Supervised Deep Learning for Robust Alzheimer's Disease Progression Modeling3.001.413, 3, 1, 5
4440Systematic Generalization and Emergent Structures in Transformers Trained on Structured Tasks3.001.635, 1, 3
4441DeepDFA: Dataflow Analysis-Guided Efficient Graph Learning for Vulnerability Detection3.000.003, 3, 3, 3
4442Spatial Reasoning Network for Zero-shot Constrained Scene Generation3.001.635, 1, 3
4443Optimal control neural networks for data-driven discovery of gradient flows.3.000.003, 3, 3, 3
4444NOTELA: A Generalizable Method for Source Free Domain Adaptation3.000.003, 3, 3, 3
4445Federated Representation Learning via Maximal Coding Rate Reduction3.001.411, 3, 5, 3
4446Memory Efficient Dynamic Sparse Training3.000.003, 3, 3, 3
4447Temporal Change Sensitive Representation for Reinforcement Learing3.000.003, 3, 3, 3
4448TKIL: Tangent Kernel Optimization for Class Balanced Incremental Learning3.000.003, 3, 3, 3
4449A Framework for Comprehensive Evaluations of Graph Neural Network based Community Detection using Node Clustering3.000.003, 3, 3
4450Improving the Strength of Human-Like Models in Chess3.000.003, 3, 3, 3
4451Domain Transfer with Large Dynamics Shift in Offline Reinforcement Learning3.000.003, 3, 3
4452Real Data Distributions Prefer Simplicity and So Do Our Models: Why Machine Learning and Model Selection Are Possible3.000.003, 3, 3, 3
4453Continual Active Learning3.001.413, 5, 3, 1
4454Pessimistic Model-Based Actor-Critic for Offline Reinforcement Learning: Theory and Algorithms3.000.003, 3, 3, 3
4455Improving Adversarial Robustness of Deep Neural Networks via Self-adaptive Margin Defense3.000.003, 3, 3
4456Knowledge Cascade: Reverse Knowledge Distillation3.000.003, 3, 3
4457Membership Leakage in Pre-trained Language Models3.001.633, 1, 5
4458An Exploration of Conditioning Methods in Graph Neural Networks3.000.003, 3, 3
4459Robust Policy Optimization in Deep Reinforcement Learning3.000.003, 3, 3, 3
4460EiX-GNN : Concept-level eigencentrality explainer for graph neural networks3.001.411, 5, 3, 3
4461The Minimal Feature Removal Problem in Neural Networks3.001.633, 5, 1
4462Continuous Depth Recurrent Neural Differential Equations3.000.003, 3, 3
4463Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning3.000.003, 3, 3, 3
4464Progressive Data Dropout: An Adaptive Training Strategy for Large-Scale Supervised Learning3.000.003, 3, 3, 3
4465Towards a Mathematics Formalisation Assistant using Large Language Models3.001.413, 1, 5, 3
4466Learning Portable Skills by Identifying Generalizing Features with an Attention-Based Ensemble3.000.003, 3, 3
4467Data dependent frequency sensitivity of convolutional neural networks3.000.003, 3, 3
4468Is end-to-end learning enough for fitness activity recognition?3.000.943, 3, 3, 5, 3, 3, 3, 1, 3
4469Forget to Learn (F2L): Rethinking Replay Loss in Unsupervised Continuous Domain Adaptation3.001.633, 5, 1
4470Single SMPC Invocation DPHelmet: Differentially Private Distributed Learning on a Large Scale3.000.003, 3, 3, 3
4471Robust Exploration via Clustering-based Online Density Estimation3.000.003, 3, 3
4472Using semantic distance for diverse and sample efficient genetic programming3.001.631, 3, 5
4473Soft Sampling for Efficient Training of Deep Neural Networks on Massive Data3.000.003, 3, 3
4474Improving Adversarial Robustness by Contrastive Guided Diffusion Process3.000.003, 3, 3
4475Revealing Dominant Eigendirections via Spectral Non-Robustness Analysis in the Deep Reinforcement Learning Policy Manifold3.000.003, 3, 3, 3, 3
4476Enhanced Spatio-Temporal Image Encoding for Online Human Activity Recognition3.000.003, 3, 3, 3
4477SmilesFormer: Language Model for Molecular Design3.001.631, 3, 5
4478A NEW PARADIGM FOR CROSS-MODALITY PERSON RE-IDENTIFICATION3.000.003, 3, 3, 3
4479Using Planning to Improve Semantic Parsing of Instructional Texts3.001.413, 3, 1, 5
4480Model Stealing Attacks Against Vision-Language Models3.001.415, 1, 3, 3
4481Improved Stein Variational Gradient Descent with Importance Weights3.001.633, 5, 1
4482Reducing Communication Entropy in Multi-Agent Reinforcement Learning3.000.003, 3, 3, 3
4483Time-Myopic Go-Explore: Learning A State Representation for the Go-Explore Paradigm3.000.003, 3, 3, 3
4484Physics Model-based Autoencoding for Magnetic Resonance Fingerprinting3.000.003, 3, 3, 3
4485Lightweight Equivariant Graph Representation Learning for Protein Engineering3.001.631, 3, 5
4486Optimizing Connectivity through Network Gradients for the Restricted Machine3.000.003, 3, 3
4487QUIC-FL: : Quick Unbiased Compression for Federated Learning3.000.003, 3, 3
4488FedMEKT: Split Multimodal Embedding Knowledge Transfer in Federated Learning3.000.003, 3, 3, 3
4489End-to-End Speech Synthesis Based on Deep Conditional Schrödinger Bridges3.001.413, 5, 1, 3
4490CCT: Cross-consistency training for Clone Detection and Code Search Tasks3.001.415, 3, 3, 1
4491GraphVF: Controllable Protein-Specific 3D Molecule Generation with Variational Flow3.000.003, 3, 3, 3
4492Comparing Auxiliary Tasks for Learning Representations for Reinforcement Learning3.000.003, 3, 3, 3
4493UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion3.000.003, 3, 3
4494Server Aggregation as Linear Regression: Reformulation for Federated Learning3.000.003, 3, 3, 3
4495The Effective coalitions of Shapley value For Integrated Gradients3.000.003, 3, 3
4496Tree-structure segmentation for logistic regression3.000.003, 3, 3
4497Learning to solve the Hidden Clique Problem with Graph Neural Networks3.000.003, 3, 3
4498PREDICTION OF TOURISM FLOW WITH SPARSE DATA INCORPORATING TOURIST GEOLOCATIONS3.000.003, 3, 3, 3
4499Meta-learning with Auto-generated Tasks for Predicting Human Behaviour in Normal Form Games3.001.415, 3, 3, 1
4500Decentralized Policy Optimization3.000.003, 3, 3
4501Image Segmentation using Transfer Learning with DeepLabv3 to Facilitate Photogrammetric Limb Scanning3.000.003, 3, 3
4502Augmentative Topology Agents For Open-Ended Learning3.000.003, 3, 3, 3
4503Revisiting Over-smoothing in Graph Neural Networks3.000.003, 3, 3, 3
4504StepGCN: Step-oriented Graph Convolutional Networks in Representation Learning3.000.003, 3, 3
4505Gradient-based Algorithms for Pessimistic Bilevel Optimization3.000.003, 3, 3
4506ENHANCING THE PRIVACY OF FEDERATED LEARNING THROUGH DATA SYNTHESIS3.001.411, 3, 3, 5
4507The Emergence of Prototypicality: Unsupervised Feature Learning in Hyperbolic Space3.000.003, 3, 3, 3
4508Coordinated Strategy Identification Multi-Agent Reinforcement Learning3.000.003, 3, 3
4509Evaluating Robustness of Generative Models with Adversarial Networks3.000.003, 3, 3
4510Approximating How Single Head Attention Learns3.000.003, 3, 3
4511MVP: Multi-task Supervised Pre-training for Natural Language Generation3.001.635, 1, 3
4512Improving Inductive Link Prediction through Learning Generalizable Node Representations3.001.413, 3, 1, 5
4513ATTRIBUTES RECONSTRUCTION IN HETEROGENEOUS NETWORKS VIA GRAPH AUGMENTATION3.001.635, 1, 3
4514HAS IT REALLY IMPROVED? KNOWLEDGE GRAPH BASED SEPARATION AND FUSION FOR RECOMMENDATION3.000.003, 3, 3
4515On Assimilating Learned Views in Contrastive Learning3.000.003, 3, 3, 3
4516Block-Diagonal Structure Learning for Subspace Clustering3.000.003, 3, 3
4517Thrust: Adaptively Propels Large Language Models with External Knowledge3.001.413, 3, 5, 1
4518SGD and Weight Decay Provably Induce a Low-Rank Bias in Neural Networks3.001.411, 3, 3, 5
4519Transfer Learning with Context-aware Feature Compensation3.000.003, 3, 3
4520TuneUp: A Training Strategy for Improving Generalization of Graph Neural Networks3.000.003, 3, 3, 3
4521Logical view on fairness of a binary classification task3.001.631, 5, 3
4522Active Sampling for Node Attribute Completion on Graphs3.001.413, 3, 1, 5
4523Emb-GAM: an Interpretable and Efficient Predictor using Pre-trained Language Models3.001.633, 1, 5
4524FedCUAU: Clustered Federated Learning using weight divergence3.000.003, 3, 3
4525A Probabilistic Approach to Self-Supervised Learning using Cyclical Stochastic Gradient MCMC3.000.003, 3, 3
4526Tabular Data to Image Generation: Benchmark Data, Approaches, and Evaluation3.000.003, 3, 3
4527Representing Latent Dimensions Using Compressed Number Lines3.001.631, 5, 3
4528Deep Invertible Approximation of Topologically Rich Maps between Manifolds3.001.413, 1, 5, 3
4529Neural Graphical Models3.000.003, 3, 3, 3
4530Meta-learning from demonstrations improves compositional generalization3.000.003, 3, 3, 3
4531Communication-Optimal Distributed Graph Clustering under Duplication Models3.001.631, 3, 5
4532LSTM-BASED-AUTO-BI-LSTM for Remaining Useful Life (RUL) Prediction: the first round of test results3.000.003, 3, 3
4533ModReduce: A Multi-Knowledge Distillation Framework with Online Learning3.001.413, 5, 3, 1
4534Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning3.000.003, 3, 3, 3, 3
4535Isometric Representations in Neural Networks Improve Robustness3.000.003, 3, 3, 3
4536CBP-QSNN: Spiking Neural Networks Quantized Using Constrained Backpropagation3.000.003, 3, 3
4537Disentangled (Un)Controllable Features3.000.003, 3, 3, 3
4538CWATR: Generating Richer Captions with Object Attributes3.000.003, 3, 3
4539QUANTIZATION AWARE FACTORIZATION FOR DEEP NEURAL NETWORK COMPRESSION3.000.003, 3, 3, 3
4540Fairness of Federated Learning with Dynamic Participants3.000.003, 3, 3
4541Context and History Aware Other-Shaping3.001.413, 1, 3, 5
4542SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation3.000.003, 3, 3
4543Bidirectional global to local attention for deep metric learning.3.001.413, 5, 1, 3
4544Class Interference of Deep Networks3.001.633, 5, 1
4545Bi-Level Dynamic Parameter Sharing among Individuals and Teams for Promoting Collaborations in Multi-Agent Reinforcement Learning3.000.003, 3, 3, 3
4546Uplift Modelling based on Graph Neural Network Combined with Causal Knowledge3.000.003, 3, 3
4547SynMotor: A Benchmark Suite for Object Attribute Regression and Multi-task Learning3.001.631, 3, 5
4548Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning3.000.003, 3, 3
4549Signs in the Lottery: Structural Similarities Between Winning Tickets3.001.413, 1, 5, 3
4550To be private and robust: Differentially Private Optimizers Can Learn Adversarially Robust Models3.001.265, 3, 1, 3, 3
4551Fine-Grained Source Code Vulnerability Detection via Graph Neural Networks3.001.411, 3, 5, 3
4552APLA: Class-imbalanced Semi-supervised Learning with Adapative Pseudo-labeling and Loss Adjustment3.001.631, 3, 5
4553Hypernetwork approach to Bayesian MAML3.000.003, 3, 3
4554Deep Leakage from Model in Federated Learning3.001.633, 5, 1
4555Existence of a bad local minimum of neural networks with general smooth activation functions3.000.003, 3, 3, 3
4556ADVERSARY-AWARE PARTIAL LABEL LEARNING WITH LABEL DISTILLATION3.001.413, 1, 3, 5
4557Identical Initialization: A Universal Approach to Fast and Stable Training of Neural Networks3.000.003, 3, 3, 3
4558Detecting Backdoor Attacks via Layer-wise Feature Analysis3.000.003, 3, 3
4559Neural Layered Min-sum Decoders for Algebraic Codes3.000.003, 3, 3
4560The Importance of Suppressing Complete Reconstruction in Autoencoders for Unsupervised Outlier Detection3.000.003, 3, 3, 3
4561CENTROID-BASED JOINT REPRESENTATION FOR HUMAN POSE ESTIMATION AND INSTANCE SEGMENTATION3.001.633, 1, 5
4562Leveraging Hard Negative Priors for Automatic Medical Report Generation3.000.003, 3, 3, 3
4563MULTI-VIEW DEEP EVIDENTIAL FUSION NEURAL NETWORK FOR ASSESSMENT OF SCREENING MAMMOGRAMS3.001.413, 5, 3, 1
4564Probable Dataset Searching Method with Uncertain Dataset Information in Adjusting Architecture Hyper Parameter3.000.003, 3, 3
4565Scaled Neural Multiplicative Model for Tractable Optimization3.001.631, 5, 3
4566On the Power-Law Hessian Spectra in Deep Learning3.000.003, 3, 3
4567Theoretical generalization bounds for improving the efficiency of deep online training3.000.003, 3, 3, 3
4568A Representation Bottleneck of Bayesian Neural Networks3.000.003, 3, 3
4569LAU: A novel two-parameter learnable Logmoid Activation Unit3.001.631, 3, 5
4570N-Student Learning: An Approach to Model Uncertainty and Combat Overfitting3.000.003, 3, 3
4571Better handling unlabeled entity problem using PU-learning and negative sampling3.000.003, 3, 3, 3
4572Communication-Efficient and Drift-Robust Federated Learning via Elastic Net3.000.003, 3, 3, 3
4573Partition Matters in Learning and Learning-to-Learn Implicit Neural Representations3.000.003, 3, 3, 3
4574Substructured Graph Convolution for Non-overlapping Graph Decomposition3.000.003, 3, 3
4575Inverse Kernel Decomposition3.000.003, 3, 3, 3
4576An Investigation of Domain Generalization with Rademacher Complexity3.000.003, 3, 3, 3
4577ProGen2: Exploring the Boundaries of Protein Language Models3.000.003, 3, 3
4578Convergence of Generative Deep Linear Networks Trained with Bures-Wasserstein Loss3.000.003, 3, 3, 3
4579Spotting Expressivity Bottlenecks and Fixing Them Optimally3.000.003, 3, 3, 3
4580Diffusing Graph Attention3.000.003, 3, 3, 3
4581AdaptFSP: Adaptive Fictitious Self Play3.000.003, 3, 3, 3
4582An Intrinsic Dimension Perspective of Transformers for Sequential Modeling3.001.411, 3, 3, 5
4583TabDDPM: Modelling Tabular Data with Diffusion Models3.001.413, 1, 5, 3
4584ErGOT: entropy-regularized graph optimal transport3.000.003, 3, 3, 3
4585Considering Layerwise Importance in the Lottery Ticket Hypothesis3.000.003, 3, 3
4586Memory of Unimaginable Outcomes in Experience Replay3.000.003, 3, 3, 3
4587Deep Reinforcement Learning for Cryptocurrency Trading: Practical Approach to Address Backtest Overfitting3.000.003, 3, 3
4588TGP: Explainable Temporal Graph Neural Networks for Personalized Recommendation3.001.411, 3, 5, 3
4589Efficient Policy Space Response Oracles3.001.633, 5, 1
4590RetinexUTV: ROBUST RETINEX MODEL WITH UNFOLDING TOTAL VARIATION3.001.413, 1, 3, 5
4591Learning in Compressed Domain via Knowledge Transfer3.000.003, 3, 3, 3
4592Generative Recorrupted-to-Recorrupted: An Unsupervised Image Denoising Network for Arbitrary Noise Distribution3.001.413, 1, 5, 3
4593Protective Label Enhancement for Label Privacy3.001.631, 3, 5
4594Low-Entropy Features Hurt Out-of-Distribution Performance3.000.003, 3, 3, 3
4595Determinant regularization for Deep Metric Learning3.001.413, 1, 3, 5
4596Learning to Communicate using Contrastive Learning3.000.003, 3, 3
4597Flexible Relation Preserving for Adversarial Training3.001.633, 1, 5
4598Joint Spatiotemporal Attention for Mortality Prediction of Patients with Long COVID3.000.003, 3, 3
4599PA-LoFTR: Local Feature Matching with 3D Position-Aware Transformer3.000.003, 3, 3
4600Explaining Representation Bottlenecks of Convolutional Decoder Networks3.000.003, 3, 3, 3
4601Divide and conquer policy for efficient GAN training3.001.413, 3, 5, 1
4602TaylorNet: A Taylor-Driven Generic Neural Architecture3.000.003, 3, 3
4603Coupling Semi-supervised Learning with Reinforcement Learning for Better Decision Making -- An application to Cryo-EM Data Collection3.000.003, 3, 3
4604ProtoVAE: Using Prototypical Networks for Unsupervised Disentanglement3.000.003, 3, 3, 3
4605Abstract Visual Reasoning by Self-supervised Contrastive Learning3.000.003, 3, 3, 3
4606i-MAE: Are Latent Representations in Masked Autoencoders Linearly Separable?3.000.003, 3, 3
4607Mixed-Precision Inference Quantization: Radically Towards Faster inference speed, Lower Storage requirement, and Lower Loss3.001.631, 3, 5
4608Leveraging Double Descent for Scientific Data Analysis: Face-Based Social Behavior as a Case Study3.001.413, 1, 3, 5
4609Deep Duplex Learning for Weak Supervision3.001.413, 1, 3, 5
4610Fast Test-Time Adaptation Using Hints3.000.003, 3, 3
4611Gradient Properties of Hard Thresholding Operator3.001.411, 3, 3, 5
4612Accurate and Efficient Soma Reconstruction in a Full Adult Fly Brain3.001.635, 1, 3
4613An Encryption Framework for Pre-Trained Neural Networks3.001.631, 3, 5
4614Wasserstein Fair Autoencoders3.001.633, 5, 1
4615Low-Rank Winograd Transformation for 3D Convolutional Neural Networks3.000.003, 3, 3, 3
4616Structure-based Drug Design with Equivariant Diffusion Models3.000.003, 3, 3, 3
4617Deep reinforced active learning for multi-class image classification3.000.003, 3, 3
4618Big Learning: A Universal Machine Learning Paradigm?3.001.635, 1, 3
4619Interpretable Out-of-Distribution Detection using Pattern Identification3.000.003, 3, 3
4620NASiam: Efficient Representation Learning using Neural Architecture Search for Siamese Networks3.001.411, 3, 3, 5
4621On a Built-in Conflict between Deep Learning and Systematic Generalization3.001.415, 1, 3, 3
4622Block-level Stiffness Analysis of Residual Networks3.000.003, 3, 3
4623Explainable Artificial Intelligence: Reaping the Fruits of Decision Trees3.001.415, 1, 3, 3
4624Hard Regularization to Prevent Collapse in Online Deep Clustering without Data Augmentation3.001.413, 3, 1, 5
46253D-Scene-Entities: Using Phrase-to-3D-Object Correspondences for Richer Visio-Linguistic Models in 3D Scenes3.000.003, 3, 3, 3
4626MultiWave: Multiresolution Deep Architectures through Wavelet Decomposition for Multivariate Timeseries Forecasting and Prediction3.000.003, 3, 3, 3
4627Shared Knowledge Lifelong Learning3.000.003, 3, 3, 3
4628WeightRelay: Efficient Heterogenous Federated Learning on Time Series3.000.003, 3, 3, 3
4629Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders3.000.003, 3, 3
4630Generaling Multimodal Variational Methods to Sets3.001.411, 3, 3, 5
4631Training A Multi-stage Deep Classifier with Feedback Signals3.001.631, 5, 3
4632Normalized Activation Function: Toward Better Convergence3.000.003, 3, 3, 3
4633Hybrid Neuro-Symbolic Reasoning based on Multimodal Fusion3.001.413, 1, 5, 3
4634Distilling Text-Image Foundation Models3.000.003, 3, 3, 3
4635Refining Visual Representation for Generalized Zero-Shot Recognition through Implicit-Semantics-Guided Metric Learning3.000.003, 3, 3, 3
4636A MULTI-SCALE STRUCTURE-PRESERVING HETEROLOGOUS IMAGE TRANSFORMATION ALGORITHM BASED ON CONDITIONAL ADVERSARIAL NETWORK LEARNING3.000.003, 3, 3
4637When do Convolutional Neural Networks Stop Learning?2.801.833, 3, 6, 1, 1
4638Universal Graph Neural Networks without Message Passing2.802.231, 5, 6, 1, 1
4639Understanding ReLU Network Robustness Through Test Set Certification Performance2.752.051, 1, 6, 3
4640Sparsity by Redundancy: Solving $L_1$ with a Simple Reparametrization2.752.051, 6, 1, 3
4641Self-Programming Artificial Intelligence Using Code-Generating Language Models2.600.803, 3, 3, 3, 1
4642Exploring Generalization of Non-Contrastive self-supervised Learning2.600.803, 3, 3, 1, 3
4643Quantized Disentangled Representations for Object-Centric Visual Tasks2.500.873, 1, 3, 3
4644HOW SAMPLING AFFECTS TRAINING: AN EFFECTIVE SAMPLING THEORY STUDY FOR LONG-TAILED IMAGE CLASSIFICATION2.500.871, 3, 3, 3
4645Farsighter: Efficient Multi-step Exploration for Deep Reinforcement Learning2.500.873, 3, 3, 1
4646CLASSIFICATION OF INCOMPLETE DATA USING AUGMENTED MLP2.500.873, 3, 1, 3
4647Correspondences between word learning in children and captioning models2.500.873, 3, 1, 3
4648Stabilized training of joint energy-based models and its practical applications2.500.873, 3, 1, 3
4649Robustness Evaluation Using Local Substitute Networks2.500.873, 3, 1, 3
4650An Empirical Study of the Neural Contextual Bandit Algorithms2.500.871, 3, 3, 3
4651Global View For GCN: Why Go Deep When You Can Be Shallow?2.501.663, 1, 5, 1
4652BIG-Graph: Brain Imaging Genetics by Graph Neural Network2.500.871, 3, 3, 3
4653Combining pretrained speech and text encoders for spoken language processing2.500.873, 3, 3, 1
4654Image Emotion Recognition using Cognitive Contextual Summarization Framework2.500.873, 3, 3, 1
4655FedPD: Defying data heterogeneity through privacy distillation2.500.871, 3, 3, 3
4656Multivariate Gaussian Representation of Previous Tasks for Continual Learning2.500.871, 3, 3, 3
4657Automatic Dictionary Generation: Could Brothers Grimm Create a Dictionary with BERT?2.500.871, 3, 3, 3
4658Indoor Localisation for Detecting Medication Use in Parkinson's Disease2.500.871, 3, 3, 3
4659Skill Graph for Real-world Quadrupedal Robot Reinforcement Learning2.500.873, 3, 1, 3
4660Hierarchical Multi-Resolution Graph Generation Networks2.500.873, 1, 3, 3
4661TT-Rules: Extracting & Optimizing Exact Rules of a CNN-Based Model - Application to Fairness2.500.873, 3, 1, 3
4662A sampling framework for value-based reinforcement learning2.500.871, 3, 3, 3
4663Change Detection for bi-temporal images classification based on Siamese Variational AutoEncoder and Transfer Learning2.500.873, 1, 3, 3
4664Coarse-to-fine Knowledge Graph Domain Adaptation based on Distantly-supervised Iterative Training2.500.871, 3, 3, 3
4665Representing Multi-view Time-series Graph Structures for Multivariate Long-term Time-series Forecasting2.501.661, 3, 5, 1
4666Automaton Distillation: A Neuro-Symbolic Transfer Learning Approach for Deep RL2.500.871, 3, 3, 3
4667Point-based Molecular Representation Learning from Conformers2.501.661, 5, 1, 3
4668Inferring Causal Relations between Temporal Events2.500.871, 3, 3, 3
4669On the Nonconvex Convergence of SGD2.500.873, 1, 3, 3
4670Comparative Analysis between Vision Transformers and CNNs from the view of Neuroscience2.500.873, 1, 3, 3
4671A Robustly and Effectively Optimized Pretraining Approach for Masked Autoencoder2.500.871, 3, 3, 3
4672Transmission Dynamics of Hepatitis B: Analysis and Control2.500.873, 3, 1, 3
4673Enhancement and Numerical Assessment of Novel SARS-CoV-2 Virus Transmission Model2.500.873, 3, 1, 3
4674DEEAPR: Controllable Depth Enhancement via Adaptive Parametric Feature Rotation2.500.873, 3, 3, 1
4675BinaryVQA: A Versatile Dataset to Push the Limits of VQA Models2.500.873, 1, 3, 3
4676Causal Information Bottleneck Boosts Adversarial Robustness of Deep Neural Network2.501.661, 3, 1, 5
4677Go-Explore with a guide: Speeding up search in sparse reward settings with goal-directed intrinsic rewards2.500.871, 3, 3, 3
4678Exploring Over-smoothing in Graph Attention Networks from the Markov Chain Perspective2.500.873, 3, 1, 3
4679Multiple output samples for each input in a single-output Gaussian process2.500.873, 3, 3, 1
4680Supervised Random Feature Regression via Projection Pursuit2.330.943, 1, 3
4681Geometry Problem Solving based on Counterfactual Evolutionary Reasoning2.330.943, 1, 3
4682Improve distance metric learning by learning positions of class centers2.330.943, 3, 1
4683MCTransformer: Combining Transformers And Monte-Carlo Tree Search For Offline Reinforcement Learning2.330.943, 1, 3
4684NOVEL FEATURE REPRESENTATION STRATEGIES FOR TIME SERIES FORECASTING WITH PREDICTED FUTURE COVARIATES2.330.943, 1, 3
4685CNN Compression and Search Using Set Transformations with Width Modifiers on Network Architectures2.330.941, 3, 3
4686Discerning Hydroclimatic Behavior with a Deep Convolutional Residual Regressive Neural Network2.330.943, 3, 1
4687Multi-scale Attention for Diabetic Retinopathy Detection in Retinal Fundus Images2.330.943, 3, 1
4688PES: Probabilistic Exponential Smoothing for Time Series Forecasting2.330.941, 3, 3
4689The batch size can affect inference results2.330.943, 1, 3
4690Multi-Reward Fusion: Learning from Other Policies by Distilling2.330.943, 1, 3
4691Break the Wall Between Homophily and Heterophily for Graph Representation Learning2.330.943, 3, 1
4692SC2EGSet: StarCraft II Esport Replay and Game-state Dataset2.330.943, 1, 3
4693Structural Privacy in Graphs2.330.943, 3, 1
4694Personalized Federated Hypernetworks for Privacy Preservation in Multi-Task Reinforcement Learning2.330.943, 3, 1
4695Uncertainty Guided Depth Fusion for Spike Camera2.330.943, 3, 1
4696$$CONVOLUTION AND POOLING OPERATION MODULE WITH ADAPTIVE STRIDE PROCESSING EFFEC$$2.331.895, 1, 1
4697Towards Global Optimality in Cooperative MARL with Sequential Transformation2.330.941, 3, 3
4698Towards Controllable Policy through Goal-Masked Transformers2.330.943, 3, 1
4699Monkeypox with Cross Infection Hypothesis via Epidemiological Mode2.330.943, 3, 1
4700MANDERA: Malicious Node Detection in Federated Learning via Ranking2.330.943, 1, 3
4701C3PO: Learning to Achieve Arbitrary Goals via Massively Entropic Pretraining2.330.941, 3, 3
4702SAE: Estimation for Transition Matrix in Annotation Algorithms2.330.943, 1, 3
4703Do We Really Achieve Fairness with Explicit Sensitive Attributes?2.330.941, 3, 3
4704Rethinking Backdoor Data Poisoning Attacks in the Context of Semi-Supervised Learning2.330.941, 3, 3
4705CoGANs: Collaborative Generative Adversarial Networks2.330.943, 3, 1
4706S-SOLVER: Numerically Stable Adaptive Step Size Solver for Neural ODEs2.331.891, 1, 5
4707Probing for Correlations of Causal Facts: Large Language Models and Causality2.252.171, 1, 1, 6
4708CI-VAE: a Class-Informed Deep Variational Autoencoder for Enhanced Class-Specific Data Interpolation2.252.171, 1, 6, 1
4709Improved Gradient Descent Optimization Algorithm based on Inverse Model-Parameter Difference2.001.001, 3, 1, 3
4710Emergence of Exploration in Policy Gradient Reinforcement Learning via Resetting2.001.001, 3, 1, 3
4711Counterfactual Vision-Language Data Synthesis with Intra-Sample Contrast Learning2.001.003, 3, 1, 1
4712Shallow Learning In Materio.2.001.003, 1, 1, 3
4713Improving Accuracy and Explainability of Online Handwriting Recognition2.001.001, 3, 1, 3
4714ESEAD: An Enhanced Simple Ensemble and Distillation Framework for Natural Language Processing2.001.003, 3, 1, 1
4715Deep Learning of Intrinsically Motivated Options in the Arcade Learning Environment2.001.001, 1, 3, 3
4716'I pick you choose': Joint human-algorithm decision making in multi-armed bandits2.001.003, 1, 1, 3
4717Unsupervised Non-Parametric Signal Separation Using Bayesian Neural Networks2.001.003, 1, 1, 3
4718Re-Benchmarking Out-of-Distribution Detection in Deep Neural Networks2.001.003, 1, 1, 3
4719Smooth Mathematical Functions from Compact Neural Networks2.001.003, 1, 3, 1
4720Online Reinforcement Learning via Posterior Sampling of Policy2.001.001, 1, 3, 3
4721Comparing semantic and morphological analogy completion in word embeddings2.001.001, 3, 1, 3
4722Co-Evolution As More Than a Scalable Alternative for Multi-Agent Reinforcement Learning2.001.003, 3, 1, 1
4723Self-Paced Learning Enhanced Physics-informed Neural Networks for Solving Partial Differential Equations2.001.001, 3, 3, 1
4724Searching optimal adjustment features for treatment effect estimation2.001.003, 3, 1, 1
4725Feature-Driven Talking Face Generation with StyleGAN22.001.001, 3, 1, 3
4726GENERATIVE OF ORIGIN MODEL DISTRIBUTION MASKED WITH EMOTIONS AND TOPICS DISTRIBUTION IN HYBRID METHOD2.001.003, 1, 1, 3
4727MESSAGENET: MESSAGE CLASSIFICATION USING NATURAL LANGUAGE PROCESSING AND META-DATA2.001.001, 3, 1, 3
4728Semi-connected Joint Entity Recognition and Relation Extraction of Contextual Entities in Family History Records2.001.001, 3, 3, 1
4729An Empirical Study on Anomaly detection Using Density Based and Representative Based Clustering algorithms2.001.003, 3, 1, 1
4730Tree Structure LSTM for Chinese Named Entity Recognition2.001.001, 1, 3, 3
4731MixQuant: A Quantization Bit-width Search that Can Optimize the Performance of your Quantization Method2.001.003, 3, 1, 1
4732The GANfather: Controllable generation of malicious activity to expose detection weaknesses and improve defence systems.1.670.941, 1, 3
4733Vectorial Graph Convolutional Networks1.670.943, 1, 1
4734Learning Discriminative Representations for Chromosome Classification with Small Datasets1.670.941, 1, 3
4735REPRESENTATIVE PROTOTYPE WITH CONSTRASTIVE LEARNING FOR SEMI-SUPENVISED FEW-SHOT CLASSIFICATION1.670.941, 1, 3
4736Adaptive Gradient Methods with Local Guarantees1.670.941, 1, 3
4737Predicting Antimicrobial MICs for Nontyphoidal Salmonella Using Multitask Representations Learning1.670.941, 3, 1
4738Convergence of the mini-batch SIHT algorithm1.670.941, 1, 3
4739Partial Output Norm: Mitigating the Model Output Blow-up Effect of Cross Entropy Loss1.500.873, 1, 1, 1
4740State Decomposition for Model-free Partially observable Markov Decision Process1.500.871, 3, 1, 1
4741Recurrent Back-Projection Generative Adversarial Network for Video Super Resolution1.500.871, 1, 3, 1
4742Ensemble Homomorphic Encrypted Data Classification1.500.873, 1, 1, 1
4743The Use of Open-Source Boards for Data Collection and Machine Learning in Remote Deployments1.500.871, 3, 1, 1
4744Speeding up Policy Optimization with Vanishing Hypothesis and Variable Mini-Batch Size1.500.871, 1, 1, 3
4745URVoice: An Akl-Toussaint/ Graham- Sklansky Approach towards Convex Hull Computation for Sign Language Interpretation1.500.871, 3, 1, 1
4746Generalization Mechanics in Deep Learning1.500.871, 3, 1, 1
4747Fusion of Deep Transfer Learning with Mixed convolution network1.500.871, 3, 1, 1
4748Evaluating Weakly Supervised Object Localization Methods Right? A Study on Heatmap-based XAI and Neural Backed Decision Tree1.500.871, 1, 1, 3
4749Quantum reinforcement learning1.000.001, 1, 1, 1
4750Manipulating Multi-agent Navigation Task via Emergent Communications1.000.001, 1, 1
4751A comparison of dataset distillation and active learning in text classification1.000.001, 1, 1
4752Activation Function: Absolute Function,One Function Behaves more Individualized1.000.001, 1, 1, 1
4753Rotation Invariant Quantization for Model Compression1.000.001, 1, 1