| General Frameworks for Conditional Two-Sample Testing | Oct 22, 2024 | Domain AdaptationFairness | CodeCode Available | 0 |
| Building Conformal Prediction Intervals with Approximate Message Passing | Oct 21, 2024 | BenchmarkingConformal Prediction | CodeCode Available | 0 |
| Reward Maximization for Pure Exploration: Minimax Optimal Good Arm Identification for Nonparametric Multi-Armed Bandits | Oct 21, 2024 | Multi-Armed Banditsvalid | —Unverified | 0 |
| Distribution Learning with Valid Outputs Beyond the Worst-Case | Oct 21, 2024 | valid | —Unverified | 0 |
| Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer? | Oct 20, 2024 | Question Answeringvalid | CodeCode Available | 0 |
| Asymptotic Time-Uniform Inference for Parameters in Averaged Stochastic Approximation | Oct 19, 2024 | valid | —Unverified | 0 |
| SylloBio-NLI: Evaluating Large Language Models on Biomedical Syllogistic Reasoning | Oct 18, 2024 | Natural Language Inferencescientific discovery | —Unverified | 0 |
| You Shall Know a Tool by the Traces it Leaves: The Predictability of Sentiment Analysis Tools | Oct 18, 2024 | Sentiment AnalysisSentiment Classification | —Unverified | 0 |
| Critical Questions Generation: Motivation and Challenges | Oct 18, 2024 | Misinformationvalid | CodeCode Available | 0 |
| Byzantine-Resilient Output Optimization of Multiagent via Self-Triggered Hybrid Detection Approach | Oct 17, 2024 | Distributed Optimizationvalid | —Unverified | 0 |
| GraphSCENE: On-Demand Critical Scenario Generation for Autonomous Vehicles in Simulation | Oct 17, 2024 | Autonomous VehiclesGraph Neural Network | —Unverified | 0 |
| Generative Conformal Prediction with Vectorized Non-Conformity Scores | Oct 17, 2024 | Autonomous DrivingConformal Prediction | —Unverified | 0 |
| Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion | Oct 16, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| Unsupervised Training of Diffusion Models for Feasible Solution Generation in Neural Combinatorial Optimization | Oct 15, 2024 | Combinatorial OptimizationScheduling | —Unverified | 0 |
| Aggregation Trees | Oct 15, 2024 | valid | —Unverified | 0 |
| DeltaDock: A Unified Framework for Accurate, Efficient, and Physically Reliable Molecular Docking | Oct 15, 2024 | Blind DockingDrug Design | CodeCode Available | 1 |
| 3D-Prover: Diversity Driven Theorem Proving With Determinantal Point Processes | Oct 14, 2024 | Automated Theorem ProvingDiversity | —Unverified | 0 |
| FormalAlign: Automated Alignment Evaluation for Autoformalization | Oct 14, 2024 | Mathematical Proofsvalid | CodeCode Available | 1 |
| Single Ground Truth Is Not Enough: Add Linguistic Variability to Aspect-based Sentiment Analysis Evaluation | Oct 13, 2024 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | —Unverified | 0 |
| Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation | Oct 12, 2024 | InformativenessRetrieval | CodeCode Available | 1 |
| : An Instruction-tuned model for English Language Proficiency Assessments | Oct 12, 2024 | valid | —Unverified | 0 |
| Natural Language Counterfactual Explanations for Graphs Using Large Language Models | Oct 11, 2024 | counterfactualExplainable artificial intelligence | CodeCode Available | 0 |
| Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos | Oct 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SimpleStrat: Diversifying Language Model Generation with Stratification | Oct 11, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Learning Representations of Instruments for Partial Identification of Treatment Effects | Oct 11, 2024 | Causal InferenceDecision Making | CodeCode Available | 0 |