| FormalAlign: Automated Alignment Evaluation for Autoformalization | Oct 14, 2024 | Mathematical Proofsvalid | CodeCode Available | 1 |
| Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation | Oct 12, 2024 | InformativenessRetrieval | CodeCode Available | 1 |
| End-to-End Conformal Calibration for Optimization Under Uncertainty | Sep 30, 2024 | Conformal PredictionDecision Making | CodeCode Available | 1 |
| Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent | Sep 17, 2024 | GSM8KQuestion Answering | CodeCode Available | 1 |
| SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps | Sep 16, 2024 | DenoisingDepth Completion | CodeCode Available | 1 |
| FuzzCoder: Byte-level Fuzzing Test via Large Language Model | Sep 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Can Unconfident LLM Annotations Be Used for Confident Conclusions? | Aug 27, 2024 | valid | CodeCode Available | 1 |
| AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and Results | Aug 21, 2024 | Image Manipulationvalid | CodeCode Available | 1 |
| BLADE: Benchmarking Language Model Agents for Data-Driven Science | Aug 19, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Uncertainty Quantification of Surrogate Models using Conformal Prediction | Aug 19, 2024 | Conformal PredictionPrediction | CodeCode Available | 1 |
| Conformal Trajectory Prediction with Multi-View Data Integration in Cooperative Driving | Aug 1, 2024 | Conformal PredictionData Integration | CodeCode Available | 1 |
| Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees | Jul 12, 2024 | Graph GenerationProperty Prediction | CodeCode Available | 1 |
| Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search | Jul 8, 2024 | Retrosynthesisvalid | CodeCode Available | 1 |
| Benchmarking structure-based three-dimensional molecular generative models using GenBench3D: ligand conformation quality matters | Jul 5, 2024 | Benchmarkingvalid | CodeCode Available | 1 |
| Combining Neural Networks and Symbolic Regression for Analytical Lyapunov Function Discovery | Jun 21, 2024 | regressionSymbolic Regression | CodeCode Available | 1 |
| SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent Collaboration | Jun 19, 2024 | SQL ParsingText to SQL | CodeCode Available | 1 |
| Ask-before-Plan: Proactive Language Agents for Real-World Planning | Jun 18, 2024 | Decision Makingvalid | CodeCode Available | 1 |
| Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions | Jun 14, 2024 | Few-Shot Semantic SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Large language model validity via enhanced conformal prediction methods | Jun 14, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| Conformal Load Prediction with Transductive Graph Autoencoders | Jun 12, 2024 | Conformal PredictionGraph Neural Network | CodeCode Available | 1 |
| Comparing Experimental and Nonexperimental Methods: What Lessons Have We Learned Four Decades After LaLonde (1986)? | Jun 2, 2024 | valid | CodeCode Available | 1 |
| Latent Fingerprint Matching via Dense Minutia Descriptor | May 2, 2024 | valid | CodeCode Available | 1 |
| Forcing Diffuse Distributions out of Language Models | Apr 16, 2024 | Dataset GenerationDiversity | CodeCode Available | 1 |
| NTIRE 2024 Challenge on Image Super-Resolution (4): Methods and Results | Apr 15, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 1 |
| Resolve Domain Conflicts for Generalizable Remote Physiological Measurement | Apr 11, 2024 | AttributeEmotion Recognition | CodeCode Available | 1 |