| Mixed-curvature decision trees and random forests | Oct 3, 2024 | Link Predictionregression | CodeCode Available | 2 |
| Towards Comprehensive Detection of Chinese Harmful Memes | Oct 3, 2024 | | CodeCode Available | 2 |
| AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML | Oct 3, 2024 | AutoMLCode Generation | CodeCode Available | 2 |
| PnP-Flow: Plug-and-Play Image Restoration with Flow Matching | Oct 3, 2024 | DeblurringDenoising | CodeCode Available | 2 |
| Curvature Diversity-Driven Deformation and Domain Alignment for Point Cloud | Oct 3, 2024 | DiversityDomain Adaptation | CodeCode Available | 2 |
| A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and Beyond | Oct 3, 2024 | MambaMedical Image Analysis | CodeCode Available | 2 |
| CodeJudge: Evaluating Code Generation with Large Language Models | Oct 3, 2024 | Code Generation | CodeCode Available | 2 |
| CAnDOIT: Causal Discovery with Observational and Interventional Data from Time-Series | Oct 3, 2024 | Causal DiscoveryTime Series | CodeCode Available | 2 |
| Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations | Oct 3, 2024 | Zero Shot Segmentation | CodeCode Available | 2 |
| NNetscape Navigator: Complex Demonstrations for Web Agents Without a Demonstrator | Oct 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations | Oct 3, 2024 | | CodeCode Available | 2 |
| MiraGe: Editable 2D Images using Gaussian Splatting | Oct 2, 2024 | | CodeCode Available | 2 |
| 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection | Oct 2, 2024 | 3DGS3D Object Detection | CodeCode Available | 2 |
| From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging | Oct 2, 2024 | Auto DebuggingBug fixing | CodeCode Available | 2 |
| Interpretable Contrastive Monte Carlo Tree Search Reasoning | Oct 2, 2024 | | CodeCode Available | 2 |
| Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint? | Oct 2, 2024 | | CodeCode Available | 2 |
| Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy | Oct 2, 2024 | Motion PlanningRobot Manipulation | CodeCode Available | 2 |
| FlipAttack: Jailbreak LLMs via Flipping | Oct 2, 2024 | | CodeCode Available | 2 |
| Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models | Oct 2, 2024 | Mixture-of-ExpertsNavigate | CodeCode Available | 2 |
| Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion? | Oct 2, 2024 | Code CompletionCode Generation | CodeCode Available | 2 |
| A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation | Oct 2, 2024 | Image GenerationQuantization | CodeCode Available | 2 |
| Selective Aggregation for Low-Rank Adaptation in Federated Learning | Oct 2, 2024 | Federated LearningGeneral Knowledge | CodeCode Available | 2 |
| VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment | Oct 2, 2024 | GSM8KMath | CodeCode Available | 2 |
| Peeling Back the Layers: An In-Depth Evaluation of Encoder Architectures in Neural News Recommenders | Oct 2, 2024 | Model SelectionNews Recommendation | CodeCode Available | 2 |
| EnzymeFlow: Generating Reaction-specific Enzyme Catalytic Pockets through Flow Matching and Co-Evolutionary Dynamics | Oct 1, 2024 | | CodeCode Available | 2 |
| Generative causal testing to bridge data-driven models and scientific theories in language neuroscience | Oct 1, 2024 | | CodeCode Available | 2 |
| PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly Detection | Oct 1, 2024 | 3D Anomaly DetectionAnomaly Detection | CodeCode Available | 2 |
| GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous Driving | Oct 1, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control | Oct 1, 2024 | Emotional Speech SynthesisSpeech Synthesis | CodeCode Available | 2 |
| MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages | Oct 1, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 2 |
| CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM | Oct 1, 2024 | 3DGSSimultaneous Localization and Mapping | CodeCode Available | 2 |
| Uncertainty Modelling and Robust Observer Synthesis using the Koopman Operator | Oct 1, 2024 | | CodeCode Available | 2 |
| Recent Advances in Speech Language Models: A Survey | Oct 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Beyond Prompts: Dynamic Conversational Benchmarking of Large Language Models | Sep 30, 2024 | BenchmarkingContinual Learning | CodeCode Available | 2 |
| LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation | Sep 30, 2024 | AttributeCollaborative Filtering | CodeCode Available | 2 |
| FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows" | Sep 30, 2024 | counterfactualHallucination | CodeCode Available | 2 |
| PerCo (SD): Open Perceptual Compression | Sep 30, 2024 | AttributeImage Compression | CodeCode Available | 2 |
| Frequency Adaptive Normalization For Non-stationary Time Series Forecasting | Sep 30, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 2 |
| Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning | Sep 30, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes | Sep 30, 2024 | Objectobject-detection | CodeCode Available | 2 |
| DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction | Sep 30, 2024 | 3D Object Detection3D Semantic Occupancy Prediction | CodeCode Available | 2 |
| ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities | Sep 30, 2024 | Decision Making | CodeCode Available | 2 |
| KV-Compress: Paged KV-Cache Compression with Variable Compression Rates per Attention Head | Sep 30, 2024 | | CodeCode Available | 2 |
| RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models | Sep 30, 2024 | Contrastive Learning | CodeCode Available | 2 |
| DeSTA2: Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data | Sep 30, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Melody-Guided Music Generation | Sep 30, 2024 | cross-modal alignmentMusic Generation | CodeCode Available | 2 |
| Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation | Sep 30, 2024 | Cross-Modal RetrievalDynamic Time Warping | CodeCode Available | 2 |
| End-to-end Piano Performance-MIDI to Score Conversion with Transformers | Sep 30, 2024 | | CodeCode Available | 2 |
| Towards Robust Multimodal Sentiment Analysis with Incomplete Data | Sep 30, 2024 | Multimodal Sentiment AnalysisSentiment Analysis | CodeCode Available | 2 |