| Efficient compilation of expressive problem space specifications to neural network solvers | Jan 24, 2024 | | CodeCode Available | 2 |
| ChatterBox: Multi-round Multimodal Referring and Grounding | Jan 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ADMap: Anti-disturbance framework for reconstructing online vectorized HD map | Jan 24, 2024 | Autonomous Driving | CodeCode Available | 2 |
| WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing | Jan 24, 2024 | Activity Recognition | CodeCode Available | 2 |
| Can AI Assistants Know What They Don't Know? | Jan 24, 2024 | MathOpen-Domain Question Answering | CodeCode Available | 2 |
| Tyche: Stochastic In-Context Learning for Medical Image Segmentation | Jan 24, 2024 | Image SegmentationIn-Context Learning | CodeCode Available | 2 |
| Beyond the Contact: Discovering Comprehensive Affordance for 3D Objects from Pre-trained 2D Diffusion Models | Jan 23, 2024 | Human-Object Interaction DetectionObject | CodeCode Available | 2 |
| Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration | Jan 23, 2024 | 3D Semantic SegmentationAutonomous Driving | CodeCode Available | 2 |
| Coverage Axis++: Efficient Inner Point Selection for 3D Shape Skeletonization | Jan 23, 2024 | | CodeCode Available | 2 |
| Neural deformation fields for template-based reconstruction of cortical surfaces from MRI | Jan 23, 2024 | | CodeCode Available | 2 |
| DiffMoog: a Differentiable Modular Synthesizer for Sound Matching | Jan 23, 2024 | Audio Synthesis | CodeCode Available | 2 |
| PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation | Jan 23, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| SGTR+: End-to-end Scene Graph Generation with Transformer | Jan 23, 2024 | graph constructionGraph Generation | CodeCode Available | 2 |
| SegmentAnyBone: A Universal Model that Segments Any Bone at Any Location on MRI | Jan 23, 2024 | MRI segmentationSegmentation | CodeCode Available | 2 |
| Shift-ConvNets: Small Convolutional Kernel with Large Kernel Effects | Jan 23, 2024 | | CodeCode Available | 2 |
| In-Context Language Learning: Architectures and Algorithms | Jan 23, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| PSAvatar: A Point-based Shape Model for Real-Time Head Avatar Animation with 3D Gaussian Splatting | Jan 23, 2024 | | CodeCode Available | 2 |
| ClipSAM: CLIP and SAM Collaboration for Zero-Shot Anomaly Segmentation | Jan 23, 2024 | Anomaly LocalizationAnomaly Segmentation | CodeCode Available | 2 |
| DsDm: Model-Aware Dataset Selection with Datamodels | Jan 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop Technologies | Jan 23, 2024 | Autonomous Driving | CodeCode Available | 2 |
| Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis | Jan 22, 2024 | Document Layout AnalysisDocument Summarization | CodeCode Available | 2 |
| CloSe: A 3D Clothing Segmentation Dataset and Model | Jan 22, 2024 | Continual Learningmodel | CodeCode Available | 2 |
| PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety | Jan 22, 2024 | | CodeCode Available | 2 |
| Graph Condensation: A Survey | Jan 22, 2024 | FairnessGraph Generation | CodeCode Available | 2 |
| SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in Chinese | Jan 22, 2024 | DiversityGSM8K | CodeCode Available | 2 |
| Detecting Multimedia Generated by Large AI Models: A Survey | Jan 22, 2024 | Survey | CodeCode Available | 2 |
| EndoGS: Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting | Jan 21, 2024 | 3D Reconstruction | CodeCode Available | 2 |
| General Flow as Foundation Affordance for Scalable Robot Learning | Jan 21, 2024 | Prediction | CodeCode Available | 2 |
| With Greater Text Comes Greater Necessity: Inference-Time Training Helps Long Text Generation | Jan 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ColorVideoVDP: A visual difference predictor for image, video and display distortions | Jan 21, 2024 | Video Compression | CodeCode Available | 2 |
| SEBERTNets: Sequence Enhanced BERT Networks for Event Entity Extraction Tasks Oriented to the Finance Field | Jan 21, 2024 | Asset ManagementEvent Extraction | CodeCode Available | 2 |
| STICKERCONV: Generating Multimodal Empathetic Responses from Scratch | Jan 20, 2024 | 2kEmpathetic Response Generation | CodeCode Available | 2 |
| Make-A-Shape: a Ten-Million-scale 3D Shape Model | Jan 20, 2024 | | CodeCode Available | 2 |
| BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models | Jan 20, 2024 | Backdoor Attack | CodeCode Available | 2 |
| PartIR: Composing SPMD Partitioning Strategies for Machine Learning | Jan 20, 2024 | | CodeCode Available | 2 |
| Pixel-Wise Recognition for Holistic Surgical Scene Understanding | Jan 20, 2024 | Scene UnderstandingSegmentation | CodeCode Available | 2 |
| CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents | Jan 19, 2024 | Decision Making | CodeCode Available | 2 |
| Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model | Jan 19, 2024 | Offline RLreinforcement-learning | CodeCode Available | 2 |
| Symbol as Points: Panoptic Symbol Spotting via Point-based Representation | Jan 19, 2024 | Point Cloud SegmentationVector Graphics | CodeCode Available | 2 |
| Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic Rooms | Jan 19, 2024 | Data AugmentationDiversity | CodeCode Available | 2 |
| Exploring Color Invariance through Image-Level Ensemble Learning | Jan 19, 2024 | Data AugmentationEnsemble Learning | CodeCode Available | 2 |
| Equivariant Graph Neural Operator for Modeling 3D Dynamics | Jan 19, 2024 | Operator learning | CodeCode Available | 2 |
| LangBridge: Multilingual Reasoning Without Multilingual Supervision | Jan 19, 2024 | Code CompletionLogical Reasoning | CodeCode Available | 2 |
| MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning | Jan 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Removal then Selection: A Coarse-to-Fine Fusion Perspective for RGB-Infrared Object Detection | Jan 19, 2024 | Multispectral Object DetectionObject | CodeCode Available | 2 |
| Large Language Models are Efficient Learners of Noise-Robust Speech Recognition | Jan 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| ZnTrack -- Data as Code | Jan 19, 2024 | Management | CodeCode Available | 2 |
| A Survey on Hardware Accelerators for Large Language Models | Jan 18, 2024 | Survey | CodeCode Available | 2 |
| Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap | Jan 18, 2024 | Code GenerationEvolutionary Algorithms | CodeCode Available | 2 |
| A Survey on Learning from Graphs with Heterophily: Recent Advances and Future Directions | Jan 18, 2024 | Graph LearningSurvey | CodeCode Available | 2 |