| MoreFixes: A Large-Scale Dataset of CVE Fix Commits Mined through Enhanced Repository Discovery | Jul 10, 2024 | Vulnerability Detection | CodeCode Available | 2 |
| Exploiting Scale-Variant Attention for Segmenting Small Medical Objects | Jul 10, 2024 | Cell SegmentationMRI segmentation | CodeCode Available | 2 |
| Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift | Jul 10, 2024 | Change DetectionDisaster Response | CodeCode Available | 2 |
| IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection | Jul 10, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| ViTime: A Visual Intelligence-Based Foundation Model for Time Series Forecasting | Jul 10, 2024 | Time SeriesTime Series Analysis | CodeCode Available | 2 |
| Graph Neural Networks and Deep Reinforcement Learning Based Resource Allocation for V2X Communications | Jul 9, 2024 | Deep Reinforcement Learning | CodeCode Available | 2 |
| Vision language models are blind: Failing to translate detailed visual features into words | Jul 9, 2024 | | CodeCode Available | 2 |
| RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models | Jul 9, 2024 | DecoderScheduling | CodeCode Available | 2 |
| ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction | Jul 9, 2024 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| Decomposition Betters Tracking Everything Everywhere | Jul 9, 2024 | Motion EstimationPoint Tracking | CodeCode Available | 2 |
| Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model | Jul 9, 2024 | Chart UnderstandingLanguage Modeling | CodeCode Available | 2 |
| FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making | Jul 9, 2024 | Decision Making | CodeCode Available | 2 |
| Etalon: Holistic Performance Evaluation Framework for LLM Inference Systems | Jul 9, 2024 | | CodeCode Available | 2 |
| Hyperion - A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM | Jul 9, 2024 | Simultaneous Localization and Mapping | CodeCode Available | 2 |
| FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance | Jul 9, 2024 | BenchmarkingConditional Image Generation | CodeCode Available | 2 |
| Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps | Jul 9, 2024 | ArticlesHallucination | CodeCode Available | 2 |
| LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration | Jul 9, 2024 | 3D ReconstructionAutonomous Navigation | CodeCode Available | 2 |
| ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement | Jul 9, 2024 | AttributeDisentanglement | CodeCode Available | 2 |
| Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention | Jul 9, 2024 | Autonomous DrivingDecoder | CodeCode Available | 2 |
| Exploring the Causality of End-to-End Autonomous Driving | Jul 9, 2024 | Autonomous Drivingcounterfactual | CodeCode Available | 2 |
| Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning | Jul 9, 2024 | Image GenerationSentence | CodeCode Available | 2 |
| Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis | Jul 9, 2024 | | CodeCode Available | 2 |
| BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space | Jul 8, 2024 | Autonomous DrivingDecoder | CodeCode Available | 2 |
| WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question Answering | Jul 8, 2024 | DiagnosticGenerative Visual Question Answering | CodeCode Available | 2 |