| Exploiting sparse structures and synergy designs to advance situational awareness of electrical power grid | Dec 19, 2024 | | CodeCode Available | 1 |
| MRWeb: An Exploration of Generating Multi-Page Resource-Aware Web Code from UI Designs | Dec 19, 2024 | | CodeCode Available | 1 |
| Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion | Dec 19, 2024 | Object | CodeCode Available | 1 |
| Multi-Level Embedding and Alignment Network with Consistency and Invariance Learning for Cross-View Geo-Localization | Dec 19, 2024 | geo-localization | CodeCode Available | 1 |
| STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning | Dec 19, 2024 | Dynamic Time WarpingMulti-Task Learning | CodeCode Available | 1 |
| RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response | Dec 19, 2024 | Denoising | CodeCode Available | 1 |
| DS^2-ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis | Dec 19, 2024 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| Defeasible Visual Entailment: Benchmark, Evaluator, and Reward-Driven Optimization | Dec 19, 2024 | Contrastive LearningDecision Making | CodeCode Available | 1 |
| Large-scale School Mapping using Weakly Supervised Deep Learning for Universal School Connectivity | Dec 19, 2024 | | CodeCode Available | 1 |
| TDCNet: Transparent Objects Depth Completion with CNN-Transformer Dual-Branch Parallel Network | Dec 19, 2024 | Depth CompletionTransparent objects | CodeCode Available | 1 |
| Cirbo: A New Tool for Boolean Circuit Analysis and Synthesis | Dec 19, 2024 | | CodeCode Available | 1 |
| PhotoHolmes: a Python library for forgery detection in digital images | Dec 19, 2024 | | CodeCode Available | 1 |
| Alignment-Free RGB-T Salient Object Detection: A Large-scale Dataset and Progressive Correlation Network | Dec 19, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| On Verbalized Confidence Scores for LLMs | Dec 19, 2024 | Uncertainty Quantification | CodeCode Available | 1 |
| PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation | Dec 19, 2024 | LIDAR Semantic SegmentationScene Understanding | CodeCode Available | 1 |
| Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models | Dec 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CwA-T: A Channelwise AutoEncoder with Transformer for EEG Abnormality Detection | Dec 19, 2024 | Anomaly DetectionEEG | CodeCode Available | 1 |
| TOMG-Bench: Evaluating LLMs on Text-based Open Molecule Generation | Dec 19, 2024 | BenchmarkingDescription-guided molecule generation | CodeCode Available | 1 |
| Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment | Dec 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficient Self-Supervised Video Hashing with Selective State Spaces | Dec 19, 2024 | DecoderMamba | CodeCode Available | 1 |
| Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM | Dec 19, 2024 | Video Generation | CodeCode Available | 1 |
| Automatic Spectral Calibration of Hyperspectral Images:Method, Dataset and Benchmark | Dec 19, 2024 | | CodeCode Available | 1 |
| Spectrum-based Modality Representation Fusion Graph Convolutional Network for Multimodal Recommendation | Dec 19, 2024 | Graph LearningMultimodal Recommendation | CodeCode Available | 1 |
| MIETT: Multi-Instance Encrypted Traffic Transformer for Encrypted Traffic Classification | Dec 19, 2024 | Contrastive LearningTraffic Classification | CodeCode Available | 1 |
| Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation | Dec 19, 2024 | Image SegmentationSegmentation | CodeCode Available | 1 |
| Eliciting Causal Abilities in Large Language Models for Reasoning Tasks | Dec 19, 2024 | Causal Inference | CodeCode Available | 1 |
| ResoFilter: Fine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis | Dec 19, 2024 | Data AugmentationSynthetic Data Generation | CodeCode Available | 1 |
| Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition | Dec 19, 2024 | Action RecognitionEmotion Recognition | CodeCode Available | 1 |
| Tokenphormer: Structure-aware Multi-token Graph Transformer for Node Classification | Dec 19, 2024 | Node ClassificationRepresentation Learning | CodeCode Available | 1 |
| FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis | Dec 19, 2024 | Data VisualizationFault Detection | CodeCode Available | 1 |
| ConfliBERT: A Language Model for Political Conflict | Dec 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| HSEvo: Elevating Automatic Heuristic Design with Diversity-Driven Harmony Search and Genetic Algorithm Using LLMs | Dec 19, 2024 | Combinatorial OptimizationDiversity | CodeCode Available | 1 |
| Rango: Adaptive Retrieval-Augmented Proving for Automated Software Verification | Dec 18, 2024 | Retrieval | CodeCode Available | 1 |
| MambaLCT: Boosting Tracking via Long-term Context State Space Model | Dec 18, 2024 | MambaObject Tracking | CodeCode Available | 1 |
| MedCoT: Medical Chain of Thought via Hierarchical Expert | Dec 18, 2024 | DiagnosticMedical Visual Question Answering | CodeCode Available | 1 |
| RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment | Dec 18, 2024 | BenchmarkingRAG | CodeCode Available | 1 |
| Generating Long-form Story Using Dynamic Hierarchical Outlining with Memory-Enhancement | Dec 18, 2024 | FormKnowledge Graphs | CodeCode Available | 1 |
| GaraMoSt: Parallel Multi-Granularity Motion and Structural Modeling for Efficient Multi-Frame Interpolation in DSA Images | Dec 18, 2024 | Computational EfficiencyVideo Frame Interpolation | CodeCode Available | 1 |
| jinns: a JAX Library for Physics-Informed Neural Networks | Dec 18, 2024 | | CodeCode Available | 1 |
| Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production | Dec 18, 2024 | AttributeDisentanglement | CodeCode Available | 1 |
| Distribution Shifts at Scale: Out-of-distribution Detection in Earth Observation | Dec 18, 2024 | Earth ObservationOut-of-Distribution Detection | CodeCode Available | 1 |
| 3D Registration in 30 Years: A Survey | Dec 18, 2024 | Point Cloud RegistrationSurvey | CodeCode Available | 1 |
| Typhoon 2: A Family of Open Text and Multimodal Thai Large Language Models | Dec 18, 2024 | document understandingImage Captioning | CodeCode Available | 1 |
| Benchmarking and Improving Large Vision-Language Models for Fundamental Visual Graph Understanding and Reasoning | Dec 18, 2024 | BenchmarkingGraph Learning | CodeCode Available | 1 |
| SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation | Dec 18, 2024 | | CodeCode Available | 1 |
| ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals | Dec 18, 2024 | Quantization | CodeCode Available | 1 |
| ConDo: Continual Domain Expansion for Absolute Pose Regression | Dec 18, 2024 | Domain Adaptationregression | CodeCode Available | 1 |
| Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA | Dec 18, 2024 | | CodeCode Available | 1 |
| Efficient Language-instructed Skill Acquisition via Reward-Policy Co-Evolution | Dec 18, 2024 | Bayesian Optimization | CodeCode Available | 1 |
| Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation | Dec 18, 2024 | 3D Human Pose EstimationAutonomous Driving | CodeCode Available | 1 |