| AddSR: Accelerating Diffusion-based Blind Super-Resolution with Adversarial Diffusion Distillation | Apr 2, 2024 | Blind Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| Test-Time Model Adaptation with Only Forward Passes | Apr 2, 2024 | modelTest-time Adaptation | CodeCode Available | 2 |
| Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model | Apr 2, 2024 | Video Generation | CodeCode Available | 2 |
| Generalizable, Fast, and Accurate DeepQSPR with fastprop | Apr 2, 2024 | Molecular Property PredictionProperty Prediction | CodeCode Available | 2 |
| FABLES: Evaluating faithfulness and content selection in book-length summarization | Apr 1, 2024 | Long-Context Understanding | CodeCode Available | 2 |
| Bridging Remote Sensors with Multisensor Geospatial Foundation Models | Apr 1, 2024 | Cloud RemovalDiversity | CodeCode Available | 2 |
| From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models | Apr 1, 2024 | Graph GenerationImage to text | CodeCode Available | 2 |
| HAHA: Highly Articulated Gaussian Human Avatars with Textured Mesh Prior | Apr 1, 2024 | | CodeCode Available | 2 |
| Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation | Apr 1, 2024 | Action SegmentationSegmentation | CodeCode Available | 2 |
| Watanabe's expansion: A Solution for the convexity conundrum | Apr 1, 2024 | | CodeCode Available | 2 |
| The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023 | Apr 1, 2024 | MRI Reconstruction | CodeCode Available | 2 |
| ARAGOG: Advanced RAG Output Grading | Apr 1, 2024 | Document EmbeddingLanguage Modeling | CodeCode Available | 2 |
| LLM Attributor: Interactive Visual Attribution for LLM Generation | Apr 1, 2024 | ArticlesAttribute | CodeCode Available | 2 |
| Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects | Apr 1, 2024 | Articulated Object modelling | CodeCode Available | 2 |
| Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation | Apr 1, 2024 | Denoising | CodeCode Available | 2 |
| TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On | Apr 1, 2024 | Virtual Try-on | CodeCode Available | 2 |
| Mapping the Increasing Use of LLMs in Scientific Papers | Apr 1, 2024 | | CodeCode Available | 2 |
| Are large language models superhuman chemists? | Apr 1, 2024 | Benchmarking | CodeCode Available | 2 |
| OpenChemIE: An Information Extraction Toolkit For Chemistry Literature | Apr 1, 2024 | | CodeCode Available | 2 |
| FireANTs: Adaptive Riemannian Optimization for Multi-Scale Diffeomorphic Matching | Apr 1, 2024 | CPUImage Registration | CodeCode Available | 2 |
| PosterLlama: Bridging Design Ability of Langauge Model to Contents-Aware Layout Generation | Apr 1, 2024 | Layout DesignLayout Generation | CodeCode Available | 2 |
| Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward | Apr 1, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Getting it Right: Improving Spatial Consistency in Text-to-Image Models | Apr 1, 2024 | Spatial Reasoning | CodeCode Available | 2 |
| Stream of Search (SoS): Learning to Search in Language | Apr 1, 2024 | Language Modelling | CodeCode Available | 2 |
| Guide to k-mer approaches for genomics across the tree of life | Apr 1, 2024 | Diversity | CodeCode Available | 2 |
| T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation | Apr 1, 2024 | Image SegmentationMamba | CodeCode Available | 2 |
| MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction | Apr 1, 2024 | DecoderOnline Vectorized HD Map Construction | CodeCode Available | 2 |
| FlexiDreamer: Single Image-to-3D Generation with FlexiCubes | Apr 1, 2024 | 3D GenerationImage to 3D | CodeCode Available | 2 |
| Measuring Style Similarity in Diffusion Models | Apr 1, 2024 | AttributeStyle Detection | CodeCode Available | 2 |
| Scalable 3D Registration via Truncated Entry-wise Absolute Residuals | Apr 1, 2024 | | CodeCode Available | 2 |
| NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields | Apr 1, 2024 | 3D Object DetectionNeRF | CodeCode Available | 2 |
| Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On | Apr 1, 2024 | DenoisingImage Generation | CodeCode Available | 2 |
| CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models | Mar 31, 2024 | DenoisingSpeech Synthesis | CodeCode Available | 2 |
| KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose Estimation | Mar 31, 2024 | 3D Human Pose EstimationMonocular 3D Human Pose Estimation | CodeCode Available | 2 |
| Against The Achilles' Heel: A Survey on Red Teaming for Generative Models | Mar 31, 2024 | Red TeamingSurvey | CodeCode Available | 2 |
| Reporting Eye-Tracking Data Quality: Towards a New Standard | Mar 31, 2024 | | CodeCode Available | 2 |
| Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts | Mar 31, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| A Review of Modern Recommender Systems Using Generative Models (Gen-RecSys) | Mar 31, 2024 | Collaborative FilteringRecommendation Systems | CodeCode Available | 2 |
| Text2HOI: Text-guided 3D Motion Generation for Hand-Object Interaction | Mar 31, 2024 | Motion GenerationObject | CodeCode Available | 2 |
| How Much are Large Language Models Contaminated? A Comprehensive Survey and the LLMSanitize Library | Mar 31, 2024 | Question Answering | CodeCode Available | 2 |
| Transformer based Pluralistic Image Completion with Reduced Information Loss | Mar 31, 2024 | DecoderImage Inpainting | CodeCode Available | 2 |
| Survey of Computerized Adaptive Testing: A Machine Learning Perspective | Mar 31, 2024 | cognitive diagnosisQuestion Selection | CodeCode Available | 2 |
| EvoCodeBench: An Evolving Code Generation Benchmark Aligned with Real-World Code Repositories | Mar 31, 2024 | Code Generation | CodeCode Available | 2 |
| Privacy Backdoors: Stealing Data with Corrupted Pretrained Models | Mar 30, 2024 | | CodeCode Available | 2 |
| InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning | Mar 30, 2024 | Continual Learningparameter-efficient fine-tuning | CodeCode Available | 2 |
| LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion | Mar 30, 2024 | DiversityImage Generation | CodeCode Available | 2 |
| Quantformer: from attention to profit with a quantitative transformer trading strategy | Mar 30, 2024 | Sentiment AnalysisTransfer Learning | CodeCode Available | 2 |
| ST-LLM: Large Language Models Are Effective Temporal Learners | Mar 30, 2024 | MVBenchReading Comprehension | CodeCode Available | 2 |
| ProLLM: Protein Chain-of-Thoughts Enhanced LLM for Protein-Protein Interaction Prediction | Mar 30, 2024 | | CodeCode Available | 2 |
| SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects | Mar 29, 2024 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 |