| LEDRO: LLM-Enhanced Design Space Reduction and Optimization for Analog Circuits | Nov 19, 2024 | Bayesian OptimizationReinforcement Learning (RL) | CodeCode Available | 1 |
| PR-ENDO: Physically Based Relightable Gaussian Splatting for Endoscopy | Nov 19, 2024 | Novel View Synthesis | CodeCode Available | 1 |
| Diffusion-Inspired Cold Start with Sufficient Prior in Computerized Adaptive Testing | Nov 19, 2024 | Question Selection | CodeCode Available | 1 |
| Translating Electrocardiograms to Cardiac Magnetic Resonance Imaging Useful for Cardiac Assessment and Disease Screening: A Multi-Center Study AI for ECG to CMR Translation Study | Nov 19, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| DLBacktrace: A Model Agnostic Explainability for any Deep Learning Models | Nov 19, 2024 | BenchmarkingDeep Learning | CodeCode Available | 1 |
| PyAWD: A Library for Generating Large Synthetic Datasets of Acoustic Wave Propagation with Devito | Nov 19, 2024 | Retrieval | CodeCode Available | 1 |
| Evaluating the Prompt Steerability of Large Language Models | Nov 19, 2024 | | CodeCode Available | 1 |
| Stylecodes: Encoding Stylistic Information For Image Generation | Nov 19, 2024 | Image Generation | CodeCode Available | 1 |
| SG-LRA: Self-Generating Automatic Scoliosis Cobb Angle Measurement with Low-Rank Approximation | Nov 19, 2024 | | CodeCode Available | 1 |
| Gradient-Weighted Feature Back-Projection: A Fast Alternative to Feature Distillation in 3D Gaussian Splatting | Nov 19, 2024 | Segmentation | CodeCode Available | 1 |
| A Survey of Medical Vision-and-Language Applications and Their Techniques | Nov 19, 2024 | Decision MakingDiagnostic | CodeCode Available | 1 |
| Signformer is all you need: Towards Edge AI for Sign Language | Nov 19, 2024 | AllGloss-free Sign Language Translation | CodeCode Available | 1 |
| UrbanDiT: A Foundation Model for Open-World Urban Spatio-Temporal Learning | Nov 19, 2024 | ImputationMulti-Task Learning | CodeCode Available | 1 |
| libcll: an Extendable Python Toolkit for Complementary-Label Learning | Nov 19, 2024 | Weakly-supervised Learning | CodeCode Available | 1 |
| ProSec: Fortifying Code LLMs with Proactive Security Alignment | Nov 19, 2024 | Code Generation | CodeCode Available | 1 |
| Harnessing Scale and Physics: A Multi-Graph Neural Operator Framework for PDEs on Arbitrary Geometries | Nov 18, 2024 | Management | CodeCode Available | 1 |
| CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware Integration and a Foundational Dataset | Nov 18, 2024 | | CodeCode Available | 1 |
| The Sound of Water: Inferring Physical Properties from Pouring Liquids | Nov 18, 2024 | Physical Attribute Prediction | CodeCode Available | 1 |
| Introducing Milabench: Benchmarking Accelerators for AI | Nov 18, 2024 | BenchmarkingDeep Learning | CodeCode Available | 1 |
| Towards Open-Vocabulary Audio-Visual Event Localization | Nov 18, 2024 | audio-visual event localization | CodeCode Available | 1 |
| Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery Detection | Nov 18, 2024 | Specificity | CodeCode Available | 1 |
| TSPRank: Bridging Pairwise and Listwise Methods with a Bilinear Travelling Salesman Model | Nov 18, 2024 | Information RetrievalLearning-To-Rank | CodeCode Available | 1 |
| TimeFormer: Capturing Temporal Relationships of Deformable 3D Gaussians for Robust Reconstruction | Nov 18, 2024 | 3D Reconstruction | CodeCode Available | 1 |
| CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization | Nov 18, 2024 | backdoor defenseText Generation | CodeCode Available | 1 |
| Edge-Enhanced Dilated Residual Attention Network for Multimodal Medical Image Fusion | Nov 18, 2024 | Brain Tumor ClassificationDiagnostic | CodeCode Available | 1 |
| PerfCodeGen: Improving Performance of LLM Generated Code with Execution Feedback | Nov 18, 2024 | HumanEvalmbpp | CodeCode Available | 1 |
| Continuous Speculative Decoding for Autoregressive Image Generation | Nov 18, 2024 | DenoisingImage Generation | CodeCode Available | 1 |
| Equivariant spatio-hemispherical networks for diffusion MRI deconvolution | Nov 18, 2024 | Diffusion MRI | CodeCode Available | 1 |
| Improved GUI Grounding via Iterative Narrowing | Nov 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Generalizable Person Re-identification via Balancing Alignment and Uniformity | Nov 18, 2024 | Data AugmentationGeneralizable Person Re-identification | CodeCode Available | 1 |
| Unveiling Redundancy in Diffusion Transformers (DiTs): A Systematic Study | Nov 18, 2024 | Scheduling | CodeCode Available | 1 |
| Aligning Few-Step Diffusion Models with Dense Reward Difference Learning | Nov 18, 2024 | Denoising | CodeCode Available | 1 |
| Graph Neural Networks for Quantifying Compatibility Mechanisms in Traditional Chinese Medicine | Nov 18, 2024 | Drug DiscoveryFeature Engineering | CodeCode Available | 1 |
| HistoEncoder: a digital pathology foundation model for prostate cancer | Nov 18, 2024 | | CodeCode Available | 1 |
| TSINR: Capturing Temporal Continuity via Implicit Neural Representations for Time Series Anomaly Detection | Nov 18, 2024 | Anomaly DetectionLarge Language Model | CodeCode Available | 1 |
| FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training | Nov 18, 2024 | Data AugmentationImage to text | CodeCode Available | 1 |
| Temporal and Spatial Reservoir Ensembling Techniques for Liquid State Machines | Nov 18, 2024 | | CodeCode Available | 1 |
| Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning | Nov 18, 2024 | AttributeCompositional Zero-Shot Learning | CodeCode Available | 1 |
| ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements | Nov 18, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection | Nov 17, 2024 | Action DetectionOpen Vocabulary Action Detection | CodeCode Available | 1 |
| Constrained Diffusion with Trust Sampling | Nov 17, 2024 | Motion Generation | CodeCode Available | 1 |
| VidComposition: Can MLLMs Analyze Compositions in Compiled Videos? | Nov 17, 2024 | Multiple-choice | CodeCode Available | 1 |
| TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models | Nov 17, 2024 | MVBenchVideo-based Generative Performance Benchmarking | CodeCode Available | 1 |
| SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree Search for Code Generation | Nov 17, 2024 | Code GenerationDiversity | CodeCode Available | 1 |
| Multilingual Large Language Models: A Systematic Survey | Nov 17, 2024 | Cross-Lingual TransferSurvey | CodeCode Available | 1 |
| PickScan: Object discovery and reconstruction from handheld interactions | Nov 17, 2024 | ObjectObject Discovery | CodeCode Available | 1 |
| SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization | Nov 17, 2024 | In-Context Learning | CodeCode Available | 1 |
| BackdoorMBTI: A Backdoor Learning Multimodal Benchmark Tool Kit for Backdoor Defense Evaluation | Nov 17, 2024 | Action Recognitionbackdoor defense | CodeCode Available | 1 |
| AIGS: Generating Science from AI-Powered Automated Falsification | Nov 17, 2024 | scientific discovery | CodeCode Available | 1 |
| Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model | Nov 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |