| GlocalCLIP: Object-agnostic Global-Local Prompt Learning for Zero-shot Anomaly Detection | Nov 9, 2024 | Anomaly DetectionContrastive Learning | CodeCode Available | 1 |
| LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation | Nov 9, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Training objective drives the consistency of representational similarity across datasets | Nov 8, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Inversion-based Latent Bayesian Optimization | Nov 8, 2024 | Bayesian OptimizationDecoder | CodeCode Available | 1 |
| SM3-Text-to-Query: Synthetic Multi-Model Medical Text-to-Query Benchmark | Nov 8, 2024 | In-Context Learning | CodeCode Available | 1 |
| LLMs as Method Actors: A Model for Prompt Engineering and Architecture | Nov 8, 2024 | Prompt Engineering | CodeCode Available | 1 |
| Aioli: A Unified Optimization Framework for Language Model Data Mixing | Nov 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Aligning Large Language Models and Geometric Deep Models for Protein Representation | Nov 8, 2024 | | CodeCode Available | 1 |
| From Transparent to Opaque: Rethinking Neural Implicit Surfaces with α-NeuS | Nov 8, 2024 | 3D Shape ReconstructionTransparent objects | CodeCode Available | 1 |
| MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling Quantization | Nov 8, 2024 | Quantization | CodeCode Available | 1 |
| HeartBERT: A Self-Supervised ECG Embedding Model for Efficient and Effective Medical Signal Analysis | Nov 8, 2024 | Heartbeat ClassificationSelf-Supervised Learning | CodeCode Available | 1 |
| BayesianFitForecast: A User-Friendly R Toolbox for Parameter Estimation and Forecasting with Ordinary Differential Equations | Nov 8, 2024 | Bayesian InferenceDecision Making | CodeCode Available | 1 |
| Learning the rules of peptide self-assembly through data mining with large language models | Nov 8, 2024 | Large Language ModelLiterature Mining | CodeCode Available | 1 |
| Autoregressive Adaptive Hypergraph Transformer for Skeleton-based Activity Recognition | Nov 8, 2024 | Action ClassificationActivity Recognition | CodeCode Available | 1 |
| Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths | Nov 8, 2024 | Information RetrievalReranking | CodeCode Available | 1 |
| Tell What You Hear From What You See -- Video to Audio Generation Through Text | Nov 8, 2024 | Audio captioningAudio Generation | CodeCode Available | 1 |
| CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature Propagation | Nov 7, 2024 | Depth Completion | CodeCode Available | 1 |
| FineTuneBench: How well do commercial fine-tuning APIs infuse knowledge into LLMs? | Nov 7, 2024 | | CodeCode Available | 1 |
| NeuroFly: A framework for whole-brain single neuron reconstruction | Nov 7, 2024 | | CodeCode Available | 1 |
| Generating Highly Designable Proteins with Geometric Algebra Flow Matching | Nov 7, 2024 | Diversity | CodeCode Available | 1 |
| Peri-midFormer: Periodic Pyramid Transformer for Time Series Analysis | Nov 7, 2024 | Anomaly DetectionClassification | CodeCode Available | 1 |
| Towards Competitive Search Relevance For Inference-Free Learned Sparse Retrievers | Nov 7, 2024 | Knowledge DistillationRetrieval | CodeCode Available | 1 |
| OneProt: Towards Multi-Modal Protein Foundation Models | Nov 7, 2024 | Drug DiscoveryRetrieval | CodeCode Available | 1 |
| Enabling LLM Knowledge Analysis via Extensive Materialization | Nov 7, 2024 | Knowledge Base ConstructionLarge Language Model | CodeCode Available | 1 |
| Image Understanding Makes for A Good Tokenizer for Image Generation | Nov 7, 2024 | Image Generation | CodeCode Available | 1 |
| Cross- and Intra-image Prototypical Learning for Multi-label Disease Diagnosis and Interpretation | Nov 7, 2024 | Interpretable Machine LearningMulti-Label Classification | CodeCode Available | 1 |
| IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving | Nov 7, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion | Nov 7, 2024 | Data AugmentationImitation Learning | CodeCode Available | 1 |
| Variational Low-Rank Adaptation Using IVON | Nov 7, 2024 | | CodeCode Available | 1 |
| The State and Fate of Summarization Datasets | Nov 7, 2024 | | CodeCode Available | 1 |
| BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages | Nov 7, 2024 | automatic-speech-translationSynthetic Data Generation | CodeCode Available | 1 |
| Distributed Attack-Resilient Platooning Against False Data Injection | Nov 7, 2024 | | CodeCode Available | 1 |
| ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset | Nov 7, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein Engineering | Nov 7, 2024 | AutoMLHyperparameter Optimization | CodeCode Available | 1 |
| wav2sleep: A Unified Multi-Modal Approach to Sleep Stage Classification from Physiological Signals | Nov 7, 2024 | Transfer Learning | CodeCode Available | 1 |
| The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities | Nov 7, 2024 | | CodeCode Available | 1 |
| Semantic-Aware Resource Management for C-V2X Platooning via Multi-Agent Reinforcement Learning | Nov 7, 2024 | Decision MakingFairness | CodeCode Available | 1 |
| DELIFT: Data Efficient Language model Instruction Fine Tuning | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Energy-based physics-informed neural network for frictionless contact problems under large deformation | Nov 6, 2024 | Computational EfficiencyContact mechanics | CodeCode Available | 1 |
| MEG: Medical Knowledge-Augmented Large Language Models for Question Answering | Nov 6, 2024 | Knowledge Graph EmbeddingsMultiple-choice | CodeCode Available | 1 |
| Learning Generalizable Policy for Obstacle-Aware Autonomous Drone Racing | Nov 6, 2024 | Deep Reinforcement LearningDrone navigation | CodeCode Available | 1 |
| Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination | Nov 6, 2024 | | CodeCode Available | 1 |
| Reconsidering the Performance of GAE in Link Prediction | Nov 6, 2024 | Computational EfficiencyLink Prediction | CodeCode Available | 1 |
| PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing | Nov 6, 2024 | DenoisingHuman Animation | CodeCode Available | 1 |
| Beyond Model Adaptation at Test Time: A Survey | Nov 6, 2024 | Domain AdaptationDomain Generalization | CodeCode Available | 1 |
| Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models | Nov 6, 2024 | Hate Speech DetectionNavigate | CodeCode Available | 1 |
| The Recurrent Sticky Hierarchical Dirichlet Process Hidden Markov Model | Nov 6, 2024 | | CodeCode Available | 1 |
| Bio-xLSTM: Generative modeling, representation and in-context learning of biological and chemical sequences | Nov 6, 2024 | Drug DiscoveryIn-Context Learning | CodeCode Available | 1 |
| Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models | Nov 6, 2024 | | CodeCode Available | 1 |
| Community Forensics: Using Thousands of Generators to Train Fake Image Detectors | Nov 6, 2024 | Diversity | CodeCode Available | 1 |