SOTAVerified

Position

Papers

Showing 151200 of 3684 papers

TitleStatusHype
Which One? Leveraging Context Between Objects and Multiple Views for Language GroundingCode1
MultiSPANS: A Multi-range Spatial-Temporal Transformer Network for Traffic Forecast via Structural Entropy OptimizationCode1
Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and AudioCode1
Towards A Holistic Landscape of Situated Theory of Mind in Large Language ModelsCode1
NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each BenchmarkCode1
CLEX: Continuous Length Extrapolation for Large Language ModelsCode1
Semi-Supervised End-to-End Learning for Integrated Sensing and CommunicationsCode1
Generative Modeling with Phase Stochastic BridgesCode1
Fast, Expressive SE(n) Equivariant Networks through Weight-Sharing in Position-Orientation SpaceCode1
CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window ExtendingCode1
Mutation-based Fault Localization of Deep Neural NetworksCode1
DropPos: Pre-Training Vision Transformers by Reconstructing Dropped PositionsCode1
Mask-Attention-Free Transformer for 3D Instance SegmentationCode1
A lightweight 3D dense facial landmark estimation model from position map dataCode1
Position-Enhanced Visual Instruction Tuning for Multimodal Large Language ModelsCode1
Relighting Neural Radiance Fields with Shadow and Highlight HintsCode1
Instruction Position Matters in Sequence Generation with Large Language ModelsCode1
DALNet: A Rail Detection Network Based on Dynamic Anchor LineCode1
Spatial LibriSpeech: An Augmented Dataset for Spatial Audio LearningCode1
DeSCo: Towards Generalizable and Scalable Deep Subgraph CountingCode1
Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual TrackingCode1
V-DETR: DETR with Vertex Relative Position Encoding for 3D Object DetectionCode1
Point Anywhere: Directed Object Estimation from Omnidirectional ImagesCode1
Advancing Beyond Identification: Multi-bit Watermark for Large Language ModelsCode1
Differentiable short-time Fourier transform with respect to the hop lengthCode1
Latent-OFER: Detect, Mask, and Reconstruct with Latent Vectors for Occluded Facial Expression RecognitionCode1
DSSE: a drone swarm search environmentCode1
2-D SSM: A General Spatial Layer for Visual TransformersCode1
Everybody Compose: Deep Beats To MusicCode1
DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent ParticlesCode1
ColdNAS: Search to Modulate for User Cold-Start RecommendationCode1
3rd Place Solution for PVUW2023 VSS Track: A Large Model for Semantic Segmentation on VSPWCode1
Collect-and-Distribute Transformer for 3D Point Cloud AnalysisCode1
The Impact of Positional Encoding on Length Generalization in TransformersCode1
Large Language Models are not Fair EvaluatorsCode1
Improving Position Encoding of Transformers for Multivariate Time Series ClassificationCode1
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector QuantizationCode1
Toeplitz Neural Network for Sequence ModelingCode1
A Vision Transformer Approach for Efficient Near-Field Irregular SAR Super-ResolutionCode1
Exploiting Inductive Bias in Transformer for Point Cloud Classification and SegmentationCode1
Optimal Robust Network Design: Formulations and Algorithms for Maximizing Algebraic ConnectivityCode1
Towards Flexible Multi-modal Document ModelsCode1
Diffusion Action SegmentationCode1
A Closer Look at Parameter-Efficient Tuning in Diffusion ModelsCode1
Searching for long faint astronomical high energy transients: a data driven approachCode1
Position-Guided Point Cloud Panoptic Segmentation TransformerCode1
Influencer Backdoor Attack on Semantic SegmentationCode1
CAPE: Camera View Position Embedding for Multi-View 3D Object DetectionCode1
Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction ModuleCode1
Deep Momentum Multi-Marginal Schrödinger BridgeCode1
Show:102550
← PrevPage 4 of 74Next →

No leaderboard results yet.