SOTAVerified

Position

Papers

Showing 51100 of 3684 papers

TitleStatusHype
PCP-MAE: Learning to Predict Centers for Point Masked AutoencodersCode2
Point Transformer V2: Grouped Vector Attention and Partition-based PoolingCode2
Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View SynthesisCode2
Machine Learning in Asset Management—Part 1: Portfolio Construction—Trading StrategiesCode2
LongEmbed: Extending Embedding Models for Long Context RetrievalCode2
Lost in the Middle: How Language Models Use Long ContextsCode2
GSGAN: Adversarial Learning for Hierarchical Generation of 3D Gaussian SplatsCode2
An End-to-End Structure with Novel Position Mechanism and Improved EMD for Stock ForecastingCode2
DeepInteraction: 3D Object Detection via Modality InteractionCode2
Position: Foundation Agents as the Paradigm Shift for Decision MakingCode2
How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric LearningCode2
LayoutDM: Discrete Diffusion Model for Controllable Layout GenerationCode2
MPNet: Masked and Permuted Pre-training for Language UnderstandingCode2
FLAT: Chinese NER Using Flat-Lattice TransformerCode2
Extending LLMs' Context Window with 100 SamplesCode2
Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers FasterCode2
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length GeneralizationCode2
Detection Transformer with Stable MatchingCode2
FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable LocalizationCode2
FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality LocalizationCode2
GLACE: Global Local Accelerated Coordinate EncodingCode2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and TransformerCode2
A Length-Extrapolatable TransformerCode2
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical FlowCode2
Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic SegmentationCode2
Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional TrainingCode2
Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous DrivingCode2
Attention-Propagation Network for Egocentric Heatmap to 3D Pose LiftingCode1
Deep Domain Confusion: Maximizing for Domain InvarianceCode1
DeepFocus: a Few-Shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness FunctionCode1
DeepBall: Deep Neural-Network Ball DetectorCode1
3D Feature Tracking via Event CameraCode1
Deep Deformable 3D Caricatures with Learned Shape ControlCode1
CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window ExtendingCode1
A Case for Rejection in Low Resource ML DeploymentCode1
DALNet: A Rail Detection Network Based on Dynamic Anchor LineCode1
Asynchronous Trajectory Matching-Based Multimodal Maritime Data Fusion for Vessel Traffic Surveillance in Inland WaterwaysCode1
A Transformer-based Approach for Source Code SummarizationCode1
Audio-Conditioned U-Net for Position Estimation in Full Sheet ImagesCode1
DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent ParticlesCode1
Deep Momentum Multi-Marginal Schrödinger BridgeCode1
A Skull-Adaptive Framework for AI-Based 3D Transcranial Focused Ultrasound SimulationCode1
Assigning personality/identity to a chatting machine for coherent conversation generationCode1
ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for Cell Instance SegmentationCode1
ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from VideosCode1
CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale AttentionCode1
Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality SettingsCode1
Arithmetic Transformers Can Length-Generalize in Both Operand Length and CountCode1
Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative ModelsCode1
Context-Patch Face Hallucination Based on Thresholding Locality-constrained Representation and Reproducing LearningCode1
Show:102550
← PrevPage 2 of 74Next →

No leaderboard results yet.