SOTAVerified

Position

Papers

Showing 150 of 3684 papers

TitleStatusHype
Moonshine: Speech Recognition for Live Transcription and Voice CommandsCode9
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image ManifoldCode7
YaRN: Efficient Context Window Extension of Large Language ModelsCode6
Extending Context Window of Large Language Models via Positional InterpolationCode6
Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language ModelsCode5
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt CompressionCode5
Cosmos World Foundation Model Platform for Physical AICode5
Desiderata for next generation of ML model servingCode4
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy PredictionCode4
KeyPoint Relative Position Encoding for Face RecognitionCode4
Programming Is Hard -- Or at Least It Used to Be: Educational Opportunities And Challenges of AI Code GenerationCode4
MIGC++: Advanced Multi-Instance Generation Controller for Image SynthesisCode4
PETR: Position Embedding Transformation for Multi-View 3D Object DetectionCode3
PETRv2: A Unified Framework for 3D Perception from Multi-Camera ImagesCode3
Transformers Can Do Arithmetic with the Right EmbeddingsCode3
ElasTST: Towards Robust Varied-Horizon Forecasting with Elastic Time-Series TransformerCode3
Position: Graph Foundation Models are Already HereCode3
VideoRoPE: What Makes for Good Video Rotary Position Embedding?Code3
Scaling Diffusion Transformers to 16 Billion ParametersCode3
VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image AnalysisCode3
Relation DETR: Exploring Explicit Position Relation Prior for Object DetectionCode3
RoFormer: Enhanced Transformer with Rotary Position EmbeddingCode3
Rotary Position Embedding for Vision TransformerCode3
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context TrainingCode3
Point Transformer V2: Grouped Vector Attention and Partition-based PoolingCode2
PCP-MAE: Learning to Predict Centers for Point Masked AutoencodersCode2
Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic SegmentationCode2
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise TrainingCode2
Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional TrainingCode2
PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano PerformanceCode2
MPNet: Masked and Permuted Pre-training for Language UnderstandingCode2
OPEN: Object-wise Position Embedding for Multi-view 3D Object DetectionCode2
Mega: Moving Average Equipped Gated AttentionCode2
Lost in the Middle: How Language Models Use Long ContextsCode2
Machine Learning in Asset Management—Part 1: Portfolio Construction—Trading StrategiesCode2
PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud RegistrationCode2
PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest TransformerCode2
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical FlowCode2
GLACE: Global Local Accelerated Coordinate EncodingCode2
An Approach for Air Drawing Using Background Subtraction and Contour ExtractionCode2
FLAT: Chinese NER Using Flat-Lattice TransformerCode2
FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality LocalizationCode2
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length GeneralizationCode2
Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers FasterCode2
An End-to-End Structure with Novel Position Mechanism and Improved EMD for Stock ForecastingCode2
How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric LearningCode2
Extending LLMs' Context Window with 100 SamplesCode2
FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable LocalizationCode2
LayoutDM: Discrete Diffusion Model for Controllable Layout GenerationCode2
GSGAN: Adversarial Learning for Hierarchical Generation of 3D Gaussian SplatsCode2
Show:102550
← PrevPage 1 of 74Next →

No leaderboard results yet.