SOTAVerified

8k

Papers

Showing 125 of 202 papers

TitleStatusHype
NeedleBench: Can LLMs Do Retrieval and Reasoning in Information-Dense Context?Code9
LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language ModelsCode5
Learning to (Learn at Test Time): RNNs with Expressive Hidden StatesCode5
StarCoder: may the source be with you!Code5
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context MultitasksCode5
KBLaM: Knowledge Base augmented Language ModelCode5
DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid FrameworkCode4
LettuceDetect: A Hallucination Detection Framework for RAG ApplicationsCode4
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers UpCode3
BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter ModelCode3
CAMixerSR: Only Details Need More "Attention"Code3
LongRoPE: Extending LLM Context Window Beyond 2 Million TokensCode3
Self-Supervised Visual Preference AlignmentCode2
MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and ThoroughlyCode2
SoccerTrack: A Dataset and Tracking Algorithm for Soccer With Fish-Eye and Drone VideosCode2
Odd-One-Out: Anomaly Detection by Comparing with NeighborsCode2
AbdomenAtlas-8K: Annotating 8,000 CT Volumes for Multi-Organ Segmentation in Three WeeksCode2
Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging casesCode2
Spacetime Gaussian Feature Splatting for Real-Time Dynamic View SynthesisCode2
LongEmbed: Extending Embedding Models for Long Context RetrievalCode2
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language ModelsCode2
CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language ModelCode2
Hyena Hierarchy: Towards Larger Convolutional Language ModelsCode2
GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity?Code2
Hungry Hungry Hippos: Towards Language Modeling with State Space ModelsCode2
Show:102550
← PrevPage 1 of 9Next →

No leaderboard results yet.