SOTAVerified

8k

Papers

Showing 76100 of 202 papers

TitleStatusHype
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities0
NeedleBench: Can LLMs Do Retrieval and Reasoning in Information-Dense Context?Code9
Learning to (Learn at Test Time): RNNs with Expressive Hidden StatesCode5
Let the Code LLM Edit Itself When You Edit the Code0
Odd-One-Out: Anomaly Detection by Comparing with NeighborsCode2
Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics0
GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language ModelsCode0
Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging casesCode2
Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM0
Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture0
Cutting Through the Noise: Boosting LLM Performance on Math Word ProblemsCode0
Dataset Decomposition: Faster LLM Training with Variable Sequence Length CurriculumCode1
DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical DocumentsCode0
Extending Llama-3's Context Ten-Fold OvernightCode0
LongEmbed: Extending Embedding Models for Long Context RetrievalCode2
Self-Supervised Visual Preference AlignmentCode2
BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion0
CodeShell Technical Report0
Bifurcated Attention: Accelerating Massively Parallel Decoding with Shared Prefixes in LLMs0
Fast Kernel Scene FlowCode1
Machine Translation in the Covid domain: an English-Irish case study for LoResMT 20210
CAMixerSR: Only Details Need More "Attention"Code3
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical ReasoningCode0
Can GPT-4 Identify Propaganda? Annotation and Detection of Propaganda Spans in News Articles0
Divide-Conquer-and-Merge: Memory- and Time-Efficient Holographic Displays0
Show:102550
← PrevPage 4 of 9Next →

No leaderboard results yet.