SOTAVerified|Agents Browse Leaderboard About Blog

8k

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 202 papers

Title	Date	Tasks	Status	Hype
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities	Jul 19, 2024	4k8k	—Unverified	0
NeedleBench: Can LLMs Do Retrieval and Reasoning in Information-Dense Context?	Jul 16, 2024	4k8k	CodeCode Available	9
Learning to (Learn at Test Time): RNNs with Expressive Hidden States	Jul 5, 2024	16k8k	CodeCode Available	5
Let the Code LLM Edit Itself When You Edit the Code	Jul 3, 2024	8kCode Generation	—Unverified	0
Odd-One-Out: Anomaly Detection by Comparing with Neighbors	Jun 28, 2024	8kAnomaly Detection	CodeCode Available	2
Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics	Jun 20, 2024	8kDescriptive	—Unverified	0
GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language Models	Jun 20, 2024	8k	CodeCode Available	0
Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases	Jun 19, 2024	8kHallucination	CodeCode Available	2
Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM	Jun 16, 2024	8kOpinion Summarization	—Unverified	0
Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture	Jun 1, 2024	8kFace Reconstruction	—Unverified	0
Cutting Through the Noise: Boosting LLM Performance on Math Word Problems	May 30, 2024	8kMath	CodeCode Available	0
Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum	May 21, 2024	2k8k	CodeCode Available	1
DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical Documents	Apr 30, 2024	8kDiversity	CodeCode Available	0
Extending Llama-3's Context Ten-Fold Overnight	Apr 30, 2024	8kGPU	—Unverified	0
LongEmbed: Extending Embedding Models for Long Context Retrieval	Apr 18, 2024	4k8k	CodeCode Available	2
Self-Supervised Visual Preference Alignment	Apr 16, 2024	8kMM-Vet	CodeCode Available	2
BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion	Apr 6, 2024	8kScene Generation	—Unverified	0
CodeShell Technical Report	Mar 23, 2024	8kHumanEval	—Unverified	0
Bifurcated Attention: Accelerating Massively Parallel Decoding with Shared Prefixes in LLMs	Mar 13, 2024	8kAnswer Generation	—Unverified	0
Fast Kernel Scene Flow	Mar 9, 2024	8kAutonomous Driving	CodeCode Available	1
Machine Translation in the Covid domain: an English-Irish case study for LoResMT 2021	Mar 2, 2024	8kDomain Adaptation	—Unverified	0
CAMixerSR: Only Details Need More "Attention"	Feb 29, 2024	2k8k	CodeCode Available	3
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning	Feb 27, 2024	8kLanguage Modeling	CodeCode Available	0
Can GPT-4 Identify Propaganda? Annotation and Detection of Propaganda Spans in News Articles	Feb 27, 2024	8kArticles	—Unverified	0
Divide-Conquer-and-Merge: Memory- and Time-Efficient Holographic Displays	Feb 25, 2024	16k8k	—Unverified	0

Show:10 25 50

← PrevPage 4 of 9Next →

No leaderboard results yet.