SOTAVerified

16k

Papers

Showing 5175 of 146 papers

TitleStatusHype
SMYRF: Efficient Attention using Asymmetric ClusteringCode1
MorphoCluster: Efficient Annotation of Plankton images by ClusteringCode1
Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the SocietyCode1
Classifying the classifier: dissecting the weight space of neural networksCode1
Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue DatasetCode1
Visual Semantic Role LabelingCode1
UniCode^2: Cascaded Large-scale Codebooks for Unified Multimodal Understanding and Generation0
MSTAR: Box-free Multi-query Scene Text Retrieval with Attention RecyclingCode0
How Far Are We from Optimal Reasoning Efficiency?Code0
FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and ItalianCode0
SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long SequencesCode0
PSC: Extending Context Window of Large Language Models via Phase Shift CalibrationCode0
Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLMCode0
FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning0
KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing ApplicationsCode0
NSF-SciFy: Mining the NSF Awards Database for Scientific Claims0
X-LRM: X-ray Large Reconstruction Model for Extremely Sparse-View Computed Tomography Recovery in One SecondCode0
Evaluating the Suitability of Different Intraoral Scan Resolutions for Deep Learning-Based Tooth Segmentation0
EpMAN: Episodic Memory AttentioN for Generalizing to Longer Contexts0
CLOVER: A Test Case Generation Benchmark with Coverage, Long-Context, and Verification0
Parallel Sequence Modeling via Generalized Spatial Propagation Network0
Depression and Anxiety Prediction Using Deep Language Models and Transfer Learning0
SparseAccelerate: Efficient Long-Context Inference for Mid-Range GPUs0
MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences0
CNNSum: Exploring Long-Context Summarization with Large Language Models in Chinese NovelsCode0
Show:102550
← PrevPage 3 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Suprime21'"1Unverified