SOTAVerified

16k

Papers

Showing 2650 of 146 papers

TitleStatusHype
Neural Fourier Modelling: A Highly Compact Approach to Time-Series AnalysisCode1
LongGenBench: Benchmarking Long-Form Generation in Long Context LLMsCode1
SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of ImagesCode1
LLaSA: A Multimodal LLM for Human Activity Analysis Through Wearable and Smartphone SensorsCode1
NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free AttentionCode1
Hydragen: High-Throughput LLM Inference with Shared PrefixesCode1
Analyzing the Effectiveness of Large Language Models on Text-to-SQL SynthesisCode1
Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic PapersCode1
Scaling Laws of RoPE-based ExtrapolationCode1
Home Electricity Data Generator (HEDGE): An open-access tool for the generation of electric vehicle, residential demand, and PV generation profilesCode1
Detecting and Preventing Hallucinations in Large Vision Language ModelsCode1
The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon TasksCode1
Faster Causal Attention Over Large Sequences Through Sparse Flash AttentionCode1
In-Context Learning with Many Demonstration ExamplesCode1
An In-Depth Exploration of Person Re-Identification and Gait Recognition in Cloth-Changing ConditionsCode1
CIRCLe: Color Invariant Representation Learning for Unbiased Classification of Skin LesionsCode1
There’s a Time and Place for Reasoning Beyond the ImageCode1
Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality ReductionCode1
There is a Time and Place for Reasoning Beyond the ImageCode1
MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at ScaleCode1
Complex Temporal Question Answering on Knowledge GraphsCode1
DeepDarts: Modeling Keypoints as Objects for Automatic Scorekeeping in Darts using a Single CameraCode1
BNLP: Natural language processing toolkit for Bengali languageCode1
Long Range Arena: A Benchmark for Efficient TransformersCode1
COUGH: A Challenge Dataset and Models for COVID-19 FAQ RetrievalCode1
Show:102550
← PrevPage 2 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Suprime21'"1Unverified