SOTAVerified

16k

Papers

Showing 5175 of 146 papers

TitleStatusHype
COUGH: A Challenge Dataset and Models for COVID-19 FAQ RetrievalCode1
Analyzing the Effectiveness of Large Language Models on Text-to-SQL SynthesisCode1
DeepDarts: Modeling Keypoints as Objects for Automatic Scorekeeping in Darts using a Single CameraCode1
An In-Depth Exploration of Person Re-Identification and Gait Recognition in Cloth-Changing ConditionsCode1
Denial-of-Service Poisoning Attacks against Large Language ModelsCode1
SMYRF: Efficient Attention using Asymmetric ClusteringCode1
X-LRM: X-ray Large Reconstruction Model for Extremely Sparse-View Computed Tomography Recovery in One SecondCode0
Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLMCode0
An Empirical Study of Mamba-based Language ModelsCode0
Author Profiling for Abuse DetectionCode0
BertRLFuzzer: A BERT and Reinforcement Learning Based FuzzerCode0
Calpric: Inclusive and Fine-grain Labeling of Privacy Policies with Crowdsourcing and Active LearningCode0
CNNSum: Exploring Long-Context Summarization with Large Language Models in Chinese NovelsCode0
Code-Switching Red-Teaming: LLM Evaluation for Safety and Multilingual UnderstandingCode0
Deep Learning for Detecting Cyberbullying Across Multiple Social Media PlatformsCode0
Deep Learning for Hate Speech Detection in TweetsCode0
Detecting Offensive Language in Tweets Using Deep LearningCode0
Extending Context Window of Large Language Models from a Distributional PerspectiveCode0
FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and ItalianCode0
FPT: Feature Prompt Tuning for Few-shot Readability AssessmentCode0
Give Me Something to Eat: Referring Expression Comprehension with Commonsense KnowledgeCode0
Hadiths Classification Using a Novel Author-Based Hadith Classification Dataset (ABCD)Code0
How Far Are We from Optimal Reasoning Efficiency?Code0
ImageNet Training in MinutesCode0
KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing ApplicationsCode0
Show:102550
← PrevPage 3 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Suprime21'"1Unverified