SOTAVerified

16k

Papers

Showing 51100 of 146 papers

TitleStatusHype
DeepDarts: Modeling Keypoints as Objects for Automatic Scorekeeping in Darts using a Single CameraCode1
The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon TasksCode1
Long Range Arena: A Benchmark for Efficient TransformersCode1
MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at ScaleCode1
Denial-of-Service Poisoning Attacks against Large Language ModelsCode1
Detecting and Preventing Hallucinations in Large Vision Language ModelsCode1
Robustness Evaluation of Entity Disambiguation Using Prior Probes: the Case of Entity Overshadowing0
10 Years of the PCG workshop: Past and Future Trends0
Acquiring Annotated Data with Cross-lingual Explicitation for Implicit Discourse Relation Classification0
A Multi-Task Network for Joint Specular Highlight Detection and Removal0
An AI-Assisted Skincare Routine Recommendation System in XR0
Author Profiling for Hate Speech Detection0
Beyond Accuracy: Statistical Measures and Benchmark for Evaluation of Representation from Self-Supervised Learning0
Bimanual Dexterity for Complex Tasks0
CLOVER: A Test Case Generation Benchmark with Coverage, Long-Context, and Verification0
AI-assisted Code Authoring at Scale: Fine-tuning, deploying, and mixed methods evaluation0
COLING 2022 Shared Task: LED Finteuning and Recursive Summary Generation for Automatic Summarization of Chapters from Novels0
Compositional Zero-Shot Learning for Attribute-Based Object Reference in Human-Robot Interaction0
Correlation-Aware Select and Merge Attention for Efficient Fine-Tuning and Context Length Extension0
Depression and Anxiety Prediction Using Deep Language Models and Transfer Learning0
Detours for Navigating Instructional Videos0
Divide-Conquer-and-Merge: Memory- and Time-Efficient Holographic Displays0
End-to-end argumentation knowledge graph construction0
EpMAN: Episodic Memory AttentioN for Generalizing to Longer Contexts0
Evaluating the Suitability of Different Intraoral Scan Resolutions for Deep Learning-Based Tooth Segmentation0
FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning0
Fast and Full-Resolution Light Field Deblurring using a Deep Neural Network0
GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models0
Human Evaluation of English--Irish Transformer-Based NMT0
Improved prompting and process for writing user personas with LLMs, using qualitative interviews: Capturing behaviour and personality traits of users0
Improved two-stage hate speech classification for twitter based on Deep Neural Networks0
Inferring Pluggable Types with Machine Learning0
Introducing RezoJDM16k: a French KnowledgeGraph DataSet for Link Prediction0
Joint Summarization of Large-scale Collections of Web Images and Videos for Storyline Reconstruction0
Large Batch Training of Convolutional Networks with Layer-wise Adaptive Rate Scaling0
Leveraging Summary Guidance on Medical Report Summarization0
Long Context Alignment with Short Instructions and Synthesized Positions0
LongIns: A Challenging Long-context Instruction-based Exam for LLMs0
Long Range Arena : A Benchmark for Efficient Transformers0
Multilingual Visual Sentiment Concept Matching0
MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences0
NSF-SciFy: Mining the NSF Awards Database for Scientific Claims0
One-shot Ultra-high-Resolution Generative Adversarial Network That Synthesizes 16K Images On A Single GPU0
Parallel Sequence Modeling via Generalized Spatial Propagation Network0
Piecing It All Together: Verifying Multi-Hop Multimodal Claims0
Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models0
Retrieval meets Long Context Large Language Models0
Robustness Evaluation of Entity Disambiguation Using Prior Probes:the Case of Entity Overshadowing0
0/1 Deep Neural Networks via Block Coordinate Descent0
Scaling Distributed Training with Adaptive Summation0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Suprime21'"1Unverified