SOTAVerified

Form

Papers

Showing 5175 of 1618 papers

TitleStatusHype
Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form PlanningCode1
Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and DetectorCode1
LongGenBench: Benchmarking Long-Form Generation in Long Context LLMsCode1
HERMES: temporal-coHERent long-forM understanding with Episodes and SemanticsCode1
Grounded Multi-Hop VideoQA in Long-Form Egocentric VideosCode1
LiteEFG: An Efficient Python Library for Solving Extensive-form GamesCode1
Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation ModelsCode1
Closed-Form Test Functions for Biophysical Sequence Optimization AlgorithmsCode1
Sonnet or Not, Bot? Poetry Evaluation for Large Models and DatasetsCode1
Suri: Multi-constraint Instruction Following for Long-form Text GenerationCode1
VERISCORE: Evaluating the factuality of verifiable claims in long-form text generationCode1
Too Many Frames, Not All Useful: Efficient Strategies for Long-Form Video QACode1
Encoding and Controlling Global Semantics for Long-form Video Question AnsweringCode1
Toward Conversational Agents with Context and Time Sensitive Long-term MemoryCode1
OLAPH: Improving Factuality in Biomedical Long-form Question AnsweringCode1
THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language ModelsCode1
SVD-AE: Simple Autoencoders for Collaborative FilteringCode1
Learning Long-form Video Prior via Generative Pre-TrainingCode1
LOGO: A Long-Form Video Dataset for Group Action Quality AssessmentCode1
CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systemsCode1
Linguistic Calibration of Long-Form GenerationsCode1
An Analysis of Linear Time Series Forecasting ModelsCode1
Do Deep Neural Network Solutions Form a Star Domain?Code1
PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language ModelsCode1
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and FeedbackCode1
Show:102550
← PrevPage 3 of 65Next →

No leaderboard results yet.