SOTAVerified

2k

Papers

Showing 101150 of 288 papers

TitleStatusHype
Test-Time Training Done Right0
PIIvot: A Lightweight NLP Anonymization Framework for Question-Anchored Tutoring Dialogues0
Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning0
UIShift: Enhancing VLM-based GUI Agents through Self-supervised Reinforcement Learning0
ViMRHP: A Vietnamese Benchmark Dataset for Multimodal Review Helpfulness Prediction via Human-AI Collaborative AnnotationCode0
Calibrating Translation Decoding with Quality Estimation on LLMsCode0
aiXamine: Simplified LLM Safety and Security0
Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis0
Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading0
On Linear Representations and Pretraining Data Frequency in Language Models0
Seedream 3.0 Technical Report0
ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration0
DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers0
Nonparametric MLE for Gaussian Location Mixtures: Certified Computation and Generic Behavior0
REPA: Russian Error Types Annotation for Evaluating Text Generation and Judgment Capabilities0
Evaluating the Suitability of Different Intraoral Scan Resolutions for Deep Learning-Based Tooth Segmentation0
Stackelberg Game Preference Optimization for Data-Efficient Alignment of Language Models0
Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks0
Exact Recovery of Sparse Binary Vectors from Generalized Linear Measurements0
Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning0
Improved Regret in Stochastic Decision-Theoretic Online Learning under Differential Privacy0
Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains0
TimeLogic: A Temporal Logic Benchmark for Video QA0
LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation0
Toward Corpus Size Requirements for Training and Evaluating Depression Risk Models Using Spoken Language0
Social-LLaVA: Enhancing Robot Navigation through Human-Language Reasoning in Social Spaces0
Multimodal Preference Data Synthetic Alignment with Reward ModelCode0
AnalogXpert: Automating Analog Topology Synthesis by Incorporating Circuit Design Expertise into Large Language Models0
Block-Based Multi-Scale Image Rescaling0
Do Large Language Models Show Biases in Causal Learning?0
MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects0
Lightweight Multiplane Images Network for Real-Time Stereoscopic Conversion from Planar Video0
Phenome-wide causal proteomics enhance systemic lupus erythematosus flare prediction: A study in Asian populations0
Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-ResolutionCode0
Fox-1 Technical Report0
STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing0
BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks0
Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied Agents0
Integrating Artificial Intelligence Models and Synthetic Image Data for Enhanced Asset Inspection and Defect Identification0
I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow0
Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning0
The Nature of NLP: Analyzing Contributions in NLP PapersCode0
Study of Subjective and Objective Quality in Super-Resolution Enhanced Broadcast Images on a Novel SR-IQA Dataset0
Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents0
PecSched: Preemptive and Efficient Cluster Scheduling for LLM Inference0
Clustering with Non-adaptive Subset Queries0
How Much Data is Enough Data? Fine-Tuning Large Language Models for In-House Translation: Performance Evaluation Across Multiple Dataset Sizes0
TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic FacesCode0
Enhancing Underwater Imaging with 4-D Light Fields: Dataset and MethodCode0
LogParser-LLM: Advancing Efficient Log Parsing with Large Language Models0
Show:102550
← PrevPage 3 of 6Next →

No leaderboard results yet.