SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1070110750 of 661570 papers

TitleStatusHype
TransBTSV2: Towards Better and More Efficient Volumetric Segmentation of Medical ImagesCode2
Free-form language-based robotic reasoning and graspingCode2
Assessment of Reinforcement Learning for Macro PlacementCode2
SAD: Segment Any RGBDCode2
Hybrid Convolutional and Attention Network for Hyperspectral Image DenoisingCode2
VOS: Learning What You Don't Know by Virtual Outlier SynthesisCode2
A Survey of Reasoning with Foundation ModelsCode2
CompletionFormer: Depth Completion with Convolutions and Vision TransformersCode2
Can Vehicle Motion Planning Generalize to Realistic Long-tail Scenarios?Code2
Vision-Based UAV Self-Positioning in Low-Altitude Urban EnvironmentsCode2
Pix2NeRF: Unsupervised Conditional π-GAN for Single Image to Neural Radiance Fields TranslationCode2
A Physics-informed Diffusion Model for High-fidelity Flow Field ReconstructionCode2
A-Bench: Are LMMs Masters at Evaluating AI-generated Images?Code2
Inserting Anybody in Diffusion Models via Celeb BasisCode2
Sparse4D v3: Advancing End-to-End 3D Detection and TrackingCode2
GPU Performance Portability needs AutotuningCode2
Bicubic++: Slim, Slimmer, Slimmest -- Designing an Industry-Grade Super-Resolution NetworkCode2
LimSim: A Long-term Interactive Multi-scenario Traffic SimulatorCode2
LEDNet: Joint Low-light Enhancement and Deblurring in the DarkCode2
DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social ExperiencesCode2
Securing AI Agents with Information-Flow ControlCode2
xRAG: Extreme Context Compression for Retrieval-augmented Generation with One TokenCode2
Automated Bioinformatics Analysis via AutoBACode2
All-In-One Metrical And Functional Structure Analysis With Neighborhood Attentions on Demixed AudioCode2
DataDream: Few-shot Guided Dataset GenerationCode2
UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial Optimization ProblemsCode2
Partial-to-Partial Shape Matching with Geometric ConsistencyCode2
Learning to Reason for Long-Form Story GenerationCode2
Post-Training Sparse Attention with Double SparsityCode2
Group Robust Preference Optimization in Reward-free RLHFCode2
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height PluginCode2
VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and CollisionsCode2
Neighborhood Attention TransformerCode2
The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMsCode2
CoD, Towards an Interpretable Medical Agent using Chain of DiagnosisCode2
Exploring Radar Data Representations in Autonomous Driving: A Comprehensive ReviewCode2
mFollowIR: a Multilingual Benchmark for Instruction Following in RetrievalCode2
GRID: A Platform for General Robot Intelligence DevelopmentCode2
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation SamplingCode2
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction TuningCode2
Deep Portrait Quality Assessment. A NTIRE 2024 Challenge SurveyCode2
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and OptimizationCode2
LiDAR Snowfall Simulation for Robust 3D Object DetectionCode2
Accelerated Hierarchical Density ClusteringCode2
EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-SpeechCode2
UniTEX: Universal High Fidelity Generative Texturing for 3D ShapesCode2
Compressing Large Language Models using Low Rank and Low Precision DecompositionCode2
DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-CorrectionCode2
Towards Robust Multimodal Sentiment Analysis with Incomplete DataCode2
Repo2Run: Automated Building Executable Environment for Code Repository at ScaleCode2
Show:102550
← PrevPage 215 of 13232Next →