SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1120111250 of 177340 papers

TitleStatusHype
Flaming-hot Initiation with Regular Execution Sampling for Large Language ModelsCode2
aMUSEd: An Open MUSE ReproductionCode2
SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual RecognitionCode2
Once-for-All: Controllable Generative Image Compression with Dynamic Granularity AdaptionCode2
Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-ResolutionCode2
Generative Modeling for Mathematical DiscoveryCode2
CybORG++: An Enhanced Gym for the Development of Autonomous Cyber AgentsCode2
Physics-based Deep LearningCode2
LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language ModelCode2
NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural NetworksCode2
Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D RepaintingCode2
Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker AdaptationCode2
MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous DrivingCode2
Text-to-3D using Gaussian SplattingCode2
UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and EditingCode2
Segment Anything with Multiple ModalitiesCode2
JudgeBench: A Benchmark for Evaluating LLM-based JudgesCode2
Global Features are All You Need for Image Retrieval and RerankingCode2
Self-supervised Anomaly Detection Pretraining Enhances Long-tail ECG DiagnosisCode2
BianQue: Balancing the Questioning and Suggestion Ability of Health LLMs with Multi-turn Health Conversations Polished by ChatGPTCode2
In-Context Retrieval-Augmented Language ModelsCode2
MIST: A Simple and Scalable End-To-End 3D Medical Imaging Segmentation FrameworkCode2
Self-Harmonized Chain of ThoughtCode2
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied AgentsCode2
CRS-Diff: Controllable Remote Sensing Image Generation with Diffusion ModelCode2
MFA-KWS: Effective Keyword Spotting with Multi-head Frame-asynchronous DecodingCode2
TEXTure: Text-Guided Texturing of 3D ShapesCode2
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous DrivingCode2
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long VideosCode2
IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet VideosCode2
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World TasksCode2
MMA-Diffusion: MultiModal Attack on Diffusion ModelsCode2
Dual-Camera Smooth Zoom on Mobile PhonesCode2
Streaming Video Understanding and Multi-round Interaction with Memory-enhanced KnowledgeCode2
CoLA: Exploiting Compositional Structure for Automatic and Efficient Numerical Linear AlgebraCode2
Efficient Remote Sensing with Harmonized Transfer Learning and Modality AlignmentCode2
Harmonizing Visual Text Comprehension and GenerationCode2
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text EncoderCode2
LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic ModelsCode2
Contour Context: Abstract Structural Distribution for 3D LiDAR Loop Detection and Metric Pose EstimationCode2
Multi-perspective Improvement of Knowledge Graph Completion with Large Language ModelsCode2
Rethinking Interactive Image Segmentation with Low Latency High Quality and Diverse PromptsCode2
TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-InstrumentCode2
OpenChemIE: An Information Extraction Toolkit For Chemistry LiteratureCode2
DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized CutCode2
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic SegmentationCode2
Optimizing tiny colorless feedback delay networksCode2
Taccel: Scaling Up Vision-based Tactile Robotics via High-performance GPU SimulationCode2
Detector-Free Structure from MotionCode2
UMERegRobust -- Universal Manifold Embedding Compatible Features for Robust Point Cloud RegistrationCode2
Show:102550
← PrevPage 225 of 3547Next →