SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1100111050 of 661570 papers

TitleStatusHype
ReConcile: Round-Table Conference Improves Reasoning via Consensus among Diverse LLMsCode2
AnglE-optimized Text EmbeddingsCode2
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone ControlCode2
Detect Everything with Few ExamplesCode2
ICASSP 2023 Acoustic Echo Cancellation ChallengeCode2
The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"Code2
Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task AgentsCode2
Wasserstein Quantum Monte Carlo: A Novel Approach for Solving the Quantum Many-Body Schrödinger EquationCode2
TART: A plug-and-play Transformer module for task-agnostic reasoningCode2
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an AgentCode2
Random-Access Infinite Context Length for TransformersCode2
Geometric Transformer with Interatomic Positional EncodingCode2
RRHF: Rank Responses to Align Language Models with Human FeedbackCode2
BanditPAM++: Faster k-medoids ClusteringCode2
Parsel🐍: Algorithmic Reasoning with Language Models by Composing DecompositionsCode2
Blockwise Parallel Transformers for Large Context ModelsCode2
On the Planning Abilities of Large Language Models - A Critical InvestigationCode2
One Fits All: Power General Time Series Analysis by Pretrained LMCode2
H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer TrainingCode2
Monitor-Guided Decoding of Code LMs with Static Analysis of Repository ContextCode2
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language ModelsCode2
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language ModelsCode2
PromptIR: Prompting for All-in-One Image RestorationCode2
Achieving Cross Modal Generalization with Multimodal Unified RepresentationCode2
RMT: Retentive Networks Meet Vision TransformersCode2
EPTQ: Enhanced Post-Training Quantization via Hessian-guided Network-wise OptimizationCode2
A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language ModelsCode2
StructChart: On the Schema, Metric, and Augmentation for Visual Chart UnderstandingCode2
DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal ServicesCode2
Text2Reward: Reward Shaping with Language Models for Reinforcement LearningCode2
DreamLLM: Synergistic Multimodal Comprehension and CreationCode2
You Only Look at Screens: Multimodal Chain-of-Action AgentsCode2
GPTFUZZER: Red Teaming Large Language Models with Auto-Generated Jailbreak PromptsCode2
Rethinking Imitation-based Planner for Autonomous DrivingCode2
Forgedit: Text Guided Image Editing via Learning and ForgettingCode2
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise TrainingCode2
PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban ScenesCode2
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured SparsityCode2
PLVS: A SLAM System with Points, Lines, Volumetric Mapping, and 3D Incremental SegmentationCode2
DriveDreamer: Towards Real-world-driven World Models for Autonomous DrivingCode2
DFormer: Rethinking RGBD Representation Learning for Semantic SegmentationCode2
vSHARP: variable Splitting Half-quadratic Admm algorithm for Reconstruction of inverse-ProblemsCode2
RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering SupervisionCode2
RaTrack: Moving Object Detection and Tracking with 4D Radar Point CloudCode2
Grasp-Anything: Large-scale Grasp Dataset from Foundation ModelsCode2
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier TransformCode2
OWL: A Large Language Model for IT OperationsCode2
RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose EstimationCode2
Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative DecodingCode2
Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)Code2
Show:102550
← PrevPage 221 of 13232Next →