SOTAVerified

World Knowledge

Papers

Showing 101150 of 818 papers

TitleStatusHype
LLM4Tag: Automatic Tagging System for Information Retrieval via Large Language Models0
Large Language Models and Mathematical Reasoning Failures0
IterQR: An Iterative Framework for LLM-based Query Rewrite in e-Commercial Search System0
RoseRAG: Robust Retrieval-augmented Generation with Small-scale LLMs via Margin-aware Preference Optimization0
Unleashing the Power of Large Language Model for Denoising Recommendation0
MoLoRec: A Generalizable and Efficient Framework for LLM-Based Recommendation0
Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related QueriesCode0
Can LLMs Maintain Fundamental Abilities under KV Cache Compression?0
LAST SToP For Modeling Asynchronous Time Series0
Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning0
Zero-shot Robotic Manipulation with Language-guided Instruction and Formal Task Planning0
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and GenerationCode3
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and RefinementCode1
A Collection of Question Answering Datasets for Norwegian0
LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading0
Distilling Multi-modal Large Language Models for Autonomous Driving0
Dynamic Knowledge Integration for Enhanced Vision-Language Reasoning0
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them0
A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model0
VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language ModelsCode1
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild0
Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and RoadmapCode3
Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering0
HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual PerceiverCode2
Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble0
VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks0
An Automatic Graph Construction Framework based on Large Language Models for RecommendationCode1
Knowledge Editing through Chain-of-ThoughtCode1
Interweaving Memories of a Siamese Large Language ModelCode0
Beyond Partisan Leaning: A Comparative Analysis of Political Bias in Large Language Models0
Logical Consistency of Large Language Models in Fact-checking0
Fietje: An open, efficient LLM for DutchCode2
GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering0
MMLU-CF: A Contamination-free Multi-task Language Understanding BenchmarkCode2
Bridging the User-side Knowledge Gap in Knowledge-aware Recommendations with Large Language ModelsCode1
AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World KnowledgeCode0
MetaMorph: Multimodal Understanding and Generation via Instruction Tuning0
HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction0
QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMsCode0
GaGA: Towards Interactive Global Geolocation Assistant0
AltFS: Agency-light Feature Selection with Large Language Models in Deep Recommender Systems0
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge GraphsCode1
Balancing Efficiency and Effectiveness: An LLM-Infused Approach for Optimized CTR Prediction0
Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach0
World knowledge-enhanced Reasoning Using Instruction-guided Interactor in Autonomous Driving0
I Don't Know: Explicit Modeling of Uncertainty with an [IDK] TokenCode1
Retrieval-Augmented Machine Translation with Unstructured KnowledgeCode1
A surprisal oracle for when every layer countsCode0
SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model0
Realistic Corner Case Generation for Autonomous Vehicles with Multimodal Large Language Model0
Show:102550
← PrevPage 3 of 17Next →

No leaderboard results yet.