SOTAVerified

Language Modeling

Papers

Showing 27012750 of 14182 papers

TitleStatusHype
LexMAE: Lexicon-Bottlenecked Pretraining for Large-Scale RetrievalCode1
Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation ApproachCode1
Few-shot Reranking for Multi-hop QA via Language Model PromptingCode1
Cross-Thought for Sentence Encoder Pre-trainingCode1
Cross-model Control: Improving Multiple Large Language Models in One-time TrainingCode1
ArcGPT: A Large Language Model Tailored for Real-world Archival ApplicationsCode1
Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term ConversationsCode1
RARR: Researching and Revising What Language Models Say, Using Language ModelsCode1
Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEditCode1
Parameter-Efficient Conversational Recommender System as a Language Processing TaskCode1
Cascade Speculative Drafting for Even Faster LLM InferenceCode1
FATA-Trans: Field And Time-Aware Transformer for Sequential Tabular DataCode1
AttributionBench: How Hard is Automatic Attribution Evaluation?Code1
Caution for the Environment: Multimodal Agents are Susceptible to Environmental DistractionsCode1
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-trainingCode1
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text TranslationCode1
FedJudge: Federated Legal Large Language ModelCode1
Parsing as PretrainingCode1
LESA: Linguistic Encapsulation and Semantic Amalgamation Based Generalised Claim Detection from Online ContentCode1
Cascaded Head-colliding AttentionCode1
Legilimens: Practical and Unified Content Moderation for Large Language Model ServicesCode1
LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data AugmentationCode1
Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language ModelCode1
Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text PretrainingCode1
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language ModelsCode1
Pay Less Attention with Lightweight and Dynamic ConvolutionsCode1
LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge AugmentationCode1
Fill in the BLANC: Human-free quality estimation of document summariesCode1
AudioBERT: Audio Knowledge Augmented Language ModelCode1
PeptideBERT: A Language Model based on Transformers for Peptide Property PredictionCode1
"Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced DistillationCode1
FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis AssistantCode1
Permutation Equivariant Models for Compositional Generalization in LanguageCode1
PermuteFormer: Efficient Relative Position Encoding for Long SequencesCode1
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language ModelingCode1
Filtering Noisy Parallel Corpus using Transformers with Proxy Task LearningCode1
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech RecognitionCode1
Fine-grained Audible Video DescriptionCode1
Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language ModelCode1
CrowdCLIP: Unsupervised Crowd Counting via Vision-Language ModelCode1
Finding Universal Grammatical Relations in Multilingual BERTCode1
Configurable Safety Tuning of Language Models with Synthetic Preference DataCode1
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene UnderstandingCode1
Learning to engineer protein flexibilityCode1
Learning To Retrieve Prompts for In-Context LearningCode1
ConfliBERT: A Language Model for Political ConflictCode1
Length Generalization of Causal Transformers without Position EncodingCode1
Conformal Alignment: Knowing When to Trust Foundation Models with GuaranteesCode1
Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak DecoderCode1
L-GreCo: Layerwise-Adaptive Gradient Compression for Efficient and Accurate Deep LearningCode1
Show:102550
← PrevPage 55 of 284Next →

No leaderboard results yet.