Dragonfly: Multi-Resolution Zoom-In Encoding Enhances Vision-Language Models Jun 3, 2024 Image Captioning Language Modelling
Code Code Available 2Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost Jun 3, 2024 Hallucination Language Modeling
— Unverified 0Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow Jun 3, 2024 GPU Language Modeling
Code Code Available 2The Geometry of Categorical and Hierarchical Concepts in Large Language Models Jun 3, 2024 Language Modelling Large Language Model
Code Code Available 2Understanding Token Probability Encoding in Output Embeddings Jun 3, 2024 Causal Language Modeling Language Modeling
— Unverified 0MultiMax: Sparse and Multi-Modal Attention Learning Jun 3, 2024 image-classification Image Classification
Code Code Available 1HBTP: Heuristic Behavior Tree Planning with Large Language Model Reasoning Jun 3, 2024 Language Modeling Language Modelling
Code Code Available 0Towards a copilot in BIM authoring tool using a large language model-based agent for intelligent human-machine interaction Jun 2, 2024 Language Modeling Language Modelling
— Unverified 0Harnessing Business and Media Insights with Large Language Models Jun 2, 2024 Data Visualization Language Modeling
— Unverified 0Inverse Constitutional AI: Compressing Preferences into Principles Jun 2, 2024 Chatbot Language Modelling
Code Code Available 1Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions Jun 2, 2024 Language Modeling Language Modelling
— Unverified 0Aligning Language Models with Demonstrated Feedback Jun 2, 2024 Articles Avg
Code Code Available 2LongSkywork: A Training Recipe for Efficiently Extending Context Length in Large Language Models Jun 2, 2024 Continual Pretraining Information Retrieval
— Unverified 0FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models Jun 2, 2024 Language Modelling Text Generation
— Unverified 0Large Language Model Confidence Estimation via Black-Box Access Jun 1, 2024 Language Modeling Language Modelling
— Unverified 0Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning Jun 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0HonestLLM: Toward an Honest and Helpful Large Language Model Jun 1, 2024 Language Modeling Language Modelling
Code Code Available 1Controlling Large Language Model Agents with Entropic Activation Steering Jun 1, 2024 Decision Making In-Context Learning
— Unverified 0KGLink: A column type annotation method that combines knowledge graph and pre-trained language model Jun 1, 2024 Column Type Annotation Deep Learning
Code Code Available 0On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots Jun 1, 2024 Language Modeling Language Modelling
— Unverified 0HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model Jun 1, 2024 Action Recognition Activity Recognition
— Unverified 0InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation Jun 1, 2024 feature selection Language Modeling
Code Code Available 1RAG Does Not Work for Enterprises May 31, 2024 Language Modeling Language Modelling
— Unverified 0LOLAMEME: Logic, Language, Memory, Mechanistic Framework May 31, 2024 Language Modeling Language Modelling
— Unverified 0DYNA: Disease-Specific Language Model for Variant Pathogenicity May 31, 2024 Language Modeling Language Modelling
— Unverified 0Query2CAD: Generating CAD models using natural language queries May 31, 2024 Language Modeling Language Modelling
Code Code Available 2LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking May 31, 2024 In-Context Learning Information Retrieval
Code Code Available 0Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment May 31, 2024 Language Modeling Language Modelling
Code Code Available 0StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond May 31, 2024 Language Modeling Language Modelling
— Unverified 0Masked Language Modeling Becomes Conditional Density Estimation for Tabular Data Synthesis May 31, 2024 Density Estimation Imputation
— Unverified 0ABodyBuilder3: Improved and scalable antibody structure predictions May 31, 2024 Language Modeling Language Modelling
Code Code Available 2MeshXL: Neural Coordinate Field for Generative 3D Foundation Models May 31, 2024 Language Modeling Language Modelling
Code Code Available 3Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality May 31, 2024 Language Modeling Language Modelling
Code Code Available 11FineRadScore: A Radiology Report Line-by-Line Evaluation Technique Generating Corrections with Severity Scores May 31, 2024 Language Modeling Language Modelling
— Unverified 0Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF May 31, 2024 Language Modeling Language Modelling
— Unverified 0Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling May 31, 2024 Diversity Image Generation
— Unverified 0You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet May 31, 2024 image-classification Image Classification
— Unverified 0Evaluating Large Language Model Biases in Persona-Steered Generation May 30, 2024 Language Modeling Language Modelling
Code Code Available 0CycleFormer : TSP Solver Based on Language Modeling May 30, 2024 Decoder Language Modeling
Code Code Available 1Automated Generation and Tagging of Knowledge Components from Multiple-Choice Questions May 30, 2024 Language Modelling Large Language Model
Code Code Available 0SeamlessExpressiveLM: Speech Language Model for Expressive Speech-to-Speech Translation with Chain-of-Thought May 30, 2024 Language Modeling Language Modelling
— Unverified 0Towards Ontology-Enhanced Representation Learning for Large Language Models May 30, 2024 Contrastive Learning Language Modeling
Code Code Available 0Who Writes the Review, Human or AI? May 30, 2024 Language Modeling Language Modelling
— Unverified 0Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads May 30, 2024 In-Context Learning Language Modeling
Code Code Available 0Knowledge-grounded Adaptation Strategy for Vision-language Models: Building Unique Case-set for Screening Mammograms for Residents Training May 30, 2024 Image-text Retrieval Language Modeling
— Unverified 0From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems May 30, 2024 Decision Making Hierarchical Reinforcement Learning
— Unverified 0GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning May 30, 2024 Graph Question Answering Knowledge Graphs
Code Code Available 3Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation May 30, 2024 Diversity Drug Design
Code Code Available 3Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach May 30, 2024 Language Modeling Language Modelling
Code Code Available 1Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model May 30, 2024 Diversity Language Modeling
Code Code Available 1