| SSLRec: A Self-Supervised Learning Framework for Recommendation | Aug 10, 2023 | Collaborative FilteringData Augmentation | CodeCode Available | 2 |
| LLM As DBA | Aug 10, 2023 | | CodeCode Available | 2 |
| Follow Anything: Open-set detection, tracking, and following in real-time | Aug 10, 2023 | | CodeCode Available | 2 |
| Flexible Isosurface Extraction for Gradient-Based Mesh Optimization | Aug 10, 2023 | | CodeCode Available | 2 |
| PoseBusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequences | Aug 10, 2023 | Deep Learningvalid | CodeCode Available | 2 |
| YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection | Aug 10, 2023 | Objectobject-detection | CodeCode Available | 2 |
| Fuzz4All: Universal Fuzzing with Large Language Models | Aug 9, 2023 | | CodeCode Available | 2 |
| PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation Learning | Aug 8, 2023 | Representation Learning | CodeCode Available | 2 |
| Cumulative Reasoning with Large Language Models | Aug 8, 2023 | Decision MakingLogical Reasoning | CodeCode Available | 2 |
| LATR: 3D Lane Detection from Monocular Images with Transformer | Aug 8, 2023 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 2 |
| FocalFormer3D : Focusing on Hard Instance for 3D Object Detection | Aug 8, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| 3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment | Aug 8, 2023 | 3D Question Answering (3D-QA)Dense Captioning | CodeCode Available | 2 |
| SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool | Aug 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Shepherd: A Critic for Language Model Generation | Aug 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions | Aug 8, 2023 | Caption GenerationImage Captioning | CodeCode Available | 2 |
| AgentSims: An Open-Source Sandbox for Large Language Model Evaluation | Aug 8, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 2 |
| ConDistFL: Conditional Distillation for Federated Learning from Partially Annotated Data | Aug 8, 2023 | Federated LearningKnowledge Distillation | CodeCode Available | 2 |
| PokerKit: A Comprehensive Python Library for Fine-Grained Multi-Variant Poker Game Simulations | Aug 8, 2023 | | CodeCode Available | 2 |
| UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition | Aug 7, 2023 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 2 |
| TinyLVLM-eHub: Towards Comprehensive and Efficient Evaluation for Large Vision-Language Models | Aug 7, 2023 | HallucinationObject Hallucination | CodeCode Available | 2 |
| Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn Dialogue | Aug 7, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| SynJax: Structured Probability Distributions for JAX | Aug 7, 2023 | | CodeCode Available | 2 |
| AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning | Aug 7, 2023 | Offline RLreinforcement-learning | CodeCode Available | 2 |
| Dual Aggregation Transformer for Image Super-Resolution | Aug 7, 2023 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| Make Explicit Calibration Implicit: Calibrate Denoiser Instead of the Noise Model | Aug 7, 2023 | DenoisingImage Denoising | CodeCode Available | 2 |
| Spanish Pre-trained BERT Model and Evaluation Data | Aug 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies | Aug 6, 2023 | Hallucination | CodeCode Available | 2 |
| Early Detection and Localization of Pancreatic Cancer by Label-Free Tumor Synthesis | Aug 6, 2023 | Specificity | CodeCode Available | 2 |
| EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent Education | Aug 5, 2023 | ChatbotLanguage Modeling | CodeCode Available | 2 |
| PowerSimulationsDynamics.jl -- An Open Source Modeling Package for Modern Power Systems with Inverter-Based Resources | Aug 5, 2023 | | CodeCode Available | 2 |
| Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP | Aug 4, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data | Aug 4, 2023 | Question AnsweringVisual Question Answering | CodeCode Available | 2 |
| FB-BEV: BEV Representation from Forward-Backward View Transformations | Aug 4, 2023 | | CodeCode Available | 2 |
| MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities | Aug 4, 2023 | MathMM-Vet | CodeCode Available | 2 |
| UniSim: A Neural Closed-Loop Sensor Simulator | Aug 3, 2023 | | CodeCode Available | 2 |
| Scaling Relationship on Learning Mathematical Reasoning with Large Language Models | Aug 3, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 2 |
| ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior Constraints | Aug 3, 2023 | Image GenerationLanguage Modelling | CodeCode Available | 2 |
| The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World | Aug 3, 2023 | AllQuestion Answering | CodeCode Available | 2 |
| DETR Doesn't Need Multi-Scale or Locality Design | Aug 3, 2023 | Decoder | CodeCode Available | 2 |
| From Sparse to Soft Mixtures of Experts | Aug 2, 2023 | | CodeCode Available | 2 |
| Flows: Building Blocks of Reasoning and Collaborating AI | Aug 2, 2023 | Prompt Engineering | CodeCode Available | 2 |
| Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking | Aug 1, 2023 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 2 |
| AnyLoc: Towards Universal Visual Place Recognition | Aug 1, 2023 | Image RetrievalVisual Place Recognition | CodeCode Available | 2 |
| DriveAdapter: Breaking the Coupling Barrier of Perception and Planning in End-to-End Autonomous Driving | Aug 1, 2023 | Autonomous DrivingBench2Drive | CodeCode Available | 2 |
| FLatten Transformer: Vision Transformer using Focused Linear Attention | Aug 1, 2023 | Diversity | CodeCode Available | 2 |
| UniVTG: Towards Unified Video-Language Temporal Grounding | Jul 31, 2023 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 |
| MovieChat: From Dense Token to Sparse Memory for Long Video Understanding | Jul 31, 2023 | Multiple-choiceQuestion Answering | CodeCode Available | 2 |
| LP-MusicCaps: LLM-Based Pseudo Music Captioning | Jul 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| All-In-One Metrical And Functional Structure Analysis With Neighborhood Attentions on Demixed Audio | Jul 31, 2023 | AllDownbeat Tracking | CodeCode Available | 2 |
| VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design | Jul 31, 2023 | Computational Efficiencytext-to-speech | CodeCode Available | 2 |