| Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite | Sep 15, 2023 | Question Answering | CodeCode Available | 2 |
| Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context | Sep 15, 2023 | | CodeCode Available | 2 |
| Optimization of Rank Losses for Image Retrieval | Sep 15, 2023 | Image RetrievalRetrieval | CodeCode Available | 2 |
| PromptASR for contextualized ASR with controllable style | Sep 14, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 2 |
| FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec | Sep 14, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 2 |
| MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning | Sep 14, 2023 | HallucinationIn-Context Learning | CodeCode Available | 2 |
| VerilogEval: Evaluating Large Language Models for Verilog Code Generation | Sep 14, 2023 | BenchmarkingCode Generation | CodeCode Available | 2 |
| Generative Image Dynamics | Sep 14, 2023 | | CodeCode Available | 2 |
| Unified Human-Scene Interaction via Prompted Chain-of-Contacts | Sep 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Beyond Adapting SAM: Towards End-to-End Ultrasound Image Segmentation via Auto Prompting | Sep 13, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| PILOT: A Pre-Trained Model-Based Continual Learning Toolbox | Sep 13, 2023 | class-incremental learningClass Incremental Learning | CodeCode Available | 2 |
| SafetyBench: Evaluating the Safety of Large Language Models | Sep 13, 2023 | Multiple-choice | CodeCode Available | 2 |
| CFDBench: A Large-Scale Benchmark for Machine Learning Methods in Fluid Dynamics | Sep 13, 2023 | | CodeCode Available | 2 |
| BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models | Sep 12, 2023 | DiagnosticNatural Language Understanding | CodeCode Available | 2 |
| Commands as AI Conversations | Sep 12, 2023 | | CodeCode Available | 2 |
| Temporal Action Localization with Enhanced Instant Discriminability | Sep 11, 2023 | Action DetectionAction Localization | CodeCode Available | 2 |
| Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning | Sep 11, 2023 | Mixture-of-Expertsparameter-efficient fine-tuning | CodeCode Available | 2 |
| ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation | Sep 11, 2023 | Autonomous DrivingDomain Generalization | CodeCode Available | 2 |
| MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning | Sep 11, 2023 | MathMathematical Reasoning | CodeCode Available | 2 |
| Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications | Sep 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase | Sep 11, 2023 | 3D Semantic SegmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |
| A physics-informed and attention-based graph learning approach for regional electric vehicle charging demand prediction | Sep 11, 2023 | Graph LearningMeta-Learning | CodeCode Available | 2 |
| Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation | Sep 10, 2023 | Talking Head Generation | CodeCode Available | 2 |
| VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching | Sep 10, 2023 | text-to-speechText to Speech | CodeCode Available | 2 |
| Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization | Sep 9, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| InstructDiffusion: A Generalist Modeling Interface for Vision Tasks | Sep 7, 2023 | Keypoint Detection | CodeCode Available | 2 |
| A-Eval: A Benchmark for Cross-Dataset Evaluation of Abdominal Multi-Organ Segmentation | Sep 7, 2023 | Organ SegmentationSegmentation | CodeCode Available | 2 |
| XGen-7B Technical Report | Sep 7, 2023 | 2k8k | CodeCode Available | 2 |
| DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models | Sep 7, 2023 | TruthfulQA | CodeCode Available | 2 |
| PyGraft: Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips | Sep 7, 2023 | BenchmarkingKnowledge Graphs | CodeCode Available | 2 |
| Natural and Robust Walking using Reinforcement Learning without Demonstrations in High-Dimensional Musculoskeletal Models | Sep 6, 2023 | Reinforcement Learning (RL) | CodeCode Available | 2 |
| GPT-InvestAR: Enhancing Stock Investment Strategies through Annual Report Analysis with Large Language Models | Sep 6, 2023 | | CodeCode Available | 2 |
| BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network | Sep 6, 2023 | Generative Adversarial NetworkSpeech Synthesis | CodeCode Available | 2 |
| Automated Bioinformatics Analysis via AutoBA | Sep 6, 2023 | AI AgentLanguage Modeling | CodeCode Available | 2 |
| GPT Can Solve Mathematical Problems Without a Calculator | Sep 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CoLA: Exploiting Compositional Structure for Automatic and Efficient Numerical Linear Algebra | Sep 6, 2023 | CoLAGaussian Processes | CodeCode Available | 2 |
| Dynamic Brain Transformer with Multi-level Attention for Functional Brain Network Analysis | Sep 5, 2023 | | CodeCode Available | 2 |
| GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction | Sep 5, 2023 | 3D Reconstructionglobal-optimization | CodeCode Available | 2 |
| Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning | Sep 5, 2023 | DecoderImage Generation | CodeCode Available | 2 |
| DAT++: Spatially Dynamic Vision Transformer with Deformable Attention | Sep 4, 2023 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| Relay Diffusion: Unifying diffusion process across resolutions for image synthesis | Sep 4, 2023 | Image Generation | CodeCode Available | 2 |
| Benchmarking Large Language Models in Retrieval-Augmented Generation | Sep 4, 2023 | Benchmarkingcounterfactual | CodeCode Available | 2 |
| NLLB-CLIP -- train performant multilingual image retrieval model on a budget | Sep 4, 2023 | Image RetrievalRetrieval | CodeCode Available | 2 |
| Adapting Segment Anything Model for Change Detection in HR Remote Sensing Images | Sep 4, 2023 | Change DetectionInteractive Segmentation | CodeCode Available | 2 |
| Orientation-Independent Chinese Text Recognition in Scene Images | Sep 3, 2023 | BenchmarkingImage Reconstruction | CodeCode Available | 2 |
| Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning | Sep 3, 2023 | Scene Text Recognition | CodeCode Available | 2 |
| RevColV2: Exploring Disentangled Representations in Masked Image Modeling | Sep 2, 2023 | Decoderimage-classification | CodeCode Available | 2 |
| CityDreamer: Compositional Generative Model of Unbounded 3D Cities | Sep 1, 2023 | modelScene Generation | CodeCode Available | 2 |
| OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation | Sep 1, 2023 | 3D Open-Vocabulary Instance Segmentation3D Open-Vocabulary Object Detection | CodeCode Available | 2 |
| Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following | Sep 1, 2023 | 3D Generation3D Question Answering (3D-QA) | CodeCode Available | 2 |