| Image Segmentation Keras : Implementation of Segnet, FCN, UNet, PSPNet and other models in Keras | Jul 25, 2023 | Image SegmentationSegmentation | CodeCode Available | 4 |
| Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image | Jul 20, 2023 | Depth EstimationImage Reconstruction | CodeCode Available | 4 |
| How is ChatGPT's behavior changing over time? | Jul 18, 2023 | Code GenerationLanguage Modelling | CodeCode Available | 4 |
| AnyDoor: Zero-shot Object-level Image Customization | Jul 18, 2023 | ObjectVirtual Try-on | CodeCode Available | 4 |
| CoTracker: It is Better to Track Together | Jul 14, 2023 | GPUmotion prediction | CodeCode Available | 4 |
| CLAIMED -- the open source framework for building coarse-grained operators for accelerated discovery in science | Jul 12, 2023 | | CodeCode Available | 4 |
| Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration | Jul 11, 2023 | HallucinationLogic Grid Puzzle | CodeCode Available | 4 |
| Semantic-SAM: Segment and Recognize Anything at Any Granularity | Jul 10, 2023 | Image SegmentationSegmentation | CodeCode Available | 4 |
| Simulation-free Schrödinger bridges via score and flow matching | Jul 7, 2023 | | CodeCode Available | 4 |
| FedCP: Separating Feature Information for Personalized Federated Learning via Conditional Policy | Jul 1, 2023 | Federated LearningPersonalized Federated Learning | CodeCode Available | 4 |
| End-to-end Autonomous Driving: Challenges and Frontiers | Jun 29, 2023 | Autonomous Drivingmotion prediction | CodeCode Available | 4 |
| The Segment Anything Model (SAM) for Remote Sensing Applications: From Zero to One Shot | Jun 29, 2023 | Image SegmentationSemantic Segmentation | CodeCode Available | 4 |
| ManimML: Communicating Machine Learning Architectures with Animation | Jun 29, 2023 | | CodeCode Available | 4 |
| RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark | Jun 29, 2023 | Combinatorial OptimizationComputational Efficiency | CodeCode Available | 4 |
| Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks | Jun 24, 2023 | PhilosophyTransfer Learning | CodeCode Available | 4 |
| LightGlue: Local Feature Matching at Light Speed | Jun 23, 2023 | 3D ReconstructionCamera Pose Estimation | CodeCode Available | 4 |
| FFCV: Accelerating Training by Removing Data Bottlenecks | Jun 21, 2023 | CPUGPU | CodeCode Available | 4 |
| SSL4EO-L: Datasets and Foundation Models for Landsat Imagery | Jun 15, 2023 | Cloud DetectionEarth Observation | CodeCode Available | 4 |
| Motion Capture Dataset for Practical Use of AI-based Motion Editing and Stylization | Jun 15, 2023 | Motion Style TransferStyle Transfer | CodeCode Available | 4 |
| INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation | Jun 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation | Jun 13, 2023 | Patch MatchingTranslation | CodeCode Available | 4 |
| Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard | Jun 13, 2023 | Information RetrievalRepresentation Learning | CodeCode Available | 4 |
| Benchmarking Neural Network Training Algorithms | Jun 12, 2023 | Benchmarking | CodeCode Available | 4 |
| MIMIC-IT: Multi-Modal In-Context Instruction Tuning | Jun 8, 2023 | In-Context LearningVisual Question Answering | CodeCode Available | 4 |
| Tracking Everything Everywhere All at Once | Jun 8, 2023 | AllMotion Estimation | CodeCode Available | 4 |
| How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources | Jun 7, 2023 | Instruction Following | CodeCode Available | 4 |
| Recognize Anything: A Strong Image Tagging Model | Jun 6, 2023 | modelSemantic Parsing | CodeCode Available | 4 |
| Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding | Jun 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Neuralangelo: High-Fidelity Neural Surface Reconstruction | Jun 5, 2023 | Neural RenderingSurface Reconstruction | CodeCode Available | 4 |
| Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis | Jun 1, 2023 | Audio SynthesisComputational Efficiency | CodeCode Available | 4 |
| LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day | Jun 1, 2023 | Image ClassificationInstruction Following | CodeCode Available | 4 |
| StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners | Jun 1, 2023 | Contrastive Learning | CodeCode Available | 4 |
| TorchRL: A data-driven decision-making library for PyTorch | Jun 1, 2023 | Computational EfficiencyDecision Making | CodeCode Available | 4 |
| A Survey on Large Language Models for Recommendation | May 31, 2023 | Recommendation Systems | CodeCode Available | 4 |
| Let's Verify Step by Step | May 31, 2023 | Active LearningMath | CodeCode Available | 4 |
| AlignScore: Evaluating Factual Consistency with a Unified Alignment Function | May 26, 2023 | Fact VerificationInformation Retrieval | CodeCode Available | 4 |
| Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models' Reasoning Performance | May 26, 2023 | | CodeCode Available | 4 |
| Reasoning with Language Model is Planning with World Model | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Vision + Language Applications: A Survey | May 24, 2023 | Image GenerationSurvey | CodeCode Available | 4 |
| ViTMatte: Boosting Image Matting with Pretrained Plain Vision Transformers | May 24, 2023 | Image Matting | CodeCode Available | 4 |
| Enhancing Chat Language Models by Scaling High-quality Instructional Conversations | May 23, 2023 | Diversity | CodeCode Available | 4 |
| VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks | May 18, 2023 | DecoderLanguage Modeling | CodeCode Available | 4 |
| TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models | May 18, 2023 | Natural Language InferenceSynthetic Data Generation | CodeCode Available | 4 |
| LIMA: Less Is More for Alignment | May 18, 2023 | Language Modellingreinforcement-learning | CodeCode Available | 4 |
| Deep Multi-Frame Filtering for Hearing Aids | May 14, 2023 | Speech Enhancement | CodeCode Available | 4 |
| DeepFilterNet: Perceptually Motivated Real-Time Speech Enhancement | May 14, 2023 | CPUSpeech Enhancement | CodeCode Available | 4 |
| The Whole Is Greater than the Sum of Its Parts: Improving Music Source Separation by Bridging Network | May 13, 2023 | Music Source Separation | CodeCode Available | 4 |
| Segment and Track Anything | May 11, 2023 | Autonomous Drivingmultimodal interaction | CodeCode Available | 4 |
| Data quality dimensions for fair AI | May 11, 2023 | ClassificationFairness | CodeCode Available | 4 |
| Hierarchically Coherent Multivariate Mixture Networks | May 11, 2023 | Computational EfficiencyTime Series | CodeCode Available | 4 |