| Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics | Jun 20, 2024 | 8kDescriptive | —Unverified | 0 |
| GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language Models | Jun 20, 2024 | 8k | CodeCode Available | 0 |
| Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases | Jun 19, 2024 | 8kHallucination | CodeCode Available | 2 |
| Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM | Jun 16, 2024 | 8kOpinion Summarization | —Unverified | 0 |
| Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture | Jun 1, 2024 | 8kFace Reconstruction | —Unverified | 0 |
| Cutting Through the Noise: Boosting LLM Performance on Math Word Problems | May 30, 2024 | 8kMath | CodeCode Available | 0 |
| Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum | May 21, 2024 | 2k8k | CodeCode Available | 1 |
| DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical Documents | Apr 30, 2024 | 8kDiversity | CodeCode Available | 0 |
| Extending Llama-3's Context Ten-Fold Overnight | Apr 30, 2024 | 8kGPU | CodeCode Available | 0 |
| LongEmbed: Extending Embedding Models for Long Context Retrieval | Apr 18, 2024 | 4k8k | CodeCode Available | 2 |