| SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation | Sep 29, 2023 | 3D Human Pose Estimation3D Human Reconstruction | CodeCode Available | 3 |
| G4SATBench: Benchmarking and Advancing SAT Solving with Graph Neural Networks | Sep 29, 2023 | Benchmarking | CodeCode Available | 1 |
| FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding | Sep 28, 2023 | BenchmarkingImage Retrieval | CodeCode Available | 1 |
| LagrangeBench: A Lagrangian Fluid Mechanics Benchmarking Suite | Sep 28, 2023 | Benchmarking | CodeCode Available | 1 |
| Revisiting Neural Program Smoothing for Fuzzing | Sep 28, 2023 | BenchmarkingCPU | CodeCode Available | 1 |
| Language Models as a Service: Overview of a New Paradigm and its Challenges | Sep 28, 2023 | Benchmarking | —Unverified | 0 |
| LawBench: Benchmarking Legal Knowledge of Large Language Models | Sep 28, 2023 | ArticlesBenchmarking | CodeCode Available | 2 |
| GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond | Sep 28, 2023 | Benchmarking | CodeCode Available | 2 |
| The Trickle-down Impact of Reward (In-)consistency on RLHF | Sep 28, 2023 | Benchmarking | CodeCode Available | 1 |
| OceanBench: The Sea Surface Height Edition | Sep 27, 2023 | BenchmarkingSensor Fusion | CodeCode Available | 1 |