| Automatic High-quality Verilog Assertion Generation through Subtask-Focused Fine-Tuned LLMs and Iterative Prompting | Nov 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| "All that Glitters": Approaches to Evaluations with Unreliable Model and Human Annotations | Nov 23, 2024 | AllFairness | CodeCode Available | 0 |
| ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow Data | Nov 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| The BS-meter: A ChatGPT-Trained Instrument to Detect Sloppy Language-Games | Nov 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Planning-Driven Programming: A Large Language Model Programming Workflow | Nov 21, 2024 | Code GenerationHumanEval | CodeCode Available | 1 |
| Memory Backdoor Attacks on Neural Networks | Nov 21, 2024 | Backdoor AttackFederated Learning | —Unverified | 0 |
| SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language Model | Nov 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| A Framework for Evaluating LLMs Under Task Indeterminacy | Nov 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization | Nov 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge | Nov 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |