LLM 8

HydraLoRA, An Asymmetric LoRA Architecture for Efficient Fine-Tuning 2025/02/15
GQA, Training Generalized Multi Query Transformer Models from Multi Head Checkpoints 2024/09/30
Foundational Autoraters, Taming Large Language Models for Better Automatic Evaluation 2024/08/01
Chain-of-verification Reduces Hallucination in Large Language Models 2024/07/17
DoRA, Weight-Decomposed Low-Rank Adaptation 2024/07/04
MART, Improving LLM Safety with Multi-round Automatic Red-Teaming 2024/06/17
Generative Agents, Interactive Simulacra of Human Behavior 2024/06/11
Iterative Reasoning Preference Optimization 2024/06/10