아카이브
- 30 / 09 GQA, Training Generalized Multi Query Transformer Models from Multi Head Checkpoints
- 01 / 08 Foundational Autoraters, Taming Large Language Models for Better Automatic Evaluation
- 17 / 07 Chain-of-verification Reduces Hallucination in Large Language Models
- 04 / 07 DoRA, Weight-Decomposed Low-Rank Adaptation
- 17 / 06 MART, Improving LLM Safety with Multi-round Automatic Red-Teaming
- 11 / 06 Generative Agents, Interactive Simulacra of Human Behavior
- 10 / 06 Iterative Reasoning Preference Optimization