<KD, Reasoning> [NAT] Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data (2023.12)

2024. 1. 3. 16:10· Paper Review

<NLP> [Transformer] Attention Is All You Need (2017.06) (1)	2024.01.10
<sLLM> TinyLlama: An Open-Source Small Language Model (2024.01) (0)	2024.01.08
<LLM> SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling (2023.12) (0)	2024.01.01
<DB, Agent> [FunSearch] Mathematical discoveries from program search with largelanguage models (2023.12) (1)	2023.12.28
<LoRA, MoE> LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment (2023.12) (1)	2023.12.26

Abstract