<Safety> Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples (2025.10)

2025. 10. 19. 15:21· Paper Review

최신 Text Embedding 모델 기술 트렌드 및 아키텍처 분석 정리 (feat. NotebookLM) (0)	2025.12.21
<Agent> Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory (2025.04) (2)	2025.06.22
<LLM, Diffusion> Large Language Diffusion Models (2024.02) (3)	2025.03.16
<Uncertainty> I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token (2024.12) (NeurIPS 2024) (4)	2024.12.27
<Structured Outputs> Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models (2024.10) (2)	2024.10.20

대규모 언어 모델(LLM) 포이즈닝 공격의 확장성: 공격 성공은 절대적인 포이즌 샘플 수에 좌우된다