[LG]《What Features in... 爱可可-爱生活 2024-11-10 19:53:10 [LG]《What Features in Prompts Jailbreak LLMs? Investigating the Mechanisms Behind Attacks》N M Kirch, S Field, S Casper [Cambridge ERA & MIT CSAIL] (2024) 机器学习人工智能论文