RealMem: Benchmarking LLMs in Real-World Memory-Driven Interaction
Haonan Bian, Zhiyuan Yao, Sen Hu +7 more
As Large Language Models (LLMs) evolve from static dialogue interfaces to autonomous general agents, effective memory is paramount to ensuring long-term consistency. However, existing benchmarks primarily focus on casual conversation or task-oriented dialogue, failing to capture **"long-term project...