Latest Reinforcement Learning Research Papers

Research on learning through interaction, reward optimization, policy learning, and decision-making AI systems.

64 Papers
Showing 4 of 4 papers

LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Qingyu Ren, Qianyu He, Jingwen Chang +6 more

Instruction-following is critical for large language models, but real-world instructions often contain logical structures such as sequential dependencies and conditional branching. Existing methods typically construct datasets with parallel constraints and optimize average rewards, ignoring logical ...

instruction-followinglogical structuressequential dependenciesconditional branchingLSRInstruct+7 more
Jan 10, 202610
PreviousPage 4 of 4
Latest Reinforcement Learning Research | Reinforcement Learning Papers