Conv-FinRe: A Conversational and Longitudinal Benchmark for Utility-Grounded Financial Recommendation Paper • 2602.16990 • Published 13 days ago • 11
Ebisu: Benchmarking Large Language Models in Japanese Finance Paper • 2602.01479 • Published 30 days ago • 17