A woman from Connecticut just became the biggest winner in the history of "Wheel of Fortune." Stamford resident Christina Derevjanik won $1,035,155 on the Ryan Seacrest-hosted game show that aired on ...
Abstract: Recently, researchers in the field of math word problem (MWP) solving have reported performance metrics for various large language models (LLMs) on benchmark datasets, with some models ...
GSM8K-V is a purely visual multi-image mathematical reasoning benchmark that systematically maps each GSM8K math word problem into its visual counterpart to enable a clean, within-item comparison ...
Abstract: In this study, we investigated the effects of self-reflection in large language models (LLMs) on problem-solving performance. We instructed nine popular LLMs to answer a series of ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果