arXiv (Cornell University)
TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios
March 2024 • Xiaokang Zhang, Jing Zhang, Zeyao Ma, Li Yang, Bohan Zhang, Guanlin Li, Zijun Yao, Kang Xu, Jinchang Zhou, Daniel Zhang-Li, Jifan Yu, Shu Zhao, Juanz…
We introduce TableLLM, a robust large language model (LLM) with 8 billion parameters, purpose-built for proficiently handling tabular data manipulation tasks, whether they are embedded within documents or spreadsheets, catering to real-world office scenarios. We propose a distant supervision method for training, which comprises a reasoning process extension strategy, aiding in training LLMs to understand reasoning patterns more effectively as well as a cross-way validation strategy, ensuring the quality of the aut…