LLM 微调

使用 AutoTrain，您可以轻松地使用自己的数据微调大型语言模型 (LLM)！

AutoTrain 支持以下类型的 LLM 微调

数据准备

LLM 微调接受 CSV 格式的数据。

对于 SFT/通用训练器，数据应采用以下格式

text
human: hello \n bot: hi nice to meet you
human: how are you \n bot: I am fine
human: What is your name? \n bot: My name is Mary
human: Which is the best programming language? \n bot: Python

对于 SFT/通用训练，您的数据集必须包含一个 text 列

对于奖励训练器，数据应采用以下格式

text	rejected_text
human: hello \n bot: hi nice to meet you	human: hello \n bot: leave me alone
human: how are you \n bot: I am fine	human: how are you \n bot: I am not fine
human: What is your name? \n bot: My name is Mary	human: What is your name? \n bot: Whats it to you?
human: Which is the best programming language? \n bot: Python	human: Which is the best programming language? \n bot: Javascript

对于奖励训练器，您的数据集必须包含一个 text 列（即选定的文本）和一个 rejected_text 列。

对于 DPO/ORPO 训练器，数据应采用以下格式

对于 DPO/ORPO 训练器，您的数据集必须包含一个prompt 列，一个text 列（又称选定文本）和一个rejected_text 列。

对于所有任务，您可以使用 CSV 和 JSONL 文件！