OpenThoughts-Agent is a large-scale research project that provides optimal tools and data recipes for training small intelligent agent models. It supports large-scale data generation (Datagen), Supervised Fine-Tuning (SFT), and Reinforcement Learning (RL) tasks, seamlessly integrating with high-performance computing clusters through a highly modular HPC launcher. OpenThoughts-Agent offers customizable data generation scripts and training configurations, deeply integrating leading machine learning tools such as Llama Factory, vLLM, and Harbor. It places special emphasis on deploying containerized tools via Harbor and supports remote container providers to adapt to various HPC environments.
conda + mamba).pip install .
The project has many dependencies; it is recommended to use a package manager like uv within the virtual environment.
pip install .[datagen]
pip install .[datagen,datagen-swesmith]
# If the basic datagen extension is already installed, you can run directly:
# pip install .[datagen-swesmith]
git submodule update --init --recursive sft/llamafactory
[sft] extension):cd sft/llamafactory
pip install -e .[train,liger-kernel,deepspeed] # Select extension components as needed
cd -
The training configuration files compatible with OT‑Agent are located under the sft/lf_configs/** path. For detailed parameters and dependency instructions, please refer to sft/llamafactory/README.md.
More Links: