Last active 1728786012

knox revised this gist 1728786011. Go to revision

1 file changed, 10 insertions

LLaMA-Factory-2.md(file created)

@@ -0,0 +1,10 @@
1 + | Approach | Full-tuning | Freeze-tuning | LoRA | QLoRA |
2 + | ---------------------- | ------------------ | ------------------ | ------------------ | ------------------ |
3 + | Pre-Training | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
4 + | Supervised Fine-Tuning | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
5 + | Reward Modeling | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
6 + | PPO Training | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
7 + | DPO Training | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
8 + | KTO Training | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
9 + | ORPO Training | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
10 + | SimPO Training | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
Newer Older