other2h ago

Does anyone have enough compute to make a distillation dataset out of GLM5.2?

rr/LocalLLaMAscore 0.49

A reddit user is asking if anyone has sufficient compute to create a distillation dataset from GLM-5.2, which could be used to train smaller models like Qwen-3.5. The proposed dataset would contain 700k-1M examples. This would benefit the community by enabling better training of smaller models.

Key takeaways

GLM-5.2 proposed as source for distillation dataset.
700k-1M examples suggested for dataset size.
Smaller models like Qwen-3.5 could benefit from dataset.

#distillation #open-weights #local-llm

Read the original

other2h ago

Does anyone have enough compute to make a distillation dataset out of GLM5.2?

A reddit user is asking if anyone has sufficient compute to create a distillation dataset from GLM-5.2, which could be used to train smaller models like Qwen-3.5. The proposed dataset would contain 700k-1M examples. This would benefit the community by enabling better training of smaller models.

Key takeaways

GLM-5.2 proposed as source for distillation dataset.
700k-1M examples suggested for dataset size.
Smaller models like Qwen-3.5 could benefit from dataset.

#distillation #open-weights #local-llm

Read at r/LocalLLaMA