Back to feed
other2h ago
Does anyone have enough compute to make a distillation dataset out of GLM5.2?
A reddit user is asking if anyone has sufficient compute to create a distillation dataset from GLM-5.2, which could be used to train smaller models like Qwen-3.5. The proposed dataset would contain 700k-1M examples. This would benefit the community by enabling better training of smaller models.
Key takeaways
- GLM-5.2 proposed as source for distillation dataset.
- 700k-1M examples suggested for dataset size.
- Smaller models like Qwen-3.5 could benefit from dataset.
other2h ago
Does anyone have enough compute to make a distillation dataset out of GLM5.2?
A reddit user is asking if anyone has sufficient compute to create a distillation dataset from GLM-5.2, which could be used to train smaller models like Qwen-3.5. The proposed dataset would contain 700k-1M examples. This would benefit the community by enabling better training of smaller models.
Key takeaways
- GLM-5.2 proposed as source for distillation dataset.
- 700k-1M examples suggested for dataset size.
- Smaller models like Qwen-3.5 could benefit from dataset.