VentureBeat AI•
AI-redeneringmodellen trainen met minder rekenkracht
Back to overview
Researchers at JD.com have developed RLSD, a new training method that enables enterprises to build custom AI reasoning models with significantly fewer computational resources. The technique combines reinforcement learning with self-distillation to overcome limitations of existing approaches, which either require expensive large models or provide insufficient feedback during training. Early experiments show RLSD-trained models outperform those using traditional distillation and reinforcement learning methods, potentially lowering both technical and financial barriers for organizations
Read full article
1 views