Explore other topics:deepseek r1 model training methodologyollama deepseek open webuideepseek v3 vram requirement중국 deepseekdeepseek knowledge distillation