Quanxing Technology's training and push all-in-one machine uses only 16 graphics cards and expands the single-machine video memory to 8TB, completing the full parameter training of the DeepSeek R1 671B model. The training hardware cost is less than 1.5 million yuan, which is more than 95% lower than the industry average, achieving a huge breakthrough. Quanxing Technology and Inspur Cloud are the first to release the DeepSeek 671B all-in-one machine