30 pointsby iamtraskover 10 years ago

2 comments

ajtullochover 10 years ago

How is this on the front page? This is a completely incoherent.<p>For anyone actually interested in some interesting techniques for multi-GPU DNN training, <a href="http://arxiv.org/pdf/1404.5997v2.pdf" rel="nofollow">http://arxiv.org/pdf/1404.5997v2.pdf</a> and references therein are probably a good start.

评论 #8656322 未加载

评论 #8656361 未加载

评论 #8656305 未加载

dhaivatpandyaover 10 years ago

The exposition is not very clear. What exactly do you mean when you say "No edges will be communicated over the network, only half of the nodes."? I'm puzzled, because a few sentences later, you claim "The only network IO that would be required would be sending each edge value to its respective node in Q."; so the edge values are actually communicated?<p>From what I've understood, what you're suggesting is that for every node in a layer, you colocate the edge on the same machine?

评论 #8656393 未加载

Distributing a Fully Connected Neural Network Across a Cluster

2 comments

Distributing a Fully Connected Neural Network Across a Cluster

2 comments