Good article.<p>I have spotted similar bugs too which become evident very quickly when working with smaller resolutions and likewise with a smaller number of grayscales.<p>> <i>We choose a H×W rectangular grid of points, from which we will draw samples.</i><p>An additional thing to keep in mind is how a camera capturing the image would be operating. It's not sampling in the true theoretical sense of picking points from a continuous signal. The pixels are of finite size, which makes the grid (2) as closer to the reality. There are additional complications for color images where the red, green and blue channels are integrating over different regions within the pixel area (see [A] for example). This makes the real grid as different from even (2) for different color channels. However, the math suggested by the author should not change still.<p>> <i>It seems the mess is unique in the deep learning world.</i><p>The title, "Where Are Pixels? -- a Deep Learning Perspective", looks unjustified. What's presented is not a deep learning perspective. It applies generically.<p>[A] <a href="https://en.wikipedia.org/wiki/Bayer_filter" rel="nofollow">https://en.wikipedia.org/wiki/Bayer_filter</a>