I don't understand "using ONE TCP connection, it can send multiple request/responses in parallel!".<p>Say a browser sends 2 requests in parallel using the same socket.<p>one request asks for a png image, one request asks for a css file. The server returns 32 bytes of data back. How does the client know what that 32 bytes are? are they part of the png image or the css file?