This paper shows measured results of using "popular image recognition services" .. that include Azure, AWS, Google, IBM and other current commercial offerings.. (the implication right away is that the tested services are using some DeepLearning system on the server side). The paper specifically says "from the point of view of a software developer".. and spends quite a bit of effort to question the assumptions of a user of these services, and identify potential pitfalls from a mis-match of user assumptions, including consistency over time, consistency between services themselves, and employing a machine that produces deterministic outcomes versus probabilistic ones. The paper looks at the behavior of a Vision-as-Service use from a Software Quality Assurance (SQA) point of view - is the result -of commercial services on the web- reliable over time. Liability within safety-critical environments is questioned.<p>The comments here (so far) address "does DeepLearning image analysis work" .. which is a broader question than what is being addressed in the paper.. Importantly, other kinds of image analysis methods, including other ML approaches, are not being compared..<p>The authors seem to be raising a bit of an alarm about services like these, reflected in the paper title (weakly):<p>[RH1] Computer vision services do not respond with consistent outputs between services, given the same input image.<p>[RH2] The responses from computer vision services are non-deterministic and evolving, and the same service can change its top-most response over time given the same input image.<p>[RH3] Computer vision services do not effectively communicate this evolution and instability, introducing risk into engineering these systems<p>To a non-specialist, this seems like detailed description of a useful real-world investigation, like a lab. The authors' skepticism is healthy, and the paper overall looks good. On the negative side, the discussion of labels in Computer Vision seems to be insufficiently distinguishing between fundamental problems in taxonomy and classification, problems with data grouping in general, and then specifically problems associated with this kind of DeepLearning image identification.