So today I learnt that Amazon.com is acutally hosted on Google Cloud Platform uses Google Domains as well as 1&1 Hosting, alongside Adobe TagManager and Google Analytics. Who could have known? \s<p>Something is really off there, what do you do to get these results? <a href="https://sitestacks.com/amazon.com" rel="nofollow">https://sitestacks.com/amazon.com</a>
Interesting, but the results seem kind of misleading. For example, <a href="https://sitestacks.com/linestarve.com" rel="nofollow">https://sitestacks.com/linestarve.com</a> says I'm using "NameCheap Web Hosting" -- they're my registrar, but I run my own HTTP server for that domain. Meanwhile <a href="https://sitestacks.com/analytics.bitmash.io" rel="nofollow">https://sitestacks.com/analytics.bitmash.io</a> reports that it's built with Piwik (true, but I would expect that to appear on a page <i>using</i> Piwik, which it doesn't), but for some reason leaves out all mention of webserver, DNS hosting, and so on.
Had a quick go and was pretty disappointed. Most of this is just having a look at the third party js loaded. Other things are wild guesses based on DNS, which has nothing to do with the website but are either out of date or reside in unassociated parts of the business. The tech stack detection was was almost entirely wrong :(
In order for me to actually use your service it would need to be rolled into a browser extension. This needs to be at your fingertips when you need it, now I need to copy-paste the url into a new tab. I use Wappalyzer [1] a couple of times a day and love the browser integration.<p>[1] <a href="https://wappalyzer.com/" rel="nofollow">https://wappalyzer.com/</a>
These tools never work well, usually a weak combination of scraping the html, JS tags and checking DNS entries. Siftery tried something similar with a bunch of VC raised and is equally bad.<p>The only decent one is <a href="http://builtwith.com" rel="nofollow">http://builtwith.com</a> which also happens to be a great 1-man business.
doesn't <a href="http://www.builtwith.com/" rel="nofollow">http://www.builtwith.com/</a> do this already and have so much data too ?
Dupe from 20 days ago<p><a href="https://news.ycombinator.com/item?id=15249136" rel="nofollow">https://news.ycombinator.com/item?id=15249136</a>
SiteStacks can find the technology used at any domain, including a set of roughly 700,000 that we’re regularly checking.<p>What makes the dataset unique is the combination of programmatic data (code breadcrumbs, network requests, DNS, some NLP, etc.), but augmented by data validated by users directly.<p>The user validated data is only available on Siftery (e.g. for sitestacks.com/uber.com you have to follow the link through to siftery.com/company/uber to see the full set), but all the programmatic methods are improved by user-validated data (e.g. if a method yields too many false positive, we bump it out).<p>We think this approach helps create the most accurate dataset of its kind. We’ve done some internal benchmarking and feel really good about it.<p>We’re looking for feedback on how this can be better, and open to partnering with others who want to make use of this data for good.
I should also mention we have new browser extensions for chrome - [1] and firefox - [2]<p>[1] <a href="https://chrome.google.com/webstore/detail/sitestacks-instant-tech-l/nnknmohbeolbkgeggkiaifelfkdlnfak" rel="nofollow">https://chrome.google.com/webstore/detail/sitestacks-instant...</a><p>[2] <a href="https://addons.mozilla.org/en-US/firefox/addon/sitestacks/reviews/" rel="nofollow">https://addons.mozilla.org/en-US/firefox/addon/sitestacks/re...</a>
Also see Whatruns which was on hn last month : <a href="https://news.ycombinator.com/item?id=15098028" rel="nofollow">https://news.ycombinator.com/item?id=15098028</a>
> We’re looking for feedback on how this can be better, and open to partnering with others who want to make use of this data for good.<p>Responding to the feedback and data discrepancies mentioned here would be a good start. The HN community here is testing this for you for free and providing you with valuable feedback, and asking you questions that you need to answer, if you want to make your product useful.<p>I don't see you (OP) responding to anyone. The 2 posts from you are both promoting the site.
<a href="https://sitestacks.com/products/g-suite-formerly-google-apps-for-work" rel="nofollow">https://sitestacks.com/products/g-suite-formerly-google-apps...</a><p>Doesn't seem like it is accurate though. Seems like this is more of crowdsourced data than automatically figuring out things.
Ran it on <a href="https://canpicker.com/" rel="nofollow">https://canpicker.com/</a>. It's kind of cool and accurate but I was expecting it to pick up stuff like react and maybe finer grain details like individual libraries.
I tried it with my website.
It spotted google analytics (correct)
and HTML5 (correct again)<p>I was curious to see if it was going to work out that I am running it against Google App Engine
but it did not figure that