TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

A Google bot scrapes pricing info by adding items to carts

362 pointsby psim1almost 5 years ago

28 comments

whoisjuanalmost 5 years ago
This bot is simply trying to get the final price (with tax and shipping) which is ridiculous because e-commerce storefronts should do that in the first place without going through the whole checkout process.<p>I always have found that kind of shady but it&#x27;s probably known to increase conversions.<p>What I found interesting is that this an open attack vector for e-commerces. Multiple bots can hit a website and start adding items and start the checkout process. This basically creates an unprecedented cart behavior data influx that ruins any possible usage for data coming from legit customers. Maybe cleaning the data wouldn&#x27;t be that hard but if someone knows what they are doing they can really make it hard (separate IPs, emails and cart behavior)<p>I doubt Shopify or Magento have anything to prevent this.
评论 #23705177 未加载
评论 #23704734 未加载
评论 #23707100 未加载
评论 #23706821 未加载
评论 #23705287 未加载
评论 #23704745 未加载
评论 #23710567 未加载
评论 #23707381 未加载
评论 #23706857 未加载
评论 #23720345 未加载
评论 #23708320 未加载
评论 #23705654 未加载
评论 #23704723 未加载
评论 #23704699 未加载
soganessalmost 5 years ago
For people saying this to calculate the final price with shipping and tax, it&#x27;s not (or at least not entirely). It is for this new sales conversion dark pattern where prices aren&#x27;t listed until you add to cart.<p>Ebay sellers are particularly bad offenders: <a href="https:&#x2F;&#x2F;www.ebay.com&#x2F;itm&#x2F;Open-Box-Certified-Samsung-Galaxy-13-3-4K-Ultra-HD-Touch-Screen-Chromeboo&#x2F;203028862820?epid=21037915306&amp;hash=item2f45769764:g:IM4AAOSwq4Nesuii" rel="nofollow">https:&#x2F;&#x2F;www.ebay.com&#x2F;itm&#x2F;Open-Box-Certified-Samsung-Galaxy-1...</a>
评论 #23709159 未加载
评论 #23710422 未加载
评论 #23708237 未加载
评论 #23706002 未加载
评论 #23705989 未加载
vmceptionalmost 5 years ago
That sparked a funny idea in my head, what if we tricked product managers industry wide to follow KPIs and A&#x2F;B tests that resulted in a better user experience for consumers, instead of experiences that coincidentally slightly upticked &quot;engagement&quot;.<p>Because it seems like this mystery shopper is already doing that.
评论 #23705313 未加载
评论 #23704937 未加载
评论 #23705308 未加载
advisedwangalmost 5 years ago
<a href="http:&#x2F;&#x2F;archive.is&#x2F;YRkQe" rel="nofollow">http:&#x2F;&#x2F;archive.is&#x2F;YRkQe</a>
评论 #23705332 未加载
yongjikalmost 5 years ago
robots.txt, man, if you don&#x27;t want search engines to visit certain part of your page, use robots.txt!<p>Once heard a tale of an angry site owner calling Google (back when Google itself was novel) - Google deleted his whole website! Turned out he had &quot;DELETE&quot; button in each page, which generated plain GET request. So Googlebot visited the site, followed links to every page, and then of course followed every link that generated GET requests - because they are supposed to be safe.<p>Don&#x27;t be like that site owner.
评论 #23706142 未加载
评论 #23706207 未加载
评论 #23711480 未加载
评论 #23710268 未加载
评论 #23706090 未加载
justinwpalmost 5 years ago
Protip: You will often get a discount coupon if you go through most of the checkout process(need to provide email), but wait a couple days. Many stores automate abandoned checkout promotions.
评论 #23704945 未加载
评论 #23707095 未加载
评论 #23715915 未加载
danimal88almost 5 years ago
It&#x27;s just price data collection. In particular, MAP policies can be skirted by not publishing a final price but having a price below MAP in the cart which is a common tactic that online sellers utilize. By pretending to walk through the cart, all sorts of data about pricing, taxes, etc. can be learned. It&#x27;s not entirely uncommon to see different prices at different times, for different user agents, for different locations, etc. Used to work for a company that build huge price collection systems and built many of them...
评论 #23705395 未加载
Alupisalmost 5 years ago
The real problem with this is from the merchant side of things.<p>This bot generates thousands of &quot;Abandoned Carts&quot; on one of our sites... thousands...<p>We send cart reminders to Abandoned Carts after a few days, sometimes with a coupon offer to complete checkout.<p>This bot is responsible for thousands of bounced emails each week, which impacts our metrics with Mandrill among other things.<p>Maybe we shouldn&#x27;t care, but it&#x27;s sloppy and ruins all sorts of stats we keep track of regarding cart abandonment rates, recapture rates and more.
评论 #23707321 未加载
评论 #23707179 未加载
评论 #23707213 未加载
评论 #23706975 未加载
评论 #23707971 未加载
rkagereralmost 5 years ago
Are there legal implications to Google bots transacting with websites under false pretenses?<p>I mean their normal web crawler identifies itself as such. Here, I feel like they&#x27;re committing (very) minor fraud by putting in fake shopper information and actively hiding their identity. Not a big deal if it were just some Joe Schmoe somewhere, but at their scale might it border on harassment? The robot equivalent of a prank call?
评论 #23705613 未加载
评论 #23705514 未加载
vmateixeiraalmost 5 years ago
Genuine question, is this not considered a DoS attack?<p>Let&#x27;s imagine I have my online stock linked to limited physical items&#x2F;assets, ex tickets for a show, which will get reserved for a period of time. This will be preventing genuine clients from buying them.
评论 #23706817 未加载
评论 #23707229 未加载
评论 #23706167 未加载
taconalmost 5 years ago
Would it be too much for Google to program the bot to get the final price, and then delete all the items from the cart? Seems rather rude, even for Google.
评论 #23705464 未加载
评论 #23713265 未加载
评论 #23704950 未加载
leohalmost 5 years ago
Such a bot could be used to damage ad tracking
doe88almost 5 years ago
I wouldn&#x27;t fault them for that, I&#x27;ve observed some sites most likely are gaming the system by detecting and providing Google bots with artificially lower prices so that they would appear in indexes summaries and then when you access the product, its real price is always higher than the one reported in the index.
评论 #23709339 未加载
madmax108almost 5 years ago
I used to work at a company that provided APIs used for search&#x2F;personalization&#x2F;autosuggest for a whole bunch of huge e-commerce companies. Since the entire integration with the customer site was API based, we worked off of tracking pixels, API requests and cookies to determine shopping behaviour. A lot of this went into determining things like ranking (If someone searches &quot;Tshirt&quot; what shows up on the first page and in what order etc.)<p>Since we were only running search and not payment processing, the tracking pixel&#x2F;API for &quot;Add to Cart&quot; was a big thing for us. The whole product ran on revenue-share so we were paid per X ATCs<p>Interesting to see if any of the customers were affected by bots doing ATC and how it was handled if it was.
aaron695almost 5 years ago
Digital shopping cart abandonment&#x2F;Inventory Exhaustion&#x2F;Hoarder bots is an interesting type of DDOS.<p>There&#x27;s a popular moment of people using it atm <a href="https:&#x2F;&#x2F;heavy.com&#x2F;news&#x2F;2020&#x2F;06&#x2F;shopping-card-abandonment-tiktok&#x2F;" rel="nofollow">https:&#x2F;&#x2F;heavy.com&#x2F;news&#x2F;2020&#x2F;06&#x2F;shopping-card-abandonment-tik...</a>
ameliusalmost 5 years ago
It would be cool if Google could manage to become a storefront for the entire web, thereby eliminating Amazon.
评论 #23707114 未加载
评论 #23706885 未加载
评论 #23706867 未加载
评论 #23708410 未加载
caseralmost 5 years ago
This feels like a great way to get data on how all these different e-commerce companies approach remarketing.
Keyframealmost 5 years ago
I think I&#x27;ve seen most Google&#x27;s technologies dissected and&#x2F;or explained in detail over the years. Lots of their own papers too. If you look into how and what they&#x27;re doing regarding data collection, including scraping, there&#x27;s nothing.
baybal2almost 5 years ago
Funny, a one quick gig I did in my college years was to write a shopping bot protection against &quot;guaranteed lowest price&quot; scraper like tigerdirect, or RFD.<p>Back then, the goal was exactly the opposite.
Youdenalmost 5 years ago
When and why did news cease being news and start being short stories and opinion? This entire article could have been cut down to the last few paragraphs and nothing of value would have been lost.<p>Look at The New York Times in 1921 [0]. Generally the stories are factual and to the point. The entire front page seems to be pure news. There&#x27;s very little storytelling here, at most there are a few timelines of events.<p>Look at The New York Times today [1]. There&#x27;s a bunch of factual and useful Coronavirus information but ~15% of the page is dedicated to &quot;Opinion&quot;, the second article appears to be pure speculation, the third article is a bunch of storytime fluff around a little bit of news and the front page has a mix of actual news and opinion pieces being passed off as news.<p>When did this happen? Why? Did people lose interest in actual news? Is there less actual news to report?<p>Perhaps this is regional? Take for example the story about the San Quentin prison. NYTimes [2] has the same drawn out nonsense as this Google story while Aljazeera [3] adds a lot of background but sticks to factual reporting.<p>[0]: <a href="https:&#x2F;&#x2F;archive.org&#x2F;details&#x2F;NYTimes_jul16_31_1921" rel="nofollow">https:&#x2F;&#x2F;archive.org&#x2F;details&#x2F;NYTimes_jul16_31_1921</a><p>[1]: <a href="http:&#x2F;&#x2F;archive.is&#x2F;oiiXU" rel="nofollow">http:&#x2F;&#x2F;archive.is&#x2F;oiiXU</a><p>[2]: <a href="https:&#x2F;&#x2F;www.nytimes.com&#x2F;2020&#x2F;06&#x2F;30&#x2F;us&#x2F;san-quentin-prison-coronavirus.html" rel="nofollow">https:&#x2F;&#x2F;www.nytimes.com&#x2F;2020&#x2F;06&#x2F;30&#x2F;us&#x2F;san-quentin-prison-cor...</a><p>[3]: <a href="https:&#x2F;&#x2F;www.aljazeera.com&#x2F;news&#x2F;2020&#x2F;07&#x2F;san-quentin-prison-sees-600-coronavirus-cases-5-days-200701192059040.html" rel="nofollow">https:&#x2F;&#x2F;www.aljazeera.com&#x2F;news&#x2F;2020&#x2F;07&#x2F;san-quentin-prison-se...</a>
评论 #23708117 未加载
评论 #23708059 未加载
评论 #23708412 未加载
评论 #23707360 未加载
评论 #23707420 未加载
评论 #23710745 未加载
评论 #23710267 未加载
评论 #23708808 未加载
评论 #23710170 未加载
评论 #23708846 未加载
评论 #23708308 未加载
评论 #23707482 未加载
评论 #23710848 未加载
评论 #23707337 未加载
评论 #23710721 未加载
评论 #23709409 未加载
ycombonatoralmost 5 years ago
Google product Growth hack: Fake it Until you make it
hbarkaalmost 5 years ago
Didn’t some #tiktokteens do the same with some guy’s web store?
评论 #23708439 未加载
s1k3salmost 5 years ago
Is this supposed to intrigue me? Good bot
tudorwalmost 5 years ago
Nice, I think it has my CC details )
ardy42almost 5 years ago
&gt; When The Wall Street Journal contacted Google in June, a spokesman at the internet giant, after a few days of digging, provided an update: The mystery shopper is a bot of its own creation.<p>&gt; The purpose: making sure the all-in price for the product, including tax and shipping, matches the listing on its Google Shopping platform or in advertisements. It wasn’t to cause angst to merchants due to thousands of abandoned carts.<p>&gt; “We use automated systems to ensure consumers are getting accurate pricing information from our merchants,” a company spokesman said. “This sometimes leads to merchants seeing abandoned carts as a result of our system testing whether the price displayed matches the price at checkout.”<p>You&#x27;d think they could have better identified themselves in accounts they were creating rather than creating this mysterious &quot;John Smith&quot; persona. Maybe &quot;GoogleBot PriceVerifier&quot; would have been a better choice.<p>edit: remove my inaccurate confusion about something, and fix quotes that I&#x27;d copied from a plagiarized version of the article.
评论 #23704812 未加载
评论 #23704835 未加载
评论 #23705180 未加载
Animatsalmost 5 years ago
Now even the WSJ has clickbait titles. Should have been &quot;Google price-checking system annoying merchants&quot;.
评论 #23705200 未加载
abofhalmost 5 years ago
Google.<p>Saved you a click.
评论 #23704997 未加载
bravoetchalmost 5 years ago
TL;DR - it&#x27;s a google bot
评论 #23705777 未加载
评论 #23704649 未加载
评论 #23704636 未加载