TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

What the SATs Taught Us about Finding the Perfect Fit

113 pointsby elsherbiniover 7 years ago

8 comments

lsiebertover 7 years ago
One of the great joys of my bachelor&#x27;s degree in psychology was being invited to take a graduate level course on Item Response Theory (with Professor Jack Vevea, now at UC Merced). I wouldn&#x27;t have fallen in love with programming and become a software developer if I hadn&#x27;t taken it. 1<p>The Rasch Model is a specifically simplified case of item response theory, but I&#x27;d argue that it may not be the best one for stitch fix. That&#x27;s not to say that it can&#x27;t be useful, but rather that the simplifications and assumptions of the Rasch model may lead to information that does not reflect the customer&#x27;s measurements as well as a more sophisticated model could. Of course it very well may be good enough, but it serves as a somewhat useful exploration of the<p>The Rasch model is an attempt to differentiate two associated sets of information, the latent trait of the test taker&#x2F;question answerer (in this case, their measurements) and the difficulty of the question (in this case, whether the item is too big, too small or just right). Basically the Rasch model treats the level of a latent trait of an individual as a function of the difficulty of a test question and what they answered.<p>But the model purposely ignores the question of the discrimination of the question, that is, how good is the question at differentiating between those who&#x27;s latent trait differs, and just assumes that the discrimination (the slope of the line reflecting the model of the question&#x27;s difficulty) is not relevant. Other models see this as relevant.<p>For example, if StitchFix offers a belt with a number of different holes, some people may feel the belt is too small if they are forced to use the last hole, some the second last hole. A question about such a belt that just asked if it was too large, too small, or just right might have low discrimination in terms of identifying an individuals underlying size. Likewise someone who has bigger thighs but a relatively slim torso might have different answers about a pair of slim fit pants of size x which are too small for their thighs, and a belt of size x. Thus questions about pants may have a higher discrimination then questions about a belt.<p>Item Response Theory outside of the Rasch also has a third factor to consider on a per question and per individual basis, which is basically the propensity to guess. Basically, how likely is someone to think carefully about the question as opposed to just putting down a random answer , and likewise are some questions more likely to have people answer blithely instead of earnestly.<p>The other thing to consider is that in most IRT tests, the latent trait is assessed at a single time for multiple questions. But weight&#x2F;fit&#x2F;measurements are here being assessed item by item, as they are tried, and the underlying fit may be changing if a person is gaining weight or bulk, retaining water, or recovering from thanksgiving dinner. While it&#x27;s unlikely that someone&#x27;s weight or size would change radically in a brief period, a model that weighed items that were tested more recently might better reflect the individual&#x27;s measurements.<p>Of course it&#x27;s been years and years since I took the class, so any screwup in this comment should reflect on me, and not my professor.<p>1 I was writing a function in R to speed up an IRT model fitting a curve in a way that let me do it in seconds instead of hours (It&#x27;s been a while but I think it was identifying the point of the curve where the slope is maximized), in any case it was a time consuming computation if you check every possibility linearly to 6 decimals for hundreds of test takers, but I figured that there weren&#x27;t local maximums and optimized with something like a binary search (but by decimal place), before I had ever heard of binary searches, and getting that sort of efficiency jump was deeply satisfying.
评论 #15927053 未加载
评论 #15927045 未加载
wencover 7 years ago
Truly good sizing is a much more complex problem than can be solved through recommendation systems. You can get incrementally better at it, but nothing beats actually trying something on at the store. Problems that cannot be solved through recommendations alone include:<p>1) Body types that aren&#x27;t average (all of our bodies are irregular, but some of us are more irregular than others). Clothes sizing are based on a average model of the human body.<p>2) Fabric interaction with body. Softer fabrics drape in different ways from more rigid fabrics. Also how loose or muscular your flesh is can affect fit if you&#x27;re in the market for really slim fits (European style).<p>3) Non-representative snapshots: your measurements in the morning will differ from other times of day, and it will differ over the course of weeks and months, even if your diet is stable.<p>4) Shrinkage&#x2F;expansion: depending on material, there is some shrinkage or expansion after the first wash&#x2F;wear. Although this is mostly a known quantity and good clothiers account for this.<p>Really good bespoke tailors understand these principles, and make allowances for them as they build your clothing. Also they know to put elastic fabrics in the right places so the suit will still fit even after a large meal.<p>I think the one way we can get closer to a good fit while being remote is to have pop-up stations&#x2F;kiosks where we can get multiple 3D scans of our body on separate occasions (kinda like how most bespoke tailors require at least 3 fittings). That still doesn&#x27;t account for fabric-body interactions, but it gets us a lot closer than recommendation systems.<p>p.s. the other problem is cultural. Most Americans don&#x27;t know or care quite as much about fit (because it&#x27;s not as prized in the culture) as their European counterparts, so their data is going to be skewed slightly towards the left end of the competence curve.
评论 #15923967 未加载
评论 #15923994 未加载
sneguover 7 years ago
I find it surprising they do all this math since the model they seem to use is &quot;Send everybody tent-like shirts that would fit a horse.&quot; After repeated requests for more fitted styles, I kept getting tents (although eventually smaller size tents).<p>I understand why they do this, because this style is likely to fit more people. But if the technique they describe in this post actually works, perhaps they could be more adventurous?
pjc50over 7 years ago
Related: <a href="http:&#x2F;&#x2F;sizes.darkgreener.com&#x2F;" rel="nofollow">http:&#x2F;&#x2F;sizes.darkgreener.com&#x2F;</a> recording the discrepancy between label size and actual fit for a lot of UK high street shopping.<p>The variance in label sizes is bad enough - and worse for women&#x27;s clothing than mens - but when you get into &quot;large&#x2F;medium&#x2F;small&quot; it&#x27;s just a lottery. Especially if you&#x27;re a westerner ordering direct from China.
评论 #15927372 未加载
评论 #15928601 未加载
jatsignover 7 years ago
Stitchfix has done a good job learning my size, but I wish I actually could &quot;shop&quot; at stichfix. When you get a box, there&#x27;s a foldout paper that shows you examples of what you could wear these new clothes with...but I don&#x27;t own any of those thing.<p>I wish they had some sort of follow up experience that would let me &quot;complete&quot; the outfit.<p>They talk a bit about why they don&#x27;t have any sort of online shopping here:<p><a href="http:&#x2F;&#x2F;multithreaded.stitchfix.com&#x2F;blog&#x2F;2015&#x2F;07&#x2F;07&#x2F;personalizing-beyond-the-point-of-no-return&#x2F;" rel="nofollow">http:&#x2F;&#x2F;multithreaded.stitchfix.com&#x2F;blog&#x2F;2015&#x2F;07&#x2F;07&#x2F;personali...</a>
robterrinover 7 years ago
They used Stan! <a href="http:&#x2F;&#x2F;mc-stan.org&#x2F;" rel="nofollow">http:&#x2F;&#x2F;mc-stan.org&#x2F;</a><p>Gelman and the rest of the crew at Columbia are doing great work. Check out <a href="https:&#x2F;&#x2F;www.generable.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.generable.com&#x2F;</a> too.
评论 #15925654 未加载
not_that_noobover 7 years ago
Nice! Using IRT for clothes sizing is indeed innovative.<p>One factor that is difficult to account for is users not telling you the truth. Users may feel embarrassed to say something is too small, as that may have negative connotations for some. It’s difficult of course to control for that, but I wonder how honest people might be in their feedback.
perseusprime11over 7 years ago
The simple model didn&#x27;t work for me. They sent me shirts and pants that did not fit.
评论 #15923328 未加载
评论 #15923343 未加载