On the proper use of this data:<p>Perhaps the easiest way to think of it is that the phrases are predictors for the race/sex, not the other way around. For example, you shouldn't expect every white male you meet to like Van Halen. However if someone says to you "I have a friend who's a big Van Halen fan", you're pretty safe in assuming that the friend is a white male.<p>Likewise, it might be that only 10% of blacks like soul food. But if almost no other demographics like it, it will still show up high on their list. So "is black" does not strongly imply "loves soul food", but "loves soul food" does strongly imply "is black".<p>In other words, <a href="http://en.wikipedia.org/wiki/Bayes_theorem" rel="nofollow">http://en.wikipedia.org/wiki/Bayes_theorem</a>