TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

The "overlearning the game" problem

362 pointsby sendosover 13 years ago

20 comments

wistyover 13 years ago
It's related to Goodhart's Law:<p><i>Any observed statistical regularity will tend to collapse once pressure is placed upon it for control purposes.</i><p>This is often the result of attempting to overoptimize a system. You can optimize a race car to a huge degree, because you know <i>exactly</i> what you want it to do.<p>You can't optimize a schooling system, because you <i>don't</i> know exactly what you want it to do. A little noise is a good thing, because the you want a little wiggle room for teachers to sidestep the dictums of education czars, and students to sidestep the dictums of teachers.<p>The Greeks solved this quite a few years ago, with sortition. Under sortition (injecting noise into elections - <a href="http://en.wikipedia.org/wiki/Sortition" rel="nofollow">http://en.wikipedia.org/wiki/Sortition</a>), Bush and Gore would have been forced to pay "paper, scissors, rock" for the presidency. Under the US's more pure democracy, they would have been tempted to make all kind of Faustian bargains with sordid players to nail down the last 0.01% of votes.<p>Randomization means that the last percent is just not worth chasing, so players in a competition won't be tempted to bend the rules for a tiny advantage.<p>The same process could be used for tests. If you allocate places in desirable courses (say medicine) randomly to anyone above a certain score, the top students won't bother drilling as hard just to get the top score.<p>Stocks are the same - quants wouldn't sweat timing as much if their placement in order books was randomized. It would be more efficient to pay attention to fundamental value than momentary fluctuations if they weren't guaranteed to make large profits on the momentary fluctuations. Some would still work on timing, but not as many.<p>Patents are just bad policy badly implemented at the moment, not over-optimized.
评论 #2936235 未加载
评论 #2937393 未加载
评论 #2936501 未加载
评论 #2936307 未加载
评论 #2937421 未加载
评论 #2936169 未加载
评论 #2937547 未加载
评论 #2939204 未加载
评论 #2940600 未加载
评论 #2936675 未加载
评论 #2939857 未加载
drewcrawfordover 13 years ago
If you pay a man by the hour, he'll work a lot of hours. If you pay him by the brick, he'll lay a lot of bricks.<p>These "games" are basically the equivalent of counting lines of code or checkins. We're measuring poor proxies instead of the things we're actually interested in. The solution isn't an arms race to build bigger and better proxies, the solution is to measure real things instead of artificial ones.<p>Here's just one example of what I mean by "measure real things". Electing representatives every X years to decide the laws of the land was once upon a time the fairest and best way to have the voices of the masses heard. Today it is feasible to directly poll everybody about every issue, so we no longer need the proxy. If you say everyone cannot be educated about every issue, fine, I can "follow" PG's votes on wall street reform and grellas's votes on IP tort reform and Schneier's votes on TSA etc just by copying their votes on those issues into my ballot, a permission which I can revoke at any time or on a vote-by-vote basis, as easy as unfollowing them on VoteTwitter. This is better than the proxy of professional politicians deciding every issue with fixed terms.
评论 #2936937 未加载
评论 #2936255 未加载
评论 #2936186 未加载
评论 #2936480 未加载
评论 #2936927 未加载
评论 #2937443 未加载
评论 #2936573 未加载
评论 #2936894 未加载
评论 #2940102 未加载
评论 #2940613 未加载
评论 #2942349 未加载
jacques_chesterover 13 years ago
Systems theorists say that "structure predicts behaviour". It's a bit trite, but also deep. Here's an example.<p>The US political system pretty much guarantees bad budgeting. Members of Congress are elected fairly independently. There is no party discipline, so each member will operate independently to maximise pork. This encourages horse-trading within and across party lines; nobody can be forced to give up something for a general good.<p>There's more: there's no incentive to balance the budget. The Executive's separation means that Congress does not need to concern itself with proper administration; it only doles out the cash. It has every incentive to ... maximise pork.<p>The dynamic behaviour of American government arises from the static structure. The drafters of the constitution drew on their knowledge of history and current affairs to try and avoid certain pitfalls. The US Constitution was state-of-the-art when it was written. It's less so now.<p>Countries where the Executive is formed out of the Legislative -- the Westminster system -- tend to have much stronger party discipline, because that discipline is required to pass budgets, enact legislation and to form the Executive. This tends to almost eliminate horse-trading, except between parties and independents. It's not perfect -- whole parties can engage in pork too -- but when policy emerges that benefits the many at the cost of a few, countries with party discipline will find it easier to adopt than those without.<p>Australia, which has the amongst the toughest party discipline in the democratic world, is also a reform leader. And I think a lot of that is explained by our constitutional arrangements.
loup-vaillantover 13 years ago
I think "overlearning" is not a good term (edit: it is quite accurate, though), because it tend to suggest the worst solution of all: that we <i>refrain</i> from learning. I prefer "lost purposes"[1].<p>The primary purpose of a game is generally to Have Fun. This purpose is lost if you have "solved" the game. The stated purpose of patents is to foster innovation. However it doesn't work[2]. The purpose of schools is learning. However its methods are flawed [3,4]. And so on.<p>The trick is to know your goals, and then find out means to best achieve them. The author said:<p>&#62; <i>But, in real life, we need to keep "playing the game": we need to have elections, and protections for inventors, and laws that govern society, and a market where companies can raise money.</i><p>But the <i>actual</i> goals are different: We don't need election, we need a working democratic system (which may, or may not, mandate elections). We don't need protections for inventors, we need innovations. We don't need laws, we need a fair and working society. We don't need a market where companies can raise money, we need a working economy.<p>Well, I could attempt recurse further up until pure morality, but that would be intractable. But at least you get the idea. If something looks broken, think about its ultimate purpose before you try to fix it.<p>[1]: <a href="http://lesswrong.com/lw/le/lost_purposes/" rel="nofollow">http://lesswrong.com/lw/le/lost_purposes/</a><p>[2]: <a href="http://www.dklevine.com/general/intellectual/against.htm" rel="nofollow">http://www.dklevine.com/general/intellectual/against.htm</a><p>[3]: <a href="http://vimeo.com/5513063" rel="nofollow">http://vimeo.com/5513063</a> (Dr Tae)<p>[4]: <a href="http://www.khanacademy.org/" rel="nofollow">http://www.khanacademy.org/</a>
评论 #2936431 未加载
评论 #2938104 未加载
Joakalover 13 years ago
Hey USA, your first past the post system actually does tend towards dual party governments [0]. Compare it to preferential system where voters can pick several parties or vote for one party for the preferences listed.<p>For example, look at this: <a href="http://www.abc.net.au/elections/federal/2010/guide/deni.htm" rel="nofollow">http://www.abc.net.au/elections/federal/2010/guide/deni.htm</a> With USA's system, Australia Labor Party would have got re-elected.<p>There's still improvements to be made for democracy that doesn't need to involve such fancy technology yet [1].<p>[0] <a href="http://en.wikipedia.org/wiki/Duvergers_law" rel="nofollow">http://en.wikipedia.org/wiki/Duvergers_law</a><p>[1] <a href="http://techpp.com/2011/06/08/homomorphic-encryption/" rel="nofollow">http://techpp.com/2011/06/08/homomorphic-encryption/</a>
shubberover 13 years ago
I'm reminded of Mechanism Design: a Nobel prize winning theory of economics that starts with the supposition that agents in any system will exploit its rules to maximize their personal gain. The corollary, which I've been quite taken by, is that where we can influence the rules of the game, we should design them such that exploitation serves a social good.
erikbover 13 years ago
Many people see this from a macroeconomic or political point of view. So I think adding a more philosophical point of view might also be of value. Of course, because it is philosophy, I can only present my own point of view. There is no right or wrong.<p>For me, as a Zen student, the solution is to just accept that that it is, what it is. No system (shape, for Zen students) can be perfect. It is created to solve a problem and with time starts to fail badly at doing anything about the problem. Then another system is created by someone else and the cycle restarts. You might think for example that the stock market or democracy is a thing that doesn't change, because it exists longer then you live. But in the end it will change. We had different political systems before and we will develop different systems in the future.<p>So in the end the system is one of the most important things we have, because it gives us something to base our decisions and actions on and goals to strive for. But also the system is nothing, just an illusion we create for us, maybe based on how we understand the illusions other people created for themself.<p>The thing that is interesting to me personally is that every system itself is instable and will change or die in the end. But the life cycle of a system, what it does for us and doesn't, that all will always stay the same and even though we try to change that, we will never succeed. So while it changes a lot in one way it is totally unchangable in another. But that just as a side note.
groanerover 13 years ago
The cynic inside me would extend this problem to the entire economy or even human society itself. Any economic transaction is supposed to benefit both parties: the buyer offers an amount he is willing to pay for what he wants, the seller provides his goods or services at a price he is willing to accept. As PG might say, wealth is created in tandem with creating stuff that people want.<p>The problem is that some elements in society have become extremely effective at creating a perversion of "stuff people want" for their own benefit. They exploit loopholes in the system, whether by preying upon the poorly-informed, shifting costs onto externalities that we can't price properly, engaging in corruption, or producing items of questionable actual value but very attractive perceived value. They take advantage of our desire for easy answers to our problems with minimal effort expended.<p>Case in point: Ponzi schemes, coal power plants, Halliburton, the tobacco industry, Zynga, and even to an extent, religion<p>We've tried to control this with laws, education, and social norms, but ultimately it seems the invisible hand reigns supreme.
drhouse_mdover 13 years ago
This article seems to be addressing 'the spirit' of a document or system.<p>Perhaps another way to look at the problem is to imagine creating an A.I. that you want to succeed at whatever system you present. In most cases, an A.I. will take the literal interpretation of the system and become a test-taker, an electable 'gotcha-game' politician or even an entity that finds it can maximize game theory to its own ends by complicating the rules of an existing system to the point of absurdity once it becomes powerful enough to modify and create rules.<p>So then how do we create systems resistant to beings that take everything literally? I suppose the only way is to reward certain outcomes as opposed to rewarding the direct product of the system itself.<p>Examples: After an election, have we elected someone who has met with a high degree of favorability in the electorate by the end of his term?<p>After having students become proficient test-takers, do they then become excellent doctors, lawyers, engineers, etc.. ?<p>In a game invented to be fun or fair, once overlearned, do they produce fun or fairness?<p>If not, then a new game needs to be created or the existing one might need to be extensively modified to produced the desired product. This is where the internet shines, where everyone is welcome to take an existing system and modify it to something better. The problem with politics, law, stock market, etc... is that they have become the only method adopted in real society (there is only one game in town).<p>If reality were allowed to adopt, incorporate or evolve from systems/games from virtual reality (i.e. internet) there might be some productive change. But first we would need to see the first step taking place, that being even flawed virtual systems are allowed to manifest in significant proportion within real life society.<p>Don't hate the game, hate the player. Every game that profits a winner will have its cheaters.
DanielBMarkhamover 13 years ago
We run into this situation in programming quite a bit. The answer is refactoring: taking the principles and patterns that work, streamlining them, ditching the cruft that works for a few edge cases but mostly gets in the way, simplifying the underlying metaphor of the system, and re-asssembling.<p>The only way to do this with much larger systems, such as systems of governance, is revolution or exploring new lands. Personally I'm a bit concerned about revolution -- the assumption with refactoring is that the people refactoring understand what the "good" parts are and what the "bad" parts are. They also need to be able to generalize and simplify in order to keep the system cognitively approachable. In my experience, it's very easy to be angry-tear-down-the-system guy, very difficult to actually refactor. As the author points out, it's not that these systems are entirely useless. The hell of the thing is that the reasons for creating these systems are still very valid.<p>I remain convinced that programmers (hackers) have a lot to add to the discussion when talking about complex, brittle systems. After all, we spend a lot of time both working with them and fixing them. To me, programming and systems architecture is applied philosophy. Very cool stuff.
评论 #2936939 未加载
kragenover 13 years ago
Less Wrong talks about this a lot; their term for one variant of it is "superstimuli".<p>What's the difference between your "overlearning" and "hacking"? They sound like the same thing.<p>This phenomenon is the reason for the Wikipedia rule, "Ignore All Rules".
评论 #2936111 未加载
评论 #2938794 未加载
Jun8over 13 years ago
Very interesting post!<p>However, I got hooked on thinking about the childhood game that he mentions briefly in the introduction. Asking more and more questions about an object and wanting all of them to be consistent is a good description of mathematics. According to Godel, both teams are <i>bound</i> to lose, because you cannot create a system of descriptions/properties about a system that are self-consistent (as their number increases), there will always be questions to a team whose answer will be inconsistent with the previous set of questions. The game is then to see which team can push the inevitable further.<p>IDEABOLT: It would be interesting to develop a program that plays this game. Each answer could be stored as an RDF statement in database.
aconbereover 13 years ago
There was a book written in the 80's that covered the overarching philosophical discussion on this kind of thinking about games. The author broke games into two categories: finite and infinite.<p>Finite games like the one invented by the children described by the OP have end states, they have winners, and the goal within them are always framed from that perspective. While infinite games have no winners, have no end, and the goals are often framed around ensuring that the game never ends.<p>It's easy to think of finite games in our lives, they are everywhere in our society, and the OP does a good job of pointing out some of the less obvious cases. Infinite games are less obvious, the one that I found most illuminating was "language", a game where people actively collude to extend the game to the end of time, and has no winner.<p>Anyay, it's an interesting read, but a little bit mumbo-jumbo by the middle of the book. I would recommend it.<p><a href="http://www.amazon.com/Finite-Infinite-Games-Vision-Possibility/dp/0345341848/ref=sr_1_1?ie=UTF8&#38;qid=1314621221&#38;sr=8-1" rel="nofollow">http://www.amazon.com/Finite-Infinite-Games-Vision-Possibili...</a>
resatoriover 13 years ago
I think it is time we recognize that it is not the systems that shape our world but the individuals.<p>There is no optimal system, be it political, economic or whatever.<p>You can always find loopholes.<p>I think it is time we take responsibility for what we do - then there is no need for a better system.<p>The patent system is not responsible for people attacking each other - its the people.
One_adm12over 13 years ago
Great post. I face this everyday in my job at an "old media" company. There are policies, rules and constraints in place which served some purpose at some point in time, and haven't been challenged for years. Unlearning the rules of established games is almost as important as evolving them. Everything should be questioned and challenged to find the root of the point of the rule/law/restriction and if it doesn't make sense any longer, throw it out.
thewisedudeover 13 years ago
Doesn't "Exploiting the System" capture the same idea? I think basically what people are doing here is understanding the core weakness that is intrinsic to the system(which probably is not perfect) and using(abusing) it to their advantage.
donaqover 13 years ago
Regarding politics, it is interesting to note that the people have not, for their part, also overlearned the game of electing the best man for the job. I wonder why this asymmetry exists?
useflyerover 13 years ago
Have you noticed that yesterday every comment was concise, and now that applications are open, every comment is elaborate, long-winded, and footnoted?
stretchwithmeover 13 years ago
Seems most of the things mentioned are an inevitable result of centralizing and expanding government power. Someone is being gamed alright.
Greedy_Foolsover 13 years ago
People are predators and that's what predators do, never stop searching for what they need. Predatory behavior is the problem, and there is little will in successful predators to outlaw there own behavior.