TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Human-Oriented Automatic Theorem Proving

42 点作者 rck超过 2 年前

5 条评论

pfdietz超过 2 年前
What I would want to see is a project for automatically converting the math literature to machine verifiable proofs. It would have to do NL to formal notation translation, as well as filling in the gaps that are left in any math paper.<p>This is a difficult problem, but if solved it would enable every trick used in every published proof to be internalized into the theorem proving tooling.
评论 #34172576 未加载
hackandthink超过 2 年前
I wondered what this should be about.<p>After all Interactive Theorem Proving is well established and there are great success stories (e.g. Lean Mathlib).<p>Who needs automatic theorem proving that mimics interactive theorem proving?<p>After reading the blog posts on the site:<p>That endeavour is really about understanding what mathematicians do and replace them with an AI.
YeGoblynQueenne超过 2 年前
I&#x27;m familiar with a different kind of automated theorem proving than the one the article is about, resolution-based theorem proving. I am still very uncertain about the differences between resolution-based theorem proving and the kind of theorem proving in proof assistants, so there is much I don&#x27;t understand from the &quot;About&quot; page and others in that site (and from previous readings about proof assistants).<p>For example, this bit from <a href="https:&#x2F;&#x2F;wtgowers.github.io&#x2F;human-style-atp&#x2F;motivatedproofs.html" rel="nofollow">https:&#x2F;&#x2F;wtgowers.github.io&#x2F;human-style-atp&#x2F;motivatedproofs.h...</a>:<p>&gt;&gt; One potential approach to making precise the notion of what I shall call here a rabbit-free proof is to define a set of &quot;move types&quot; -- that is, methods of transforming a problem -- and defining a motivated proof to be one that can be generated using these move types. The idea is then to design the move types in such a way as to make it impossible to generate rabbits.<p>I think I misunderstand what are &quot;move types&quot; (is that the same as &quot;proof steps&quot; in other contexts? I don&#x27;t understand those, either).<p>If I think about &quot;transformations of a problem&quot;, with my mind stuck on resolution, all I can think of is transforming an initial formula to a new formula during a proof, by the application of one or more inference rules. But, in resolution theorem-proving, this kind of transformation can be achieved by successive applications of a single inference rule, the resolution rule. Briefly: given two clauses p ∨ q and ¬p ∨ r, resolution eliminates the contradiction p ∧ ¬p and derives the new clause q ∨ r, then continues until only the empty clause remains at which point the only logical consequence of the original theorem is a contradiction, allowing a proof by refutation. SLD-resolution, restricted to Horn goals and definite clauses is sound and refutation-complete and semi-decidable which is pretty much the best that can be done within a formal system, but the important thing is that a single inference rule, resolution, is doing all this work so the proof is straightforward to automate: even a dumb computer can just spam resolution steps until there are no more contradictions to eliminate. So I don&#x27;t understand why choosing &quot;move types&quot; is an open problem, and why resolution can&#x27;t be used to solve it. This is probably a result of my confusion and misunderstanding of what the other kind of automated theorem proving is all about in the first place.<p>I&#x27;m not trying to be smug. I really don&#x27;t know what this is all about. Can anyone please explain? I&#x27;m also probably missing a lot of the necessary background on this.
评论 #34171901 未加载
zozbot234超过 2 年前
Personally, I still stand by my comment on the earlier thread - the problem of defining a &quot;motivated&quot; proof must boil down to a proof in <i>some</i> well-defined logic (which could of course be a logic <i>fragment</i>) where finding proofs is inherently easy. It looks like this project hasn&#x27;t been engaging all that much with that well-known line of research, seemingly choosing instead to focus on a variety of heuristic questions. I&#x27;m not sure what to make of that.
评论 #34173291 未加载
评论 #34171744 未加载
评论 #34171531 未加载
zozbot234超过 2 年前
Previous discussion: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=31190641" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=31190641</a>