The Swift compiler is slow due to how types are inferred

266 pointsby paraboul11 months ago

23 comments

I'm suspicious about the truth of this claim. I don't think bidirectional typechecking is the problem in itself: the problem trying to do type inference when you have some extremely flexible operator overloading, subtyping, and literal syntax features. It's these "expressive" features which have made type inference in Swift far more computationally complex, not type inference itself.And certainly not bidirectional type inference; the author of this post's definition of this concept isn't even right (bidirectional typing refers to having a distinction between typing judgements which are used to infer types and those which are used to restrict types, not moving bidirectionally between parent nodes & child nodes). I don't know if the mistake comes from the post author or Chris Lattner, and I don't know if the word "bidirectional" is relevant to Swift's typing; I don't know if Swift has a formal description of its type system or that formal description is bidirectional or not.EDIT: watching the video the Chris Lattner quote comes from, it appears the mistake about the word "bidirectional" is his. Bidirectional type systems are an improvement over ordinary type systems in exactly the direction he desires: they distinguish which direction typing judgements can be used (to infer or check types), whereas normal formal descriptions of type systems don't make this distinction, causing the problems he describes. "Bottom up" type checking is just a specific pattern of bidirectional type checking.Regardless, the problem with Swift is that a literal can have any of an unbound arity of types which implement a certain protocol, all of which have supertypes and subtypes, and the set of possibilities grows combinatorially because of these language features.cf: <a href="https://arxiv.org/abs/1908.05839" rel="nofollow">https://arxiv.org/abs/1908.05839</a>

评论 #40729274 未加载

评论 #40664513 未加载

评论 #40662710 未加载

评论 #40662326 未加载

PaulHoule11 months ago

One of the interesting tradeoffs in programming languages is compile speed vs everything else.If you've ever worked on a project with a 40 minute build (me) you can appreciate a language like go that puts compilation speed ahead of everything else. Lately I've been blown away by the "uv" package manager for Python which not only seems to be the first correct one but is also so fast I can be left wondering if it really did anything.On the other hand, there's a less popular argument that the focus on speed is a reason why we can't have nice things and, for people working on smaller systems, languages should be focused on other affordances so we have things like<a href="https://www.rebol.com/" rel="nofollow">https://www.rebol.com/</a>One area I've thought about a lot is the design of parsers: for instance there is a drumbeat you hear about Lisp being "homoiconic" but if you had composable parsers and your language exposed its own parser, and if every parser also worked as an unparser, you could do magical metaprogramming with ease similar to LISP. Python almost went there with PEG but stopped short of it being a real revolution because of... speed.As for the kind of problem he's worried about (algorithms that don't scale) one answer is compilation units and careful caching.

评论 #40661606 未加载

评论 #40661942 未加载

评论 #40667334 未加载

评论 #40663063 未加载

评论 #40665345 未加载

评论 #40662196 未加载

Decabytes11 months ago

I'm a big fan of the idea of Swift as a cross platform language general purpose langauge, but it just feels bad without Xcode. The Vscode extension is just okay, and all of the tutorials/documentation assumes you are using Xcode.A lot of the issues that Swift is currently facing are the same issues that C# has, but C# had the benefit of Mono and Xamarin, and in general more time. Plus you have things like JetBrains Rider to fill in for Visual Studio. Maybe in a few years Swift will get there, but I'm just wary because Apple really doesn't have any incentive to support it.Funnily enough, the biggest proponent of cross platform Swift has been Miguel De Icaza, Gnome creator, and cofounder of Mono the cross platform C# implementation pre .net core. His Swift Godot project even got a shout out recently by Apple

评论 #40661535 未加载

评论 #40662381 未加载

评论 #40667373 未加载

评论 #40661598 未加载

评论 #40757863 未加载

评论 #40666295 未加载

jshier11 months ago

They never will, since it's also one of Swift's greatest strengths. What they may, eventually, do is dedicate the resources to minimize the negative aspects of the system while documenting clear ways to mitigate the biggest issues. Unfortunately Apple's dev tools org is chronically under resourced, which means improvements to the inference system and its diagnostics come and go as engineers are allowed to work on it. Occasionally it will improve, only to then regress as more features are added to the language, and then the cycle continues.

评论 #40661316 未加载

评论 #40661437 未加载

评论 #40661658 未加载

pajuc11 months ago

It's really hard for me to read past Lattner's quote. "Beautiful minimal syntax" vs "really bad compile times" and "awful error messages".I know it's not helpful to judge in hindsight, lots of smart people, etc.But why on earth would you make this decision for a language aimed at app developers? How is this not a design failure?If I read this article correctly, it would have been an unacceptable decision to make users write setThreatLevel(ThreatLevel.midnight) in order to have great compile times and error messages.Can someone shed some light on this to make it appear less stupid? Because I'm sure there must be something less stupid going on.

评论 #40662091 未加载

评论 #40661988 未加载

评论 #40661957 未加载

评论 #40679596 未加载

评论 #40661841 未加载

评论 #40662284 未加载

putzdown11 months ago

It looks to me as if there’s a solution to this problem based on the precompilation of sparse matrices. I’ll explain. If you have a function (or operator) call of the form fn(a, b), and you know that fn might accept 19 types (say) in the “a” place and 57 types in the “b” place, then in effect you have a large 2d matrix of the a types and the b types. (For functions taking a larger number of arguments you have a matrix with larger dimensionality.) The compiler’s problem is to find the matrix cell (indeed the first cell by some ordering) that is non-empty. If all the cells are empty, then you have a compiler error. If at least one cell is non-empty (the function is implemented for this type combination), then you ask “downward” whether the given arguments values can conform to the acceptable types. I know that there’s complexity in this “downward” search, but I’m guessing that the bulk of the time is spent on searching this large matrix. If so, then it’s worth noting that there are good ways of making this kind of sparse matrix search very fast, almost constant time.

mrkeen11 months ago

HM works great for me. Let's try it elsewhere instead of blaming the algorithm!<pre><code> {-# LANGUAGE OverloadedStrings #-} -- Let strings turn into any type defining IsString {-# LANGUAGE GeneralizedNewtypeDeriving #-} -- simplify/automate defining IsString import Data.String (IsString) main = do -- Each of these expressions might be a String or one of the 30 Foo types below let address = "127.0.0.1" let username = "steve" let password = "1234" let channel = "11" let url = "http://" <> username <> ":" <> password <> "@" <> address <> "/api/" <> channel <> "/picture" print url newtype Foo01 = Foo01 String deriving (IsString, Show, Semigroup) newtype Foo02 = Foo02 String deriving (IsString, Show, Semigroup) -- ... eliding 27 other type definitions for the comment newtype Foo30 = Foo30 String deriving (IsString, Show, Semigroup) </code></pre> Do we think I've captured the combinatorics well enough?The url expression is 9 adjoining expressions, where each expression (and pair of expressions, and triplet of expressions ...) could be 1 of at least 31 types.$ ghc --version<pre><code> The Glorious Glasgow Haskell Compilation System, version 9.0.2 </code></pre> $ time ghc -fforce-recomp foo.hs<pre><code> [1 of 1] Compiling Main ( foo.hs, foo.o ) Linking foo ... real 0m0.544s user 0m0.418s sys 0m0.118s </code></pre> Feels more sluggish than usual, but bad combinatorics shouldn't just make it slightly slower.I tried compiling the simplest possible program and that took `real 0m0.332s` so who knows what's going on with my setup...

评论 #40662986 未加载

ashdnazg11 months ago

Our CI posts a list of shame with the 10 worst offending expressions on every PR as part of the build and test results.So far it's working quite nicely. Every now and then you take a look and notice that your modules are now at the top, so you quickly fix them, passing the honour to the next victim.

temp12378924611 months ago

Does anyone know why, anecdotally, it seems like the slowness of type inference is more of a pain point in Swift than in Ocaml, Rescript, Purescript, Haskell, etc?

评论 #40661526 未加载

评论 #40661587 未加载

评论 #40662042 未加载

评论 #40661599 未加载

评论 #40661485 未加载

irdc11 months ago

One could argue that anything that anything that makes the development process itself more efficient, as opposed to the compiling, is worth it since programmers themselves ain’t getting any faster anytime soon, but timing out after more than 40 seconds on a state-of-the-art CPU because of a handful of lines is just ridiculous.

评论 #40661531 未加载

评论 #40677679 未加载

评论 #40661678 未加载

评论 #40662193 未加载

ajkjk11 months ago

The times here seem unreasonably bad even with the bad algorithm. Something else has got to be going on. Maybe kind of hidden factorial complexity when it tries every combination?

评论 #40662192 未加载

评论 #40661953 未加载

评论 #40677748 未加载

vlovich12311 months ago

The combinatorial explosion is intractable but since it only seems to come up in really obscure corner cases, I wonder if the typical inference scenarios can be solved by having the compiler cache the AST across invocations so that inference only needs to be performed on invalidated parts of the AST as it’s being typed instead of waiting for the user to invoke the compiler.

评论 #40662459 未加载

评论 #40662100 未加载

评论 #40662399 未加载

w10-111 months ago

This article doesn't even mention the new type checker and constraint solver.The compiler is open-source, and discussed on open forums. Readers would love some summary/investigation into slow-down causes and prospects for fixes.

liuliu11 months ago

The math type inference example makes the usual claim that "what if Swift can replace Python" a non-starter. As someone who have to deal with this on frequent basis, it is pretty sad.(I maintains s4nnc and a fork of PythonKit).

评论 #40662622 未加载

评论 #40661607 未加载

peppertree11 months ago

I have a feeling it's going to be nearly impossible to replace it without breaking a lot of existing code, since the syntax will have to be a lot more explicit.

fooker11 months ago

This is not a fixable flaw. Solving these constraints efficiently can definitely get you a Turing award, it's basically the SAT problem.And without this type system, swift is just Objective C in a prettier syntax, so Apple has to bite the bullet and bear with it.

评论 #40661431 未加载

评论 #40661825 未加载

评论 #40661976 未加载

评论 #40662166 未加载

erichocean11 months ago

There are fast type inference algorithms available today, such as MLStruct. [0][0] <a href="https://github.com/hkust-taco/mlstruct">https://github.com/hkust-taco/mlstruct</a>

Sniffnoy11 months ago

Wait, in Swift it's illegal to multiply an int by a double?? So you would have to explicitly cast index to a double? I definitely didn't expect that!

评论 #40661923 未加载

评论 #40661750 未加载

评论 #40662345 未加载

评论 #40667821 未加载

评论 #40661661 未加载

评论 #40661659 未加载

评论 #40661686 未加载

tinganho11 months ago

Isn’t the channel variable declared and inferred as an int32? Can’t see why the overload isn’t resolved directly?

评论 #40662951 未加载

pshirshov11 months ago

Scala does essentially the same and a lot faster, so it's not a fundamental limitation.

tantalor11 months ago

> they’re invalid swiftIf this isn't valid why are we even taking about it? The compiler should report syntax error or something

评论 #40662026 未加载

评论 #40661962 未加载

评论 #40662189 未加载

评论 #40661917 未加载

ebri11 months ago

All those Swiftie haters

dmurray11 months ago

> The issue is caused by using the + operator with the channel Int and a String literal. Thanks to the standard library’s 17 overloads of + and 9 types adopting the ExpressibleByStringLiteral Protocol, the swift compiler can’t rule out that there might be a combination of types and operators that make the expression valid, so it has to try them all. Just considering that the five string literals could be one of the possible nine types results in 59,049 combinations, but I suspect that’s a lower bound, since it doesn’t consider the many overloads of +.This really seems like a design flaw. If there are 59,049 overloads for string concatenation, surely either- one of them should be expressive enough to allow concatenation with an integer, which we can do after all in some other languages- or, the type system should have some way to express that no type reachable by concatenating subtypes of String can ever get concatenated to an integer.Is this unreasonable? Probably there's some theorem about why I'm wrong.