Trailing slashes on URLs: Contentious or settled?

95 点作者 josephscott超过 3 年前

17 条评论

brabel超过 3 年前

This is one of those "made up problems" as it only exists because you insist in doing magic, i.e. mapping /foo to either /foo/index.html or /foo.html.I am guilty myself, because we need to do things that match expectation of users, but this problem simply disappears if you stop doing magic.So, the real question is, what "magic" do you prefer?My implementation was, when given a trailing slash, check if the recource is a directory (as this is what the trailing slash signifies) and use the convention of loading the index.html file at that directory if it is. If it's not a directory, then it's a 404.If there's no trailing slash, try to load the file with the equivalent extension to the Accept content-type header. So, if the client wants JSON, you try /foo.json for the path /foo. If client requests HTML, try /foo.html.

评论 #30099863 未加载

评论 #30099357 未加载

评论 #30099942 未加载

neilwilson超过 3 年前

Very much required if you want to use relative URL calculations.Stumbled across this in Go yesterdayCompare a relativeUrl ref from '<a href="http://localhost/1.0" rel="nofollow">http://localhost/1.0</a>' vs. '<a href="http://localhost/1.0/" rel="nofollow">http://localhost/1.0/</a>'e.g. <a href="https://play.golang.com/p/AsUSx6bRWn6" rel="nofollow">https://play.golang.com/p/AsUSx6bRWn6</a>

评论 #30097585 未加载

评论 #30099321 未加载

评论 #30097637 未加载

评论 #30097728 未加载

Flimm超过 3 年前

Trailing slashes are terrible in right-to-left contexts. The trailing slash suddenly teleports to the beginning of the URL visually. Take this example:<pre><code> <div dir="rtl"> http://example.com/example/ </div> </code></pre> This displays as if it were the string:<pre><code> /http://example.com/example </code></pre> It's awful. Much better if everything was configured to leave off the trailing slash.

评论 #30088158 未加载

评论 #30094912 未加载

floatingatoll超过 3 年前

Machines will occasionally add trailing slashes, and they will rarely remove them.Users will occasionally add trailing slashes, and will occasionally remove trailing slashes.So as long as $PATH and $PATH/ map to the same outcome, and redirection from the ‘wrong’ one to the ‘right’ one to the other uses a 307/308 to allow non-GET methods to redirect, then everything will always work out okay.Varying from that recipe in any regard is a source of pain and trauma in every complicated ingress scenario I’ve worked with for twenty years. (Dispatch methods, regular expressions, exact string matches, all of them.)

评论 #30099583 未加载

harshreality超过 3 年前

The common redirects that most webservers do are:/path to /path/ *if and only if /path is a directory in the filesystem*/path/ to /path/index.htm[l] (usually the default of some indexing module, often configurable so you or some other module can add index.php etc.)Redirects can be internal or external. Nginx, for instance, does step 1 via an external redirect and step 2 via an internal redirect, so the final url displayed in the browser when the input was /path would end up as /path/ but not /path/index.html even if that's what's being served. You can, however, combine both steps into one and make it an internal redirect by doing something like "try_files $uri/index.html ..."It's not standard to rewrite:/path to /path.htmlBut almost any webserver can be configured to do so, and various websites and web apps may have reasons for doing that. Then it's merely a matter of understanding the webserver's internal configuration rule order to determine whether it'll redirect /path preferentially to /path/index.html or to /path.htmlThere's no universal right or wrong. Every one of those paths is different, and webservers can choose which to rewrite to which other.

chrismorgan超过 3 年前

> SEO: if your content exists at two (or more!) distinct URL endpoints, it is a SEO no-no. […] You need redirects.Or you could just only serve it from one of the two URLs and let the other 404. (Yeah, static file servers don’t tend to be fond of working this way, but it’s not an unreasonable option if you’re forming a matrix of fundamental possibilities. And indeed, in the analysis, it’s what half the servers do at least some of the time; and it’s what all standard static file serving software that I know does out of the box.)> Vercel, Render, and Azure Static Web Apps: slashless /resource returns content [from resource/index.html] but without redirects, resulting in multiple endpoints for the same content.This is obviously Wrong with a capital W because of how it breaks relative URLs in the file. Surely it should just be considered a bug and fixed (probably to redirect to /resource/)?> Almost everyone agrees that /resource should return content from resource.htmlThis is a biased view because you’re only considering mildly-opinionated static file servers and their configuration. If you’re serving it yourself with things like nginx or Apache httpd, you won’t get this out of the box and must opt into it. (No idea about others like Caddy.)> (When both resource.html and resource/index.html are present and /resource/ is requested.) Netlify redirects to /resource instead.I say of this too that it is obviously a bug. Netlify isn’t just taking an opinionated stance, it’s doing what is fairly unequivocally the Wrong Thing.

评论 #30096120 未加载

评论 #30098690 未加载

评论 #30096087 未加载

newsbinator超过 3 年前

To me, conceptually, trailing slashes are as valuable as trailing commas in English. They're separators, and on the end there's nothing left to separate.

评论 #30081347 未加载

评论 #30094644 未加载

zaphar超过 3 年前

If you automatically serve an index.html when the url is /resource/ I presume you would also serve the same page at /resource/index.html which in practice means that you are again in same content at two different URLs land. I lean more toward the principle of be permissive in what you accept principle here and would serve the same content for: /resource /resource/ and /resource/index.html if presented with the url without doing a redirect. But in all my links or documentation standardize on just one of those. which in practice means that for most crawlers you'll only have one effective URL for the content, while still providing an experience that isn't annoying for users if they happen to type in a trailing slash for the browser.

评论 #30094299 未加载

评论 #30094184 未加载

thraxil超过 3 年前

I've used Django for a long time. Django defaults to adding trailing slashes to make relative URLs easier to implement correctly. I've always found that sensible and useful. I've recently been putting some APIs behind GCP's API Gateway and discovered that their OpenAPI implementation strips trailing slashes: <a href="https://cloud.google.com/endpoints/docs/openapi/openapi-limitations#url_path_templating" rel="nofollow">https://cloud.google.com/endpoints/docs/openapi/openapi-limi...</a>So... I guess no more trailing slashes for me.

jillesvangurp超过 3 年前

Trailing slashes should be optional and the server response should be the same regardless of their presence. At least, that's how I expect my applications to behave. It's the principle of the least amount of surprise. You might be pleasantly surprised that it works if you add the slash as opposed to be mildly annoyed when it gives you a 404. Requiring the slash would be very surprising. The least surprising is that they both work the same way. I see no technical reason for them to behave differently.Some web-servers do this and others require fiddling with to get them to behave that way. But ultimately. the slash is just a separator and not semantically relevant. It's like many languages now allowing trailing commas in lists. It's convenient. The comma does not add an extra element.One place where this comes up in practice is with specifying base URIs in e.g. configuration files. Somewhere else, this base URI is consumed to construct a full URI using a suffix that may or may not have a leading slash.If your base URL is <a href="http://foo(/)" rel="nofollow">http://foo(/)</a> and you want to append (/)bar, you might end up with <a href="http://foobar" rel="nofollow">http://foobar</a>, <a href="http://foo//bar" rel="nofollow">http://foo//bar</a>, <a href="http://foo/bar" rel="nofollow">http://foo/bar</a> depending on what people do on both sides. Or those three with a trailing slash. There is no right answer.The only sane behavior that follows the principle of the least amount of surprise is to make sure that base uri and suffix will be separated by exactly one slash and assume nothing about the presence of leading or trailing slashes. That way, nothing will break or behave unexpectedly if people add or omit a trailing or leading slash.

krobelus超过 3 年前

The lack of canonicalization has caused us tons of trouble on an Angular app. Check out <a href="https://symflower.com/en/company/blog/2021/path-independent-angular/" rel="nofollow">https://symflower.com/en/company/blog/2021/path-independent-...</a> - you'll be laughing (or crying)

archi42超过 3 年前

> The NearlyFreeSpeech.NET member site you are attempting to reach appears to be temporarily unavailable.<a href="https://archive.is/XvE2X" rel="nofollow">https://archive.is/XvE2X</a>

rascul超过 3 年前

/path is different from /path/. They can serve the same content depending on web server config, though. For an example of how they can be different, check <a href="https://news.ycombinator.com/newest" rel="nofollow">https://news.ycombinator.com/newest</a> vs <a href="https://news.ycombinator.com/newest/" rel="nofollow">https://news.ycombinator.com/newest/</a>.

teddyh超过 3 年前

You know what’s better than the “resource” URL leading to resource/index.html? Having the “resource” URL lead to plain resource.html. Then you can still have a directory named “resource” and any relative links in resource.html to “resource/thing” (to reach resource/thing.html) will feel natural, as “thing” is indeed in a subdirectory.(Apache supports this using “MultiviewsMatch”.)

account42超过 3 年前

I like to use /resource/ traling slashes only if the page is a mere index of pages in that directory. If the page has its own content I use /resource without a trailing slash, even if there is a matching directory, because I think it looks prettier - but that is of course purely a matter of taste.

mjul超过 3 年前

When in doubt, check the spec.In this case it is in RFC 2396 "Uniform Resource Identifiers (URI): Generic Syntax" [1].In Section 3 you find this. The forward stroke ("slash") is a separator:<pre><code> URI that are hierarchical in nature use the slash "/" character for separating hierarchical components. </code></pre> [1] <a href="http://www.ietf.org/rfc/rfc2396.txt" rel="nofollow">http://www.ietf.org/rfc/rfc2396.txt</a>

评论 #30099851 未加载

_zooted超过 3 年前

> SEO: if your content exists at two (or more!) distinct URL endpoints, it is a SEO no-no. SE-no-no. SEO-apolo-graphql-anton-ohno (I apologize for nothing). Ahem. You need redirects.Probably not.

评论 #30097002 未加载