Use links not keys to represent relationships in APIs

342 pointsby saregoabout 6 years ago

39 comments

twblalockabout 6 years ago

Here is the problem with links in a nutshell:> The server is now free to change the format of new URLs at any time without affecting clients (of course, the server must continue to honor all previously-issued URLs).If you have to honor all previously-issued URLs then you aren't changing your format -- you are supporting two formats from now on, the old one and the new one.You can of course tell your users that you will deprecate the old format, but unless you are as powerful as Google your users may prevent you from enforcing a deadline for deprecation.If the URLs in your API responses are FQDNs rather than relative paths, all of this gets significantly harder to deal with.Even if you figured all of that out, links are not idiomatic if your users consume your api via an RPC or GraphQL.

评论 #19893049 未加载

评论 #19891015 未加载

评论 #19890797 未加载

评论 #19890963 未加载

评论 #19890998 未加载

评论 #19891490 未加载

评论 #19890973 未加载

评论 #19893367 未加载

评论 #19891373 未加载

评论 #19892447 未加载

评论 #19900563 未加载

评论 #19892981 未加载

评论 #19890689 未加载

评论 #19929807 未加载

评论 #19892423 未加载

raquoabout 6 years ago

I never had any uncertainty regarding where I can use a given entity id in a well designed API. Fix the naming and organization of your API if that's a problem for your users.Conversely, I often need to log or store API-provided entity ids on my side, and having to parse it out of a URL or store irrelevant URL bytes in my own database would be really annoying.You're not going to avoid the need to compile entity URLs on the client side either, unless you only make requests to entities returned by the API, which would be a weird constraint to design client code around.I really don't see the point to any of this.

评论 #19890216 未加载

评论 #19900147 未加载

评论 #19890611 未加载

WanderingWavesabout 6 years ago

I think this takes an overly-simplitisic view of APIs. Going by the primary example in the article, by representing a pet's owner as a link instead of an id, they're basically discounting the idea that there may be separate endpoints that take in an owner id. For example, if there was an endpoint that let you get the invoices by customer, you would still need to understand the templates for that endpoint.More fundamentally, I think it's trying to solve a smaller problem in the face of a much bigger one, you still need to know what the response of any given endpoint is going to be. Just because they've passed me a link, doesn't me I don't need documentation on what endpoint that link points to. I still need to know that the owner is a link to the people endpoint so I can properly parse that result. That in turn requires just as much documentation (IMO) to describe the relationships as it would to properly document your URI templates.Obviously, the primary reason to use links over ids is to give the developers of the API more control over changing things like routes and Ids and whatnot, but I feel like it is a bit disingenuous to make it out to be a much better user experience or something, since it really isn't.

评论 #19890820 未加载

评论 #19890891 未加载

评论 #19903346 未加载

评论 #19890422 未加载

k_bxabout 6 years ago

There's literally not a single upside of this shown in the article.> The server is now free to change the format of new URLs at any time without affecting clients (of course, the server must continue to honour all previously-issued URLs).No more than it was previously.> The URL passed out to the client by the server will have to include the primary key of the entity in a database plus some routing information, but because the client just echoes the URL back to the server and the client is never required to parse the URL, clients do not have to know the format of the URL.Instead, you now have to require new kind of knowledge, one of the keys which must be present in schema and their meaning. E.g. knowing that "pets" key is present and leads to a relationship of a particular kind, with all the implicit logic added and documented. And what if you want to get pet's owners with some additional parameter, like only getting ones which are exclusively yours? Would you need to edit that "pets" url adding "&exclusively_owned=true"?

评论 #19903138 未加载

sbr464about 6 years ago

I don’t mind using a link, but I’d prefer to have both the exact id and the link to avoid having to parse a link in an unreliable way to get the actual id.It’s interesting how GraphQL changes the base point of the article concerning documentation/ease of api use.In our GraphQL resolvers we typically add two fields, thing_id which resolves to the id string, and thing which resolves an object that you can drill into as desired.I was starting to see a lot of GraphQL APIs only add “thing”, which meant you had to do a lot of queries like below just to get the id.<pre><code> thing { id }</code></pre>

评论 #19890469 未加载

评论 #19890319 未加载

评论 #19903421 未加载

geezerjayabout 6 years ago

Why did the author of this blog post decided to pass web links in resources and completely ignored standard practices such as RFC 8288 which employs the Link HTTP header?<a href="https://tools.ietf.org/html/rfc8288" rel="nofollow">https://tools.ietf.org/html/rfc8288</a>Additionally, compact URIs (CURIES) are also widely used in this context.<a href="https://www.w3.org/TR/2010/NOTE-curie-20101216/" rel="nofollow">https://www.w3.org/TR/2010/NOTE-curie-20101216/</a>I feel that the author tried to reinvent HATEOAS but skipped a cursory bibliographical review and jumped right into reinventing the wheel, and one which has already been reinvented multiple times (HAL, JSON-LD, etc...)

评论 #19890394 未加载

评论 #19890344 未加载

评论 #19890240 未加载

评论 #19892441 未加载

westurnerabout 6 years ago

A thing may be identified by a URI (/person/123) for which there are zero or more URL routes (/person/123, /v1/person/123). Each additional route complicates caching; redirects are cheap for the server but slower for clients.JSONLD does define a standard way to indicate that a value is a link: @id (which can be specified in a/an @context) <a href="https://www.w3.org/TR/json-ld11/" rel="nofollow">https://www.w3.org/TR/json-ld11/</a>One additional downside to storing URIs instead of bare references is that it's more complicated to validate a URI template than a simple regex like \d+ or [abcdef\=\d+]+

评论 #19890710 未加载

Illniyarabout 6 years ago

The idea of using uris instead of keys is not a new one (as has been mentioned by other commenters). Every few years the idea gets a resurgence of people who say that REST apis should be HATEOS and that we are doing it wrong.It seems obvious that the cost-value for this is simply not there, if it was good enough, you'd see developers requesting it and many more vendors implementing it. So far I haven't seen any recent changes that might skew the cost-value towards the uri's favor, only the opposite (cue GraphQl).Using uris have little benefits, but it does have the following problems:As a user of the api:* You need to keep an arbitrary length key in your database if you save references. It can cause some issues with certain setups (less so these days though).* If you keep the entire URI as identity, then you can't use multiple endpoints. For instance lots of companies have an endpoint for production and one for reports - using URI for one endpoint in another is quite awkward.* Working with queries is troublesome, especially with get request. Consider searching for all transaction of a specific account, where the account's identifier is `<a href="https://api.google.com/v1/account/123`" rel="nofollow">https://api.google.com/v1/account/123`</a>* Upgrading to a new version of the api (one with a different url like v1/v2) now not only requires you to change your code to work with the new version, but also migrate all previous ids you kept in your database, which is a much different and more error-prone issue then simply changing code.

评论 #19904588 未加载

kartanabout 6 years ago

The article knowledge has been lost to time. In "relational databases" you always name the foreign key as the relationship between tables.From "A Practical Guide to Relational Database Design" from the year 2000. "Each relationship line should carry a pair of descriptive elements, which define the nature of the association between entities. A name is a single word or descriptive phrase; it should always contain a verb such as: owns, owned by, holds, administered by, etc. Examples from our simple model are: A PART is sold on an ORDER LINE. An ORDER LINE is placed for a PART."But, this has been lost because the practicality is that it is hard to know what is the element. As other comments points.Probably the best is both worlds: PersonId_Owner. PersonId_Veterinary. Or something similar.It seems that such a discussion should have been solved decades ago. And here we are. :)

rhackerabout 6 years ago

I feel like the author has never used graphql. We're also just finally graduating past REST to something more meaningful. This advice feels 15 years late and now totally wrong. An API shouldn't be tied to a protocol like http, it should be able to move on to other things.Ahh I was correct:> I have never used GraphQL, so I can't endorse it, but you may want to evaluate it as an alternative to designing and implementing your own API query capability.You really shouldn't write this giant article without having tried that.

chvidabout 6 years ago

Hypermedia As The Engine Of Application State (HATEOAS)<a href="https://en.wikipedia.org/wiki/HATEOAS" rel="nofollow">https://en.wikipedia.org/wiki/HATEOAS</a>The idea has been around for a while; I personally don't think it is a good idea.There is even a content type (or two) for it: application/hal+json and application/hal+xml.<a href="http://stateless.co/hal_specification.html" rel="nofollow">http://stateless.co/hal_specification.html</a>

评论 #19890454 未加载

评论 #19890645 未加载

sisciaabout 6 years ago

I just don't understand why mixing two different concepts at two different abstraction levels only for some, apparent, simplicity.On one level we got ID unique identifier of a resource, on another level we have URL, how to get a specific resources.They are just different things that shouldn't be mixed.What if tomorrow I want to get the same resource via graphql? Or in a message bus?

评论 #19890387 未加载

评论 #19890374 未加载

EugeneOZabout 6 years ago

What is the source of knowledge for the client about fields, where they can read link to the entity?For human it's obvious that dog has an owner, so field "owner" should be used, but for code - you need to write it, "document it". So if you're going to "document" every field containing link to external resource, you'll end up with even more code, than just "documenting" API endpoints.Also, pretty often you need multiple IDs of entities to send POST/PUT request - just to create a relation.POST /adoption, owner_id=5, dog_id=7.How should it look with links? Will it be issue for the server to parse them? And it's just simple case with 1 to 1 relation, sometimes you need to add sets of objects to another entity.It's a really bad advice and after reading this I'm not sure I should trust other articles from that source.

评论 #19893835 未加载

kabesabout 6 years ago

The document also forgets that API's are not read-only. So let's say you have users and usergroups and you can request a usergroup with its list of users and you can add users to usergroups.If you use links for read, you should also use them for writes, otherwise it's quite inconsistent. So now you need to add a lot of parsing everywhere to extract the id's out of the urls, just for the sake of being more dogmatic

评论 #19905303 未加载

评论 #19893602 未加载

whackabout 6 years ago

I've literally spent 2 years working on a project that did exactly what this article is recommending. There were some places which needed the relative-url as an identifier, and other places which needed the "database id" as the identifier. We constantly had to extract the id from the URL, or convert the id into a URL, and keep a mental map of which format each input was using, and which format was needed for each output. It was a mess. I would personally not recommended this at all.

评论 #19903591 未加载

评论 #19895790 未加载

wvenableabout 6 years ago

The caveats section of this article is longer than the content -- it makes a better case for not using links as keys.

anbopabout 6 years ago

Basically, advocating for dynamic typing rather than static typing, across an API boundary. You’ll save code constructing API requests but need to create a lot of application logic to handle an owner link and pet link separately, since they have different semantics.

gridlockdabout 6 years ago

No.What's the point? None of this is useful to me, all of this is extra complexity. Why would I want to expose every addressable entity through URLs and HTTP? That's not what IDs are for.I'm aware that this fits into the whole REST idea. I still don't care.

austincheneyabout 6 years ago

I am surprised the article didn't mention RDF. In every data facet of RDF the data is uniquely identified by URI. In the case of RDF the URI is merely a unique identifier that can resolve to a HTTP resource, but doesn't have to.

vasilakisfilabout 6 years ago

The fact that JSON is just a format standard and doesn't have specified components (like links etc) but instead we have to built those on top has cost us a lot in APIs. Btw, according to RFC 8288 Web Linking (and before that 5988), a link consists of 3 parts + 1 optional part:"In this specification, a link is a typed connection between two resources and is comprised of:<pre><code> o a link context, o a link relation type (Section 2.1), o a link target, and o optionally, target attributes (Section 2.2). A link can be viewed as a statement of the form "link context has a link relation type resource at link target, which has target attributes". For example, "https://www.example.com/" has a "canonical" resource at "https://example.com", which has a "type" of "text/html".</code></pre> "That's why you need a standardized link component that is globally accepted/understood that takes into account all parts of the linking, instead of having various ways depending on the API/JSON-based Media Type to communicate that something is a link.

评论 #19890898 未加载

vbezhenarabout 6 years ago

I used links but I'm gonna rewrite this code to simply pass IDs. The reason is simple: I need additional configuration for my server to know its hostname and I don't want to do that. May be my server even have few different hostnames for different clients? So I must parse client request and extract Hostname? But it's served via reverse-proxy, so I must do some complex configurations to pass this information. So many issues. But client knows perfectly well which server he's talking to, so he can just append server-base and id. Yes, client must know about its structure, but it's nonsense that client can somehow learn something. I'll code that anyway.May be it makes sense when you're writing an API and some different person writes a client and she's so shy that she don't want to even ask you. Yeah, she can inspect answer and find out that this seems like a link to query further. I never was in that situation, I was always building all software myself, so for me this does not make sense.

theptipabout 6 years ago

I'm not sure about the verdict on URL versioning. I've used header versioning extensively and while flexible, it also carries some big downsides, mainly that it's confusing for new developers, and makes it real hard to casually explore the API in a browser (bad DX). I'm also not sure you do want to encourage mixing v1 and v2 API representations; I have certainly seen cases where it makes progressive upgrade easier, but it can also bring inconsistencies, so having a default new integrator path of "start at v2/login and use whatever links you get" is appealing.I do like the idea from Stripe of having Accept header versioning, but pinning every new client to default to the newest GA version. Gets around most of the DX concerns I raised, but it's a bit more machinery to wire up.

perfunctoryabout 6 years ago

One advantage I see is that now you can do<pre><code> /pets?owner=/people/98765 or /pets?owner=/org/98765 </code></pre> which makes your api more polymorphic.Having said that I don't think URL is the right term to describe this. It's more like a <type, id> tuple.

miguelmotaabout 6 years ago

I wasn't fully convinced by this article. Language specific API wrapper clients can abstract all these complexities. Having links for IDs felt very unnatural but I guess that's because I have never came across an API that uses links like the article suggested

asavinovabout 6 years ago

Conceptual and data modeling aspects of this problem are discussed in [1]. It compares links with joins (and foreign keys) by proposing a solution (concept-oriented model) which does not use joins at all but rather relies on links only.Essentially, a foreign key is viewed as a relational workaround for representing links with some significant drawbacks and the question is why not to use links directly without relational wrapping.[1] Joins vs. Links or Relational Join Considered Harmful: <a href="https://www.researchgate.net/publication/301764816_Joins_vs_Links_or_Relational_Join_Considered_Harmful" rel="nofollow">https://www.researchgate.net/publication/301764816_Joins_vs_...</a>

评论 #19891170 未加载

gigatexalabout 6 years ago

Sometimes I wish REStful/REST the whole idea was a lot more opinionated. Sure you can have opinionated frameworks but nothing is stopping you from using a patch like I would a delete... (not the best example but you get the gist).

abetlenabout 6 years ago

I think the issue brought up in this blog post pales in comparison to the two biggest problems faced when working with REST APIs: querying for nested data, and the limitations of CRUD interfaces to model complex behavior.

评论 #19892035 未加载

schnableabout 6 years ago

How do you write the article and never mention REST, hypermedia or HATEOS once?

评论 #19905439 未加载

i386about 6 years ago

Why on earth would you blow out your request size for the sake of purity? Calling GET /pets is going to return a lot of instances of pet with very similar URLs.

currriuoslyabout 6 years ago

This is what Django REST framework had done right for years.

评论 #19890417 未加载

hit8runabout 6 years ago

JSONAPI Specification also makes use of URLs in links to resources:<a href="https://jsonapi.org" rel="nofollow">https://jsonapi.org</a>

评论 #19895831 未加载

vbstevenabout 6 years ago

What really convinced me about HATEOAS and links was the first time I used a HAL browser and started clicking around to discover an API using only its entry point and navigating from there.From that point on I try to use it as much as possible. A typical API response for my projects looks like this:<pre><code> { "id": "3ccf0f1b-dd3f-48d9-911a-ddf479078c37", "name": "Quantus Tasks", "description": "Quantus Tasks Desktop Application", "license_key_type": "alphanumeric_32", "created_at": "2019-05-12T10:45:42.089406Z", "updated_at": "2019-05-12T10:45:42.089406Z", "_links": { "self": { "href": "http://localhost:8000/v1/applications/3ccf0f1b-dd3f-48d9-911a-ddf479078c37" }, "licenses": { "href": "http://localhost:8000/v1/applications/3ccf0f1b-dd3f-48d9-911a-ddf479078c37/licenses" }, "templates": { "href": "http://localhost:8000/v1/applications/3ccf0f1b-dd3f-48d9-911a-ddf479078c37/templates" }, "apikeys": { "href": "http://localhost:8000/v1/applications/3ccf0f1b-dd3f-48d9-911a-ddf479078c37/apikeys" } } } </code></pre> It still has the ID field in there for cases where the client needs to store the id itself but it should not be used to template URI's for related resources, the links are there for that.

评论 #19891404 未加载

sam0x17about 6 years ago

I think I am missing the core concept here. This still uses IDs, only now you have to grep them out of a URL construct instead of just getting them directly?I don't get the intent at all here, but I have a suspicion whatever problem this tries to solve is better solved by UUIDs or by doing nothing out of the ordinary.

评论 #19892057 未加载

ragerinoabout 6 years ago

Reminds me of HATEOAS see here: <a href="https://en.wikipedia.org/wiki/HATEOAS" rel="nofollow">https://en.wikipedia.org/wiki/HATEOAS</a>Also RDF endpoints usually use resolvable URI's to connect concepts and objects with each other.

polskibusabout 6 years ago

Is the main reason for this is that crawlers could figure out and index more by themselves?

tveitaabout 6 years ago

No one thinks twice about using links for images. You wouldn't make an API that specified images as "image id 2345, which the client can find at /images/{id}"

评论 #19892998 未加载

coding123about 6 years ago

This is all assuming REST is still a good idea.

carmate383about 6 years ago

Why on earth would one trade off a short, static unique identifier for a potentially long, dynamic "link" that essentially binds all data to some crappy API that will be outdated in a few years? Is it really _that_ hard to use keys?

the_arunabout 6 years ago

In micro services world this makes perfect sense. In the legacy land - this is slightly tricky as dependent application (where our foreign key points to) may or may not be in services world. But I get the idea.

评论 #19895878 未加载