Reasons to use protocol buffers instead of JSON

78 pointsby brynaryalmost 11 years ago

20 comments

falcolasalmost 11 years ago

> Schemas Are AwesomeNo reason you can't implement schemas over JSON. In fact, you typically implicitly do - what your code is expecting to be present in the data structures deserialized from JSON.> Backward Compatibility For FreeJSON is unversioned, so you can add and remove fields as you wish.> Less Boilerplate CodeHow much boilerplate is there in parsing JSON? I know in Python, it's:<pre><code> structure = json.loads(json_string) </code></pre> Now then, if you want to implement all kinds of type checking and field checking on the front end, you're always welcome to, but allowing "get attribute" exceptions to bubble up and signal a bad data structure have always appealed to me more. I'm writing in Python/Ruby/Javascript to avoid rigid datastructures and boilerplate in the first place most times.[EDIT] And for languages where type safety is in place, the JSON libraries frequently allow you to pre-define the data structure which the JSON will attempt to parse into, giving type safety & a well define schema for very little additional overhead as well.> Validations and ExtensibilitySame as previous comment about type checking, etc.> Easy Language InteroperabilityEven easier: JSON!And you don't have to learn yet another DSL, and compile those down into lots of boilerplate!I'm not trying to say that you shouldn't use Protocol Buffers if its a good fit for your software, but this list is a bit anemic on real reasons to use them, particularly for dynamically typed languages.

评论 #7854640 未加载

评论 #7853958 未加载

wunkialmost 11 years ago

Or, another alternative is Cap'n Proto [1] from the primary author of Protocol Buffers v2. It smooths some of the bumps of protocol buffers.[1]: <a href="http://kentonv.github.io/capnproto/" rel="nofollow">http://kentonv.github.io/capnproto/</a>

评论 #7853880 未加载

Arkadiralmost 11 years ago

Easy language interoperability as a reason to choose Protobuf over JSON ? Mainstream languages support both JSON and Protobuf equally well, and the others tend to support JSON more often than Protobuf.Free backwards compatibility ? No. Numbered fields are a good thing, but they only help in the narrow situation where your "breaking change" consists in adding a new, optional piece of data (a situation that JSON handles as well). New required fields ? New representation of old data ? You'll need to write code to handle these cases anyway.As for the other points, they are a matter of libraries (things that the Protobuf gems support and the JSON gems don't) instead of protocol --- the OCaml-JSON parser I use certainly has benefits #1 (schemas), #3 (less boilerplate) and #4 (validation) from the article.There is, of course, the matter of bandwidth. I personally believe there are few cases where it is worth sacrificing human-readability over, especially for HTTP-based APIs, and especially for those that are accessed from a browser.I would recommend gzipped msgpack as an alternative to JSON if reducing the memory footprint is what you want: encoding JSON as msgpack is trivial by design.

CJeffersonalmost 11 years ago

Reasons not to use protocol buffers (in C++ at least):<pre><code> 1) Doesn't support Visual Studio 2013. 2) Doesn't support Mac OS X Mavericks. 3) No "nice" support C++11 (i.e. move constructors) </code></pre> (These can be at least partly solved by running off svn head, but that doesn't seem like a good idea for a product one wants to be stable)With JSON I can be sure there will be many libraries which will work on whatever system I use.

评论 #7854404 未加载

评论 #7854991 未加载

ardit33almost 11 years ago

We at Spotify use them extensively and are actually moving away from Protobufs, which we consider as 'legacy'. The advantages of Protobufs don't make up for its disadvantages over plain JSON. With JSON you have universal support, simple parsing, developer and debug friendly, much easier to mock, etc etc.

AYBABTMEalmost 11 years ago

I think the main advantages are:<pre><code> - network bandwidth/latency: smaller RPC consume less space, are received and responded to faster. - memory usage: less data is read and processed while encoding or decoding protobuf. - time: haven't actually benchmarked this one, but I assume CPU time spent decoding/encoding will be smaller since you don't need to go from ASCII to binary. </code></pre> Which means, all performance improvements. They come, as usual, at the cost of simplicity and ease of debugging.

评论 #7854267 未加载

gldalmasoalmost 11 years ago

> When Is JSON A Better Fit? > Data from the service is directly consumed by a web browserThis seems to me like a key issue, you need to really know beforehand that this won't ever be the case, else you need to make your application polyglot afterwards. A risky bet for any business data service.Maybe if it's strictly infrastructure glue type internal service. But even then, maybe someone will come along wanting to monitor this thing on the browser.

评论 #7854496 未加载

评论 #7854294 未加载

zntalmost 11 years ago

Not so sure about "backwards compatibility" part.From Protocol buffer python doc: <a href="https://developers.google.com/protocol-buffers/docs/pythontutorial" rel="nofollow">https://developers.google.com/protocol-buffers/docs/pythontu...</a>"Required Is Forever You should be very careful about marking fields as required. If at some point you wish to stop writing or sending a required field, it will be problematic to change the field to an optional field – old readers will consider messages without this field to be incomplete and may reject or drop them unintentionally. You should consider writing application-specific custom validation routines for your buffers instead. Some engineers at Google have come to the conclusion that using required does more harm than good; they prefer to use only optional and repeated. However, this view is not universal."So basically I will be in trouble if I decide to get rid of some fields which are not necessary, but somehow were defined as "required" in the past.This will potentially result in bloated protobuf definitions that have a bunch of legacy fields.I will stick to the JSON, thanks.

评论 #7854002 未加载

评论 #7853998 未加载

评论 #7853901 未加载

评论 #7853889 未加载

评论 #7853947 未加载

评论 #7853896 未加载

plingalmost 11 years ago

This is a fine reason to use protocol buffers instead of JSON:<pre><code> [ 4738723874747487387273747838727347383827238734.00 ] </code></pre> Parsing that universally is a shit.

KaiserProalmost 11 years ago

The biggest thing is that its smaller over the wire.however if schemas scare you (shame on you if they do) then msgpack might be a better choice.

jacob019almost 11 years ago

Definitely the right direction for performance. My company ended up going with python-gevent and zeromq to implement an asynchronous API server with persistent TCP connections. Our application servers are able make remote calls over a persistent tcp connection without any noticeable overhead. You could still use JSON, and we tried it--but since we're all python anyway we decided to just pickle the objects which is way faster. We looked at protocol buffers, but found it to be a bit cumbersome. It's been stable for two years and completely solved our scaling problems.

tieTYTalmost 11 years ago

> There do remain times when JSON is a better fit than something like Protocol Buffers, including situations where:> * You need or want data to be human readableWhen things "don't work" don't you always want this feature? Over a long lifetime, this could really reduce your debugging costs. Perhaps protocol buffers has a "human readable mode". If not, it seems like a risk to use it.

评论 #7855431 未加载

redthrowawayalmost 11 years ago

...in Ruby. For some applications.With Node, I'd have to see a very good argument for why I should give up all of JSON's great features for the vast majority of services. Unless the data itself needs to be binary, I see no reason why I shouldn't use the easy, standard, well-supported, nice-to-work-with JSON.

评论 #7854471 未加载

jweiralmost 11 years ago

Anyone tried ProtoBuf.js "Protocol Buffers for JavaScript." on the client side?<a href="https://github.com/dcodeIO/ProtoBuf.js" rel="nofollow">https://github.com/dcodeIO/ProtoBuf.js</a>

评论 #7854501 未加载

jayvanguardalmost 11 years ago

What a huge step backwards. We had decades of binary protocols. They sucked. Then everyone moved to non-binary protocols and it was much better. Let's not do it all over again.

评论 #7855752 未加载

nlyalmost 11 years ago

I'd rather use Avro. The binary encoding is more dense and there's an officially supported JSON encoding (the text encoding for Protobufs is mostly intended for debugging)

pkandathilalmost 11 years ago

When you convert an object from language X to JSON, validate it using a schema validation before deserializing, then is it not the same as JSON. Also now with JSON you have the opportunity to have human readable data which is great when debugging issues. I am not seeing the advantage of protocol buffers. It would be great if you can compare payload sizes and see if there is a significant savings from that perspective.

mrinterwebalmost 11 years ago

Personally, I prefer Thrift over protocol buffers. I'm surprised Thrift wasn't mentioned.

don_draperalmost 11 years ago

"Reason #3: Less Boilerplate Code"Anyone who has used Avro in Java knows that this is not true.

评论 #7854537 未加载

评论 #7854204 未加载

AnimalMuppetalmost 11 years ago

Use protocol buffers because you get versioning? How hard would it be to add a version number to your JSON struct?(Answer: Trivial.)