TechEcho

6 comments

wuliwongalmost 11 years ago

They are overstated the naivety of the "naive" approach.<p>The article states:<p>'directly fetch all the photos that the user followed from a single, monolithic data store, sort them by creation time and then only display the latest 10'<p>That isn't how the query would work. It is implying that the query would return all the results, clearly it would only return 10. I haven't read all the details, I'm sure Instagram does something better then this basic SQL now but it is silly in a technical article to overstate the problem in such an obviously incorrect way.

carbocationalmost 11 years ago

In short, the feeds are heavily denormalized and are constructed when new photos are added by people you follow, rather than at request time.<p>They seem to favor using disk space and saving on processing time and memory.

评论 #8059061 未加载

megaman821almost 11 years ago

It seems that these problems would be better solved at the database level. It is just that the open source databases lack good implementations of materialized views and data replication to slaves. Instead you have this more fragile solution involving the Postgres, RabbitMQ, and Redis to accomplish nearly the same thing.

acoyfellowalmost 11 years ago

This post is great.. but its from April 2013. I wonder how their architecture has changed since then?

评论 #8058688 未加载

lobster_johnsonalmost 11 years ago

Were they only using two RabbitMQ brokers? I have a habit of running one per machine, which allows every app to simply connect to localhost; but on virtualized clusters, RabbitMQ gets a network partition almost every other day, and having just two would obviously cut down on the number of partitions.

alttabalmost 11 years ago

If they are using EC2 hosts, I wonder why they don't use AWS SQS/SNS for their queue systems and Dynamo for their materialized views.

评论 #8058741 未加载

6 comments

wuliwongalmost 11 years ago

carbocationalmost 11 years ago

评论 #8059061 未加载

megaman821almost 11 years ago

acoyfellowalmost 11 years ago

This post is great.. but its from April 2013. I wonder how their architecture has changed since then?

评论 #8058688 未加载

lobster_johnsonalmost 11 years ago

alttabalmost 11 years ago

If they are using EC2 hosts, I wonder why they don't use AWS SQS/SNS for their queue systems and Dynamo for their materialized views.

评论 #8058741 未加载

How Instagram Feeds Work: Celery and RabbitMQ

6 comments

How Instagram Feeds Work: Celery and RabbitMQ

6 comments