科技回声

8 条评论

h2odragon大约 1 年前

<a href="https://old.reddit.com/r/sysadmin/comments/1cqn3qa/whats_the_biggest_cpu_core_physical_server_i_can/l3t45a7/" rel="nofollow">https://old.reddit.com/r/sysadmin/comments/1cqn3qa/whats_the...</a>> The whole thing is about 10k APIs that all share the same cluster of 10 databases on the backend, which was never designed to scale like this. This company did 500 million revenue 2010 and now 15 billion this year, all running on this fking sql back end. They have a team of 500 devs writing for these apps, the complexity is unbelievable. No one knows how to untangle it and scale out to micro services.

评论 #40344063 未加载

评论 #40345230 未加载

rbanffy大约 1 年前

It's hard to give significant advice with this little information - how much time the CPUs spend waiting for the memory, how many cache misses are happening, how many core execution units are doing something at any given time, etc.HPE has single-image machines that can have up to 16 4th gen Xeons, which gives a top limit of 960 cores. IBM has POWER10 boxes that go up to 240 cores (but they are POWER 10 cores that can do, IIRC, up to 8 threads per core (increasing cache misses, but reducing unused execution units).

评论 #40344132 未加载

upon_drumhead大约 1 年前

This is fake, just rage bait. Besides the numbers in the post just not making any sense, the OP states that the company is in healthcare[1] but then says he's a 43 year old director[2], which still tracks, but then he says he's been 20 years in "big law"[3], then as a it director in fintech[4]. He says he's changed jobs twice in the last two years[5]. I gave up looking after just the first page of his post history.[1] <a href="https://old.reddit.com/r/sysadmin/comments/1cqn3qa/whats_the_biggest_cpu_core_physical_server_i_can/l3t4ftv/" rel="nofollow">https://old.reddit.com/r/sysadmin/comments/1cqn3qa/whats_the...</a>[2] <a href="https://old.reddit.com/r/ITManagers/comments/1cqa0cp/genai_is_reigniting_my_work_enthusiasm_fire/l3s7pkj/" rel="nofollow">https://old.reddit.com/r/ITManagers/comments/1cqa0cp/genai_i...</a>[3] <a href="https://old.reddit.com/r/sysadmin/comments/1cotpdb/how_is_working_for_a_law_firm/l3iwzf3/" rel="nofollow">https://old.reddit.com/r/sysadmin/comments/1cotpdb/how_is_wo...</a>[4] <a href="https://old.reddit.com/r/Ameristralia/comments/1cnyxsh/what_type_of_jobs_do_americans_living_in/l3cn3d8/" rel="nofollow">https://old.reddit.com/r/Ameristralia/comments/1cnyxsh/what_...</a>[5] <a href="https://old.reddit.com/r/Intune/comments/ncj7oa/ios_sso_extension_authenticate_an_ipad_to_kerberos/l2zlwrf/" rel="nofollow">https://old.reddit.com/r/Intune/comments/ncj7oa/ios_sso_exte...</a>

sgt101大约 1 年前

I ran a stressed app some years ago. We only had a wee little backend because our revenue was v.low, but we wanted to do stuff like sleep inside and eat, and so were motivated to cut costs to make profit.What I did was make a table of all the queries that were being run on my backend, and I ordered them by the number of times that they were called and the cost of calling them (I honestly can't remember the measure I used for that but it was like cputime*memory or similar). I then did two things for the top queries.1) Optimised them where I could.2) Looked for where they were being used and tried to stop it.(2) was very successful.

ktpsns大约 1 年前

It's hard to believe for me how you would not start buying own hardware at this scale. In particular when the hyperscalers (at first glance) don't have anything to provide to match the needs.

评论 #40344709 未加载

评论 #40345278 未加载

__turbobrew__大约 1 年前

This seems like a good use case for Spanner? The pain would be in migrating the backend to gke, but is you are hitting the limits of what azure can do you are going to have to migrate at some point.

评论 #40344915 未加载

评论 #40344804 未加载

tristor大约 1 年前

Having been through similar situations in my past life, I can confidently say that they don't need more CPU cores, they need to start really looking at their architecture holistically and identifying the critical path that can be rewritten in priority order for performance. At this point, throwing more hardware at the problem is the wrong thing to do /even/ if it temporarily kicks the can down the road. They have a fundamental system design issue that needs to be addressed, likely piecemeal and prioritized. The first step should be adding more performance instrumentation.

coolkil大约 1 年前

Shame it is running ms sql. anything Postgress, oracle or db2 and it might have been a candidate for running on a IBM Linuxone might even be a valid contender for the cost it is currently running at.

8 条评论

h2odragon大约 1 年前

评论 #40344063 未加载

评论 #40345230 未加载

rbanffy大约 1 年前

评论 #40344132 未加载

upon_drumhead大约 1 年前

sgt101大约 1 年前

ktpsns大约 1 年前

It's hard to believe for me how you would not start buying own hardware at this scale. In particular when the hyperscalers (at first glance) don't have anything to provide to match the needs.

评论 #40344709 未加载

评论 #40345278 未加载

__turbobrew__大约 1 年前

This seems like a good use case for Spanner? The pain would be in migrating the backend to gke, but is you are hitting the limits of what azure can do you are going to have to migrate at some point.

评论 #40344915 未加载

评论 #40344804 未加载

tristor大约 1 年前

coolkil大约 1 年前

Shame it is running ms sql. anything Postgress, oracle or db2 and it might have been a candidate for running on a IBM Linuxone might even be a valid contender for the cost it is currently running at.

Biggest CPU for the Bad System

8 条评论

Biggest CPU for the Bad System

8 条评论