When optimizing to scale to multiple cores…

When optimizing to scale to multiple cores

"There is no silver bullet in software development. All we can do is choose and balance;"

In the previous article, we talked about the five directions of program optimization under single thread (ref: "Five Directions of Program Optimization"); when the single core is optimized to the extreme value, it is time to multi-task;

It is very clear to think about it. A single task is broken down into multiple tasks, and multiple CPUs are allowed to work at the same time and execute in parallel, so the efficiency is naturally improved.

However, it may not be that simple;

Granularity of task decomposition

First, we need to determine whether our single task can be decomposed. For example, if we want to parse many files, it is easy to divide such a task into multiple tasks. However, if it is a time-consuming serial logic calculation, and the later calculation depends on the previous results, it is difficult to split. This form may need to be split at a higher level.

Data race

Programming is about computing and data. Computing is done in parallel, but the data is still accessed from the same source. Accessing common resources will result in resource competition.

If this is not controlled, the same data may be repeatedly calculated (in multiple read scenarios) or dirty data may be generated (in write-back scenarios).

Introducing locks

In order to make data access orderly, locks need to be introduced to prevent dirty data;

Controlling the granularity of locks is a topic that requires careful consideration;

For example, in scenarios with large amounts of reads and small amounts of writes, using read-write locks can significantly improve efficiency compared to locking everything equally.

In the products we come into contact with daily, databases are masters of locks. When updating data, whether it is locking rows, columns, or tables, the performance of different granularities varies significantly.

Thundering Herd

Consider a scenario where multiple threads are waiting for a lock. If the lock can be obtained, the thread starts working (thread pool)

When the lock is released, waking up multiple threads may cause herd shock;

Solution:

Use a single-threaded solution/processing accpet connection to handle the operation of waiting for the lock, so that only one thread is waiting for the lock at any time;

For more details, see:

In the "Client-Server Programming Method", a thread pool is created in advance, and each thread accepts a section

Data replication

Let each thread use its own data, so that the data is not shared, which can eliminate resource competition and locks;

Duplicate data into multiple copies to reduce competition and allow each user to access their own data;

But this introduces a new problem: if each thread writes back data, how to ensure the consistency of such data?

After all, what they represent is actually a piece of data;

When it comes to data consistency, synchronization between multiple copies of data is a problem;

Data Sharding

Well, let’s change our thinking and not use data replication. We use data sharding. The idea of sharding is easier to think of. Since “computing” is divided into multiple small tasks, data can also be processed in the same way.

The data is divided into shards, each of which stores different contents and has nothing in common;

In this way, there is no data contention in data access, and because the data is different, there is no problem of data consistency synchronization;

However, sharding is far from being as good as imagined;

Sharding causes each thread to see a fragment of data instead of the entire set. This means that the thread can only process this specific part of data. In this way, the calculations between threads lose their interchangeability. Certain tasks can only be processed on specific threads.

And if a task needs to access all the data, it becomes even more complicated;

It turns out that after sharding, we pushed the problem upward to the thread level, and needed to consider the processing at the business logic level;

This may be more complicated;

OK, if you want faster speed and use multi-core processing, you need to face more problems;

When it comes to the architecture level, the problems we face when expanding a single machine to a multi-machine cluster are similar.

Reference: Reading Notes on "Large-Scale Website Technology Architecture" [2] - Architecture Patterns

There is no silver bullet in software development, all we can do is choose and balance;

<<: Why Google is eating itself

>>: 4 tips to simplify your IT programmer's work life

What is the form of Tik Tok advertising promotion? Analysis of the characteristics of the Tik Tok platform!

Recommend

【Creative Cultivation Program】Where is Newton's apple?

Author: The Nutcracker Studio The great scientist...

Serious violations will result in permanent suspension of accounts. WeChat: Further strengthen platform ecological governance

The WeChat team issued an announcement stating th...

When optimizing to scale to multiple cores…

What is the form of Tik Tok advertising promotion? Analysis of the characteristics of the Tik Tok platform!

6 types of user psychology that you need to know for insightful copywriting

Microsoft finally admits it's wrong to sell Xbox One

What does Baidu bidding ROI mean in online marketing?

30 most common oCPC delivery questions and answers!

Lu Mingming QQ Group Precision Traffic Column 4.0 [2022 Edition], creating a group screen domination system

Hailstorm Center collects data, large models support extreme weather forecasts, "Storm Chasers" are on stage

China Space Day丨The stars and the sea, never stop!

Luliang Mini Program Franchise Company, how much does it cost to join a fresh fruit and vegetable delivery mini program?

Advertising and marketing promotion, the secret of Thailand’s “tear-jerking” advertisement!

Recommend

【Creative Cultivation Program】Where is Newton's apple?

Serious violations will result in permanent suspension of accounts. WeChat: Further strengthen platform ecological governance

Uncovering the secrets of Samsung’s empire: more than just selling phones

"Pouring water to make ice" makes the winter atmosphere full, be careful not to get hurt

Winterberry Group: How AI is changing video and content production report

Building a scalable Javascript mobile app backend solution with Horizon

The State Council's Safety Committee Office reminds you to take precautions →

Live e-commerce operation methodology

Don't treat "Chinese Valentine's Day" as "Valentine's Day" anymore! I am silent knowing the truth...

How to write eye-catching copy?

Sichuan Xunniu March Chan Theory Basic Course

Will drinking water with scale for a long time cause kidney stones? The answer is unexpected!

CDC: Number of confirmed cases of monkeypox in the U.S. reaches 10,392

Several tips to improve Android ListView performance

Article: Ma Yili divorces netizens: This is the difference between Huawei Mate and P series