I am currently working on an iOS IM SDK, and the beta version is out. For details, please visit http://netease.im. During the internal trial phase, there were constant requests from other departments or partners asking about various technical details, so I wrote an article to record and introduce all aspects of the next IM APP, including technology selection (including communication methods, network connection methods, protocol selection) and common problems. Communication method selection There are only two options for IM communication: direct device connection (P2P) and transfer through a server. P2P P2P is more common in chat tools within the local area network. Typical applications include: pigeon messaging, Maze (you know), etc. This type of software generally does two things after it is started: Perform UDP broadcast: send your own information and receive information from other terminals in the same LAN Open TCP listening: Wait for other end to connect For detailed process, please refer to the source code of Flying Pigeon. However, this method has various limitations and inconveniences: on the one hand, it is only suitable for online point-to-point message transmission, and does not support offline, group and other services. On the other hand, due to the existence of NAT, it is much more difficult to connect machines in different LANs, and connections cannot be established under certain network types (symmetric NAT). Server Transfer Almost all Internet IM products use server-transfer to transmit messages. Compared with the P2P method, it has the following advantages:
Of course, it also has its own problems: the server architecture is complex and the concurrency requirements are high. Network connection method There are two main ways to connect to the IM network:
The latter is common in WEB IM systems (of course, many WEB IMs are now implemented based on WebSocket). Its advantages are simple implementation and easy development. The problem is that the traffic is large, the server load is large, the timeliness of messages cannot be guaranteed well, and it does not support large-scale users. It is more suitable for small IM systems, such as the client system of a small website. TCP long connection can better support large number of users, but the problem is that the implementation of client and server is relatively complicated. Of course, there are some variants, such as using MQTT for server notification/message sending and HTTP short connection for uploading instructions and messages. This method can ensure the timeliness of downlink messages/instructions, but the problem of slow uplink is still serious in weak network. Early exchanges were based on this method. Protocol Selection The general principle of IM protocol selection is: easy to expand, convenient to cover various business logics, and relatively save traffic. The latter requirement is especially important for mobile IM. Common protocols are:
The advantages of the XMPP protocol are: the protocol is open source, highly scalable, and implemented in various languages on each end (including the server), making it easy for developers to access. However, there are also many disadvantages: XML has weak expressiveness, too much redundant information, large traffic, and a lot of pitfalls in actual use. The SIP protocol is mostly used in VOIP-related modules. It is a text protocol. Since I have not actually used it, I will not comment on it, but from the fact that it is a text protocol, it can almost be concluded that its traffic will not be small. The advantages of MQTT are simple protocol and low traffic, but it is not a protocol designed specifically for IM and is mostly used for push. Almost all mainstream IM apps on the market use private protocols. A well-designed private protocol generally has the following advantages: high efficiency, traffic saving (usually using binary protocol), high security, and difficult to crack. The disadvantage is that there are no existing samples to refer to in the early stage of development, which places high demands on designers. A good protocol needs to meet the following conditions: high efficiency, simplicity, good readability, traffic saving, easy to expand, and able to match the current team's technology stack. Based on the above principles, we can conclude that: If the team is small and the team's technology accumulation in IM is not enough, you can consider using XMPP or MQTT+HTTP short connection implementation. Otherwise, you can consider designing and implementing a private protocol yourself. Design of private protocol Serialization selection The biggest characteristics of mobile Internet compared to wired networks are: low bandwidth, high latency, high packet loss rate, poor stability, and high traffic costs. Therefore, binary protocols are generally used for serialization of private protocols instead of text protocols. Common binary serialization libraries include protobuf and MessagePack. Of course, you can also implement your own binary protocol serialization and deserialization process, such as TeamTalk of Mogujie. However, the former two are far better than TeamTalk in terms of scalability and readability (TeamTalk does not even support Variant, and an int occupies 4 bytes when transmitted), so in most cases it is not recommended to implement the serialization and deserialization process of binary protocols yourself. Protocol format design Application layer protocols based on TCP are generally divided into a packet header and a packet body (such as HTTP), and the IM protocol is no exception. The packet header is generally used to indicate the common part of each request/feedback, such as packet length, request type, return code, etc. The packet header is filled with information corresponding to different requests/feedbacks. A simplest packet header can be defined as
Taking the heartbeat packet as an example, assuming that the current serial is 1 and the command of the heartbeat packet is 10, when using MessagePack for serialization: length=4, serial=1, command=10, code=0, each field occupies one byte, the package body is empty, and only 4 bytes are required. Of course, this is the simplest example. When facing real business logic, more information will need to be stuffed into the package body. This requires developers to summarize the common parts based on their own business logic, such as the protocol version number added for compatibility, the module ID added for load balancing, etc. Other issues The above is a rough selection process of an IM system: communication method, connection method, protocol selection, and protocol design. However, there are still a lot of problems to be dealt with in the actual development process. Protocol encryption In order to ensure that the protocol is not easily cracked, almost all mainstream IMs on the market will encrypt the protocol for transmission. The common process is similar to HTTPS encryption: after the connection is established, the client and the server negotiate, and finally the client obtains a current Session key, and subsequent data transmission is encrypted and decrypted using this key. Generally, stream encryption, such as RC4, is used for efficiency considerations. In the early negotiation process, it is recommended to use asymmetric encryption such as AES to increase the difficulty of cracking. Quick Connect (Login) For iOS APP, since there is no real background, each time the APP is started, it basically needs to reconnect and log in once (except for switching within a short period of time), so how to quickly reconnect and log in is very important. Common optimization ideas are as follows:
Keep connected Generally, the way for APP to maintain connection is to use the heartbeat at the application layer, and perform reconnection operations through the timeout of the heartbeat packet and other conditions (network switching). So the question is: why use the application layer heartbeat and how to design the application layer heartbeat. As we all know, the TCP protocol has a KEEPALIVE setting option. After setting it to KEEPALIVE, the client will send a heartbeat packet to the server every N seconds (the default is 7200s). However, in actual operation, we often use application layer heartbeats. The reasons are as follows:
In order to save traffic and power, mobile terminals usually make some small optimizations on heartbeat packets in actual operation.
Message reachable In mobile networks, packet loss and network reconnection are very common. In order to ensure the reachability of messages, a message receipt and resend mechanism is generally required. For example, Yixin.com, each message will be resent up to 3 times, with a timeout of 15 seconds. At the same time, the current connection status will be checked before sending. If the current connection is not established correctly, the cached message will be checked periodically (every 2 seconds, 15 times). Therefore, in the worst case, a message will have a retry time of more than 2 minutes to ensure the reachability of the message. Because of the existence of retransmission, the receiving end may occasionally receive two duplicate messages. In this case, the receiving end needs to deduplicate. The general practice is that each message has its own unique message id (usually uuid). File upload optimization IM messages (including SNS modules) contain a large number of file upload requirements, so how to optimize file upload has become a relatively large topic. Common optimization ideas include the following:
|
<<: iOS application architecture discusses the organization and calling scheme of view layer
>>: Improving JavaScript performance becomes the top priority for the Edge team
Author| Shang Huaijun Rendering on a computer or ...
Today’s content is a bit too much, the full text ...
A brand is the position that a product or service...
The ranking of a website is determined by many fa...
A student who has just switched to information fl...
The beauty sign-holding team includes the Russian...
The configuration of the server is closely relate...
In Q3 2021, the gaming, short video, e-commerce l...
Just as KOL corresponds to the herd effect in psy...
[[121743]] On the morning of October 27, the WeCh...
Source code introduction: A button with the same ...
When I was talking with a friend last week, I cam...
Many people think that SEO is actually very simpl...
Here I would like to share with you a review of a...
Let’s take a look at some of the features of WeCh...