The author of this article leads everyone to have a preliminary understanding of the basic recommendation engine and influencing factors of information flow, and summarizes the value of data operation through an actual case in information flow feeds data operation for everyone to learn and refer to. 1. Introduction to Information FlowInformation feeds have become almost ubiquitous, running through our 24-hour Internet life. When you are commuting on the subway, you can check Toutiao to get the latest news. The information flow has neatly arranged the hot articles in a queue for you to read. When you want to have a good meal, the information flow of Dianping has recommended many restaurants in the city. When you can't sleep at night and want to buy something to reward yourself after a hard day's work, the dazzling array of recommended products on Taobao are so accurate that you can't stop browsing... Although the information flow format has been widely used, its earliest application was in the information content scenario, starting with the News Feed feature released by Facebook in 2006. The platform aggregates content after sorting through established algorithms and rules, allowing users to consume content smoothly and efficiently on a single page. Users no longer need to frequently jump between portals and blog sites as they did in the prehistoric era of mobile interactive networks; platforms also keep users within their own jurisdiction more efficiently by providing aggregated content display platforms. The English word for information flow is “Feed”, which is really a very clever word. Feed means "feed" in English, which vividly depicts the scene in the information flow where users are "fed" content by the platform in a certain order. Users have limited time to consume content. How can a platform feed users with their favorite content within a limited time, so that they can consume more content on the platform (thereby bringing higher potential commercial value to the platform)? This is the "recommendation ranking" problem that all feed scene operators have been studying for years. 2. The basis of information flow: recommendation engineThe core of the recommendation engine is "how to recommend the right items to the right users", so the establishment of the connection between "items" and "users" is the most core proposition in the recommendation algorithm. The entire recommendation process can basically be summarized as the process of "recall" → "sort" → "adjust weight" → "output results". A simple metaphor will be used to help everyone understand the process. Everyone must have participated in military training during their school days. The final parade review at the end of the training is the highlight of the entire training process. So how do we arrange the queues reasonably?
Recommendation algorithms are a very deep and technical subject, but because this book is mainly aimed at operators, the author tries to summarize the main factors that affect the ranking of information flows from a more explicit level:
3. Problem: How to cold start information feeds?Having said that, I would like to share with you my previous experience in operating a tool product. Everyone may be familiar with the dilemma of most tool products: users stay for a long time and have poor stickiness, which leads to limited efficiency and methods of monetization. There are many competing products on the market. If we cannot quickly prove the value of our product through data indicators, the entire product will face the risk of being cancelled. Therefore, how to increase the time users spend on the site has become a very important issue within our team. Our tool product has the function of WiFi connection. Previously, after users successfully connected to WiFi, the landing page they were redirected to was a "Connection Successful" page. Apart from that, there was no other connection. However, at this time, the user is at an emotional high point of completing the operation, and is in a WiFi scenario that is not sensitive to traffic. We wondered if we could provide users with some content consumption value by taking over the content of information feeds, while also creating a commercial monetization scenario? But we are a tool product team and have no experience in content operation. How can we create an information feed from 0 to 1? After analyzing the current situation of our team, we decided to quickly start from the following aspects: First, where does the information content come from? Some of our sister products have ready-made information content, but we need to develop the specific recommendation algorithms ourselves; although our algorithm team has no experience in content recommendation, their experience in recommendation in software distribution also has similarities that can be learned from and reused. A good cook cannot cook without rice. We have both the "rice" and the "good cook", but whether it is "fried rice" or "rice soup" that our users think is the most delicious, we need to try more before we can come to a conclusion. There are so many factors for recommendation sorting, but for us, due to the attributes of the tool product, not many of them can be used. According to our situation, we decided to conduct the following three groups of A/B test experiments:
Based on the settings of the three groups of experiments, we selected three groups of random test user groups to implement the strategy, and set the "average information consumption time" as the key evaluation indicator. It took three long days to wait for the experimental results to show up. During these three days, our team was also betting on which strategy would perform best. Readers, guess which strategy will perform best? 4. Analysis: Find the deeper causes of the problemThe bets within the team basically all focused on the view that the strategies of the first two groups would be better. The view of colleagues who think that user portraits are better is straightforward: users will be more interested in content that is more relevant to them. Colleagues who think that popularity sorting will have a better effect are also right. Content that more people click on is often curious and fresh, which will naturally attract more people to read. But after our operations staff collected and sorted the experimental data, they were a little surprised: the least popular option three, which was based on time sorting, actually had better "average information consumption time" than the first two options. The team was a little discouraged for a while, and there were also doubts about the technical capabilities of colleagues in the algorithm team. As operators, we need to go one step further through data analysis at this time to see: Are the data indicators showing the whole truth? In order to analyze this problem, we first broke it down. Experimental data indicators:
The experimental design:
After disassembly and analysis, we found that the poor data indicators of the first two groups of solutions are not necessarily the whole truth. First of all, we found that there are certain problems in the setting of the indicator "average information consumption time". After all, our product is a tool product, and most users leave after connecting to WiFi. Information Feeds are destined to be a function only for some relatively idle users. Therefore, the "average information consumption time" of users between the experimental groups is very discrete, and the existence of individual extreme value users in Plan 3 has raised the overall average time data. To solve this problem, we can make certain adjustments to extreme values during calculations and add the data indicator of "average information click-through rate" to more objectively evaluate the effectiveness of each solution. Secondly, through analysis, it was found that due to data collection reasons, Plan 1 and Plan 2 did not fully achieve the effects of their respective strategies. For example, in solution one, “sorting based on user portrait”, many users in the experimental group had incomplete installation list data due to Android permission restrictions; the geographic location identification of some users’ IPs was not accurate enough. The test found that some users in Guangzhou were recommended local news in Beijing, which naturally affected the effectiveness of the strategy. For example, in Plan 2, since some "clickbait" content has a high click-through rate, the first screen of the experimental group of users is full of "clickbait" content. The content quality is very low, and users jump out of the screen quickly after clicking, resulting in poor experimental results of the strategy. 5. The Importance of Data Operations ThinkingIf we do not further analyze the data indicators and only look at the experimental results, we may directly think that "time sorting" is the best solution for our users and we should develop in this direction in the future. There is no need for the so-called optimization of the model algorithm. But only through analysis can we see the full picture of the facts more clearly and continuously propose optimization plans for iteration. What is reflected here is the importance of problem-breaking thinking and the importance of logical problem-analysis thinking. I hope that through this book, I can share these thinking frameworks with you, the readers, and become a better operator. Write at the backIn the future, we will share more articles on data operations, Internet products (or some personal artistic hobbies) on the platform. Everyone is welcome to communicate! Author: Huang Yiyuan Source: Huang Yiyuan |
<<: 【Guxia Wuji】Jianghu Wuji Riding White Horse Tactics Document
>>: 10 Trend Predictions for Influencers and Influencers in 2020
In early July, FileBlaze, headquartered in San Fr...
How much does it cost to purchase a customized mi...
The following content is the editor’s exclusive e...
From the perspective of sociology and psychology,...
Today's case study is about the Zebra English...
Compared with To C operations , To B operations a...
I believe that every operations student is famili...
The new graduates have a good understanding of th...
Any project that wants to make money must grasp t...
Product promotion is a difficult problem. It look...
Recently, there have been several pieces of good n...
Learn Bazi from scratch, learn Bazi Jiugongge fro...
The 7 creative rules for information flow adverti...
If you don’t advertise, you’re waiting for death;...
For the promotion and operation of Tik Tok, is it...