How can we use data to clearly profile existing users, find the core concerns of users in various industries, and conduct refined operations to increase user repurchases? How can we sort out the data clearly and compile indicators that can actually guide the business? Business data is complex and numerous. How can we define core indicators of concern through massive data to guide user growth and conversion? How to know the core path of user experience product through data? How to design a product’s onboarding guide to improve the user experience ? Guide more users to experience the core of the product and become users with “high conversion potential” ? When operating users, how can we use data to clearly profile existing users, find the core concerns of users in various industries , and conduct refined operations to increase user repurchases? These may be what many operators want to know when faced with massive amounts of data. We all know that data has powerful capabilities, and cleaned data can point out a clear path forward. As the saying goes, an operator who can't look at data is not a good product manager. As a user growth product operator who mainly analyzes qualitatively and quantitatively through data and user interviews, and then produces corresponding strategies to guide growth, today I will talk about several hard-core abilities to help improve operational capabilities through data and formulate operational strategies. There are several stages in the process of data analysis, including tracking data, obtaining data, analyzing data, and producing feasible operational strategies. Each stage is difficult. The following may be real scenarios for operations to extract data:
This is a very common situation and is understandable because the operation perspective is the business perspective, but the development perspective is the data perspective. This field does not include whether the user is active as you said. At this time, I will definitely think that I need a set of data that can clearly tell me what industry this user is in, what functions he uses, what business model he has, and what status he is in! ! This brings up a question: how can we clearly sort out the data and compile these indicators that can actually guide the business? How to define user portraits through data?
Indicator definition: scenario-based definition facilitates the identification of indicators that need to be extracted Before communicating with data or development to extract data, you first need to think about what kind of portrait results you want to get. You can boldly use assumptions here, for example
This is very clear. Generally, I will divide the data into two types, and then refine the relevant indicators based on the two types of data. Each type of data here can be further subdivided into detailed data indicators. For example, user basic data can be refined in this way, and other indicator types can also be refined in this way. Indicators can be selected based on product attributes and the content that needs to be understood. Data extraction - dimensionality reduction of multidimensional data After clarifying the definition of indicators, we will find that some indicators may involve multiple dimensions and there is no way to compare and analyze them. For example, a user successfully creates a certain type of product. The sales volume and sales volume of each product are different. How to comprehensively handle the usage of the product function? Here we need to process the data by dimensionality reduction, which can be done by weighted averaging, or taking the mode or median as representation, so as to reduce the situation of multi-dimensional comparison in comparative evaluation. Data Analysis - Discovering What the “Most Important Indicator” Is A user record has many associated data fields. What is the core difference between a paying user and a non-paying user? What is the key to getting users to pay ? What do users care about ? This may require analysis to see clearly which independent variables are related to the dependent variable (user payment). Here I recommend an algorithm, the CHAID decision tree. This type of decision tree is specifically used to find out the core variables that affect the final result. In other words, with so many functions, so many user behaviors, and so many attributes, which type of user with which attributes and which type of user with which behavior are more likely to convert! How is the decision tree algorithm calculated? Assuming that we need to understand how users can pay, then whether or not to pay is the dependent variable to be examined, and it is also the value that the decision tree needs to predict based on the variable situation. We divide the entire data set into a training set and a validation set according to 20% and 80%, that is, one part is used to train the model to let the model find characteristic factors from the data, and the other part is used for verification and prediction to determine whether the model and the selected characteristic variables are effective and how good the fit is. Extract 2 given values from the independent variable and perform a chi-square test with the dependent variable; if the chi-square test shows that the relationship between the two is not significant, the two positive given values can be combined. Continue to reduce the number of values of the independent variable until all values of the independent variable are significant. For example, there are 130 independent variables in our data, and we don’t know whether many of them are related to whether users pay, whether the number of users’ weekly active times is related to user payments, and whether users’ attempts at a certain feature are related to user payments. In this case, we can use the chi-square test of the decision tree to determine whether the independent variables and dependent variables are related by distance. Find the most significant independent variable by comparison, and split the sample according to the final value of the independent variable, that is, to form multiple different trees (generally CHAID generates two tree nodes) Finally, all decision points related to whether users pay or not are displayed. For example, if more than three live broadcast functions are created, the probability of payment is as high as 80%. The decision tree helps us eliminate irrelevant or insignificantly correlated independent variables and tells us what will lead to user conversion and payment. , Related reading: 1. Data operation: How can operations train their data thinking? 2. Data operation: 8 essential data analysis methods for operation! 3. Data operation: How to use data analysis to achieve user growth? 4. Data operation: How to build a data indicator system? 5. Data operation: How to analyze data more efficiently and more valuablely? 6. Data Operation: How can big data make users more willing to pay? Author: LunaDeng |
<<: Niu Men's Fitness Series Courses from Beginner to Mastery Collection Baidu Cloud Download
>>: up to date! Data rankings of 59 advertising platforms!
Introduction to postpartum recovery course resourc...
As China's Internet demographic dividend grad...
A few days ago, a friend was chatting with me and...
"We played with ourselves for a month after ...
The launch of mini programs has brought convenien...
This article is an analysis of Pinduoduo's ca...
Game players should be familiar with Douyu Live, ...
As a product develops over time, it accumulates m...
Pinduoduo has always been a classic example of fi...
Internet advertising generally follows the follow...
Dongzhe Daily 100 Project 1 Document Baidu Cloud ...
The topic I will share today is "Cognitive M...
The core idea of " traffic pool thinking&q...
Guangdiantong is an advertising platform based on...
Guike Zhihu Sales Practice Camp will help you ear...