WOT2016 Tian Chao: What can big data bring to the information platform?

WOT2016 Tian Chao: What can big data bring to the information platform?

[Original article from 51CTO.com] The WOT2016 Big Data Summit will be held at the Beijing JW Marriott Hotel from November 25 to 26, 2016. Dozens of front-line experts in the big data field and data technology pioneers will gather on site to engage in in-depth exchanges and discussions on cutting-edge technical topics such as machine learning, real-time computing, system architecture, and NoSQL technology practices, while sharing the latest practices and hottest industry applications in the big data field.

51CTO reporter conducted an exclusive interview with Tian Chao, R&D director of Yidian Zixun's big data platform, who will be speaking at the conference. Let us get a sneak peek and find out what Tian Chao thinks about Yidian Zixun's large-scale real-time click feedback platform.

Tian Chao is currently the technical director of the big data center at Yidian Zixun, responsible for infrastructure and big data platform related work. He graduated with a master's degree from the Institute of Computing Technology of the Chinese Academy of Sciences. He has worked as an engineer at Yahoo Beijing R&D Center, CTO of Synchronous Disk, and senior technical manager of AutoNavi Software. He is currently the technical director of the big data platform of Yidian Zixun.

Big data technology refers to the ability to process massive amounts of data and data applications built on such processing capabilities. Since the widespread popularity of Hadoop, the industry has had the ability to build large-scale data storage and computing. As technology continues to develop, the demand for upper-level applications to have the ability to process massive amounts of data in real time is increasing, which has led to the emergence of various real-time computing frameworks and systems such as Storm. Today's technologies, including Spark and Google Dataflow, hope to more organically unify offline computing with online computing.

Real-time data processing capabilities are an essential component for a modern Internet company. Online machine learning, real-time user portrait systems, real-time data warehouses, real-time statistical analysis systems and other businesses of various companies all require the ability to calculate large-scale feedback data in real time. The real-time computing parts of these systems have certain commonalities and certain special parts. At the beginning of the design, Yidian Zixun's real-time feedback platform abstracted the common computing models and data structures of the real-time computing parts of the above systems. When designing the system, it referred to Google's Mesa system, and designed it into a scalable platform that can support the real-time computing tasks of the above systems within Yidian Zixun.

Many information platforms only serve readers, but Yidian News can do the opposite, serving readers while also providing information to authors. The system analyzes based on user behavior and explores user needs for interests and how those needs are met. These data and in-depth data mining provide a global God's perspective for Yidian News' content ecosystem construction, allowing Yidian News to observe group performance and content trends from a higher perspective. Yidian News also has a system called Yidian Insight, which is currently in an invitation test. The system maps knowledge of user interests to different fields and displays this knowledge in various data visualization methods.

Search engines emphasize user search, which is equivalent to users leading the content; recommendation means that users are completely passive and do not express themselves. First, users are given common content, and then based on their click behavior, their preferences are guessed, and then the content is recommended to them. Search engines and recommendation engines are different systems with similar structures. The core goal of Yidian Zixun's interest engine design is to organically integrate search technology and recommendation technology. In the interest engine, the underlying data of users' search and recommendation behaviors are completely connected, making full use of users' active expressions and passive behavior signals, constantly learning and mining users' interests based on artificial intelligence technology, and distributing content based on user interests.

Tian Chao believes that the continuous development of technologies from big data to artificial intelligence is actually a natural process of the industry's ability to process and utilize data. In the early days, most technologies in the industry were used to process result data, with data volumes at the GB level, and databases used for storage. The ability to obtain, store, and calculate data was at an early stage. With the continuous development of a series of infrastructures such as Hadoop, big data technology has also continued to develop. Technical personnel not only process business result data, but also conduct more in-depth processing of logs describing user behavior to assist business calculations. In this era, the amount of data has grown to the PB level, and various distributed file systems are used for storage. At this stage, various offline computing, streaming computing, and graph computing models have also developed with the development of big data applications. Today, after having better computing models and more massive data, the use of data has also deepened, and the combination of artificial intelligence and deep learning technology with big data can also construct more intelligent applications.

[51CTO original article, please indicate the original author and source as 51CTO.com when reprinting on partner sites]

The high-end technology summit [WOT2016 "Big Data Technology Summit"] hosted by 51CTO will be grandly opened at the Beijing Yuecai JW Marriott Hotel from November 25th to 26th. More than 40 heavyweight guests in the industry will gather to analyze the practical combination of big data technology and industry applications. The organizer will invite more lecturers to the "WOT Lecturer Interview Room" to deeply analyze the technical dry goods.

More interviews from WOT2016
  • WOT2016 Xiang Lei: Building your own visual big data query platform
  • WOT2016 Wang An: See the sparks between finance and big data
  • 【WOT Lecturer】 Director Shao Guoan of the National Information Center: Security Requirements for Big Data
  • WOT lecturer Liu Zhe: Listen to AdMaster's Lambda architecture practice
  • WOT lecturer Zhao Qiang: Redis high performance cache and persistence

<<:  Android unit testing - verify the correct posture of function parameters and return values

>>:  How to use Android image resources to create a more sophisticated APP

Recommend

Advertising with high conversion rates all have these characteristics!

Five years ago, a trend emerged - traditional ent...

If you accidentally swallow gum, don’t worry, it will be fine in a week!

When I was a child, the two most common lies I he...

Ping An Health Product Analysis

At present, there are more than 3,000 certified m...

What is the situation with online defense in many universities? Why do this?

It’s settled! Graduates from many universities co...

What does bidding mainly do and what do bidding promoters need to do every day?

Many companies are recruiting SEM bidders, and ma...

【World First Aid Day】Protect lives, “rescue” around you

Author: Wang Changyuan, Chief Physician, Xuanwu H...

Artificial Intelligence, in a Jar

By Rich Heimann The "brain in a jar" is...

The four basic ways to attract users: mining, support, output and retention

Background: It has been four years since Changba ...

Tesla adjusts prices again: Model series all increase by around RMB 10,000

Although Tesla has built a factory in China, the ...

How do merchants choose the WeChat mini program platform?

As mini programs continue to develop, many busine...

Community operation: How to build a high-quality and active community?

Communities have become a standard feature of Int...

Why is MediaTek, which was so powerful last year, losing ground now?

MediaTek's performance last year was impressi...