According to foreign media reports, Yahoo recently launched a new "Yahoo News Recommendation" dataset, which is known as the largest machine learning dataset ever released to the public. Yahoo said that this dataset is mainly launched for academic research communities, so that they no longer need to worry about not being able to obtain large-scale datasets in their research.
It is reported that the public data set includes 110 billion events, and its total capacity in an uncompressed state is 13.5TB. Researchers can find data such as anonymous user news interaction data in the dataset, which was collected from 20 million users in the early months of last year. The Yahoo News Feed dataset contains data on users’ interactions with multiple Yahoo sections, such as Yahoo Movies, Yahoo News, and Yahoo Finance. In addition, Yahoo has added some demographic data, such as gender, age and geographic location, to the dataset. "Our goal is to promote independent research in large-scale machine learning and recommendation systems, and to help create a level playing field between industry and academic research," Yahoo said in a statement. |
<<: Google reorganizes secretive R&D department Google X: new logo unveiled
>>: A glimpse of the leopard: product technology direction from CES 2016
According to information from spies, a group of s...
At six o'clock yesterday evening, Papi Jiang ...
Nowadays, 4K screens are no longer exclusive to l...
Short videos in the field of magic also attract p...
In view of the fact that everyone had many diffic...
On August 26, 1980, Shenzhen Special Economic Zon...
Mixed Knowledge Specially designed to cure confus...
A recent survey shows that 67% of Americans would...
Source code introduction Keep your favorite peopl...
What factors lead to the low traffic in Douyin’s ...
Inspired by the recent criticism of traffic fraud...
Many guys who have just started using Kuaishou ha...
Here’s a little-known fact that may subvert your ...
1. Explosive growth Xiaohongshu has been extremel...