Yahoo releases massive machine learning dataset for researchers

Yahoo releases massive machine learning dataset for researchers

According to foreign media reports, Yahoo recently launched a new "Yahoo News Recommendation" dataset, which is known as the largest machine learning dataset ever released to the public. Yahoo said that this dataset is mainly launched for academic research communities, so that they no longer need to worry about not being able to obtain large-scale datasets in their research.

[[162026]]

It is reported that the public data set includes 110 billion events, and its total capacity in an uncompressed state is 13.5TB.

Researchers can find data such as anonymous user news interaction data in the dataset, which was collected from 20 million users in the early months of last year.

The Yahoo News Feed dataset contains data on users’ interactions with multiple Yahoo sections, such as Yahoo Movies, Yahoo News, and Yahoo Finance.

In addition, Yahoo has added some demographic data, such as gender, age and geographic location, to the dataset. "Our goal is to promote independent research in large-scale machine learning and recommendation systems, and to help create a level playing field between industry and academic research," Yahoo said in a statement.

<<:  Google reorganizes secretive R&D department Google X: new logo unveiled

>>:  A glimpse of the leopard: product technology direction from CES 2016

Recommend

Introduction to Huawei AppGallery Paid Display Service

Paid display service introduction HUAWEI PPS (Pai...

The Marketing Dilemma of 618

There is a promotional festival called 618 every ...

How does Baidu bidding charge? Is there any contact information?

Baidu’s paid promotion is charged based on clicks...

Swift theme color top solution

1. Conventional theme color usage points Before a...

iOS development: Inducing users to comment on your app

"Since my own app has few downloads and comm...