Anonymous Microsoft Web Data
Abstract
Log of anonymous users of www.microsoft.com; predict areas of the web site a user visited based on data on other areas the user visited.
Purpose
Additional Information We created the data by sampling and processing the www.microsoft.com logs. The data records the use of www.microsoft.com by 38000 anonymous, randomly-selected users. For each user, the data lists all the areas of the web site (Vroots) that user visited in a one week timeframe. Users are identified only by a sequential number, for example, User #14988, User #14989, etc. The file contains no personally identifiable information. The 294 Vroots are identified by their title (e.g. "NetShow for PowerPoint") and URL (e.g. "/stream"). The data comes from one week in February, 1998.
Papers Citing this Dataset
8 papers found
RECOME: a New Density-Based Clustering Algorithm Using Relative KNN Kernel Density
By Yangli-ao Geng, Qingyong Li, Rong Zheng, Fuzhen Zhuangz, Ruisi He.
Learning Arithmetic Circuits
By Daniel Lowd, Pedro Domingos.
A Growing Evolutionary Algorithm and Its Application for Data Mining
By Ning Hou, Zhanmin Wang.