observations.nips

nips(path)

Load the NIPS conference papers 1987-2015 data set (Perrone, Jenkins, Spano, & Teh, 2016). It is in the form of a 11,463 x 5,812 matrix of word counts, containing 11,463 words and 5,811 NIPS conference papers (the first column contains the list of words). Each column contains the number of times each word appears in the corresponding document. The names of the columns give information about each document and its timestamp in the following format: year_paperID.

Args:

  • path: str. Path to directory which either stores file or otherwise file will be downloaded and extracted there. Filename is NIPS_1987-2015.csv.

Returns:

Tuple of np.darray x_train and dictionary metadata of column headers (documents) and row headers (words).

Perrone, V., Jenkins, P. A., Spano, D., & Teh, Y. W. (2016). Poisson random fields for dynamic feature models. arXiv Preprint arXiv:1611.07460.