The books dataset

About the dataset

The examples in this website use a dataset containing meta-information about 11,121 books. It is based on a dataset published on Kaggle by user ‘soumik’ under a CC0: Public Domain license.

The dataset includes the following information for each book:

  • id, a unique identifier
  • title, the title
  • authors, names of the authors, separated by slashes (/)
  • average_rating, the average rating
  • isbn, the International Standard Book Number (ISBN)
  • language_code, a code indicating the language in which the book was written
  • num_pages, the number of pages
  • publication_date, the publication date
  • publisher, the publisher

Load the dataset

The dataset is available as a comma-separated values (CSV) file. Use the file upload feature of Kibana to load the file into Elasticsearch:

  1. Download the file and save it to disk.
  2. Unzip the books.csv.zip file.
  3. Open Kibana in a web browser and navigate to the Kibana Home.
  4. Select Upload a file.
  5. Choose Select or drag and drop a file, and select the unzipped books.csv file.
  6. Select Import.
  7. Enter books for the index name and select Import.
  8. Wait a few moments for Kibana to import the data.

To validate that the import was successful:

  1. From the Kibana main menu, select Discover.
  2. If not selected yet, in the top left, choose the books data view.
  3. Change the time filter in the top right to display data for the last 150 years.
  4. Discover should show you a list of 11,121 books.