Interface for querying and data mining for the IMDb dataset

Martin Butler, Stefan Robila

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

This paper describes the design and implementation of a tool to extract the IMDb dataset files and import them into a database. This approach differs from other published tools or research in that the previous work used relational databases. This tool uses document oriented data structures, and allows others to augment the code to change structures based on their needs. The project development required the use of technologies currently in demand for web developers and software engineers, which allows other developers to fork a copy of the work and utilize in their own work. In addition, it provided the project team an opportunity to develop additional marketable skills. Finally, a web interface to perform queries against the import data to validate the import process was also developed.

Original languageEnglish
Title of host publication2016 IEEE Long Island Systems, Applications and Technology Conference, LISAT 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781467384902
DOIs
StatePublished - 16 Jun 2016
EventIEEE Long Island Systems, Applications and Technology Conference, LISAT 2016 - Farmingdale, United States
Duration: 29 Apr 2016 → …

Publication series

Name2016 IEEE Long Island Systems, Applications and Technology Conference, LISAT 2016

Other

OtherIEEE Long Island Systems, Applications and Technology Conference, LISAT 2016
Country/TerritoryUnited States
CityFarmingdale
Period29/04/16 → …

Keywords

  • IMDb Database
  • Large Data Set Processing
  • Unstructured Databases

Fingerprint

Dive into the research topics of 'Interface for querying and data mining for the IMDb dataset'. Together they form a unique fingerprint.

Cite this