TY - GEN
T1 - Interface for querying and data mining for the IMDb dataset
AU - Butler, Martin
AU - Robila, Stefan
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/6/16
Y1 - 2016/6/16
N2 - This paper describes the design and implementation of a tool to extract the IMDb dataset files and import them into a database. This approach differs from other published tools or research in that the previous work used relational databases. This tool uses document oriented data structures, and allows others to augment the code to change structures based on their needs. The project development required the use of technologies currently in demand for web developers and software engineers, which allows other developers to fork a copy of the work and utilize in their own work. In addition, it provided the project team an opportunity to develop additional marketable skills. Finally, a web interface to perform queries against the import data to validate the import process was also developed.
AB - This paper describes the design and implementation of a tool to extract the IMDb dataset files and import them into a database. This approach differs from other published tools or research in that the previous work used relational databases. This tool uses document oriented data structures, and allows others to augment the code to change structures based on their needs. The project development required the use of technologies currently in demand for web developers and software engineers, which allows other developers to fork a copy of the work and utilize in their own work. In addition, it provided the project team an opportunity to develop additional marketable skills. Finally, a web interface to perform queries against the import data to validate the import process was also developed.
KW - IMDb Database
KW - Large Data Set Processing
KW - Unstructured Databases
UR - http://www.scopus.com/inward/record.url?scp=84978505032&partnerID=8YFLogxK
U2 - 10.1109/LISAT.2016.7494103
DO - 10.1109/LISAT.2016.7494103
M3 - Conference contribution
AN - SCOPUS:84978505032
T3 - 2016 IEEE Long Island Systems, Applications and Technology Conference, LISAT 2016
BT - 2016 IEEE Long Island Systems, Applications and Technology Conference, LISAT 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - IEEE Long Island Systems, Applications and Technology Conference, LISAT 2016
Y2 - 29 April 2016
ER -