Cloud based predictive analytics

Text classification, recommender systems and decision support

Klavdiya Hammond, Aparna Varde

Research output: Contribution to conferencePaperResearchpeer-review

8 Citations (Scopus)

Abstract

This paper presents a detailed study of technologies based on Hadoop and MapReduce available over the cloud for large-scale data mining and predictive analytics. Although some studies may have shown that cloud technologies relying on the MapReduce framework do not perform as well as parallel database management systems, e.g., with ad hoc queries and interactive applications, MapReduce has still been widely used by many organizations for big data storage and analytics. A number of MapReduce based tools are broadly available over the cloud. In this work we explore the Apache Hive data warehousing solution and particularly its Mahout data mining libraries for predictive analytics. We present results in the context of text classification, recommender systems and decision support. We develop prototype tools in these areas and discuss our outcomes from the study useful to researchers and other professionals in cloud computing and application domains. To the best of our knowledge, ours is among the first few in-depth studies on Mahout with application prototypes available for use.

Original languageEnglish
Pages607-612
Number of pages6
DOIs
StatePublished - 1 Jan 2013
Event2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013 - Dallas, TX, United States
Duration: 7 Dec 201310 Dec 2013

Other

Other2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013
CountryUnited States
CityDallas, TX
Period7/12/1310/12/13

Fingerprint

Recommender systems
Data mining
Data warehouses
Cloud computing
Predictive analytics

Keywords

  • Cloud computing
  • Data mining
  • Mahout
  • Predictive analytics

Cite this

Hammond, K., & Varde, A. (2013). Cloud based predictive analytics: Text classification, recommender systems and decision support. 607-612. Paper presented at 2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013, Dallas, TX, United States. https://doi.org/10.1109/ICDMW.2013.95
Hammond, Klavdiya ; Varde, Aparna. / Cloud based predictive analytics : Text classification, recommender systems and decision support. Paper presented at 2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013, Dallas, TX, United States.6 p.
@conference{7852dc5383e449edaf88710e349a3b9e,
title = "Cloud based predictive analytics: Text classification, recommender systems and decision support",
abstract = "This paper presents a detailed study of technologies based on Hadoop and MapReduce available over the cloud for large-scale data mining and predictive analytics. Although some studies may have shown that cloud technologies relying on the MapReduce framework do not perform as well as parallel database management systems, e.g., with ad hoc queries and interactive applications, MapReduce has still been widely used by many organizations for big data storage and analytics. A number of MapReduce based tools are broadly available over the cloud. In this work we explore the Apache Hive data warehousing solution and particularly its Mahout data mining libraries for predictive analytics. We present results in the context of text classification, recommender systems and decision support. We develop prototype tools in these areas and discuss our outcomes from the study useful to researchers and other professionals in cloud computing and application domains. To the best of our knowledge, ours is among the first few in-depth studies on Mahout with application prototypes available for use.",
keywords = "Cloud computing, Data mining, Mahout, Predictive analytics",
author = "Klavdiya Hammond and Aparna Varde",
year = "2013",
month = "1",
day = "1",
doi = "10.1109/ICDMW.2013.95",
language = "English",
pages = "607--612",
note = "null ; Conference date: 07-12-2013 Through 10-12-2013",

}

Hammond, K & Varde, A 2013, 'Cloud based predictive analytics: Text classification, recommender systems and decision support' Paper presented at 2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013, Dallas, TX, United States, 7/12/13 - 10/12/13, pp. 607-612. https://doi.org/10.1109/ICDMW.2013.95

Cloud based predictive analytics : Text classification, recommender systems and decision support. / Hammond, Klavdiya; Varde, Aparna.

2013. 607-612 Paper presented at 2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013, Dallas, TX, United States.

Research output: Contribution to conferencePaperResearchpeer-review

TY - CONF

T1 - Cloud based predictive analytics

T2 - Text classification, recommender systems and decision support

AU - Hammond, Klavdiya

AU - Varde, Aparna

PY - 2013/1/1

Y1 - 2013/1/1

N2 - This paper presents a detailed study of technologies based on Hadoop and MapReduce available over the cloud for large-scale data mining and predictive analytics. Although some studies may have shown that cloud technologies relying on the MapReduce framework do not perform as well as parallel database management systems, e.g., with ad hoc queries and interactive applications, MapReduce has still been widely used by many organizations for big data storage and analytics. A number of MapReduce based tools are broadly available over the cloud. In this work we explore the Apache Hive data warehousing solution and particularly its Mahout data mining libraries for predictive analytics. We present results in the context of text classification, recommender systems and decision support. We develop prototype tools in these areas and discuss our outcomes from the study useful to researchers and other professionals in cloud computing and application domains. To the best of our knowledge, ours is among the first few in-depth studies on Mahout with application prototypes available for use.

AB - This paper presents a detailed study of technologies based on Hadoop and MapReduce available over the cloud for large-scale data mining and predictive analytics. Although some studies may have shown that cloud technologies relying on the MapReduce framework do not perform as well as parallel database management systems, e.g., with ad hoc queries and interactive applications, MapReduce has still been widely used by many organizations for big data storage and analytics. A number of MapReduce based tools are broadly available over the cloud. In this work we explore the Apache Hive data warehousing solution and particularly its Mahout data mining libraries for predictive analytics. We present results in the context of text classification, recommender systems and decision support. We develop prototype tools in these areas and discuss our outcomes from the study useful to researchers and other professionals in cloud computing and application domains. To the best of our knowledge, ours is among the first few in-depth studies on Mahout with application prototypes available for use.

KW - Cloud computing

KW - Data mining

KW - Mahout

KW - Predictive analytics

UR - http://www.scopus.com/inward/record.url?scp=84898036609&partnerID=8YFLogxK

U2 - 10.1109/ICDMW.2013.95

DO - 10.1109/ICDMW.2013.95

M3 - Paper

SP - 607

EP - 612

ER -

Hammond K, Varde A. Cloud based predictive analytics: Text classification, recommender systems and decision support. 2013. Paper presented at 2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013, Dallas, TX, United States. https://doi.org/10.1109/ICDMW.2013.95