ICOM6044 - Data science for business
Instructor Professor Alan Montgomery ( alm3@andrew.cmu.edu)
Teaching assistant Mr. Jason Ma
Syllabus Data mining focuses on identifying patterns using past transactions to discover relationships. By its very nature electronic commerce is able to generate large amounts of information and data mining methods are quite helpful for managers in turning this information into knowledge which in turns can be used to make better decisions. These quantitative methods have the potential to dramatically change decision making in many areas of business. For example ideas like interactive marketing, customer relationship management, and database marketing are pushing companies to utilize the information they collect about their customers in order to make better marketing decisions.

This module focuses on how data mining techniques can be applied to solve managerial problems in marketing and electronic commerce. The emphasis is on understanding and applying existing techniques using computer software tools. The set of data mining techniques and marketing problems that can be studied is immense; therefore our strategy will be to focus on popular techniques like decision trees, logistic regression, linear regression, and text processing methods. Each of these techniques is applied to a specific case study in which students will be asked to solve a business problem using the specified approach. The objective is for students to be able to generalize their experience in these settings to other problems using the same technique.
Objectives
  • Understand the data mining process
  • Introduce and understand a set of data mining techniques
  • Illustrates these techniques to specific case studies for solving business and e-commerce problems
Learning outcomes Upon completion of this module you will understand the role of data mining in addressing many problems that arise in electronic commerce. Our focus is on learning a core set of principles that underlies many data mining techniques, and consider their applications to specific problems like: analyzing clickstream data from web traffic, market segmentation, predicting customer profitability and retention, and analyzing text data from keyword search and social media. We consider a compatible set of techniques to analyze these problems, such as cluster analysis, linear regression, regression trees, and logistic regression. The focus of this course is on the application and interpretation of the techniques, as opposed to implementing the algorithms.
Prior knowledge expected It is expected that students understand that this module makes extensive use of statistics and mathematics.
Topics covered
  1. Understanding Data and the Data Mining Process
  2. Exploring and Visualizing Data
  3. Data Based Decision Making
  4. Market Segmentation of Customers
  5. Predictive Modeling
  6. Identifying Profitable Customers
  7. Overfitting and Evaluating Models
  8. Data Mining Techniques for Prediction
  9. Working with Unstructured Datasets
  10. Business Strategy for Employing Data Science
Teaching format 10 three-hour lectures with prescribed readings to provide important background information for the lectures. Students are asked to complete a considerable amount of computer and analysis work.
Assessment
  • In-course assessment (65%)
    •  Class Participation (10%)
    • Three case assignments (55%)
  • Final examination (35%)
    Date: 11 August 2015 (Tue)
    Time: 7:30pm - 9:30pm
    Venue: TBA
    Note: The textbook for this module is permitted during the examination along with a one-page sheet (A4) of notes.  This note sheet may be typed or hand-written on both sides, but the type must be legible (7 point or larger)
Course materials Prescribed textbook: Foster Provost and Tom Fawcett, Data Science for Business: What you need to know about data mining and data-analytic thinking, O'Reilly Media; 1 edition (August 16, 2013)
Enrolment No add or drop after 30 June 2015.
Class quota Class size will be maintained at 45.
 
- Back -