dm:mains.santanna.dm4crm.2014
Differenze
Queste sono le differenze tra la revisione selezionata e la versione attuale della pagina.
Entrambe le parti precedenti la revisioneRevisione precedenteProssima revisione | Revisione precedente | ||
dm:mains.santanna.dm4crm.2014 [10/05/2015 alle 20:17 (10 anni fa)] – Dino Pedreschi | dm:mains.santanna.dm4crm.2014 [10/05/2015 alle 20:18 (10 anni fa)] (versione attuale) – [Data Mining for Customer Relationship Management 2015] Dino Pedreschi | ||
---|---|---|---|
Linea 1: | Linea 1: | ||
====== Data Mining for Customer Relationship Management 2014 ====== | ====== Data Mining for Customer Relationship Management 2014 ====== | ||
+ | |||
+ | * **Fosca Giannotti** ISTI-CNR, Knowledge Discovery and Data Mining Lab [[[email protected]]] | ||
+ | |||
+ | * **Dino Pedreschi** Università di Pisa, Knowledge Discovery and Data Mining Lab [[[email protected]]] | ||
+ | |||
+ | * Assistente: **Anna Monreale**, Università di PIsa, Knowledge Discovery and Data Mining Lab [[[email protected]]] | ||
+ | |||
+ | ===== News ===== | ||
+ | |||
+ | * **Instruction for report delivery: send all reports by email (in pdf) to all three instructors (Dino, Fosca, Anna) within June 30, 2014. Specify [MAINS] in the subject of the email.** | ||
+ | * **Before Wednesday 4 June 2013: install KNIME (http:// | ||
+ | * **Before Tuesday 10 June 2013: install Cytoscape (http:// | ||
+ | ====== Goals ====== | ||
+ | |||
+ | Organizations and business are overwhelmed by the flood of data continuously collected into their data warehouses and arriving from external sources – the Web above all. Traditional exploratory techniques may fail to make sense of the data, due to its inherent complexity and size. Data mining and knowledge discovery techniques emerged as an alternative approach, aimed at revealing patterns, rules and models hidden in the data, and at supporting the analytical user to develop descriptive and predictive models for a number of business problems. This short course focusses on the main applications scenarios of data mining to challenging problems in the broad CRM domain - Customer Relationship Management. | ||
+ | |||
+ | ====== Syllabus ====== | ||
+ | |||
+ | * Clustering models for customer segmentation. Discussion of real cases. Hands-on project: segmentation of a base of anonymized customers from the retail industry. Clustering models for competitive intelligence. | ||
+ | * Patterns and association rule mining for market basket analysis. Hands-on project: mining association rules from sales data of the retail industry. | ||
+ | * Prediction models for promotion performance and churn analysis. Discussion of real cases. Hands-on project: churn prediction from a base of anonymized customers from the retail industry. | ||
+ | * Analysis of human mobility patterns by mobility data mining from big data. Mining official data for understanding of human behavior. | ||
+ | * Social network analysis for undestanding diffusion phenomena. Viral marketing. | ||
+ | * Application of data mining to geo-marketing. Analysis of innovators. Predictive models for fraud detection. | ||
+ | |||
+ | |||
+ | ====== Textbooks ====== | ||
+ | |||
+ | * Slides (see Calendar). | ||
+ | |||
+ | * Gordon S. Linoff e Michael J. Berry. //Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management.// | ||
+ | |||
+ | * Pang-Ning Tan, Michael Steinbach, Vipin Kumar. // | ||
+ | * [[http:// | ||
+ | |||
+ | |||
+ | |||
+ | ====== Reading about the "data analyst" | ||
+ | |||
+ | * Data, data everywhere. The Economist, Feb. 2010 {{: | ||
+ | * Data scientist: The hot new gig in tech, CNN & Fortune, Sept. 2011 [[http:// | ||
+ | * Welcome to the yotta world. The Economist, Sept. 2011 {{: | ||
+ | ====== Calendar ====== | ||
+ | |||
+ | ^ ^ Date ^ Topic ^ Learning material ^ | ||
+ | |0. | 11.04.2014 - 09: | ||
+ | |1. | 03.06.2014 - 09: | ||
+ | |2. | 04.06.2014 - 09: | ||
+ | |3. | 05.06.2014 - 09: | ||
+ | |4. | 09.06.2014 - 09: | ||
+ | |5. | 10.06.2014 - 09: | ||
+ | |6. | 11.06.2014 - 09: | ||
+ | |7. | ||
+ | ===== Datasets ===== | ||
+ | |||
+ | |||
+ | 0. Iris dataset. {{: | ||
+ | |||
+ | 1. Shuttle dataset.{{: | ||
+ | |||
+ | |||
+ | ===== Exercises ===== | ||
+ | |||
+ | **1. Market Basket Analysis. ** Problem: given a database of transactions of customers of a supermarket, | ||
+ | |||
+ | **Guidelines for the report:** The report has to illustrate the input dataset, the adopted frequent pattern algorithm and the association rule analysis discussing your findings related to the most interesting rules by using the different measure introduced in the course. | ||
+ | |||
+ | **2. Customer segmentation with k-means.** Problem: given the dataset of RFM (Recency, Frequency and Monetary value) measurements of a set of customers of a supermarket, | ||
+ | * Recency: no. of days since last purchase | ||
+ | * Frequency: no. of visits (shopping in the supermarket) in the observation period | ||
+ | * Monetary value: total amount spent in purchases during the observation period. | ||
+ | |||
+ | **Guidelines for the report:** | ||
+ | |||
+ | * **Data Understanding**: | ||
+ | |||
+ | * **Clustering Analysis by K-means**: Identification of the best value of k and Characterization of the obtained clusters by using both analysis of the k centroids and comparison of the statistics of variables within the clusters and that in the whole dataset. | ||
+ | |||
+ | |||
+ | **3. Churn analysis with decision trees. ** Problem: given a dataset of measurements over a set of customers of an e-commenrce site, find a high-quality classifier, using decision trees, which predicts whether each customers will place only one or more orders to the shop. The explanation of the available variables is {{: | ||
+ | |||
+ | **Guidelines for the report:** The report has to illustrate the input dataset, the adopted classification methodology and the decision tree validation and interpretation. Describe the process adopted to select the proposed tree, together with its quality evaluation. | ||
+ | |||
+ | |||
+ | |||
+ | **Deadline**: | ||
+ | ====== Exams ====== | ||
+ | |||
+ | The exam of the CRM module consists in the evaluation of the reports of the proposed exercises. | ||
+ | |||
+ | |||
+ | ====== Previous editions ====== | ||
+ | |||
+ | * [[MAINS.SANTANNA.DM4CRM.2013]] | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
dm/mains.santanna.dm4crm.2014.1431289052.txt.gz · Ultima modifica: 10/05/2015 alle 20:17 (10 anni fa) da Dino Pedreschi