Strumenti Utente

Strumenti Sito


mds:lbi:lds_2021-2022

LABORATORY OF DATA SCIENCE (2021/2022)

Instructors:

News

  • [01-12-2021]: Third and final part of the project is up
  • [16-11-2021]: Instructions for the SSAS project in the Lecture of today: to avoid conflicts in deployment/process follow this steps once the solution is opened: (1) rename the project as <your account>_foodmart (2) from project properties select 'Deployment', then rename the database as <your account>_foodmart; (3) click on the button “show all files” just above “Solution explorer” right click on “view code” on the .database file that is visualized, and then change the ID from ruggieri_foodmart into <your account>_foodmart, and finally save the file; (4) change the credentials of connection to database on SQL Server. As an alternative solution you mayimport the project from the SSAS server and rename it as <your account>_foodmart (step 4 is still necessary).
  • [15/10/2021] Instructions for installing Data Tools for Visual Studio 2019 are in the software section of the wiki. Please follow them closely, step by step.
  • [15/10/2021] IMPORTANT The first part of the project is available. Checkpoint: 15 November.
  • [02/10/2021] The lecture of Monday 4th October will be canceled.
  • [08/09/2021] The first lecture will be on 16 Sept.
  • [16/09/2021] IMPORTANT Please, fill the document at the following link with your information, so that we can provide you access to teaching database and mailing list: https://docs.google.com/spreadsheets/d/1yYzHXmykhbfwy7G9uB_Z1fGcW_Vtvjugy4Yvlj-aM2Y/edit?usp=sharing

Hours and Rooms

Classes

Lessons will be held onilne by Teams Platform

Day of Week Hour Room
Monday 11:00 - 12:45 Teams
Thursday 09:00 - 10:45 Teams

Link to Teams module: https://teams.microsoft.com/l/team/19%3amm3HFMqMSvpUrGY2sMYlpzxQ-atdxhfXreRUHhvrODs1%40thread.tacv2/conversations?groupId=c196ac40-93a2-4436-adfe-a81af3d06eef&tenantId=c7456b31-a220-47f5-be52-473828670aa1

Learning Material

Slides & Registration of the classes

  • The slides used in the course will be inserted in the calendar after each class.
  • Registration of each lecture will be available on Teams

Past Exams

Software

F.A.Q.

Class calendar - (2021-2022)

Day Topic Slides Data/Software References Video Lectures Teacher
13.09 11:00-12:45 Lecture canceled
1. 16.09 09:00-10:45 Introduction. File data access. 2021-lds.01.introduction.pdf 2020-lds.02.bi_architectures.pptx.pdf 2020-lds.03.file_data_access.pptx.pdf - BI technology: An Overview of Business Intelligence Technology - File access: File System Interface Video1 Video2 Monreale
2. 20.09 11:00-12:45 Representation formats: CSV, FLV, ARFF, XML. Python Recap 2020-lds.04.python.pptx.pdf - File Formats: Introduction to data technologies(Chps. 5, 6), Weka ARFF Format, XRFF Format - Python reference: Free python book with exercises Video1 Video2 Pellungrini
3. 23.09 11:00-12:45 File Access in Python lds.05.fileaccess-python2021.pdf census.csv.zip Collection of files Partial Solutions to Python Exercises Video1 Video2 Pellungrini
4. 27.09 9:00-10:45 File Access in Python Practice lds.05.fileaccess-python2021.pdf census.csv.zip Collection of files Partial Solutions to Python Exercises csv to Arff conversion solution Video Pellungrini
5. 30.09 9:00-10:45 Python Exercises ex-customers.pdf ex-customers_solution.zip data-customers.zip lds.file.format.zip Video1 Video2 Video3 Pellungrini
04.10 11:00-12:45 Lecture canceled
6. 07.10 9:00-10:45 RDBMS access protocols: ODBC, OLE DB, JDBC. ODBC Programming. lds.06.relational_data_access-2021.pdf Monreale
7. 11.10 11:00-12:45 RDBMS access protocols: ODBC, OLE DB, JDBC. ODBC Programming. lds.06.relational_data_access-2021.pdf 2021-code-db-samples.zip Monreale
8. 14.10 9:00-10:45 Stratified sampling lds.07.sqlserver.pdf stratifiedsampling.zip Video Pellungrini
9. 18.10 12:00-12:45 ETL Introduction lds.08.etlandssis.pdf Video Monreale
10. 21.10 9:00-10:45 SSIS: toCSV, FromCSV 2021-lds-etl-project.zip }}strazione%20della%20riunione.mp4|Video Monreale
11. 25.10 11:00-12:45 SSIS exercises: Pipeline, Update exercisefact_table.pdf Video Monreale
12. 28.10 9:00-10:45 SSIS exercises: Stratified Subsampling ex-midterm.pdf Monreale
13. 04.11 9:00-10:45 Project Support & Discussion Monreale
14. 08.11 11:00-12:45 SSIS: Surrogate keys + Slowly changing dimensions Monreale
15. 11.11 9:00-10:45 SSIS: Slowly changing dimensions + Datawarehousing and OLAP recap. lds.09.dwandolap.pdf 2021-lds-etl-project_full.zip Monreale
16. 15.11 11:00-12:45 OLAP with SQL Server Analysis Services (SSAS): data source views, dimensions, hierarchies. Data cubes, Parent-child hierarchies. lds.09.ssas-21.pdf foodmart_monreale_full-cube.zip 1) SSAS (olap): documentation; 2) S. Harinath et al. Professional Microsoft SQL Server Analysis Services 2012 with MDX and DAX, Wrox publisher, 2012. Chps. 4-6. Monreale
17. 18.11 11:00-12:45 Cube deployment. Measure setup, Calculated Members, Excel power pivot integration. ROLAP, MOLAP, HOLAP definition and setup. Cache management. foodmartexplorative.xlsx foodmart_monreale_complete.zip Monreale
18. 22.11 11:00-12:45 Introduction to MDX 2021-mdxquery-demo-partial.mdx.zip MDX: 1) documentation and a useful guide on ordering; 2) S. Harinath ed al. Professional Microsoft SQL Server Analysis Services 2012 with MDX and DAX, Wrox publisher, 2012. Chp. 3. Since the video of this lecture has some issue I'm linking the Video of the last year. It is not exactly equal but very similar. The videos are two because the lectures of these year could not be completely aligned. Video1 Video2 Monreale
19. 25.11 09:00-11:00 Practice on MDX Exercises to be done before next lecture: 1) Queries you have already answered by Excel in Lecture N.17 2) This list of queries: mdx-ex.pdf 2021-mdxquery-demo.mdx.zip Video 1 Video 2 Monreale
20. 29.11 11:00-12:45 Practice on MDX lbi.09.mdxpractice.mdx.zip Video 1 Pellungrini
21. 02.12 09:00-10:45 Practice on MDX mdx_exercises_2021.txt.zip mdxquerytop.mdx.zip
22. 09.12 09:00-10:45 Practice on MDX + PowerBI
23. 13.12 11:00-13:00 ROOM C Lecture by Microstrategy
24. 16.12 09:00-10:45 Lecture only ONLINE dedicated to Project discussion with groups

Exams

PROJECT

A project consists in a set of assignements corresponding to a BI process: data integration, construction of an OLAP cube, qurying of a OPLAP cube and reporting.

The project has to be performed by a team of 2 students (at most 3 after asking authorization for that to the teachers).

Project to be delivered within 31 December 2021

Project to be delivered during the exam sessions Students who did not deliver the above project within 31 December 2021 need to ask by email a new project to the teachers. The project that will be assigned will require about 2 weeks of work and after the delivery it will be discussed during the oral exam. For those students, the oral exams will also cover some practical parts that could not be included in the project. Please write to both teachers!

Exam sessions

Session Date Time Room Notes Marks

Past Editions

mds/lbi/lds_2021-2022.txt · Ultima modifica: 04/11/2022 alle 12:17 (2 anni fa) da Salvatore Ruggieri

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki