Welcome to MSMK7025: Algorithms, Big Data and Online Marketplaces!

Class A: Wed & Sat 2:00pm - 5:00pm (ATC B2)
Class B: Wed & Sat 6:30pm - 9:30pm (ATC B2)
Class Days: Dec 8, 11, 15, 18, 22; Jan 5, 8, 12, 15, 19.
Final Exam: Jan 26 (Wednesday).

Instructor: Xi Li

  • You can download the software for the course here: R, R Studio and Tableau Public
  • Note: When installing R/Rstudio, make sure your path does not contain any non-english characters. (安装路径必须为纯英文,否则运行会出错)
  • No textbooks for the course.

Grading:
10%: Group Paper Presentation (~12 minutes, in class). Details Spreadsheet Class A Class B
30%: Two Group Data Projects

  • For details, please see the slides for the data workshops
  • Deadline for Project I: Dec 29, 5pm (Class A) 9:30pm (Class B)
  • Deadline for Project II: Jan 22, 5pm (Class A) 9:30pm (Class B)

10%: In-class Participation
50%: Final Exam (Multiple Choice Only; Open-book Open-notes)
Sample questions for the final exam will be released in the last session (i.e., Jan 19).

QR code for live comments
QR code for live comments

Lecture 1: Introduction

Slides (No Answer Keys) Slides (With Answer Keys)

Lecture 2: Data analysis: An R approach

This class introduces some basic R operations. The lecture is designed for students with zero background in computer programming/R. We will explore how to use R to conduct simple data analysis.
Note: Please get R installed on your laptop and bring it with you.
Slides
Teaching materials: A beginner’s guide to R
Troubleshooting for package installation: Here

Lecture 3: Beyond Linear Regression (Rescheduled)

Note: Please get R installed on your laptop and bring it with you.
Slides (No Anwer Keys) Slides (With Anwer Keys)

Lecture 4: Data Workshop I (Rescheduled)

Note: Please get R installed on your laptop and bring it with you.
Slides
Dataset: CSV
Sample Data Analysis with R: Sample

Lecture 5: Data visualization: A Tableau approach (Rescheduled)

This class introduces Tableau Public, a power data visualization software. It is simple and you will have fun using it.
Note: Please get Tableau Public installed on your laptop and bring it with you.
Slides
Data files: Superstore Data, Movie Data Tableau Public Gallery
Example of Word Cloud: Word Cloud Tagul for generating Word Cloud: Tagul
Dituhui for creating a map for China

Lecture 6: Data Scraping

Note: Please get R and Chrome browser installed on your laptop and bring it with you.
Slides
Teaching materials: Scraper (HKU), Scaper (Marketing Science Journal), Scraper (MIT), Scraper (Photo)

Lecture 7: Causality

Slides
Optional reading: a discussion of some famous instruments (in simplified Chinese) here
Optional reading: the 2021 Nobel Prize in Economics here for a simplified Chinese version, and here for an English version.

Lecture 8: Text Analysis

Note: Please get R installed on your laptop and bring it with you.
Slides
Codes: text analysis, topic models
A Chinese sentiment lexicon here
Demonstration of sentiment analysis.
Demonstration of LDA. Data files: document and stopword

Lecture 9: Data Workshop II

Note: Please get R installed on your laptop and bring it with you.
Slides
Teaching materials: Quadratic Regression, Interactions and Fixed Effects
Dataset: XLSX, CSV

Note: In the previous dataset, some values were mistakenly treated as strings. The CSV file is now updated, and you can continue using it. We thank the students for pointing this out and apologize for the inconvenience.

Sample Data Analysis with R: Sample

Lecture X: Recommender Systems and Final Review (Online Class)

Note: Due to the university policy, the class will be delivered online exclusively. Please join the class through ZOOM. We will also conduct a brief review of the class with sample questions for final.
ZOOM Link: Class A, Class B
Slides
Final Review
Thanks for taking the course! Finally, the intructor would invite you to conduct an anonymous survey to gather your feedback: Link