Schedule

Note: The instructor reserves the right to make changes to this schedule throughout the semester.


Improved Class Structure & Learning Suggestions

Hope every student gets most out of it

  • Focus on in-class coding practices: Think-Pair-Share
  • Spend 9 hours/week outside of class on reading, reviewing, assignments, student hours, Spinelli tutoring hours, & group work

Lec 39: Mon 12/9

Announcements

  • Your final assignments are due at 2pm Wednesday 12/18. No extensions will be granted for the following final assignments
    • MP3 via GitHub
    • Don't forget to fill out the peer evaluation Google Form
    • Optional final exam via Moodle
  • Finals week (until our final assignments are due):
    • My student hours: Wed (Dec 11) 3:45pm-4:45pm, Thur (Dec 12) 9am-10am, Fri (Dec 13) 3:45pm-4:45pm, in McConnell 214
    • If you are not on campus and would like to meet during my regular student hours and appointments, Slack me and I will send you a Zoom link
  • Kind reminder: If you participated in the pre-course survey, please fill out the post-course survey by Wednesday 12/18 at 2pm

Today’s Topics/Activities

In-class exercise

  • Work on MP3

Lec 38: Fri 12/6

Announcements

  • Quiz3 is due at 9pm this Friday (12/6). Please note that extensions will not be granted for quizzes
  • The deadline for revising Lab8 and gaining full credits is today at 9pm
  • Thank you Linh/Zorina for helping with this course!
  • Kind reminder: If you participated in the pre-course survey, please fill out the post-course survey by Wednesday 12/18 at 2pm

Today’s Topics/Activities

In-class exercise

  • Work on MP3

Lec 37: Wed 12/4

Announcements

  • Lab9 is due at 9pm today
  • Quiz3 is due at 9pm this Friday (12/6). Please note that extensions will not be granted for quizzes
  • 15-minutes reflection exercise: Time to fill out course evaluations
  • Kind reminder: If you participated in the pre-course survey, please fill out the post-course survey by Wednesday 12/18 at 2pm

Today’s Topics/Activities

In-class exercise

  • Work on MP3

Lec 36: Mon 12/2

Announcements

  • Quiz3 is due at 9pm this Friday (12/6). Please note that extensions will not be granted for quizzes
  • Your group has been posted in the #in-class-discussions channel. Please ask all questions about MP3 in #questions. Until MP3 is due, you will sit next to your partner(s) in class
  • Kind reminder: If you participated in the pre-course survey, please fill out the post-course survey by Wednesday 12/18 at 2pm

Today’s topics/activities

Chalk talk

  • Thanks for your constructive feedback in the google survey I sent out. If you have further suggestions on this course and my teaching of it, please feel free to let me know via Slack, email, or student appointments
  • Brief summary: Mapping Learning Reflections

In-class exercise

  • Discuss MP3 background reading
  • Work on MP3

Lec 35: Mon 11/25

Announcements

Today’s topics/activities

Chalk talk

Post-class exercise


Lec 34: Fri 11/22

Announcements

  • The deadline for revising Lab6 and gaining full credits is today at 9pm
  • Your grades of Lab8 and Quiz2 have been posted on your personal gradebook
  • The deadline for revising Lab8 and gaining full credits is Friday 12/6 at 9pm. Labs resubmission will not be accepted outside of the two weeks' time frame
  • Mapping Learning Reflections and Course Assessment Feedback by next Saturday 11/30 at 9pm
  • Dietary Requirements Survey by next Monday 11/25
  • Kind reminder: If you participated in the pre-course survey, please fill out the post-course survey by Wednesday 12/18 at 2pm

Today’s topics/activities

Lab9: Maps

  • Click this link to accept the Lab9 assignment in our Github classroom

Lec 33: Wed 11/20

Announcements

  • Kind reminder: If you participated in the pre-course survey, please fill out the post-course survey by Wednesday 12/18 at 2pm

Today’s topics/activities

Lecture slides


Lec 32: Mon 11/18

Announcements

  • Your grades of MP2 have been posted on your personal gradebook
  • Comments on your code of your mini-projects are often sent to your email associated with your GitHub account. Please read them so you can improve your code for future assignments
  • Dietary Requirements Survey by next Monday 11/25
  • Participate in the Integarting Disability Inclusion Components into Statistics and Data Science Pedagogy post-course survey:
    • If you participated in the pre-course survey, please fill out the post-course survey by Wednesday 12/18 at 2pm
    • You will gain 2 extra credits for completing both the pre-course survey and the post-course survey
    • I'd greatly appreciate if you could complete the post-course survey so I can gain an understanding of how the disability inclusion datasets have helped you learn data science and how these datasets can be improved

Today’s topics/activities

Required pre-class exercise

  • Read MDSR 17.5-17.7
  • Read Lecture slides to understand the three approaches creating choropleth maps

Chalk talk

  • Federal Information Processing Standard (FIPS) codes for counties
  • Example: Looking up the database:
    • 25XXX = Massachusetts counties
    • 25015 = Hampshire County, Massachusetts
    • From Lec32In-classExercises-ChoroplethMaps.qmd -> Section 4 -> Look at contents of mass_pop_orig -> GEOID variable:
    > mass_pop_orig
    Simple feature collection with 14 features and 7 fields
    Geometry type: MULTIPOLYGON
    Dimension:     XY
    Bounding box:  xmin: -73.50814 ymin: 41.23796 xmax: -69.92839 ymax: 42.88659
    Geodetic CRS:  NAD83
    First 10 features:
       GEOID                             NAME   variable estimate moe
    1  25017  Middlesex County, Massachusetts B01003_001  1600842  NA
    2  25005    Bristol County, Massachusetts B01003_001   561037  NA
    3  25025    Suffolk County, Massachusetts B01003_001   796605  NA
    4  25001 Barnstable County, Massachusetts B01003_001   213496  NA
    5  25027  Worcester County, Massachusetts B01003_001   824772  NA
    6  25011   Franklin County, Massachusetts B01003_001    70577  NA
    7  25013    Hampden County, Massachusetts B01003_001   467871  NA
    8  25015  Hampshire County, Massachusetts B01003_001   161032  NA
    9  25021    Norfolk County, Massachusetts B01003_001   700437  NA
    10 25009      Essex County, Massachusetts B01003_001   783676  NA

In-class exercise

Post-class exercise


Lec 31: Fri 11/15

Announcements

  • The deadline for revising Lab5 and gaining full credits is today at 9pm
  • Your grades of Lab7 have been posted on your personal gradebook
  • The deadline for revising Lab7 and gaining full credits is Friday 11/29 at 9pm. Labs resubmission will not be accepted outside of the two weeks' time frame
  • Quiz3 is assigned today

Today’s topics/activities

Required pre-class exercise

  • Read MDSR 17.3.3-17.4
  • Follow the code in the Lecture slides
  • Take some notes of your questions

Chalk talk

Post-class exercise


Lec 30: Wed 11/13

Announcements

  • Lab8 is due at 9pm today
  • Quiz2 is due at 9pm today. Please note that extensions will not be granted for quizzes

Today’s topics/activities

  • Continue with static maps

Lec 29: Mon 11/11

Announcements

  • Quiz2 is due at 9pm this Wednesday (11/13). Please note that extensions will not be granted for quizzes

Today’s topics/activities

Required pre-class exercise

  • Read MDSR 17.1-17.3.2

Chalk talk

  • Compare the following London underground maps. As stated in this article, the transit map sacrifices accuracy for clarity.
    • The map of stations as they truly exist:
    • The transit map inside the stations and trains. All lines are either straight or at 45 degrees and futhermore the geographic space is distorted

Lecture slides

Post-class exercise


Lec 28: Fri 11/8

Announcements

  • Today's student hours have been rescheduled to 2:00 PM - 3:00 PM over Zoom
  • MP2 is due at 9pm today
  • Both group members: Fill out the peer evaluation Google Form by 9pm today
  • The deadline for revising Lab4 and gaining full credits is today at 9pm
  • Your grades of Lab6 have been posted on your personal gradebook
  • The deadline for revising Lab6 and gaining full credits is Friday 11/22 at 9pm. Labs resubmission will not be accepted outside of the two weeks' time frame

Today’s topics/activities

In-class exercise

  • Work on MP2

Lec 27: Wed 11/6

Announcements

  • Lab7 is due at 9pm today
  • MP2 is due at 9pm this Friday (11/8)
  • Both group members: Fill out the peer evaluation Google Form by 9pm this Friday (11/8)
  • Quiz2 is due at 9pm next Wednesday (11/13). Please note that extensions will not be granted for quizzes

Today’s topics/activities

In-class exercise

  • Work on MP2

Lec 26: Mon 11/4

Announcements

  • This Friday's student hours will be rescheduled to 2:00 PM - 3:00 PM

Today’s topics/activities

In-class exercise

  • Discuss MP2 background reading
  • Work on MP2

Lec 25: Fri 11/1

Announcements

  • Complete data wrangling learning reflections by today
  • Your grades of Lab5 and Quiz1 have been posted on your personal gradebook
  • The deadline for revising Lab5 and gaining full credits is Friday 11/15 at 9pm. Labs resubmission will not be accepted outside of the two weeks' time frame

Today’s topics/activities

Lab8: Programming in R

  • Click this link to accept the Lab8 assignment in our Github classroom

Lec 24: Wed 10/30

Announcements

  • Lab6 is due at 9pm today

Today’s topics/activities

Required pre-class exercise

  • Read MDSR 7.1-7.5
  • Follow the code in the Lecture slides
  • Take some notes of your questions

Lecture slides

In-class exercise

Optional post-class exercise


Lec 23: Mon 10/28

Announcements

  • Mini-Project 2 (MP2) assigned today:
    • I have posted the groups of two in the #in-class-discussions channel. Please ask all of your questions related to MP2 in the #questions channel. Until MP2 is due, you will sit next to your partner in class
  • Complete data wrangling learning reflections by 11/1 this Friday

Today’s topics/activities

Chalk talk

Lab7: Advanced Git

  • This lab is a group assignment. You'll work with your MP2 partner to learn a new approach to collaborating in GitHub
  • Click this link to accept the Lab7 assignment in our Github classroom, please strictly enter Lab7-Section(S1 or S2)-Group(your MP2 group number) (if you find that your group has been created, join your group)

Lec 22: Fri 10/25

Announcements

  • Your grades of Lab4 have been posted on your personal gradebook
  • Comments on your code of your labs are often sent to your email associated with your GitHub account. Please read them via GitHub so you can improve your code for future assignments
  • The deadline for revising Lab4 and gaining full credits is Friday 11/8 at 9pm. Labs resubmission will not be accepted outside of the two weeks' time frame

Today’s topics/activities

Lab6: Tidy Data

  • Click this link to accept the Lab6 assignment in our Github classroom
  • Readings for Exercise 10:
    1. Don't count on it: How the HUD Point-in-Time count underestimates the homelessness crisis in America (access Perusall via our course Moodle page)

Lec 21: Wed 10/23

Announcements

  • Lab5 is due at 9pm today
  • Quiz2 is assigned today

Readings

  • Deep neural networks are more accurate than humans at detecting sexual orientation from facial images (access Perusall via our course Moodle page)
  • What does it mean to ‘solve’ the problem of discrimination in hiring? Social, technical and legal perspectives from the UK on automated hiring systems (access Perusall via our course Moodle page)

Today’s topics/activities

Chalk talk

In-class exercise

  • Discuss data ethics

Lec 20: Mon 10/21

Announcements

  • Quiz1 is due by 9pm today. Please note that extensions will not be granted for quizzes
  • The deadline for revising Lab3 and gaining full credits is today at 9pm. Lab3 resubmission will not be accepted after this deadline

Today’s topics/activities

Required pre-class exercise

  • Read MDSR Appendix C - Algorithmic thinking
  • Follow the code in the Lecture slides
  • Take some notes of your questions

In-class exercise

Optional post-class exercise


Lec 19: Fri 10/18

Announcements

  • Quiz1 is due by 9pm next Monday (10/21). Please note that extensions will not be granted for quizzes

Today’s topics/activities

Lab5: Joining Tables

  • Click this link to accept the Lab5 assignment in our Github classroom

Lec 18: Wed 10/16

Announcements

  • Lab4 is due at 9pm today
  • Quiz1 is due by 9pm next Monday (10/21). Please note that extensions will not be granted for quizzes
  • Your grade of MP1 has been posted on your personal gradebook
  • Detailed comments on your code of your mini-projects are often sent to your email associated with your GitHub account. Please read them via GitHub so you can improve your code for future assignments

Today’s topics/activities

Post readings (Please take some notes while reading)

  • Deep neural networks are more accurate than humans at detecting sexual orientation from facial images (access Perusall via our course Moodle page)
  • What does it mean to ‘solve’ the problem of discrimination in hiring? Social, technical and legal perspectives from the UK on automated hiring systems (access Perusall via our course Moodle page)

Lec 17: Fri 10/11

Announcements

  • The deadline for revising Lab2 and gaining full credits is today at 9pm. Lab2 resubmission will not be accepted after this deadline

Today’s topics/activities

Lab4: Wrangling Single Table

  • Click this link to accept the Lab4 assignment in our Github classroom

Lec 16: Wed 10/9

Today’s topics/activities

Required pre-class exercise

Lecture slides

In-class exercise

Optional post-class exercise


Lec 15: Mon 10/7

Announcements

  • Your grade of Lab3 has been posted on your personal gradebook
  • Comments on your code of your labs are often sent to your email associated with your GitHub account. Please read them via GitHub so you can improve your code for future assignments
  • The deadline for revising Lab3 and gaining full credits is Monday 10/21 at 9pm. Labs resubmission will not be accepted outside of the two weeks' time frame

Today’s topics/activities

Required pre-class exercise

In-class exercise

Optional post-class exercise


Lec 14: Fri 10/4

Announcements

  • No class today

Lec 13: Wed 10/2

Today’s topics/activities

Required pre-class exercise

Chalk talk

  • Thanks so much for your constructive feedback and thoughtful self-reflections on your learning in the Course Structure Feedback google survey I sent out. Improved Class Structure & Learning Suggestions
  • Six functions to wrangle single table
  • What is pseudocode?

In-class exercise

Optional post-class exercise


Lec 12: Mon 9/30

Announcements

  • MP1 is due at 9pm today
  • Both group members: Fill out the peer evaluation Google Form by 9pm today
  • The deadline for revising Lab1 and gaining full credits is today at 9pm. Lab1 resubmission will not be accepted after this deadline
  • Quiz1 is assigned today

Today’s topics/activities

In-class exercise

  • Work on MP1

Lec 11: Fri 9/27

Announcements

  • Your grade of Lab2 has been posted on your personal gradebook
  • Comments on your code of your labs are often sent to your email associated with your GitHub account. Please read them via GitHub so you can improve your code for future assignments
  • The deadline for revising Lab2 and gaining full credits is Friday 10/11 at 9pm. Labs resubmission will not be accepted outside of the two weeks' time frame

Today’s topics/activities

Chalk talk

  • How to view your comments via GitHub?

In-class exercise

  • Work on MP1

Lec 10: Wed 9/25

Announcements

Today’s topics/activities

In-class exercise

  • Discuss MP1 background reading
  • Work on MP1

Lec 09: Mon 9/23

Announcements

  • Please read MP1 background reading and take some notes before this Wednesday's class
  • Kind reminder: If you agree to participate, please fill out the pre-course survey by September 25, Wednesday

Today’s topics/activities

Chalk talk

  • Trend lines via a geom_smooth() layer. Two types:
    • Linear regression
    • LOWESS: Locally Weighted Scatterplot Smoothing
  • Totally not relevant to this course, but if you're interested, you could take a look at gganimate about how to create animated plots in R. Just for fun!
  • Authoring Markdown Documents
  • Data fundamentals

Lec 08: Fri 9/20

Announcements

  • Kind reminder: If you agree to participate, please fill out the pre-course survey by September 25, Wednesday

Today’s topics/activities

Lab3: Recap 5NG

  • Click this link to accept the Lab3 assignment in our Github classroom
  • Readings for Step 11: Ethical Conisderations in Lab 3
    1. Algorithmic bias detection and mitigation: Best practices and policies to reduce consumer harms
    2. Battling algorithmic bias: How do we ensure algorithms treat us fairly? (access Perusall via our course Moodle page)
    3. The dark side of customer analytics (access Perusall via our course Moodle page)
    4. Electronic monitoring and privacy issues in business-marketing: The ethics of the doubleclick experience (access Perusall via our course Moodle page)
  • In this course, we use the APA references style. For example, if you will cite this lab's readings, the references should look like:
    1. Lee, N. T., Resnick, P., & Barton, G. (2019, May 22). Algorithmic bias detection and mitigation: Best practices and policies to reduce consumer harms. Brookings. https://www.brookings.edu/research/algorithmic-bias-detection-and-mitigation-best-practices-and-policies-to-reduce-consumer-harms/#footnote-7
    2. Kirkpatrick, K. (2016). Battling algorithmic bias: How do we ensure algorithms treat us fairly? Communications of the ACM, 59(10), 16-17. https://dl.acm.org/doi/10.1145/2983270
    3. Davenport, T. H., & Harris, J. G. (2007, May). The dark side of customer analytics. Harvard Business Review Case Study and Commentary.
    4. Charters, D. (2002). Electronic monitoring and privacy issues in business-marketing: The ethics of the doubleclick experience. Journal of Business Ethics, 35(4), 243-254.

Lec 07: Wed 9/18

Announcements

  • Complete Class Structure Feedback and ggplot2 Learning Reflections by next Wednesday 9/25
  • Mini-Project 1 (MP1) assigned today:
    • You will be assigned groups for MP1 and MP2. You can choose your groups for MP3
    • I have posted the groups of two/three in the #in-class-discussions channel. Please ask all of your questions related to MP1 in the #questions channel. Until #mp1 is due, you will sit next to your partner in class
  • Discussion on managing group dynamics:
    • When issues arise, let me know as early as possible
    • Don’t forget you’ll be filling out peer evaluation Google Form
  • Kind reminder: If you agree to participate, please fill out the pre-course survey by September 25, Wednesday

Today’s topics/activities

Chalk talk

  • Discuss MP1
  • Descriptive statistics
  • Interpret boxplots

Lecture slides

In-class exercise

  • Boxplots! Powerful, but tricky!
  • Say we want to study the distribution of the following 12 values which are pre-sorted:

    1, 3, 5, 6, 7, 8, 9, 12, 13, 14, 15, 30

    They have the following summary statistics.

    Min. 1st Quartile Median = 2nd Quartile 3rd Quartile Max. IQR
    1 5.5 8.5 13.5 30 8 = 13.5 - 5.5

    Let’s compare the points and the corresponding boxplot side-by-side with the values on the \(y\)-axis matching:

  • Create boxplots using ggplot2

Lec 06: Mon 9/16

Announcements

  • Your personal gradebook was sent to you by Slackbot during the weekend
  • Your grades of the syllabus quiz and Lab1 have been posted on your personal gradebook
  • Comments on your code of your labs are sent to you on Slack and sent to your email associated with your GitHub account. Please read them so you can improve your code for future assignments
  • The deadline for revising Lab1 and gaining full credits is Monday 9/30 at 9pm. Labs resubmission will not be accepted outside of the two weeks' time frame
  • The correct answers to the common errors in the syllabus quiz:
    • Student hours: McConnell 214 / WF 3:45 PM - 4:45 PM, TH 9:00 AM - 10:00 AM
  • Kind reminder: If you agree to participate, please fill out the pre-course survey by September 25, Wednesday

Today’s topics/activities

Chalk talk

  • How to view your personal gradebook?
    1. Please check the name on the gradebook is you
    2. Please save this link so you can track your grades throughout this semester
    3. You can only view this gradebook
  • Type of variables
  • Color theory
    1. color vs fill aesthetics in ggplot2: `fill` = gets used for polygons, and `color` = gets used for points and lines
    2. brewer vs distiller: `scale_*_brewer` = gets used for categorical variables, and `scale_*_distiller` = gets used for numerical variables
    3. Selecting an appropriate color palette from colorbrewer2.org
    4. Also see Figure 2.11 in MDSR
    5. Also define colors in terms of hex codes
    6. scale_color_manual vs scale_fill_manual
    7. scale_color_viridis() vs scale_fill_viridis()

In-class exercise


Lec 05: Fri 9/13

Announcements

  • Please read ModernDive 2.5-2.6, 2.8, and MDSR 2.2.2 before Monday's class
  • Kind reminder: If you agree to participate, please fill out the pre-course survey by September 25, Wednesday

Lab2: Overplotting + Graphics

  • Click this link to accept the Lab2 assignment in our Github classroom

Lec 04: Wed 9/11

Announcements

  • If you haven't been added to the course Slack channels, please let me know
  • For a truly unique perspective on Data Visualization: Mona Chalabi. See video below:
  • Kind reminder: If you agree to participate, please fill out the pre-course survey by September 25, Wednesday

Today’s topics/activities

Chalk talk


Lec 03: Mon 9/9

Announcements

  • Please read ModernDive 2.4 before Wednesday's class
  • Kind reminder: If you agree to participate, please fill out the pre-course survey by September 25, Wednesday

Today’s topics/activities

Chalk talk


Lec 02: Fri 9/6

Announcements

  • Please read MDSR 2 before Monday's class
  • Kind reminder: If you agree to participate, please fill out the pre-course survey by September 25, Wednesday

Today’s topics/activities

Chalk talk

  • Create a project environment for SDS192

Lab1: GitHub


Lec 01: Wed 9/4

Announcements

  • Welcome!
  • Before this Friday (9/6)'s class:
    • Record your name by visiting this course and click on the Name Recorder link if you haven't done so and then learn how to pronounce each other's names via our course Moodle page
    • Complete syllabus quiz via our course Moodle page
  • Participate in the Integrating Disability Inclusion Components into Statistics and Data Science Pedagogy study (IRB approved):
    • If you agree to participate, please fill out the pre-course survey by September 25, Wednesday
    • You will gain 2 extra credits for completing both the pre-course survey and the post-course survey
    • If you are younger than 18 years old, please DM me on Slack and you will automatically gain 2 extra credits
    • Your responses will be confidential. Data from this study without identifiers to individuals may be used in research paper, presentation, etc.

Today’s topics/activities