Now organized into chapters which can be independently run.
"New" relative to the data in the archive directory. All from Jeff Sackmann’s GitHub. Sackmann’s data is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. According to its ShareAlike term, data provided here falls under the same license. The title of the data directory cannot be changed without making corresponding changes in the Jupyter notebooks under the Code directory.
Even though the language breakdown describes the project as almost all Jupyter notebook, the majority of analysis is done in R and stored in the various .Rmd files. This issue is outlined here: github-linguist/linguist#5208. Python is only used for preparing the data for analysis and extracting aggregate stats. The real language breakdown would be in the ballpark of 70% R and 30% Python.