Data Management With R (latest version)
Intro
This course covers many different topics related to the processing and management of data. For a quick overview of many of the topics see the “Topics” section in the Appendix.
*** ignore the material below ***
Just in case, you decided to read this anyway … I’ll explain. The following “TODO” items are meant as a reminder for me, YR, as I am working on this book. It is not intended for you, the student. If you are not me (i.e. YR), you should ignore what it says from here down. I intend to remove this section before the semester starts, but if I don’t just ignore it.
Note that some of the info below appears in other places in the text. If the material below appears in other parts of this book, you should read it where it appears.
TODO - add these topics to the text
binary vs text (ascii, unicode)
structured data vs unstructured data
structured textual data
- html, xml, css
- md (different flavors
- JSON
- YAML
- TOML
099900-toDo-v001.qmd
The following is the contents of the 099900-toDo-v001.qmd file (also linked as the “Additional topics” entry in the menu at the left).
I put this here as a reminder to me, YR, to add this material to the main body of this book.
55. Additional topics
The following are additional topics that I plan to move to the main portion of the text. This page is a temporary place to put the info as it’s being developed.
Importing Data
https://rafalab.dfci.harvard.edu/dsbook-part-1/R/importing-data.html
Excel
Read excel file with formulas in cells into R
https://stackoverflow.com/questions/32014692/read-excel-file-with-formulas-in-cells-into-r
XLConnect
https://www.rdocumentation.org/packages/XLConnect/versions/1.0.8/topics/getCellFormula-methods https://cran.r-project.org/web/packages/XLConnect/vignettes/XLConnect.pdf https://financetrain.com/using-xlconnect-in-r-programming
Using AI to code
Using Gihub Copilot with RStudio
https://www.youtube.com/watch?v=t7NrkAeosog&list=PLSjaBj3ul8GpuMUaSGME2_NomnIsYWIDj
APIs
How to work with APIs uing R (httr2 package tutorial) https://www.youtube.com/watch?v=8tilyqp4bZY
Trends in API world https://www.forbes.com/sites/forbestechcouncil/2024/02/07/four-trends-shaping-apis-in-2024/?sh=14a432ea3369
Code for graphs and charts - e.g. mermaid
Info about mermaid: https://mermaid.js.org/
Using in Quarto: https://quarto.org/docs/authoring/diagrams.html
Online books
Introduction to Data Science Data Wrangling and Visualization with R
https://rafalab.dfci.harvard.edu/dsbook-part-1/
Advanced Data Science Statistics and Prediction Algorithms Through Case Studies
https://rafalab.dfci.harvard.edu/dsbook-part-2/
Online exercises and other resources
https://docs.dodona.be/en/ https://dodona.be/en/activities/?tab=all&programming_language=R