Data Management With R (latest version)

Author

Y. Rosenthal

Intro

This course covers many different topics related to the processing and management of data. For a quick overview of many of the topics see the “Topics” section in the Appendix.

*** ignore the material below ***

Just in case, you decided to read this anyway … I’ll explain. The following “TODO” items are meant as a reminder for me, YR, as I am working on this book. It is not intended for you, the student. If you are not me (i.e. YR), you should ignore what it says from here down. I intend to remove this section before the semester starts, but if I don’t just ignore it.

Note that some of the info below appears in other places in the text. If the material below appears in other parts of this book, you should read it where it appears.

TODO - add these topics to the text

  • binary vs text (ascii, unicode)

  • structured data vs unstructured data

  • structured textual data

    • html, xml, css
    • md (different flavors
    • JSON
    • YAML
    • TOML

099900-toDo-v001.qmd

The following is the contents of the 099900-toDo-v001.qmd file (also linked as the “Additional topics” entry in the menu at the left).

I put this here as a reminder to me, YR, to add this material to the main body of this book.

55. Additional topics

The following are additional topics that I plan to move to the main portion of the text. This page is a temporary place to put the info as it’s being developed.

Importing Data

https://rafalab.dfci.harvard.edu/dsbook-part-1/R/importing-data.html

Excel

Read excel file with formulas in cells into R
https://stackoverflow.com/questions/32014692/read-excel-file-with-formulas-in-cells-into-r

XLConnect
https://www.rdocumentation.org/packages/XLConnect/versions/1.0.8/topics/getCellFormula-methods https://cran.r-project.org/web/packages/XLConnect/vignettes/XLConnect.pdf https://financetrain.com/using-xlconnect-in-r-programming

Using AI to code

Using Gihub Copilot with RStudio
https://www.youtube.com/watch?v=t7NrkAeosog&list=PLSjaBj3ul8GpuMUaSGME2_NomnIsYWIDj

APIs

How to work with APIs uing R (httr2 package tutorial) https://www.youtube.com/watch?v=8tilyqp4bZY

Trends in API world https://www.forbes.com/sites/forbestechcouncil/2024/02/07/four-trends-shaping-apis-in-2024/?sh=14a432ea3369

Code for graphs and charts - e.g. mermaid

Info about mermaid: https://mermaid.js.org/

Using in Quarto: https://quarto.org/docs/authoring/diagrams.html

Online books

Introduction to Data Science Data Wrangling and Visualization with R
https://rafalab.dfci.harvard.edu/dsbook-part-1/

Advanced Data Science Statistics and Prediction Algorithms Through Case Studies
https://rafalab.dfci.harvard.edu/dsbook-part-2/

Online exercises and other resources

https://docs.dodona.be/en/ https://dodona.be/en/activities/?tab=all&programming_language=R

https://financetrain.com/using-xlconnect-in-r-programming