if(!require(readr)){install.packages("readr");require(readr);}
if(!require(sqldf)){install.packages("sqldf");require(sqldf);}
27 27. Get the data - “books database” and other tables
As we learn about databases and SQL we will need to refer to various collections of data (i.e. “tables” of data in SQL terminology).
This section explains how to import data into R for use with SQL and also provides the various data files that we will be using in the upcoming SQL lessons.
27.1 Different ways to import data for use with SQL
SQL is a language that is used with Relational Database Management SOftware (RDBMS).
27.2 Prepare to import the data
In order to read the data into R and to access the data using SQL, we will need to load the R packages “readr” and “sqldf”. We do so with the following commands:
27.3 import the data: grades
Download the CSV file: grades.csv
This a .csv file that contains information about students and their grades. You can read the contents of the file into R with the read.csv function as shown below.
= read.csv("grades.csv") grades
Once you’ve done so, you can display the data:
sqldf("select * from grades")
id student year honors test1 test2 test3 major
1 s0100 joe fr FALSE 70 82 NA IDS
2 s0200 sue fr FALSE 80 80 79 IDS
3 s0300 anne fr TRUE 90 95 92 IDS
4 s0400 frank so TRUE 100 95 91 ACC
5 s0500 bob so FALSE 30 NA NA IDS
6 s0600 samantha so TRUE 100 70 97 IDS
7 s0700 larry ju FALSE 32 80 NA FIN
8 s0800 bill se FALSE 84 90 92 ACC
9 s0900 lucricia fr TRUE 80 100 88 IDS
10 s1000 margaret se FALSE 59 47 NA IDS
11 s1100 dalia se FALSE 85 92 93 FIN
12 s1200 zvi ju TRUE 90 98 92 ACC
13 s1300 mike ju TRUE 90 86 79 IDS
14 s1400 david se TRUE 90 87 87 IDS
15 s1500 dana so FALSE 100 93 91 FIN
27.4 import the data: customers, orders, movies, north_american_cities, etc
Download the RData file: sqlbolt_tables-v007.RData
A .RData file contains a copy of R’s environment variables. You can save you environment variables to an RData file with the save()
function. (see ?save). Once you have an RData file, you can load those variables into your R environment with the load()
function (see ?load).
This RData file contains the following different dataframe variables. When using the sqldf() function, each dataframe variable can used as though it were a Relational Database table.
- customers
- orders
- movies
- north_american_cities
- as well as some others (boxOffice, buildings, employees)
Use the load function as shown below to load the data into R.
load("sqlbolt_tables-v007.RData")
Below are the contents of these tables:
sqldf("select * from grades")
id student year honors test1 test2 test3 major
1 s0100 joe fr FALSE 70 82 NA IDS
2 s0200 sue fr FALSE 80 80 79 IDS
3 s0300 anne fr TRUE 90 95 92 IDS
4 s0400 frank so TRUE 100 95 91 ACC
5 s0500 bob so FALSE 30 NA NA IDS
6 s0600 samantha so TRUE 100 70 97 IDS
7 s0700 larry ju FALSE 32 80 NA FIN
8 s0800 bill se FALSE 84 90 92 ACC
9 s0900 lucricia fr TRUE 80 100 88 IDS
10 s1000 margaret se FALSE 59 47 NA IDS
11 s1100 dalia se FALSE 85 92 93 FIN
12 s1200 zvi ju TRUE 90 98 92 ACC
13 s1300 mike ju TRUE 90 86 79 IDS
14 s1400 david se TRUE 90 87 87 IDS
15 s1500 dana so FALSE 100 93 91 FIN
sqldf("select * from movies")
id title director year length_minutes
1 1 Toy Story John Lasseter 1995 81
2 2 A Bug's Life John Lasseter 1998 95
3 3 Toy Story 2 John Lasseter 1999 93
4 4 Monsters, Inc. Pete Docter 2001 92
5 5 Finding Nemo Andrew Stanton 2003 107
6 6 The Incredibles Brad Bird 2004 116
7 7 Cars John Lasseter 2006 117
8 8 Ratatouille Brad Bird 2007 115
9 9 WALL-E Andrew Stanton 2008 104
10 10 Up Pete Docter 2009 101
11 11 Toy Story 3 Lee Unkrich 2010 103
12 12 Cars 2 John Lasseter 2011 120
13 13 Brave Brenda Chapman 2012 102
14 14 Monsters University Dan Scanlon 2013 110
15 87 WALL-G Brenda Chapman 2042 92
sqldf("select * from orders")
OrderID CustomerID EmployeeID OrderDate ShipperID
1 10248 90 5 7/4/1996 3
2 10249 81 6 7/5/1996 1
3 10250 34 4 7/8/1996 2
4 10251 84 3 7/8/1996 1
5 10252 76 4 7/9/1996 2
6 10253 34 3 7/10/1996 2
7 10254 14 5 7/11/1996 2
8 10255 68 9 7/12/1996 3
9 10256 88 3 7/15/1996 2
10 10257 35 4 7/16/1996 3
11 10258 20 1 7/17/1996 1
12 10259 13 4 7/18/1996 3
13 10260 55 4 7/19/1996 1
14 10261 61 4 7/19/1996 2
15 10262 65 8 7/22/1996 3
16 10263 20 9 7/23/1996 3
17 10264 24 6 7/24/1996 3
18 10265 7 2 7/25/1996 1
19 10266 87 3 7/26/1996 3
20 10267 25 4 7/29/1996 1
21 10268 33 8 7/30/1996 3
22 10269 89 5 7/31/1996 1
23 10270 87 1 8/1/1996 1
24 10271 75 6 8/1/1996 2
25 10272 65 6 8/2/1996 2
26 10273 63 3 8/5/1996 3
27 10274 85 6 8/6/1996 1
28 10275 49 1 8/7/1996 1
29 10276 80 8 8/8/1996 3
30 10277 52 2 8/9/1996 3
31 10278 5 8 8/12/1996 2
32 10279 44 8 8/13/1996 2
33 10280 5 2 8/14/1996 1
34 10281 69 4 8/14/1996 1
35 10282 69 4 8/15/1996 1
36 10283 46 3 8/16/1996 3
37 10284 44 4 8/19/1996 1
38 10285 63 1 8/20/1996 2
39 10286 63 8 8/21/1996 3
40 10287 67 8 8/22/1996 3
41 10288 66 4 8/23/1996 1
42 10289 11 7 8/26/1996 3
43 10290 15 8 8/27/1996 1
44 10291 61 6 8/27/1996 2
45 10292 81 1 8/28/1996 2
46 10293 80 1 8/29/1996 3
47 10294 65 4 8/30/1996 2
48 10295 85 2 9/2/1996 2
49 10296 46 6 9/3/1996 1
50 10297 7 5 9/4/1996 2
51 10298 37 6 9/5/1996 2
52 10299 67 4 9/6/1996 2
53 10300 49 2 9/9/1996 2
54 10301 86 8 9/9/1996 2
55 10302 76 4 9/10/1996 2
56 10303 30 7 9/11/1996 2
57 10304 80 1 9/12/1996 2
58 10305 55 8 9/13/1996 3
59 10306 69 1 9/16/1996 3
60 10307 48 2 9/17/1996 2
61 10308 2 7 9/18/1996 3
62 10309 37 3 9/19/1996 1
63 10310 77 8 9/20/1996 2
64 10311 18 1 9/20/1996 3
65 10312 86 2 9/23/1996 2
66 10313 63 2 9/24/1996 2
67 10314 65 1 9/25/1996 2
68 10315 38 4 9/26/1996 2
69 10316 65 1 9/27/1996 3
70 10317 48 6 9/30/1996 1
71 10318 38 8 10/1/1996 2
72 10319 80 7 10/2/1996 3
73 10320 87 5 10/3/1996 3
74 10321 38 3 10/3/1996 2
75 10322 58 7 10/4/1996 3
76 10323 39 4 10/7/1996 1
77 10324 71 9 10/8/1996 1
78 10325 39 1 10/9/1996 3
79 10326 8 4 10/10/1996 2
80 10327 24 2 10/11/1996 1
81 10328 28 4 10/14/1996 3
82 10329 75 4 10/15/1996 2
83 10330 46 3 10/16/1996 1
84 10331 9 9 10/16/1996 1
85 10332 51 3 10/17/1996 2
86 10333 87 5 10/18/1996 3
87 10334 84 8 10/21/1996 2
88 10335 37 7 10/22/1996 2
89 10336 60 7 10/23/1996 2
90 10337 25 4 10/24/1996 3
91 10338 55 4 10/25/1996 3
92 10339 51 2 10/28/1996 2
93 10340 9 1 10/29/1996 3
94 10341 73 7 10/29/1996 3
95 10342 25 4 10/30/1996 2
96 10343 44 4 10/31/1996 1
97 10344 89 4 11/1/1996 2
98 10345 63 2 11/4/1996 2
99 10346 65 3 11/5/1996 3
100 10347 21 4 11/6/1996 3
101 10348 86 4 11/7/1996 2
102 10349 75 7 11/8/1996 1
103 10350 41 6 11/11/1996 2
104 10351 20 1 11/11/1996 1
105 10352 28 3 11/12/1996 3
106 10353 59 7 11/13/1996 3
107 10354 58 8 11/14/1996 3
108 10355 4 6 11/15/1996 1
109 10356 86 6 11/18/1996 2
110 10357 46 1 11/19/1996 3
111 10358 41 5 11/20/1996 1
112 10359 72 5 11/21/1996 3
113 10360 7 4 11/22/1996 3
114 10361 63 1 11/22/1996 2
115 10362 9 3 11/25/1996 1
116 10363 17 4 11/26/1996 3
117 10364 19 1 11/26/1996 1
118 10365 3 3 11/27/1996 2
119 10366 29 8 11/28/1996 2
120 10367 83 7 11/28/1996 3
121 10368 20 2 11/29/1996 2
122 10369 75 8 12/2/1996 2
123 10370 14 6 12/3/1996 2
124 10371 41 1 12/3/1996 1
125 10372 62 5 12/4/1996 2
126 10373 37 4 12/5/1996 3
127 10374 91 1 12/5/1996 3
128 10375 36 3 12/6/1996 2
129 10376 51 1 12/9/1996 2
130 10377 72 1 12/9/1996 3
131 10378 24 5 12/10/1996 3
132 10379 61 2 12/11/1996 1
133 10380 37 8 12/12/1996 3
134 10381 46 3 12/12/1996 3
135 10382 20 4 12/13/1996 1
136 10383 4 8 12/16/1996 3
137 10384 5 3 12/16/1996 3
138 10385 75 1 12/17/1996 2
139 10386 21 9 12/18/1996 3
140 10387 70 1 12/18/1996 2
141 10388 72 2 12/19/1996 1
142 10389 10 4 12/20/1996 2
143 10390 20 6 12/23/1996 1
144 10391 17 3 12/23/1996 3
145 10392 59 2 12/24/1996 3
146 10393 71 1 12/25/1996 3
147 10394 36 1 12/25/1996 3
148 10395 35 6 12/26/1996 1
149 10396 25 1 12/27/1996 3
150 10397 60 5 12/27/1996 1
151 10398 71 2 12/30/1996 3
152 10399 83 8 12/31/1996 3
153 10400 19 1 1/1/1997 3
154 10401 65 1 1/1/1997 1
155 10402 20 8 1/2/1997 2
156 10403 20 4 1/3/1997 3
157 10404 49 2 1/3/1997 1
158 10405 47 1 1/6/1997 1
159 10406 62 7 1/7/1997 1
160 10407 56 2 1/7/1997 2
161 10408 23 8 1/8/1997 1
162 10409 54 3 1/9/1997 1
163 10410 10 3 1/10/1997 3
164 10411 10 9 1/10/1997 3
165 10412 87 8 1/13/1997 2
166 10413 41 3 1/14/1997 2
167 10414 21 2 1/14/1997 3
168 10415 36 3 1/15/1997 1
169 10416 87 8 1/16/1997 3
170 10417 73 4 1/16/1997 3
171 10418 63 4 1/17/1997 1
172 10419 68 4 1/20/1997 2
173 10420 88 3 1/21/1997 1
174 10421 61 8 1/21/1997 1
175 10422 27 2 1/22/1997 1
176 10423 31 6 1/23/1997 3
177 10424 51 7 1/23/1997 2
178 10425 41 6 1/24/1997 2
179 10426 29 4 1/27/1997 1
180 10427 59 4 1/27/1997 2
181 10428 66 7 1/28/1997 1
182 10429 37 3 1/29/1997 2
183 10430 20 4 1/30/1997 1
184 10431 10 4 1/30/1997 2
185 10432 75 3 1/31/1997 2
186 10433 60 3 2/3/1997 3
187 10434 24 3 2/3/1997 2
188 10435 16 8 2/4/1997 2
189 10436 7 3 2/5/1997 2
190 10437 87 8 2/5/1997 1
191 10438 79 3 2/6/1997 2
192 10439 51 6 2/7/1997 3
193 10440 71 4 2/10/1997 2
194 10441 55 3 2/10/1997 2
195 10442 20 3 2/11/1997 2
196 10443 66 8 2/12/1997 1
sqldf("select * from customers")
CustomerID CustomerName ContactName Address City PostalCode Country
1 1 Alfreds Futterkiste Maria Anders Obere Str. 57 Berlin 12209 Germany
2 2 Ana Trujillo Emparedados y helados Ana Trujillo Avda. de la Constituci\xf3n 2222 M\xe9xico D.F. 5021 Mexico
3 3 Antonio Moreno Taquer\xeda Antonio Moreno Mataderos 2312 M\xe9xico D.F. 5023 Mexico
4 4 Around the Horn Thomas Hardy 120 Hanover Sq. London WA1 1DP UK
5 5 Berglunds snabbk\xf6p Christina Berglund Berguvsv\xe4gen 8 Lule\xe5 S-958 22 Sweden
6 6 Blauer See Delikatessen Hanna Moos Forsterstr. 57 Mannheim 68306 Germany
7 7 Blondel p\xe8re et fils Fr\xe9d\xe9rique Citeaux 24, place Kl\xe9ber Strasbourg 67000 France
8 8 B\xf3lido Comidas preparadas Mart\xedn Sommer C/ Araquil, 67 Madrid 28023 Spain
9 9 Bon app' Laurence Lebihans 12, rue des Bouchers Marseille 13008 France
10 10 Bottom-Dollar Marketse Elizabeth Lincoln 23 Tsawassen Blvd. Tsawassen T2F 8M4 Canada
11 11 B's Beverages Victoria Ashworth Fauntleroy Circus London EC2 5NT UK
12 12 Cactus Comidas para llevar Patricio Simpson Cerrito 333 Buenos Aires 1010 Argentina
13 13 Centro comercial Moctezuma Francisco Chang Sierras de Granada 9993 M\xe9xico D.F. 5022 Mexico
14 14 Chop-suey Chinese Yang Wang Hauptstr. 29 Bern 3012 Switzerland
15 15 Com\xe9rcio Mineiro Pedro Afonso Av. dos Lus\xedadas, 23 S\xe3o Paulo 05432-043 Brazil
16 16 Consolidated Holdings Elizabeth Brown Berkeley Gardens 12 Brewery London WX1 6LT UK
17 17 Drachenblut Delikatessend Sven Ottlieb Walserweg 21 Aachen 52066 Germany
18 18 Du monde entier Janine Labrune 67, rue des Cinquante Otages Nantes 44000 France
19 19 Eastern Connection Ann Devon 35 King George London WX3 6FW UK
20 20 Ernst Handel Roland Mendel Kirchgasse 6 Graz 8010 Austria
21 21 Familia Arquibaldo Aria Cruz Rua Or\xf3s, 92 S\xe3o Paulo 05442-030 Brazil
22 22 FISSA Fabrica Inter. Salchichas S.A. Diego Roel C/ Moralzarzal, 86 Madrid 28034 Spain
23 23 Folies gourmandes Martine Ranc\xe9 184, chauss\xe9e de Tournai Lille 59000 France
24 24 Folk och f\xe4 HB Maria Larsson \xc5kergatan 24 Br\xe4cke S-844 67 Sweden
25 25 Frankenversand Peter Franken Berliner Platz 43 M\xfcnchen 80805 Germany
26 26 France restauration Carine Schmitt 54, rue Royale Nantes 44000 France
27 27 Franchi S.p.A. Paolo Accorti Via Monte Bianco 34 Torino 10100 Italy
28 28 Furia Bacalhau e Frutos do Mar Lino Rodriguez Jardim das rosas n. 32 Lisboa 1675 Portugal
29 29 Galer\xeda del gastr\xf3nomo Eduardo Saavedra Rambla de Catalu\xf1a, 23 Barcelona 8022 Spain
30 30 Godos Cocina T\xedpica Jos\xe9 Pedro Freyre C/ Romero, 33 Sevilla 41101 Spain
31 31 Gourmet Lanchonetes Andr\xe9 Fonseca Av. Brasil, 442 Campinas 04876-786 Brazil
32 32 Great Lakes Food Market Howard Snyder 2732 Baker Blvd. Eugene 97403 USA
33 33 GROSELLA-Restaurante Manuel Pereira 5\xaa Ave. Los Palos Grandes Caracas 1081 Venezuela
34 34 Hanari Carnes Mario Pontes Rua do Pa\xe7o, 67 Rio de Janeiro 05454-876 Brazil
35 35 HILARI\xd3N-Abastos Carlos Hern\xe1ndez Carrera 22 con Ave. Carlos Soublette #8-35 San Crist\xf3bal 5022 Venezuela
36 36 Hungry Coyote Import Store Yoshi Latimer City Center Plaza 516 Main St. Elgin 97827 USA
37 37 Hungry Owl All-Night Grocers Patricia McKenna 8 Johnstown Road Cork Ireland
38 38 Island Trading Helen Bennett Garden House Crowther Way Cowes PO31 7PJ UK
39 39 K\xf6niglich Essen Philip Cramer Maubelstr. 90 Brandenburg 14776 Germany
40 40 La corne d'abondance Daniel Tonini 67, avenue de l'Europe Versailles 78000 France
41 41 La maison d'Asie Annette Roulet 1 rue Alsace-Lorraine Toulouse 31000 France
42 42 Laughing Bacchus Wine Cellars Yoshi Tannamuri 1900 Oak St. Vancouver V3F 2K1 Canada
43 43 Lazy K Kountry Store John Steel 12 Orchestra Terrace Walla Walla 99362 USA
44 44 Lehmanns Marktstand Renate Messner Magazinweg 7 Frankfurt a.M. 60528 Germany
45 45 Let's Stop N Shop Jaime Yorres 87 Polk St. Suite 5 San Francisco 94117 USA
46 46 LILA-Supermercado Carlos Gonz\xe1lez Carrera 52 con Ave. Bol\xedvar #65-98 Llano Largo Barquisimeto 3508 Venezuela
47 47 LINO-Delicateses Felipe Izquierdo Ave. 5 de Mayo Porlamar I. de Margarita 4980 Venezuela
48 48 Lonesome Pine Restaurant Fran Wilson 89 Chiaroscuro Rd. Portland 97219 USA
49 49 Magazzini Alimentari Riuniti Giovanni Rovelli Via Ludovico il Moro 22 Bergamo 24100 Italy
50 50 Maison Dewey Catherine Dewey Rue Joseph-Bens 532 Bruxelles B-1180 Belgium
51 51 M\xe8re Paillarde Jean Fresni\xe8re 43 rue St. Laurent Montr\xe9al H1J 1C3 Canada
52 52 Morgenstern Gesundkost Alexander Feuer Heerstr. 22 Leipzig 4179 Germany
53 53 North/South Simon Crowther South House 300 Queensbridge London SW7 1RZ UK
54 54 Oc\xe9ano Atl\xe1ntico Ltda. Yvonne Moncada Ing. Gustavo Moncada 8585 Piso 20-A Buenos Aires 1010 Argentina
55 55 Old World Delicatessen Rene Phillips 2743 Bering St. Anchorage 99508 USA
56 56 Ottilies K\xe4seladen Henriette Pfalzheim Mehrheimerstr. 369 K\xf6ln 50739 Germany
57 57 Paris sp\xe9cialit\xe9s Marie Bertrand 265, boulevard Charonne Paris 75012 France
58 58 Pericles Comidas cl\xe1sicas Guillermo Fern\xe1ndez Calle Dr. Jorge Cash 321 M\xe9xico D.F. 5033 Mexico
59 59 Piccolo und mehr Georg Pipps Geislweg 14 Salzburg 5020 Austria
60 60 Princesa Isabel Vinhoss Isabel de Castro Estrada da sa\xfade n. 58 Lisboa 1756 Portugal
61 61 Que Del\xedcia Bernardo Batista Rua da Panificadora, 12 Rio de Janeiro 02389-673 Brazil
62 62 Queen Cozinha L\xfacia Carvalho Alameda dos Can\xe0rios, 891 S\xe3o Paulo 05487-020 Brazil
63 63 QUICK-Stop Horst Kloss Taucherstra\xdfe 10 Cunewalde 1307 Germany
64 64 Rancho grande Sergio Guti\xe9rrez Av. del Libertador 900 Buenos Aires 1010 Argentina
65 65 Rattlesnake Canyon Grocery Paula Wilson 2817 Milton Dr. Albuquerque 87110 USA
66 66 Reggiani Caseifici Maurizio Moroni Strada Provinciale 124 Reggio Emilia 42100 Italy
67 67 Ricardo Adocicados Janete Limeira Av. Copacabana, 267 Rio de Janeiro 02389-890 Brazil
68 68 Richter Supermarkt Michael Holz Grenzacherweg 237 Gen\xe8ve 1203 Switzerland
69 69 Romero y tomillo Alejandra Camino Gran V\xeda, 1 Madrid 28001 Spain
70 70 Sant\xe9 Gourmet Jonas Bergulfsen Erling Skakkes gate 78 Stavern 4110 Norway
71 71 Save-a-lot Markets Jose Pavarotti 187 Suffolk Ln. Boise 83720 USA
72 72 Seven Seas Imports Hari Kumar 90 Wadhurst Rd. London OX15 4NB UK
73 73 Simons bistro Jytte Petersen Vinb\xe6ltet 34 K\xf8benhavn 1734 Denmark
74 74 Sp\xe9cialit\xe9s du monde Dominique Perrier 25, rue Lauriston Paris 75016 France
75 75 Split Rail Beer & Ale Art Braunschweiger P.O. Box 555 Lander 82520 USA
76 76 Supr\xeames d\xe9lices Pascale Cartrain Boulevard Tirou, 255 Charleroi B-6000 Belgium
77 77 The Big Cheese Liz Nixon 89 Jefferson Way Suite 2 Portland 97201 USA
78 78 The Cracker Box Liu Wong 55 Grizzly Peak Rd. Butte 59801 USA
79 79 Toms Spezialit\xe4ten Karin Josephs Luisenstr. 48 M\xfcnster 44087 Germany
80 80 Tortuga Restaurante Miguel Angel Paolino Avda. Azteca 123 M\xe9xico D.F. 5033 Mexico
81 81 Tradi\xe7\xe3o Hipermercados Anabela Domingues Av. In\xeas de Castro, 414 S\xe3o Paulo 05634-030 Brazil
82 82 Trail's Head Gourmet Provisioners Helvetius Nagy 722 DaVinci Blvd. Kirkland 98034 USA
83 83 Vaffeljernet Palle Ibsen Smagsl\xf8get 45 \xc5rhus 8200 Denmark
84 84 Victuailles en stock Mary Saveley 2, rue du Commerce Lyon 69004 France
85 85 Vins et alcools Chevalier Paul Henriot 59 rue de l'Abbaye Reims 51100 France
86 86 Die Wandernde Kuh Rita M\xfcller Adenauerallee 900 Stuttgart 70563 Germany
87 87 Wartian Herkku Pirkko Koskitalo Torikatu 38 Oulu 90110 Finland
88 88 Wellington Importadora Paula Parente Rua do Mercado, 12 Resende 08737-363 Brazil
89 89 White Clover Markets Karl Jablonski 305 - 14th Ave. S. Suite 3B Seattle 98128 USA
90 90 Wilman Kala Matti Karttunen Keskuskatu 45 Helsinki 21240 Finland
91 91 Wolski Zbyszek ul. Filtrowa 68 Walla 01-012 Poland
sqldf("select * from north_american_cities")
city country population latitude longitude
1 Guadalajara Mexico 1500800 20.65970 -103.34961
2 Toronto Canada 2795060 43.65323 -79.38318
3 Houston United States 2195914 29.76043 -95.36980
4 New York United States 8405837 40.71278 -74.00594
5 Philadelphia United States 1553165 39.95258 -75.16522
6 Havana Cuba 2106146 23.05407 -82.34519
7 Mexico City Mexico 8555500 19.43261 -99.13321
8 Phoenix United States 1513367 33.44838 -112.07404
9 Los Angeles United States 3884307 34.05223 -118.24368
10 Ecatepec de Morelos Mexico 1742000 19.60184 -99.05067
11 Montreal Canada 1717767 45.50169 -73.56726
12 Chicago United States 2718782 41.87811 -87.62980
27.5 The “Books Database”
The “books database” is a collection of several tables that contains data relating to books (i.e. “titles”), authors, publishers, etc.
You can download the data for the database in the following ways:
See below for details.
27.5.1 SQLite file for “Books Database”
Click here to download all the data for the books datbase as a single “SQLite” database file.
To use this file you can do one of the following:
Install SQLite software on your computer. There are many such programs avaiable. I recommend “DB Viewer for SQLite” which is a free open source program.
Website (download from here): https://sqlitebrowser.org/
Github repository: https://github.com/sqlitebrowser/sqlitebrowser
You can search online for other similar programs for working with sqlite3 files.
Free web based tool: https://y-rosenthal.github.io/yrSqliteViewer/yrSqliteViewer.html
This is a website that lets you upload a sqlite3 file and run SQL Select statements against the data. I created this website myself (with the help of some AI coding assistants).
PROs: you don’t need to install anything
CONs: currently only works with SQL SELECT statment but not any other types of sql statements.
See this chapter (in this book) https://y-rosenthal.github.io/DataManagementUsingR/sql0950-workingWithDatabasesInR-v010.html for instructions on how to access this database file directly via R commands.
27.5.2 CSV files for “Books Database”
Download the CSV files for the “books database”. These CSV files together comprise the data for the “books database”.
Download a tutorial on using Relational databases.
This word document contains:
- a complete description of the “books database”
- a tutorial on how “relational databases” are structured
- a tutorial on Entity Relationship Diagrams (ERDs)
- a set of questions and answers
Once you’ve downloaded the various CSV files you can run the following commands to import the data into R.
27.5.3 Import the data (books database)
if(!require(readr)){install.packages("readr");require(readr);}
# Read in the data for the books database - see the
= read_csv("titles.csv", na="NULL", show_col_types=FALSE)
titles = read_csv("authors.csv", na="NULL", show_col_types=FALSE)
authors = read_csv("publishers.csv", na="NULL", show_col_types=FALSE)
publishers = read_csv("title_authors.csv", na="NULL", show_col_types=FALSE)
title_authors = read_csv("royalties.csv", na="NULL", show_col_types=FALSE) royalties
27.5.4 Books Database - Entity Relationship Diagram (ERD)
The following is an “Entity Relationship Diagram (ERD)” that describes the relationships between the tables. See the Word document linked above for more info:
%%{init: {'theme': 'base', 'themeVariables': { 'primaryColor': '#ffffff', 'primaryTextColor': '#0000ff', 'primaryBorderColor': '#000000', 'lineColor': '#000000', 'secondaryColor': '#ffffff', 'tertiaryColor': '#ffffff', 'mainBkg': '#ffffff', 'background': '#ffffff', 'edgeLabelBackground': '#ffffff' }}}%% erDiagram authors ||--o{ title_authors : "" title_authors }|--|| titles : "" titles ||--o| royalties : "" publishers |o--o{ titles : "" authors { string au_id PK string au_fname string au_lname string phone string address string city string state string zip } title_authors { string title_id FK1 string au_id FK2 int au_order float royalty_share } titles { string title_id PK string title_name string type string pub_id FK int pages float price int sales date pubdate } publishers { string pub_id PK string pub_name string city string state string country } royalties { string title_id PK float advance float royalty_rate }
Below are the contents of these tables:
sqldf("select * from titles")
title_id title_name type pub_id pages price sales pubdate
1 T01 1977! history P01 107 21.99 566 8/1/2000 0:00
2 T02 200 Years of German Humor history P03 14 19.95 9566 4/1/1998 0:00
3 T03 Ask Yor System Administrator computer P02 1226 39.95 25667 9/1/2000 0:00
4 T04 But I Did It Unconciously psychology P01 510 12.99 13001 5/31/1999 0:00
5 T05 Exchange of Platitudes psychology P01 201 6.95 201440 1/1/2001 0:00
6 T06 How About Never? biography P01 473 19.95 11320 7/31/2000 0:00
7 T07 I Blame My Mother biography P03 333 23.95 1500200 10/1/1999 0:00
8 T08 Just Wait Until After School children P01 86 10.00 4095 6/1/2001 0:00
9 T09 Kiss My Boo Boo children P01 22 13.95 5000 5/31/2002 0:00
10 T10 Not Without My Fabrerge Egg biography P05 NA NA NA <NA>
11 T11 Perhaps It's a Glandular Problem psychology P05 826 7.99 94123 11/30/2000 0:00
12 T12 Spontaneous, Not Annoying biography P05 507 12.99 100001 8/31/2000 0:00
13 T13 What Are The Civilian Applications? history P03 802 29.99 10467 5/31/1999 0:00
sqldf("select * from publishers")
pub_id pub_name city state country
1 P01 Abatis Publishers New York NY USA
2 P02 Core Dump Books San Francisco CA USA
3 P03 Schandenfreude Press Hamburg <NA> Germany
4 P04 Tneterhooks Press Berkeley CA USA
5 P05 AAA Publishing Berkeley CA USA
sqldf("select * from title_authors")
title_id au_id au_order royalty_shares
1 T01 A01 1 1.0
2 T02 A01 1 1.0
3 T03 A05 1 1.0
4 T04 A03 1 0.6
5 T04 A04 2 0.4
6 T05 A04 1 1.0
7 T06 A02 1 1.0
8 T07 A02 1 0.5
9 T07 A04 2 0.5
10 T08 A06 1 1.0
11 T09 A06 1 1.0
12 T10 A02 1 1.0
13 T11 A03 2 0.3
14 T11 A04 3 0.3
15 T11 A06 1 0.4
16 T12 A02 1 1.0
17 T13 A01 1 1.0
sqldf("select * from authors")
au_id au_fname au_lname phone address city state zip
1 A01 Sarah Buchman 718-496-7223 75 West 205 St Bronx NY 10468
2 A02 Wendy Heydemark 303-986-7020 2922 Baseline Rd Boulder CO 80303
3 A03 Hallie Hull 415-549-4278 3800 Waldo Ave, #14F San Francisco CA 94123
4 A04 Klee Hull 415-549-4278 3800 Waldo Ave, #14F San Francisco CA 94123
5 A05 Christian Kells 212-771-4680 114 Horatio St New York NY 10014
6 A06 Harvey Kellsey 650-836-7128 390 Serra Mall Palo Alto CA 94305
7 A07 Paddy O'Furniture 941-925-0752 1442 Main St Sarasota FL 34236
sqldf("select * from royalties")
title_id advance royalty_rate
1 T01 10000 0.05
2 T02 1000 0.06
3 T03 15000 0.07
4 T04 20000 0.08
5 T05 100000 0.09
6 T06 20000 0.08
7 T07 1000000 0.11
8 T08 0 0.04
9 T09 0 0.05
10 T10 NA NA
11 T11 100000 0.07
12 T12 50000 0.09
13 T13 20000 0.06