-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathJupyterNotebooks_RunningLog.qmd
220 lines (157 loc) · 6.6 KB
/
JupyterNotebooks_RunningLog.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
---
title: "Jupyter Notebooks Running Log"
subtitle: 'As difficult as you would expect'
author: "Rob Donald"
date: today
date-format: "dddd DD MMMM YYYY"
format:
pdf:
toc: true
toc-depth: 2
number-sections: true
lot: false
lof: false
colorlinks: true
code-block-bg: 'ffffcc' # fafad2 lightgoldenrodyellow, ffffcc web safe
papersize: a4
include-in-header:
text: \addtokomafont{disposition}{\rmfamily}
\usepackage{xurl}
bibliography: ../../PostDocMaster.bib
cite-method: biblatex
biblatexoptions:
- citestyle = authoryear
- style = authoryear
- urldate = iso
- datezeros = true
- natbib = true
- sorting = nty
---
```{=tex}
\def\shrug{\texttt{\raisebox{0.75em}{\char`\_}\char`\\\char`\_\kern-0.5ex(\kern-0.25ex\raisebox{0.25ex}
{\rotatebox{45}{\raisebox{-.75ex}"\kern-1.5ex\rotatebox{-90})}}\kern-0.5ex)\kern-0.5ex\char`\_/\raisebox{0.75em}
{\char`\_}}}
```
```{r}
#| label: "library_setup"
#| include: false
suppressMessages({suppressWarnings({
library(dplyr)
library(tidyr)
library(readr)
library(lubridate)
library(here)
library(tibble)
library(ggplot2)
library(RobsRUtils)
library(pander)
library(kableExtra)
})})
```
{{< pagebreak >}}
# Executive Summary
A running log to support the understanding of how to view and use Jupyter Notebooks (python).
*Note this is all done on an M2 MacBook Pro, so how it works on older Intel based Macs and Windows
is unknown.*
# Background
In the mixed world of data science you often run across a thing called a "Jupyter Notebook" which
is a python version of a `.Rmd` or `.qmd` file, i.e. it mixes text with code.
I think it may have been called an "IPython Notebook" in the past \shrug.
This R project was motivated by a need to try to at least *read* a `.ipynb` file even if you can't get
it to run.
This has been asked about recently (2023-10-07) on the Posit Community site at:
* \url{https://community.rstudio.com/t/is-there-a-plugin-to-view-jupyter-notebooks-ipynb/174723}
But with no answer.
As you will see it all seems a bit tedious but I'll admit to a bias on all this because I still
find setting up a python environment *so* much more awkward that an RStudio project.
# 2024-01-17
Setting up the project.
## Install Python
This is now pretty straightforward. For every python project you are are going to
investigate you need to set up a virtual environment so that you can isolate things.
1. Create a new R Project
2. Set up a running log file
3. From the terminal
+ a) virtualenv Some_Sensible_Name_py_env
+ b) source Some_Sensible_Name_py_env/bin/activate
+ c) pip install numpy pandas matplotlib
This will get you a minimal environment. You can have a look at
the file `Venv_Setup_20240117_01.txt` in this repo to see the sort of thing that will go
screaming across the Rstudio terminal output as you run the above steps.
4. Quit and Restart the R Project. You should now see on the Rstudio terminal tab that
the virtual environment is running. For this project it looks like this:
+ `(JupyterNB_py_env) Taylor:JupyterNotebooks rob$`
5. Install any other python packages you need. I usually install `scipy` at least
+ pip install scipy
You can have a look at the top of file `Venv_Setup_20240117_02.txt` in this repo
to see the expected output.
## Install Jupyter Notebook
Now we get into the specifics of this project. The Jupyter project at:
* \url{https://jupyter.org/}
gives you some background. It is a major competitor to RStudio and their Quarto and R Shiny
systems as you can see from all the overlap.
For my purposes all I want to do is read a `.ipynb` file so I just need the 'Jupyter Notebook'
application which is the classic interface:
* \url{https://jupyter.org/install}
Still at the Rstudio terminal prompt you just install it as a python package.
* pip install notebook
This will install *lots and lots* of things, you can see the expected output in the
file `Venv_Setup_20240117_02.txt` in this repo.
This is an application which runs up a small local web server to read the `.ipynb` files.
### Starting Jupyter Notebook {#sec-Starting-Jupyter-Notebook}
Remember this is an application so you start it from the RStudio terminal tab with:
* $ `jupyter notebook`
After it settles down you will see some log messages like this
```{}
[I 2024-01- ... ServerApp] Jupyter Server 2.12.5 is running at:
[I 2024-01- ... ServerApp] http://localhost:8888/tree?token=ca163 ...
[I 2024-01- ... ServerApp] http://127.0.0.1:8888/tree?token=ca163 ...
[I 2024-01- ... ServerApp] Use Control-C to stop this server and
shut down all kernels (twice to skip confirmation).
```
It starts up a web page in your default browser which you can now try to use to read the `.ipynb` file you are interested in. See @sec-Using-Jupyter-Notebook
### Stopping Jupyter Notebook {#sec-Stopping-Jupyter-Notebook}
As it says above, use `Control-C` __in the RStudio terminal tab__ to stop the server. If you do it
once it comes up asking for confirmation. If you are not quick with the 'y' at the prompt
it will restart \shrug.
```{}
^C
[I 2024-01- ... ServerApp] interrupted
[I 2024-01- ... ServerApp] Serving notebooks from local directory: /Users/...
1 active kernel
Jupyter Server 2.12.5 is running at:
http://localhost:8888/tree?token=ca163 ...
http://127.0.0.1:8888/tree?token=ca163 ...
Shutdown this Jupyter server (y/[n])?
[I 2024-01- ... ServerApp] No answer for 5s:
[I 2024-01- ... ServerApp] resuming operation...
```
Assuming you answer 'y' it will shutdown with a message like this:
```{}
Shutdown this Jupyter server (y/[n])? y
[C 2024-01- ... ServerApp] Shutdown confirmed
[I 2024-01- ... ServerApp] Shutting down 5 extensions
(JupyterNB_py_env) Taylor:JupyterNotebooks rob$
```
Note that just closing the web page it started in @sec-Starting-Jupyter-Notebook will __*not*__ stop the local web server.
## Using Jupyter Notebook {#sec-Using-Jupyter-Notebook}
The thing I have found is that the files you want to look at need to be under the project root
for the Jupyter Notebook server. I can run with that.
There is a big hint with that if you try to use the file 'open' on the web page.
```{r}
#| echo: false
#| fig-pos: 'H'
#| fig.align: "center"
#| out.height: '6cm'
#| out.width: '9cm'
#| label: JNB-File-Open
#| fig-cap: 'Reading a file'
image.path <- here('images','JNB_Open_File.pdf')
knitr::include_graphics(path = image.path)
```
It says *relative to jlab root* which I take to mean the web server root which starts
at your R Project's top level folder.
{{< pagebreak >}}
# References {.unnumbered}
::: {#refs}
:::