- Published on
Uploading Jupyter Notebook Files to Blogdown
- Authors
- Name
I have been working quite a bit with Python recently, using the popular Jupyter Notebook interface. Have been thinking about uploading some of my machine learning experiments and notes on the blog but integrating python with a blog built on blogdown
seems problematic as I could not google a solution. Turns out it is actually quite simple (maybe that's why nobody posted a tutorial on it, or maybe people who blog in R do not really use Python).
Anyway, here's the solution:
From the cmd line navigate to the folder which contains the notebook file and run the following line substituting Panda_Plotting for the name of your ipynb file.
jupyter nbconvert --to markdown Panda_Plotting.ipynb
This generates a .md
file as well as a folder containing all the images from that file. In my case the folder was named Panda_Plotting_files.
Copy the image files to the static folder where the website is located. I placed my files in the static/img/python_img/
folder.
Now all that is left to be done is to create a new post in markdown format,
new_post('Hello Python World', ext='.md')
copy the markdown file over, and replace all the image directory paths to point to the newly created one in the static folder.
Here's my Panda_Plotting.ipynb file converted to markdown and hosted on my blog. It displays markdown and all the output quite nicely!
Converted Jupyter Notebook File
%matplotlib inline
import pandas as pd
import numpy as np
from datetime import datetime
import matplotlib.pyplot as plt
plt.plot(np.random.normal(size=100), np.random.normal(size=100), 'ro')
data:image/s3,"s3://crabby-images/3930e/3930eb59b0eb4683e4f0758f6717902e2a96a84c" alt="png"
Using pandas plotting functions
Line Graphs
normals = pd.Series(np.random.normal(size=10))
normals.plot()
data:image/s3,"s3://crabby-images/ca67f/ca67f6428cf278d973810bf8e684faaefe2be5a8" alt="png"
normals.cumsum().plot(grid=True)
data:image/s3,"s3://crabby-images/d7db7/d7db77c21ca2097330085d705ee99e58fc32b511" alt="png"
variables = pd.DataFrame({'normal': np.random.normal(size=100),
'gamma': np.random.gamma(1, size=100),
'poisson': np.random.poisson(size=100)})
variables.cumsum().plot()
data:image/s3,"s3://crabby-images/0d818/0d81884e1da8a6ed22d15f887a753bb2c948e8d5" alt="png"
variables.cumsum().plot(subplots=True)
data:image/s3,"s3://crabby-images/0770a/0770aed21d084fdfdc52333f88365dd1e30b32ae" alt="png"
### More control over subplots
fig, axes = plt.subplots(nrows=1, ncols=3, figsize=(12, 4))
for i,var in enumerate(['normal','gamma','poisson']):
variables[var].cumsum(0).plot(ax=axes[i], title=var)
axes[0].set_ylabel('cumulative sum')
data:image/s3,"s3://crabby-images/8662e/8662ed79c9d2bfd9837ec60709ae26a822be2779" alt="png"
Bar Graphs
segments = pd.read_csv("./data/transit_segments.csv")
segments.st_time = segments.st_time.apply(lambda d: datetime.strptime(d, '%m/%d/%y %H:%M'))
segments['year'] = segments.st_time.apply(lambda d: d.year)
segments['long_seg'] = (segments.seg_length > segments.seg_length.mean())
segments['long_sog'] = (segments.avg_sog > segments.avg_sog.mean())
segments.groupby('year').seg_length.mean().plot(kind='bar')
data:image/s3,"s3://crabby-images/8d111/8d1110c107578b53eb7c339e5779ff15ce2c781e" alt="png"
segments.groupby(['year','long_seg']).seg_length.count().plot(kind='bar')
data:image/s3,"s3://crabby-images/3dfec/3dfec4dd0b1fad82437d163370b6c2e49a905277" alt="png"
### Stacked Bars
temp = pd.crosstab([segments.year, segments.long_seg], segments.long_sog)
temp.plot(kind='bar', stacked=True)
data:image/s3,"s3://crabby-images/f2b10/f2b10674821d66c887a49dddf5608185dd760a27" alt="png"
Histograms
variables = pd.DataFrame({'normal': np.random.normal(size=100),
'gamma': np.random.gamma(1, size=100),
'poisson': np.random.poisson(size=100)})
variables.normal.hist(bins=30, grid=False)
data:image/s3,"s3://crabby-images/77f00/77f0073d91e83dd15b5c9a49eca8a33f42140d7a" alt="png"
variables.poisson.plot(kind='kde', xlim=(-4,6))
data:image/s3,"s3://crabby-images/78abc/78abcc16805212028418f9a02b2bdc8494ddd6d0" alt="png"
variables.gamma.hist(bins=20, normed=True)
variables.gamma.plot(kind='kde', style='r--')
data:image/s3,"s3://crabby-images/c7ae9/c7ae91c08074b0252c54230c96129006cf672fdc" alt="png"
Scatterplot
segments.plot(kind='scatter', x='seg_length', y='avg_sog')
data:image/s3,"s3://crabby-images/1d670/1d670126241f5d8d29c00e5b3dcbde39681a8d88" alt="png"
segments_subset = segments.loc[1:10000,['seg_length', 'avg_sog', 'min_sog', 'max_sog']]
pd.plotting.scatter_matrix (segments_subset, figsize=(12,8), diagonal='kde')
data:image/s3,"s3://crabby-images/8a77f/8a77f0492de16afcc69f28357e7ccbe13903a7e0" alt="png"