Making a Static Site Generator (and a New Site)
The Plan
Initially, I planned to create everything from scratch -- the file reader, the markdown parser, the syntax highlighter, everything. After a few attempts, I decided that this was entirely silly and scrapped that idea. I also toyed with the idea of using a compile-time layout system but I didn't want to have to recompile a Rust executable every time I made changes to the layouts.
I finally settled on using Tera as my templating engine. It's Jinja-like and made for Rust with a lot of really useful features. It's significantly slower than compile-time engines but also much more flexible. In addition to a templating engine, I also needed something to parse Markdown files which would be the basis for the content of each page. For this, I used pulldown-cmark.
With these two things decided on, my idea was simple: layouts would be defined in one location and content would be defined in another. Most fancy layout stuff would be handled by Tera (includes, extensions, etc) and the data would live in the Markdown. This created two distinct types of data: layouts
and content
. I also had the idea that the content
directory structure should define the site's structure as well. (eg. content/blog/my-post.md
becomes domain.com/blog/my-post
).
The "Implementation"
Creating it was straightforward -- in fact, it was so simple that I accidentally deleted the project while trying to push it to git and had to rewrite it. I did it in a few hours (and stayed up until 6 am doing it).
It only performs a few steps.
- Compile the content from the
content
directory- If this filename is prefixed with
.
(dot), skip it - Read front matter
- Parse the rest as markdown
- Parse the previous result as a one-off Tera template
- If this filename is prefixed with
- Assemble the content into a
HashMap
- This is so it can be easily accessed later
- There is a limitation in my implementation -- Roxy only compiles a map out of the root
content
directory. This will become relevant later.
- Create Tera context
- This is where the content map becomes relevant. This
HashMap
is serialized and given to each layout. One major use would be to iterate their contents, eg. a blog index that displays all posts. Subdirectories can't be accessed if they aren't recognized as content themselves).
- This is where the content map becomes relevant. This
- Create output files
- Iterate each content file
- Create a matching directory structure in the output folder (eg.
content/blog/my-post.md
becomesoutput/blog/my-post/index.md
) - If it has a
layout
frontmatter field, try to use that layout, otherwise, useindex.html
as a default - Render the template using the context made earlier
- Write out the file
- Copy static files to the output folder
There were a few things I didn't consider in my original plan. Most notably, syntax highlighting and command-line argument parsing. I ended up adding syntect
for syntax highlighting and clap
for argument parsing. Additionally, there's a quirk based on my decision to tie content
to page layout: to create a page, even if it's entirely layout driven, it requires a content file as a placeholder.
There's a bunch of things Roxy doesn't do
- Create a development web server
- Create new content files from a template
- Initialize a project to give the user an idea of how to use it
I ended up making bash files to do all that though and they're available in the repository for this website if you'd like to see or use them.
The code is a disaster and needs to be refactored. I'll do that the minute I have a really good reason to. For now, it works well enough for me, and I really like using it. This is one of the times I'm very happy I decided to make something superfluous for myself.