TODO

Create Cache for splitters and regexes

In progress, but a lot of stuff is still not cached.

Try to move more of the regex-based parsing to using splitters / BitArray

Inline parsing performance:

Clean up
Optimize escape parsing a la houdini
Optimize as much as possible without OTP, then..
Parse inlines concurrently

See if we can use glentities for HTML entity decoding

Note: A first pass at this failed because the Commonmark spec suite expectation doesn’t match the glentities output. :/ May try again and take a closer look at why it fails.

Use gluri for URI normalization

Generally minimize copies as much as possible

Create a new parser for inlines which isn’t line-based at all instead of misusing the line- based block parser.

When parsing the block structure, we should create inline parsing tasks, store a reference to the task in a task list and push the task to a job scheduler. Then during HTML conversion we can resolve jobs on demand and process them in parallel on the BEAM (and async on JS). So the block structure only contains inline IDs, and you have to query the context for the actual inlines.

Use string_tree when producing HTML output instead of just concatenating strings together.

Note: I tried a naive pass at this and everything became super slow.

Try building the Gleam website with mork isf jot

Refactor the API

Rename mork.parse to mork.from_string
Rename mork.parse_with_options to mork.parse
Make mork.strip_frontmatter take a Bool argument

mork - v1.11.1

Features

Safe HTML

Performance