Python Markdown extension to include local or remote files
Project description
Include extension for Python Markdown. It lets you include local or remote (downloadable) files into your markdown at arbitrary positions.
This project is motivated by markdown-include and provides the same functionalities with some extras.
Inclusion for local file is by default recursive and for remote file non-recursive. You can change this behavior through configuration.
You can include part of the file by slicing according to line/column number.
File/Downloaded contents are cached, i.e if you include same file multiple times in multiple places, they won't be read/downloaded more than once. This behavior can also be changed with configuration.
Circular inclusion by default raises an exception. You can change this behavior to include the affected files in non-recursive mode through configuration.
You should not use markdown-include along with this extension, choose either one, not both.
Syntax
- Simple:
{! file_path_or_url !}
- With explicit encoding:
{! file_path_or_url | encoding !}
- With recurs_state on:
{!+ file_path_or_url !}
or{!+ file_path_or_url | encoding !}
. This makes the included file to be able to include other files. This is meaningful only when recursion is set toNone
. If it is set toFalse
, this explicit recurs_state defintion can not force recursion. This is a depth 1 recursion, so you can choose which one to recurs and which one to not. - With recurs_state off:
{!- file_path_or_url !}
or{!- file_path_or_url | encoding !}
. This will force not to recurs even when recursion is set toTrue
. - Escaped syntax: You can escape it to get the literal. For example,
\{! file_path_or_url !}
will give you literal{! file_path_or_url !}
and\\\{! file_path_or_url !}
will give you\{! file_path_or_url !}
- File slice: You can slice a file by line and column number. The syntax is
{! file_path [ln:l.c-l.c,l.c-l.c,...] !}
. No spaces allowed inside file slice syntax[ln:l.c-l.c,l.c-l.c,]
. See more detals in File slicing section.
General syntax: {!recurs_state file_path_or_url [ln:slice_syntax] | encoding !}
The spaces are not necessary. They are just to make it look nice :) . No spaces allowed between
{!
and recurs_state (+-
)
You can change the syntax!!!
If you don't like the syntax you can change it through configuration.
There might be some complications with the syntax {!file!}
, for example, conflict with markdown.extensions.attr_list
which uses {:?something}
. As the :
is optional, a typical problem that occurs is this one:
A paragraph
{!our syntax!}
would produce:
<p syntax_="syntax!" _our="!our">A paragraph</p>
If you really want to avoid this type of collision, find some character sequence that is not being used by any extension that you are using and use those character sequences to make up the syntax.
See the configuration section for details
Install
Install from Pypi:
pip install mdx_include
Usage
text = r"""
some text {! test1.md !} some more text {! test2.md | utf-8 !}
Escaping will give you the exact literal \{! some_file !}
If you escape, then the backslash will be removed.
If you want the backslash too, then provide two more: \\\{! some_file !}
"""
md = markdown.Markdown(extensions=['mdx_include'])
html = md.convert(text)
print(html)
Example output:
(when test1.md contains a single line **This is test1.md**
and test2.md contains **This is test2.md**
)
<p>some text <strong>This is test1.md</strong> some more text <strong>This is test2.md</strong></p>
<p>Escaping will give you the exact literal {! some_file !}</p>
<p>If you escape, then the backslash will be removed.</p>
<p>If you want the backslash too, then provide two more: \{! some_file !}</p>
Configuration
Config param | Default | Details |
---|---|---|
base_path |
. |
The base path from which relative paths are normalized. |
encoding |
utf-8 |
The file encoding. |
allow_local |
True |
Whether to allow including local files. |
allow_remote |
True |
Whether to allow including remote files. |
truncate_on_failure |
True |
Whether to truncate the matched include syntax on failure. False value for both allow_local and allow_remote is treated as a failure. |
recurs_local |
True |
Whether the inclusions are recursive on local files. Options are: True , False and None . None is a neutral value with negative default and overridable with recurs_state (e.g {!+file!} ). False will permanently prevent recursion i.e you won't be able to override it with the recurs_state. True value is overridable with recurs_state (e.g {!-file!} ). |
recurs_remote |
False |
Whether the inclusions are recursive on remote files. Options are: True , False and None . None is a neutral value with negative default and overridable with recurs_state (e.g {!+file!} ). False will permanently prevent recursion i.e you won't be able to override it with the recurs_state. True value is overridable with recurs_state (e.g {!-file!} ). |
syntax_left |
\{! |
The left boundary of the syntax. (Used in regex, thus escaped { ). |
syntax_right |
!\} |
The right boundary of the syntax. (Used in regex, thus escaped } ). |
syntax_delim |
\| |
The delimiter that separates encoding from path_or_url. (Used in regex, thus escaped | ). |
syntax_recurs_on |
+ |
The character to specify recurs_state on. (Used in regex). |
syntax_recurs_off |
- |
The character to specify recurs_state off. (Used in regex). |
content_cache_local |
True |
Whether to cache content for local files. |
content_cache_remote |
True |
Whether to cache content for remote files. |
content_cache_clean_local |
False |
Whether to clean content cache for local files after processing all the includes |
content_cache_clean_remote |
False |
Whether to clean content cache for remote files after processing all the includes |
allow_circular_inclusion |
False |
Whether to allow circular inclusion. If allowed, the affected files will be included in non-recursive mode, otherwise it will raise an exception. |
line_slice_separator |
['',''] |
A list of lines that will be used to separate parts specified by line slice syntax: 1-2,3-4,5 etc. |
Example with configuration
configs = {
'mdx_include': {
'base_path': 'mdx_include/test/',
'encoding': 'utf-8',
'allow_local': True,
'allow_remote': True,
'truncate_on_failure': False,
'recurs_local': None,
'recurs_remote': False,
'syntax_left': r'\{!',
'syntax_right': r'!\}',
'syntax_delim': r'\|',
'syntax_recurs_on': '+',
'syntax_recurs_off': '-',
},
}
text = r"""
some text {! some_file !} some more text {! some_more_file | utf-8 !}
Escaping will give you the exact literal \{! some_file !}
If you escape, then the backslash will be removed.
If you want the backslash too, then provide two more: \\\{! some_file !}
"""
md = markdown.Markdown(extensions=['mdx_include'], extension_configs=configs)
html = md.convert(text)
print(html)
File slicing
You can include part of the file from certain line/column number to certain line/column number.
The general file slice syntax is: [ln:l.c-l.c,l.c-l.c,...]
, where l is the line number and c is the column number. All indexes are inclusive.
Examples:
Slice | Details |
---|---|
[ln:1-4] |
line 1 to 4 (both inclusive) |
[ln:1.2-3.4] |
character 2 in line 1 from character 4 in line 3 |
[ln:2-] |
line 2 to all of the rest |
[ln:-3] |
Last line to 3rd line (reversion) |
[ln:6-2] |
6th line to 2nd line (reversion) |
[ln:2.9-2.2] |
From 9th character of line 2 to 2nd character of line 2 (string reverse) |
[ln:.3-.10] |
Slice along the column from every row, from 3rd character to 10th character |
[ln:2] |
line 2 only |
Multiple slicing can be done by adding more slice expressions with commas (s,
). In this case, a separator (default is two newlines) is inserted between each slice. For example, with slice expression 1-2,4-9
, two newlines will be inserted between the lines 1-2 and 4-9.
More details on the rcslice doc
Manual cache control
The configuration gives you enough cache control, but that's not where it ends :). You can do manual cache cleaning instead of letting the extension handle it for itself. First turn the auto cache cleaning off by setting content_cache_clean_local
and/or content_cache_clean_remote
to False
(this is default), then call the cache cleaning function manually on the markdown object whenever you want:
md.mdx_include_content_cache_clean_local()
md.mdx_include_content_cache_clean_remote()
You can also get the internal cache dictionary and make inplace modification (e.g cleaning a specific cache for a specific file/URL, or even modify the cached content):
local_cache_dict = md.mdx_include_get_content_cache_local()
remote_cache_dict = md.mdx_include_get_content_cache_remote()
How circular inclusion works
Let's say, there are three files, A, B and C. A includes B, B includes C and C inclues A and we are doing recursive include.
If circular inclusion is not allowed in the config i.e if allow_circular_inclusion
is False
(which is the default) then it will raise an exception.
If allow_circular_inclusion
is set to True
, then it will work like this:
- A and B will be normally included
- B includes C normally too
- C includes A which is a circular inclusion (
C>A>B>C>A>B>C...
). Thus A will be included in non-recursive mode asallow_circular_inclusion
is set toTrue
i.e C will include A literally without parsing A anymore.
An example of including a gist
The following markdown:
Including a gist:
```python
{! https://gist.github.com/drgarcia1986/3cce1d134c3c3eeb01bd/raw/73951574d6b62a18b4c342235006ff89d299f879/django_hello.py !}
```
will produce (with fenced code block enabled):
<p>Including a gist:</p>
<pre><code class="python"># -*- coding: utf-8 -*-
# Settings
from django.conf import settings
settings.configure(
DEBUG=True,
SECRET_KEY='secretfoobar',
ROOT_URLCONF=__name__,
MIDDLEWARE_CLASSES=(
'django.middleware.common.CommonMiddleware',
'django.middleware.csrf.CsrfViewMiddleware',
'django.middleware.clickjacking.XFrameOptionsMiddleware',
)
)
# Views
from django.http import HttpResponse
from django.conf.urls import url
def index(request):
return HttpResponse('<h1>Hello Word</h1>')
# Routes
urlpatterns = (
url(r'^$', index),
)
# RunServer
if __name__ == '__main__':
from django.core.management import execute_from_command_line
import sys
execute_from_command_line(sys.argv)
</code></pre>
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file mdx_include-1.1.4.tar.gz
.
File metadata
- Download URL: mdx_include-1.1.4.tar.gz
- Upload date:
- Size: 14.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 775b0e57506a3aa50b25c227fec1919a6b5dacb7aec74cc5e15918727c2783c2 |
|
MD5 | 9d4eb67087d6083cae5eb01bf36009d5 |
|
BLAKE2b-256 | f24af787e78fbf75f27962b36c532a1c930c37c76340ca3e5a8ec6b603cca1f0 |
File details
Details for the file mdx_include-1.1.4-py3-none-any.whl
.
File metadata
- Download URL: mdx_include-1.1.4-py3-none-any.whl
- Upload date:
- Size: 10.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e6fb73cbc7ab36b1731ef161c062cfb9c69986c9a9e3f6969b9462180f931b7e |
|
MD5 | 96fc5674cfa2cfbd621150b84861485c |
|
BLAKE2b-256 | 72b82b492910713ba8494f8944144313a6f76c60d5a12bab04e30bacb25035e1 |