Built-in search plugin¶
The search plugin adds a search bar to the header, allowing users to search your documentation. It's powered by lunr.js, a lightweight full-text search engine for the browser, elimininating the need for external services, and even works when building offline-capable documentation.
Objective¶
How it works¶
The plugin scans the generated HTML and builds a search index from all pages and sections by extracting the section titles and contents. It preserves some inline formatting like code blocks and lists, but removes all other formatting, so the search index is as small as possible.
When a user visits your site, the search index is shipped to the browser, indexed with lunr.js and made available for fast and simple querying – no server needed. This ensures that the search index is always up to date with your documentation, yielding accurate results.
When to use it¶
It's generally recommended to use the plugin, as interactive search functionality is a vital part of every good documentation. Additionally, the plugin integrates perfectly with several of the other built-in plugins that Material for MkDocs offers:
-
The offline plugin adds support for building offline-capable documentation, so you can distribute the
site
directory as a.zip
file that can be downloaded.
Your documentation can work without connectivity to the internet
-
The meta plugin makes it easy to boost specific sections in search results or to exclude them entirely from being indexed, giving more granular control over search.
Simpler organization and management of search in different subsections
Configuration¶
As with all built-in plugins, getting started with the search plugin is straightforward. Just add the following lines to mkdocs.yml
, and your users will be able to search your documentation:
The search plugin is built into Material for MkDocs and doesn't need to be installed.
General¶
The following settings are available:
enabled
¶
9.2.9 true
Use this setting to enable or disable the plugin when building your project. It's normally not necessary to specify this setting, but if you want to disable the plugin, use:
Search¶
The following settings are available for search:
lang
¶
Use this setting to specify the language of the search index, enabling stemming support for other languages than English. The default value is automatically computed from the site language, but can be explicitly set to another language or even multiple languages with:
Language support is provided by lunr languages, a collection of language-specific stemmers and stop words for lunr.js maintained by the Open Source community.
The following languages are currently supported by lunr languages:
ar
– Arabicda
– Danishde
– Germandu
– Dutchen
– Englishes
– Spanishfi
– Finnishfr
– Frenchhi
– Hindihu
– Hungarianhy
– Armenianit
– Italianja
– Japanesekn
- Kannadako
– Koreanno
– Norwegianpt
– Portuguesero
– Romanianru
– Russiansa
– Sanskritsv
– Swedishta
– Tamilte
– Teluguth
– Thaitr
– Turkishvi
– Vietnamesezh
– Chinese
If lunr languages doesn't provide support for the selected site language, the plugin falls back to another language that yields the best stemming results. If you discover that the search results are not satisfactory, you can contribute to lunr languages by adding support for your language.
separator
¶
Use this setting to specify the separator used to split words when building the search index on the client side. The default value is automatically computed from the site language, but can also be explicitly set to another value with:
Separators support positive and negative lookahead assertions, which allows for rather complex expressions that yield precise control over how words are split when building the search index.
Broken into its parts, this separator induces the following behavior:
The first part of the expression inserts token boundaries for each document before and after whitespace, hyphens, commas, brackets and other special characters. If several of those special characters are adjacent, they are treated as one.
Many programming languages have naming conventions like PascalCase
or camelCase
. By adding this subexpression to the separator, words are split at case changes, tokenizing the word PascalCase
into Pascal
and Case
.
When adding .
to the separator, version strings like 1.2.3
are split into 1
, 2
and 3
, which makes them undiscoverable via search. When using this subexpression, a small lookahead is introduced which will preserve version strings and keep them discoverable.
If your documentation includes HTML/XML code examples, you may want to allow users to find specific tag names. Unfortunately, the <
and >
control characters are encoded in code blocks as <
and >
. Adding this subexpression to the separator allows for just that.
pipeline
¶
Use this setting to specify the pipeline functions that are used to filter and expand tokens after tokenizing them with the separator
and before adding them to the search index. The default value is automatically computed from the site language, but can also be explicitly set with:
The following pipeline functions can be used:
stemmer
– Stem tokens to their root form, e.g.running
torun
stopWordFilter
– Filter common words according, e.g.a
,the
, etc.trimmer
– Trim whitespace from tokens
Segmentation¶
The plugin supports text segmentation of Chinese via jieba, a popular Chinese text segmentation library. Other languages like Japanese and Korean are currently segmented on the client side, but we're considering to move this functionality into the plugin in the future.
The following settings are available for segmentation:
jieba_dict
¶
Use this setting to specify a custom dictionary to be used by jieba for segmenting text, replacing the default dictionary. jieba comes with several dictionaries, which can be used with:
The following dictionaries are provided by jieba:
- dict.txt.small – 占用内存较小的词典文件
- dict.txt.big – 支持繁体分词更好的词典文件
The provided path is resolved from the root directory.
jieba_dict_user
¶
Use this setting to specify an additional user dictionary to be used by jieba for segmenting text, augmenting the default dictionary. User dictionaries are ideal for tuning the segmenter:
The provided path is resolved from the root directory.
Usage¶
Metadata¶
The following properties are available:
boost
¶
Use this property to increase or decrease the relevance of a page in the search results, giving more weight to them. Use values above 1
to rank up and values below 1
to rank down:
exclude
¶
Use this property to exclude a page from the search results. Note that this will not only remove the page, but also all subsections of the page from the search results: