Skip to content

Instantly share code, notes, and snippets.

@eddiewebb
Last active October 24, 2024 06:51
Show Gist options
  • Save eddiewebb/735feb48f50f0ddd65ae5606a1cb41ae to your computer and use it in GitHub Desktop.
Save eddiewebb/735feb48f50f0ddd65ae5606a1cb41ae to your computer and use it in GitHub Desktop.
Hugo JS Searching with Fuse.js

Client side searching for Hugo.io with Fuse.js

WHY

This gist shows how to implement client side searching with nothing but Hugo and a few common JS tools.

  • No NPM, grunt, etc
  • No additional build time steps, just hugo as you would normally.
  • Easy to swap out choice of client side search tools, anything that can use a json index
  • Highlights matching keywords in results

Sample

You can visit the Hugo Resume Theme with example site to quickly explore this feature, or visit live site (try "devops", "atlassian developer", or "rest api" as good sample searches).

Setup

  1. Add the file shown here in root directory or under themes/<themeName>
  2. Add JSON as additional output format in config.toml
  3. hugo
  4. Visit localhost:1313/search

Files

content/search.md

---
title: "Search Results"
sitemap:
  priority : 0.1
layout: "search"
---


This file exists solely to respond to /search URL with the related `search` layout template.

No content shown here is rendered, all content is based in the template layouts/page/search.html

Setting a very low sitemap priority will tell search engines this is not important content.

This implementation uses Fusejs, jquery and mark.js


## Initial setup

Search  depends on additional output content type of JSON in config.toml
\```
[outputs]
  home = ["HTML", "JSON"]
\```

## Searching additional fileds

To search additional fields defined in front matter, you must add it in 2 places.

### Edit layouts/_default/index.JSON
This exposes the values in /index.json
i.e. add `category`
\```
...
  "contents":{{ .Content | plainify | jsonify }}
  {{ if .Params.tags }},
  "tags":{{ .Params.tags | jsonify }}{{end}},
  "categories" : {{ .Params.categories | jsonify }},
...
\```

### Edit fuse.js options to Search
`static/js/search.js`
\```
keys: [
  "title",
  "contents",
  "tags",
  "categories"
]
\```


layouts/_default/search.html

This is the page rendered when viewing /search in your browser. THis example uses the template functionality of "base" and "blocks", to add my required JS files right above </body> but only on this page. You can use any template, as long as you include the 3rd part libs (jquery, fuse, mark.js) before search.js, it will work.

{{ define "footerfiles" }}
<script src="https://code.jquery.com/jquery-3.3.1.min.js"></script>
<script src="https://cdnjs.cloudflare.com/ajax/libs/fuse.js/3.2.0/fuse.min.js"></script>
<script src="https://cdnjs.cloudflare.com/ajax/libs/mark.js/8.11.1/jquery.mark.min.js"></script>
<script src="{{ "js/search.js" | absURL }}"></script>
{{ end }}
{{ define "main" }}
<section class="resume-section p-3 p-lg-5 d-flex flex-column">
  <div class="my-auto" >
    <form action="{{ "search" | absURL }}">
      <input id="search-query" name="s"/>
    </form>
    <div id="search-results">
     <h3>Matching pages</h3>
    </div>
  </div>
</section>
<!-- this template is sucked in by search.js and appended to the search-results div above. So editing here will adjust style -->
<script id="search-result-template" type="text/x-js-template">
    <div id="summary-${key}">
      <h4><a href="${link}">${title}</a></h4>
      <p>${snippet}</p>
      ${ isset tags }<p>Tags: ${tags}</p>${ end }
      ${ isset categories }<p>Categories: ${categories}</p>${ end }
    </div>
</script>
{{ end }}

static/js/search.js

This file uses jquery, fuse.js, mark.js to search the hugo created index, and return matching content, with highlighting.


summaryInclude=60;
var fuseOptions = {
  shouldSort: true,
  includeMatches: true,
  threshold: 0.0,
  tokenize:true,
  location: 0,
  distance: 100,
  maxPatternLength: 32,
  minMatchCharLength: 1,
  keys: [
    {name:"title",weight:0.8},
    {name:"contents",weight:0.5},
    {name:"tags",weight:0.3},
    {name:"categories",weight:0.3}
  ]
};


var searchQuery = param("s");
if(searchQuery){
  $("#search-query").val(searchQuery);
  executeSearch(searchQuery);
}else {
  $('#search-results').append("<p>Please enter a word or phrase above</p>");
}



function executeSearch(searchQuery){
  $.getJSON( "/index.json", function( data ) {
    var pages = data;
    var fuse = new Fuse(pages, fuseOptions);
    var result = fuse.search(searchQuery);
    console.log({"matches":result});
    if(result.length > 0){
      populateResults(result);
    }else{
      $('#search-results').append("<p>No matches found</p>");
    }
  });
}

function populateResults(result){
  $.each(result,function(key,value){
    var contents= value.item.contents;
    var snippet = "";
    var snippetHighlights=[];
    var tags =[];
    if( fuseOptions.tokenize ){
      snippetHighlights.push(searchQuery);
    }else{
      $.each(value.matches,function(matchKey,mvalue){
        if(mvalue.key == "tags" || mvalue.key == "categories" ){
          snippetHighlights.push(mvalue.value);
        }else if(mvalue.key == "contents"){
          start = mvalue.indices[0][0]-summaryInclude>0?mvalue.indices[0][0]-summaryInclude:0;
          end = mvalue.indices[0][1]+summaryInclude<contents.length?mvalue.indices[0][1]+summaryInclude:contents.length;
          snippet += contents.substring(start,end);
          snippetHighlights.push(mvalue.value.substring(mvalue.indices[0][0],mvalue.indices[0][1]-mvalue.indices[0][0]+1));
        }
      });
    }

    if(snippet.length<1){
      snippet += contents.substring(0,summaryInclude*2);
    }
    //pull template from hugo templarte definition
    var templateDefinition = $('#search-result-template').html();
    //replace values
    var output = render(templateDefinition,{key:key,title:value.item.title,link:value.item.permalink,tags:value.item.tags,categories:value.item.categories,snippet:snippet});
    $('#search-results').append(output);

    $.each(snippetHighlights,function(snipkey,snipvalue){
      $("#summary-"+key).mark(snipvalue);
    });

  });
}

function param(name) {
    return decodeURIComponent((location.search.split(name + '=')[1] || '').split('&')[0]).replace(/\+/g, ' ');
}

function render(templateString, data) {
  var conditionalMatches,conditionalPattern,copy;
  conditionalPattern = /\$\{\s*isset ([a-zA-Z]*) \s*\}(.*)\$\{\s*end\s*}/g;
  //since loop below depends on re.lastInxdex, we use a copy to capture any manipulations whilst inside the loop
  copy = templateString;
  while ((conditionalMatches = conditionalPattern.exec(templateString)) !== null) {
    if(data[conditionalMatches[1]]){
      //valid key, remove conditionals, leave contents.
      copy = copy.replace(conditionalMatches[0],conditionalMatches[2]);
    }else{
      //not valid, remove entire section
      copy = copy.replace(conditionalMatches[0],'');
    }
  }
  templateString = copy;
  //now any conditionals removed we can do simple substitution
  var key, find, re;
  for (key in data) {
    find = '\\$\\{\\s*' + key + '\\s*\\}';
    re = new RegExp(find, 'g');
    templateString = templateString.replace(re, data[key]);
  }
  return templateString;
}



layouts/_default/index.json

Hugo already builds indexes of all pages, we can cherry-pick which aspects should be searchable. The result is a newly created JSON index at /index.json

{{- $.Scratch.Add "index" slice -}}
{{- range .Site.RegularPages -}}
    {{- $.Scratch.Add "index" (dict "title" .Title "tags" .Params.tags "categories" .Params.categories "contents" .Plain "permalink" .Permalink) -}}
{{- end -}}
{{- $.Scratch.Get "index" | jsonify -}}

Config.toml

Add this snippet to your config file to instruct Hugo to create the index file in JSON format. (RSS and HTML are default outputs, what's important is to add JSON.

...
[outputs]
  home = ["HTML", "RSS", "JSON"]

Alternately if using a custom _index.md for home page, you can just add the output formats to front matter.


outputs:
- html
- rss
- json

See https://gohugo.io/templates/output-formats#output-formats-for-pages

MIT License

Copyright 2022 Edward A. Webbinaro

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

@jgoerzen
Copy link

jgoerzen commented Feb 7, 2022

@eddiewebb Would you by any chance be able to put a license like MIT on this? I want to run my website with purely Open Source components and this would let me be very clear that it's happening that way. Thanks!

@eddiewebb
Copy link
Author

@jgoerzen, done.

@dsryu0822
Copy link

I guess live site doesn't work.

@eddiewebb
Copy link
Author

I guess live site doesn't work.

Sorry, bad jQuery version, it's fixed again.

@strobaek
Copy link

Hi,
Thank you for a great description for adding search to your hugo site.
I have tried to set up ad described.
I have the search page.
My question is: How to I tag the md-pages so they will show up when searching?
I currently just get a blank result, so I must be doing something wrong.
Thank you.

@strobaek
Copy link

Follow up:
I can see that if I add a tags and categories item in the header section of the md-page, they appear in the docs folder (after running hugo).
Unfortunately my search result is still blank.

@onegit20
Copy link

onegit20 commented Aug 24, 2022

thanks all!

针对hugo-theme-even修改支持中文搜索,不搜索categories
https://gist.github.com/onegit20/db426ec7c90c545391168a12ebadbd4e

站点效果:https://yanyong.cc/search/

@ludard
Copy link

ludard commented Oct 14, 2022

Thanks for this gist. I made changes to the hugo-theme-jane theme based on this gist.
The following features are now supported:

  • Chinese Content Search
  • Consistent web style with the jane theme
  • Search page adapts to mobile, tablet, PC and other devices

https://gist.github.com/ludard/cac7ec7f9c1b167dbcb8b87210c72956

Search Page: https://ludard.com/search


感谢gist和各位的贡献,我在此基础上做了些修改。主要功能:

  • 支持中文搜索
  • 搜索页面适配 jane 主题风格,相同的主题色、页头和页脚等
  • 搜索页面适配手机、平板、PC 等多设备

修改后的 gist: https://gist.github.com/ludard/cac7ec7f9c1b167dbcb8b87210c72956
搜索页面:https://ludard.com/search
中文搭建教程:https://ludard.com/post/tech/create-blog-step-by-step-5

@TigGrak
Copy link

TigGrak commented Nov 26, 2022

sorry but it could not work when i use this way to search my blog with the theme stack, the search web always shows nothing even i enter the key words which my posts' title have

and this is its erros
image

(i built my blog on url, http://xx.com/blog not http://xx.com/)

image

I think the error may be in the index.json
But I've set it up following the tutorial

@bfredo123
Copy link

Hi, many thanks for this contribution! I have an issue to search content generated by page variables, which is not indexed:

  • I am using a theme (gethugothemes' agico)
  • this theme provides templates to populate pages with some nice layout. For this, one has to create a 'feature.content' page variable, which is used by the template to generate the final page content. And the actual content in the .md file is empty, everything is in the front matter.
  • so, the page content is actually not available once the index.json file is built (at least, this is my understanding, because the ".Plain" attribute is empty, and same issue with ".Content" and ".RawContent")
  • as a result, this content cannot be searched, which is my issue (other pages with regular content can be searched pretty well).

Is there a way to work this around? (I guess, by enforcing the generation of index.json once the other pages have been built?)

Many thanks! [ I may be wrong in my assumptions because I am a newbye in Hugo ]

@ttntm
Copy link

ttntm commented Sep 2, 2023

@bfredo123 I've encountered something like you are deact a long time ago and figured out a workaround back then.

No idea how accurate this is in 2023, haven't used hugo in a long time, but it might give you some ideas: https://ttntm.me/blog/hugo-plain-function-ignores-page-resources/

@josineto
Copy link

josineto commented Sep 2, 2023

Hi @bfredo123 , I think your case is simpler than the solution proposed by @ttntm (which helps too). I haven't looked in a long time this code, so I'll explain broadly:

  • so, the page content is actually not available once the index.json file is built (at least, this is my understanding, because the ".Plain" attribute is empty, and same issue with ".Content" and ".RawContent")

In the template that creates index.json, simply add a line that, for each page, inserts a field with the frontmatter variable that you need to search (feature.content), to make it searchable.

  • as a result, this content cannot be searched, which is my issue (other pages with regular content can be searched pretty well).

In the code that makes the search (the .js file), in the section that uses the index.json field, insert the field you've just created as one of the portions to search.

And voilà !

@bfredo123
Copy link

Sounds great @ttntm and @jzeneto. Actually both seem to be the same approach: add an extra search key, fulfilled with page resources or equivalent .feature.content stuff. Will try that right away!
Thank you!

@bfredo123
Copy link

Another issue I am facing: some times (randomly?), the search returns twice the same page in the result list. I could verify that this comes from the fuse.search() result. But sometimes, restarting "Hugo server" fixes i. Any clue about what it is due to?

@josineto
Copy link

josineto commented Sep 2, 2023

@bfredo123 that happens to me too, but only in development, not in production. So I just ignore it haha

@bfredo123
Copy link

bfredo123 commented Sep 2, 2023

Sounds good... :)
Just implemented the tip about params-generated content search, seems to work great (it takes into account various params managed by the template: "description", "about", and the "title", "subtitle" and "content" of the "about_item" array within "about". Here it is for those interested:

{{- $.Scratch.Add "index" slice -}}
{{- range site.RegularPages -}}
  {{- $sc := newScratch -}}
  {{- if isset .Params "description" -}}
    {{- $sc.Add "ct" .Description  -}}
  {{- end -}}

  {{- if isset .Params "about" -}}
    {{- range .Params.About.about_item }}
      {{- $sc.Add "ct" (print .title " " .subtitle " " .content) -}}
    {{- end -}}
  {{- end -}}
  {{- $sc.Add "ct" .Plain -}}
  {{- $content := $sc.Get "ct" }}

{{ $date:= .PublishDate.Format "02"}}
  {{- $.Scratch.Add "index" (dict "title" .Title "date" $date "tags" .Params.tags "image" .Params.image "categories" .Params.categories "contents" $content "permalink" .Permalink) -}}
{{- end -}}
{{- $.Scratch.Get "index" | jsonify -}}

Thank you again for your great and superfast help, I was a bit desperate!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment