Traditionally websites have consisted of HTML-based web pages. They may also reference various document types, such as PDFs as well as Word and Powerpoint documents. These document types contain textual content as well as images that may be crucial for your business. Thus finding the content is of great importance.
In this tutorial, we’ll review information on AddSearch’s document types feature. First, we will describe some use cases to give you an idea of where finding PDFs and office documents are important. Then we’ll look at how you can set up document types for crawling and indexing as well as filters for search results based on the document types.
Our clients render PDFs and Microsoft Office documents searchable for many reasons. Here are some popular examples:
In all of these cases, making PDFs and Microsoft Office documents searchable gives better user experience and makes the information better accessible. All of the use cases may also benefit a mix of search results that pin together of product web pages and document types. Pinning search results can be done using the pinned results feature. Read more about the pinned results from our documentation and earlier blog post.
Document types indexing is available to free trial users and enterprise customers. For the Small and Large subscription plans the support is available with the purchase of the Plus package add-on.
You can set up document types feature by following these instructions or taking the following steps
When the setting is changed, a full re-crawl is required.
In addition to the content, AddSearch indexes the metadata from PDFs and Microsoft Office documents. There are settings we can use to enhance what is indexed as well as what is shown in the search results. Please contact AddSearch Customer Support if you need help in setting up the search.
You can filter the search results based on document types with category filters. Below you can see the supported category filters for document types
The category filter has the following syntax
categories=doctype_pdf which you can add to the search script.
The following scripts exemplify how to use the document types as filters in the search Widget script. Replace
#### with your site key.
Include search results with PDF as the document type
<script src="https://addsearch.com/js/?key=####&categories=doctype_pdf"> </script>
Include search results with pptx as the document type
<script src="https://addsearch.com/js/?key=####categories=doctype_pptx"> </script>
You can also create combinations of document types to include in the search results. For instance, the following filter includes search results with PDF, pptx and doc as the document type
<script src="https://addsearch.com/js/?key=####categories=doctype_pdf, doctype_pptx, doctype_doc"> </script>
For more information on category filters visit the documentation.
The use cases show why it is important to make different document types searchable. The reason for this is that websites have important content that may come in different document types. Regardless of the document type, it is important that your users find exactly the content they need easily.