

Only documents without copyright protection (documents with printing, copying, and editing enabled) will show cached versions and document previews.

In all cases, the values of the metadata fields are indexed as part of the document content. If the document title is the same as the file name, the search appliances uses the first text it discovers in a large font within the document itself. The search appliance uses the PDF document title property as the title in the search index. The search appliance ignores the title tag in a web page if it has only one character. If you want titles extracted from document metadata, do not use a value for the title metadata that is the same as the file name.

How the search appliance makes the determination varies by the document type. The Google Search Appliance analyzes documents during the indexing process to determine which text is the document title and which is the body text. If it cannot determine the mime type from the file extension, then the mime type returned is application/octet-stream.īack to top How the Google Search Appliance Determines the Document Title In this case, when the document is searched for, the search appliance guesses the actual mime type from the file extension. If the search appliance crawls or is fed a document with a mime type that it does not know how to interpret, it associates that document with text/other and indexes it.
