Query with Full-Text Search
https://docs.microsoft.com/en-us/sql/relational-databases/search/query-with-full-text-search?view=sql-server-2017
Last updated
https://docs.microsoft.com/en-us/sql/relational-databases/search/query-with-full-text-search?view=sql-server-2017
Last updated
APPLIES TO: SQL Server Azure SQL Database Azure SQL Data Warehouse Parallel Data Warehouse
Write full-text queries by using the predicates CONTAINS and FREETEXT and the rowset-valued functions CONTAINSTABLE and FREETEXTTABLE with a SELECT statement. This article provides examples of each predicate and function and helps you choose the best one to use.
To match words and phrases, use CONTAINS and CONTAINSTABLE.
To match the meaning, but not the exact wording, use FREETEXT and FREETEXTTABLE.
The following examples use the AdventureWorks sample database. For the final release of AdventureWorks, see AdventureWorks Databases and Scripts for SQL Server 2016 CTP3. To run the sample queries, you also have to set up Full-Text Search. For more info, see Get Started with Full-Text Search.
The following example finds all products with a price of $80.99
that contain the word "Mountain"
:SQLCopy
The following example searches for all documents that contain words related to vital safety components
:SQLCopy
The following example returns the description ID and description of all products for which the Descriptioncolumn contains the word "aluminum" near either the word "light" or the word "lightweight." Only rows with a rank of 2 or higher are returned.SQLCopy
The following example extends a FREETEXTTABLE query to return the highest ranked rows first and to add the ranking of each row to the select list. To write a similar query, you have to know that ProductDescriptionID is the unique key column for the ProductDescription table.SQLCopy
Here is an extension of the same query that only returns rows with a rank of 10 or greater:SQLCopy
CONTAINS
/CONTAINSTABLE
and FREETEXT
/FREETEXTTABLE
are useful for different kinds of matching. The following info helps you to choose the best predicate or function for your query:
Match single words and phrases with precise or fuzzy (less precise) matching.
You can also do the following things:
Specify the proximity of words within a certain distance of one another.
Return weighted matches.
Combine search conditions with logical operators. For more info, see Using Boolean operators (AND, OR, and NOT) later in this article.
Match the meaning, but not the exact wording, of specified words, phrases, or sentences (the freetext string).
Matches are generated if any term or form of any term is found in the full-text index of a specified column.
The predicates CONTAINS
/FREETEXT
and the rowset-valued functions CONTAINSTABLE
/FREETEXTTABLE
have different syntax and options. The following info helps you to choose the best predicate or function for your query:
Usage. Use the full-text predicates CONTAINS and FREETEXT in the WHERE or HAVING clause of a SELECT statement.
Results. The CONTAINS and FREETEXT predicates return a TRUE or FALSE value that indicates whether a given row matches the full-text query. Matching rows are returned in the result set.
More options. You can combine the predicates with any of the other Transact-SQL predicates, such as LIKE and BETWEEN.
You can specify either a single column, a list of columns, or all columns in the table to be searched.
Optionally, you can specify the language whose resources are used by the full-text query for word breaking and stemming, thesaurus lookups, and noise-word removal.
You can use a four-part name in the CONTAINS or FREETEXT predicate to query full-text indexed columns of the target tables on a linked server. To prepare a remote server to receive full-text queries, create a full-text index on the target tables and columns on the remote server and then add the remote server as a linked server.
More info. For more info about the syntax and arguments of these predicates, see CONTAINS and FREETEXT.
Usage. Use the full-text functions CONTAINSTABLE and FREETEXTTABLE functions like a regular table name in the FROM clause of a SELECT statement.
You have to specify the base table to search when you use either of these functions. As with the predicates, you can specify a single column, a list of columns, or all columns in the table to be searched, and optionally, the language whose resources are used by given full-text query.
Typically you have to join the results of CONTAINSTABLE or FREETEXTTABLE with the base table. To join the tables, you have to know the unique key column name. This column, which occurs in every full-text enabled table, is used to enforce unique rows for the table (the unique**key column). For more info about the key column, see Create and Manage Full-Text Indexes.
Results. These functions return a table of zero, one, or more rows that match the full-text query. The returned table contains only rows from the base table that match the selection criteria specified in the full-text search condition of the function.
Queries that use one of these functions also return a relevance ranking value (RANK) and full-text key (KEY) for each row returned, as follows:
KEY column. The KEY column returns unique values of the returned rows. The KEY column can be used to specify selection criteria.
RANK column. The RANK column returns a rank value for each row that indicates how well the row matched the selection criteria. The higher the rank value of the text or document in a row, the more relevant the row is for the given full-text query. Different rows can be ranked identically. You can limit the number of matches to be returned by specifying the optional top_n_by_rank parameter. For more information, see Limit Search Results with RANK.
More info. For more info about the syntax and arguments of these functions, see CONTAINSTABLE and FREETEXTTABLE.
You can use CONTAINS, CONTAINSTABLE, FREETEXT, or FREETEXTTABLE to search a table for a specific word or phrase. For example, if you want to search the ProductReview table in the AdventureWorks2012 database to find all comments about a product with the phrase "learning curve," you could use the CONTAINS predicate as follows:SQLCopy
The search condition, in this case "learning curve," can be complex and can be composed of one or more terms.
More info about simple term searches
In full-text search, a word (or token) is a string whose boundaries are identified by appropriate word breakers, following the linguistic rules of the specified language. A valid phrase consists of multiple words, with or without any punctuation marks between them.
For example, "croissant" is a word, and "café au lait" is a phrase. Words and phrases such as these are called simple terms.
CONTAINS and CONTAINSTABLE look for an exact match for the phrase. FREETEXT and FREETEXTTABLE break up the phrase into separate words.
You can use CONTAINS or CONTAINSTABLE to search for words or phrases with a specified prefix. All entries in the column that contain text beginning with the specified prefix are returned. For example, to search for all rows that contain the prefix top
-, as in top``ple
, top``ping
, and top
. The query looks like the following example:SQLCopy
All text that matches the text specified before the asterisk (*) is returned. If the text and asterisk are not delimited by double quotation marks, as in CONTAINS (DESCRIPTION, 'top*')
, full-text search does not consider the asterisk to be a wildcard..
When the prefix term is a phrase, each token making up the phrase is considered a separate prefix term. All rows that have words beginning with the prefix terms will be returned. For example, the prefix term "light bread*" will find rows with text of "light breaded," "lightly breaded," or "light bread," but it will not return "lightly toasted bread."
More info about prefix searches
A prefix term refers to a string that is affixed to the front of a word to produce a derivative word or an inflected form.
For a single prefix term, any word starting with the specified term will be part of the result set. For example, the term "auto*" matches "automatic," "automobile," and so forth.
For a phrase, each word within the phrase is considered to be a prefix term. For example, the term "auto tran*" matches "automatic transmission" and "automobile transducer," but it does not match "automatic motor transmission."
Prefix searches are supported by CONTAINS and CONTAINSTABLE.
You can use CONTAINS, CONTAINSTABLE, FREETEXT, or FREETEXTTABLE to search for all the different tenses and conjugations of a verb or both the singular and plural forms of a noun (an inflectional search) or for synonymous forms of a specific word (a thesaurus search).
The following example searches for any form of "foot" ("foot," "feet," and so on) in the Comments
column of the ProductReview
table in the AdventureWorks
database:SQLCopy
Full-text search uses stemmers, which allow you to search for the different tenses and conjugations of a verb, or both the singular and plural forms of a noun. For more information about stemmers, see Configure and Manage Word Breakers and Stemmers for Search.
More info about generation term searches
The inflectional forms are the different tenses and conjugations of a verb or the singular and plural forms of a noun.
For example, search for the inflectional form of the word "drive." If various rows in the table include the words "drive," "drives," "drove," "driving," and "driven," all would be in the result set because each of these can be inflectionally generated from the word drive.
FREETEXT and FREETEXTTABLE look for inflectional terms of all specified words by default. CONTAINS and CONTAINSTABLE support an optional INFLECTIONAL
argument.
A thesaurus defines user-specified synonyms for terms. For more info about thesaurus files, see Configure and Manage Thesaurus Files for Full-Text Search.
For example, if an entry, "{car, automobile, truck, van}," is added to a thesaurus, you can search for the thesaurus form of the word "car." All rows in the table queried that include the words "automobile," "truck," "van," or "car," appear in the result set because each of these words belongs to the synonym expansion set containing the word "car."
FREETEXT and FREETEXTTABLE use the thesaurus by default. CONTAINS and CONTAINSTABLE support an optional THESAURUS
argument.
A proximity term indicates words or phrases that are near to each other. You can also specify the maximum number of non-search terms that separate the first and last search terms. In addition, you can search for words or phrases in any order, or in the order in which you specify them.
For example, you want to find the rows in which the word "ice" is near the word "hockey" or in which the phrase "ice skating" is near the phrase "ice hockey."
For more info about proximity searches, see Search for Words Close to Another Word with NEAR.
You can use CONTAINSTABLE to search for words or phrases and specify a weighting value. Weight, measured as a number from 0.0 through 1.0, indicates the importance of each word and phrase within a set of words and phrases. A weight of 0.0 is the lowest, and a weight of 1.0 is the highest.
The following example shows a query that searches for all customer addresses, using weights, in which any text beginning with the string "Bay" has either "Street" or "View." The results give a higher rank to those rows that contain more of the words specified.SQLCopy
A weighted term can be used in conjunction with any simple term, prefix term, generation term, or proximity term.
More info about weighted term searches
In a weighted term search, a weighting value indicates the degree of importance for each word and phrase within a set of words and phrases. A weight value of 0.0 is the lowest, and a weight value of 1.0 is the highest.
For example, in a query searching for multiple terms, you can assign each search word a weight value indicating its importance relative to the other words in the search condition. The results for this type of query return the most relevant rows first, according to the relative weight you have assigned to search words. The result sets contain documents or rows containing any of the specified terms (or content between them); however, some results will be considered more relevant than others because of the variation in the weighted values associated with different searched terms.
Weighted term searches are supported by CONTAINSTABLE.
The CONTAINS predicate and CONTAINSTABLE function use the same search conditions. Both support combining several search terms by using Boolean operators - AND, OR, and NOT - to perform logical operations. You can use AND, for example, to find rows that contain both "latte" and "New York-style bagel." You can use AND NOT, for example, to find the rows that contain "bagel" but do not contain "cream cheese."
In contrast, FREETEXT and FREETEXTTABLE treat the Boolean terms as words to be searched.
For information about combining CONTAINS with other predicates that use the logical operators AND, OR, and NOT, see Search Condition (Transact-SQL).
The following example uses the CONTAINS predicate to search for descriptions in which the description ID is not equal to 5 and the description contains both the word "Aluminum" and the word "spindle." The search condition uses the AND Boolean operator. This example uses the ProductDescription table of the AdventureWorks2012database.SQLCopy
When you write full-text queries, you can also specify the following options:
Case sensitivity. Full-text search queries are case-insensitive. However, in Japanese, there are multiple phonetic orthographies in which the concept of orthographic normalization is akin to case insensitivity (for example, kana = insensitivity). This type of orthographic normalization is not supported.
Stopwords. When defining a full-text query, the Full-Text Engine discards stopwords (also called noise words) from the search criteria. Stopwords are words such as "a," "and," "is," or "the," that can occur frequently but that typically do not help when searching for particular text. Stopwords are listed in a stoplist. Each full-text index is associated with a specific stoplist, which determines what stopwords are omitted from the query or the index at indexing time. For more info, see Configure and Manage Stopwords and Stoplists for Full-Text Search.
Language, with the LANGUAGE option. Many query terms depend heavily on word-breaker behavior. To ensure that you are using the correct word breaker (and stemmer) and thesaurus file, we recommend that you specify the LANGUAGE option. For more information, see Choose a Language When Creating a Full-Text Index.
Thesaurus. FREETEXT and FREETEXTTABLE queries use the thesaurus by default. CONTAINS and CONTAINSTABLE support an optional THESAURUS argument. For more info, see Configure and Manage Thesaurus Files for Full-Text Search.
After you apply a given word breaker, thesaurus, and stoplist combination in a query, you can see how Full-Text Search tokenizes the results by using the sys.dm_fts_parser dynamic management view. For more information, see sys.dm_fts_parser (Transact-SQL).
CONTAINS (Transact-SQL) CONTAINSTABLE (Transact-SQL) FREETEXT (Transact-SQL) FREETEXTTABLE (Transact-SQL) Create Full-Text Search Queries (Visual Database Tools) Improve the Performance of Full-Text Queries
Query with Full-Text Search