Text fields can be evaluated as a set of tokens called terms. Depending on the underlying storage service, the terms may be generated and indexed when a text field is stored, or they may be computed dynamically. Generally, terms are alphanumeric character sequences separated by whitespace and/or punctuation.
In a term, the characters _,
@,
#, and
- are treated as term
separators, creating multiple terms that are part of the same literal. For example, the following literals all define two terms,
john and
smith – the separator character is not part of either term:
The characters ? and
* are single- and multi-character wildcards:
? matches any single character and
* matches any sequence of characters.