Rechercher dans le manuel MySQL
10.8.4 Collation Coercibility in Expressions
In the great majority of statements, it is obvious what
collation MySQL uses to resolve a comparison operation. For
example, in the following cases, it should be clear that the
collation is the collation of column x
:
However, with multiple operands, there can be ambiguity. For example:
Should the comparison use the collation of the column
x
, or of the string literal
'Y'
? Both x
and
'Y'
have collations, so which collation takes
precedence?
A mix of collations may also occur in contexts other than
comparison. For example, a multiple-argument concatenation
operation such as CONCAT(x,'Y')
combines its arguments to produce a single string. What
collation should the result have?
To resolve questions like these, MySQL checks whether the collation of one item can be coerced to the collation of the other. MySQL assigns coercibility values as follows:
An explicit
COLLATE
clause has a coercibility of 0 (not coercible at all).The concatenation of two strings with different collations has a coercibility of 1.
The collation of a column or a stored routine parameter or local variable has a coercibility of 2.
A “system constant” (the string returned by functions such as
USER()
orVERSION()
) has a coercibility of 3.The collation of a literal has a coercibility of 4.
The collation of a numeric or temporal value has a coercibility of 5.
NULL
or an expression that is derived fromNULL
has a coercibility of 6.
MySQL uses coercibility values with the following rules to resolve ambiguities:
Use the collation with the lowest coercibility value.
If both sides have the same coercibility, then:
If both sides are Unicode, or both sides are not Unicode, it is an error.
If one of the sides has a Unicode character set, and another side has a non-Unicode character set, the side with Unicode character set wins, and automatic character set conversion is applied to the non-Unicode side. For example, the following statement does not return an error:
It returns a result that has a character set of
utf8
and the same collation asutf8_column
. Values oflatin1_column
are automatically converted toutf8
before concatenating.For an operation with operands from the same character set but that mix a
_bin
collation and a_ci
or_cs
collation, the_bin
collation is used. This is similar to how operations that mix nonbinary and binary strings evaluate the operands as binary strings, except that it is for collations rather than data types.
Although automatic conversion is not in the SQL standard, the standard does say that every character set is (in terms of supported characters) a “subset” of Unicode. Because it is a well-known principle that “what applies to a superset can apply to a subset,” we believe that a collation for Unicode can apply for comparisons with non-Unicode strings.
The following table illustrates some applications of the preceding rules.
Comparison | Collation Used |
---|---|
column1 = 'A' |
Use collation of column1 |
column1 = 'A' COLLATE x |
Use collation of 'A' COLLATE x |
column1 COLLATE x = 'A' COLLATE y |
Error |
To determine the coercibility of a string expression, use the
COERCIBILITY()
function (see
Section 12.15, “Information Functions”):
- -> 0
- -> 3
- -> 4
- -> 5
For implicit conversion of a numeric or temporal value to a
string, such as occurs for the argument 1
in
the expression CONCAT(1, 'abc')
,
the result is a character (nonbinary) string that has a
character set and collation determined by the
character_set_connection
and
collation_connection
system
variables. See Section 12.2, “Type Conversion in Expression Evaluation”.
Traduction non disponible
Le manuel MySQL n'est pas encore traduit en français sur l'infobrol. Seule la version anglaise est disponible pour l'instant.
Document créé le 26/06/2006, dernière modification le 26/10/2018
Source du document imprimé : https://www.gaudry.be/mysql-rf-charset-collation-coercibility.html
L'infobrol est un site personnel dont le contenu n'engage que moi. Le texte est mis à disposition sous licence CreativeCommons(BY-NC-SA). Plus d'info sur les conditions d'utilisation et sur l'auteur.
Références
Ces références et liens indiquent des documents consultés lors de la rédaction de cette page, ou qui peuvent apporter un complément d'information, mais les auteurs de ces sources ne peuvent être tenus responsables du contenu de cette page.
L'auteur de ce site est seul responsable de la manière dont sont présentés ici les différents concepts, et des libertés qui sont prises avec les ouvrages de référence. N'oubliez pas que vous devez croiser les informations de sources multiples afin de diminuer les risques d'erreurs.