Rechercher dans le manuel MySQL
12.5 String Functions and Operators
[+/-]
Table 12.7 String Functions and Operators
Name | Description |
---|---|
ASCII() |
Return numeric value of left-most character |
BIN() |
Return a string containing binary representation of a number |
BIT_LENGTH() |
Return length of argument in bits |
CHAR() |
Return the character for each integer passed |
CHAR_LENGTH() |
Return number of characters in argument |
CHARACTER_LENGTH() |
Synonym for CHAR_LENGTH() |
CONCAT() |
Return concatenated string |
CONCAT_WS() |
Return concatenate with separator |
ELT() |
Return string at index number |
EXPORT_SET() |
Return a string such that for every bit set in the value bits, you get an on string and for every unset bit, you get an off string |
FIELD() |
Index (position) of first argument in subsequent arguments |
FIND_IN_SET() |
Index (position) of first argument within second argument |
FORMAT() |
Return a number formatted to specified number of decimal places |
FROM_BASE64() |
Decode base64 encoded string and return result |
HEX() |
Hexadecimal representation of decimal or string value |
INSERT() |
Insert substring at specified position up to specified number of characters |
INSTR() |
Return the index of the first occurrence of substring |
LCASE() |
Synonym for LOWER() |
LEFT() |
Return the leftmost number of characters as specified |
LENGTH() |
Return the length of a string in bytes |
LIKE |
Simple pattern matching |
LOAD_FILE() |
Load the named file |
LOCATE() |
Return the position of the first occurrence of substring |
LOWER() |
Return the argument in lowercase |
LPAD() |
Return the string argument, left-padded with the specified string |
LTRIM() |
Remove leading spaces |
MAKE_SET() |
Return a set of comma-separated strings that have the corresponding bit in bits set |
MATCH |
Perform full-text search |
MID() |
Return a substring starting from the specified position |
NOT LIKE |
Negation of simple pattern matching |
NOT REGEXP |
Negation of REGEXP |
OCT() |
Return a string containing octal representation of a number |
OCTET_LENGTH() |
Synonym for LENGTH() |
ORD() |
Return character code for leftmost character of the argument |
POSITION() |
Synonym for LOCATE() |
QUOTE() |
Escape the argument for use in an SQL statement |
REGEXP |
Whether string matches regular expression |
REGEXP_INSTR() |
Starting index of substring matching regular expression |
REGEXP_LIKE() |
Whether string matches regular expression |
REGEXP_REPLACE() |
Replace substrings matching regular expression |
REGEXP_SUBSTR() |
Return substring matching regular expression |
REPEAT() |
Repeat a string the specified number of times |
REPLACE() |
Replace occurrences of a specified string |
REVERSE() |
Reverse the characters in a string |
RIGHT() |
Return the specified rightmost number of characters |
RLIKE |
Whether string matches regular expression |
RPAD() |
Append string the specified number of times |
RTRIM() |
Remove trailing spaces |
SOUNDEX() |
Return a soundex string |
SOUNDS LIKE |
Compare sounds |
SPACE() |
Return a string of the specified number of spaces |
STRCMP() |
Compare two strings |
SUBSTR() |
Return the substring as specified |
SUBSTRING() |
Return the substring as specified |
SUBSTRING_INDEX() |
Return a substring from a string before the specified number of occurrences of the delimiter |
TO_BASE64() |
Return the argument converted to a base-64 string |
TRIM() |
Remove leading and trailing spaces |
UCASE() |
Synonym for UPPER() |
UNHEX() |
Return a string containing hex representation of a number |
UPPER() |
Convert to uppercase |
WEIGHT_STRING() |
Return the weight string for a string |
String-valued functions return NULL
if the
length of the result would be greater than the value of the
max_allowed_packet
system
variable. See Section 5.1.1, “Configuring the Server”.
For functions that operate on string positions, the first position is numbered 1.
For functions that take length arguments, noninteger arguments are rounded to the nearest integer.
Returns the numeric value of the leftmost character of the string
str
. Returns0
ifstr
is the empty string. ReturnsNULL
ifstr
isNULL
.ASCII()
works for 8-bit characters.See also the
ORD()
function.Returns a string representation of the binary value of
N
, whereN
is a longlong (BIGINT
) number. This is equivalent toCONV(
. ReturnsN
,10,2)NULL
ifN
isNULL
.Returns the length of the string
str
in bits.- -> 32
CHAR(
N
,... [USINGcharset_name
])CHAR()
interprets each argumentN
as an integer and returns a string consisting of the characters given by the code values of those integers.NULL
values are skipped.CHAR()
arguments larger than 255 are converted into multiple result bytes. For example,CHAR(256)
is equivalent toCHAR(1,0)
, andCHAR(256*256)
is equivalent toCHAR(1,0,0)
:- +----------------+----------------+
- +----------------+----------------+
- | 0100 | 0100 |
- +----------------+----------------+
- +------------------+--------------------+
- +------------------+--------------------+
- | 010000 | 010000 |
- +------------------+--------------------+
By default,
CHAR()
returns a binary string. To produce a string in a given character set, use the optionalUSING
clause:- +----------------------+---------------------------------+
- +----------------------+---------------------------------+
- +----------------------+---------------------------------+
If
USING
is given and the result string is illegal for the given character set, a warning is issued. Also, if strict SQL mode is enabled, the result fromCHAR()
becomesNULL
.Returns the length of the string
str
, measured in characters. A multibyte character counts as a single character. This means that for a string containing five 2-byte characters,LENGTH()
returns10
, whereasCHAR_LENGTH()
returns5
.CHARACTER_LENGTH()
is a synonym forCHAR_LENGTH()
.Returns the string that results from concatenating the arguments. May have one or more arguments. If all arguments are nonbinary strings, the result is a nonbinary string. If the arguments include any binary strings, the result is a binary string. A numeric argument is converted to its equivalent nonbinary string form.
CONCAT()
returnsNULL
if any argument isNULL
.- -> 'MySQL'
- -> NULL
- -> '14.3'
For quoted strings, concatenation can be performed by placing the strings next to each other:
- -> 'MySQL'
CONCAT_WS(
separator
,str1
,str2
,...)CONCAT_WS()
stands for Concatenate With Separator and is a special form ofCONCAT()
. The first argument is the separator for the rest of the arguments. The separator is added between the strings to be concatenated. The separator can be a string, as can the rest of the arguments. If the separator isNULL
, the result isNULL
.- -> 'First name,Second name,Last Name'
- -> 'First name,Last Name'
CONCAT_WS()
does not skip empty strings. However, it does skip anyNULL
values after the separator argument.ELT()
returns theN
th element of the list of strings:str1
ifN
=1
,str2
ifN
=2
, and so on. ReturnsNULL
ifN
is less than1
or greater than the number of arguments.ELT()
is the complement ofFIELD()
.EXPORT_SET(
bits
,on
,off
[,separator
[,number_of_bits
]])Returns a string such that for every bit set in the value
bits
, you get anon
string and for every bit not set in the value, you get anoff
string. Bits inbits
are examined from right to left (from low-order to high-order bits). Strings are added to the result from left to right, separated by theseparator
string (the default being the comma character,
). The number of bits examined is given bynumber_of_bits
, which has a default of 64 if not specified.number_of_bits
is silently clipped to 64 if larger than 64. It is treated as an unsigned integer, so a value of −1 is effectively the same as 64.- -> 'Y,N,Y,N'
- -> '0,1,1,0,0,0,0,0,0,0'
Returns the index (position) of
str
in thestr1
,str2
,str3
,...
list. Returns0
ifstr
is not found.If all arguments to
FIELD()
are strings, all arguments are compared as strings. If all arguments are numbers, they are compared as numbers. Otherwise, the arguments are compared as double.If
str
isNULL
, the return value is0
becauseNULL
fails equality comparison with any value.FIELD()
is the complement ofELT()
.Returns a value in the range of 1 to
N
if the stringstr
is in the string liststrlist
consisting ofN
substrings. A string list is a string composed of substrings separated by,
characters. If the first argument is a constant string and the second is a column of typeSET
, theFIND_IN_SET()
function is optimized to use bit arithmetic. Returns0
ifstr
is not instrlist
or ifstrlist
is the empty string. ReturnsNULL
if either argument isNULL
. This function does not work properly if the first argument contains a comma (,
) character.- -> 2
Formats the number
X
to a format like'#,###,###.##'
, rounded toD
decimal places, and returns the result as a string. IfD
is0
, the result has no decimal point or fractional part.The optional third parameter enables a locale to be specified to be used for the result number's decimal point, thousands separator, and grouping between separators. Permissible locale values are the same as the legal values for the
lc_time_names
system variable (see Section 10.15, “MySQL Server Locale Support”). If no locale is specified, the default is'en_US'
.Takes a string encoded with the base-64 encoded rules used by
TO_BASE64()
and returns the decoded result as a binary string. The result isNULL
if the argument isNULL
or not a valid base-64 string. See the description ofTO_BASE64()
for details about the encoding and decoding rules.- -> 'JWJj', 'abc'
For a string argument
str
,HEX()
returns a hexadecimal string representation ofstr
where each byte of each character instr
is converted to two hexadecimal digits. (Multibyte characters therefore become more than two digits.) The inverse of this operation is performed by theUNHEX()
function.For a numeric argument
N
,HEX()
returns a hexadecimal string representation of the value ofN
treated as a longlong (BIGINT
) number. This is equivalent toCONV(
. The inverse of this operation is performed byN
,10,16)CONV(HEX(
.N
),16,10)Returns the string
str
, with the substring beginning at positionpos
andlen
characters long replaced by the stringnewstr
. Returns the original string ifpos
is not within the length of the string. Replaces the rest of the string from positionpos
iflen
is not within the length of the rest of the string. ReturnsNULL
if any argument isNULL
.- -> 'QuWhattic'
- -> 'Quadratic'
- -> 'QuWhat'
This function is multibyte safe.
Returns the position of the first occurrence of substring
substr
in stringstr
. This is the same as the two-argument form ofLOCATE()
, except that the order of the arguments is reversed.This function is multibyte safe, and is case-sensitive only if at least one argument is a binary string.
LCASE()
is a synonym forLOWER()
.LCASE()
used in a view is rewritten asLOWER()
when storing the view's definition. (Bug #12844279)Returns the leftmost
len
characters from the stringstr
, orNULL
if any argument isNULL
.This function is multibyte safe.
Returns the length of the string
str
, measured in bytes. A multibyte character counts as multiple bytes. This means that for a string containing five 2-byte characters,LENGTH()
returns10
, whereasCHAR_LENGTH()
returns5
.NoteThe
Length()
OpenGIS spatial function is namedST_Length()
in MySQL.Reads the file and returns the file contents as a string. To use this function, the file must be located on the server host, you must specify the full path name to the file, and you must have the
FILE
privilege. The file must be readable by the server and its size less thanmax_allowed_packet
bytes. If thesecure_file_priv
system variable is set to a nonempty directory name, the file to be loaded must be located in that directory. (Prior to MySQL 8.0.17, the file must be readable by all, not just readable by the server.)If the file does not exist or cannot be read because one of the preceding conditions is not satisfied, the function returns
NULL
.The
character_set_filesystem
system variable controls interpretation of file names that are given as literal strings.LOCATE(
,substr
,str
)LOCATE(
substr
,str
,pos
)The first syntax returns the position of the first occurrence of substring
substr
in stringstr
. The second syntax returns the position of the first occurrence of substringsubstr
in stringstr
, starting at positionpos
. Returns0
ifsubstr
is not instr
. ReturnsNULL
if any argument isNULL
.- -> 4
- -> 0
- -> 7
This function is multibyte safe, and is case-sensitive only if at least one argument is a binary string.
Returns the string
str
with all characters changed to lowercase according to the current character set mapping. The default isutf8mb4
.LOWER()
(andUPPER()
) are ineffective when applied to binary strings (BINARY
,VARBINARY
,BLOB
). To perform lettercase conversion, convert the string to a nonbinary string:- +-------------+------------------------------------+
- +-------------+------------------------------------+
- | New York | new york |
- +-------------+------------------------------------+
For collations of Unicode character sets,
LOWER()
andUPPER()
work according to the Unicode Collation Algorithm (UCA) version in the collation name, if there is one, and UCA 4.0.0 if no version is specified. For example,utf8mb4_0900_ai_ci
andutf8_unicode_520_ci
work according to UCA 9.0.0 and 5.2.0, respectively, whereasutf8_unicode_ci
works according to UCA 4.0.0. See Section 10.10.1, “Unicode Character Sets”.This function is multibyte safe.
LCASE()
used within views is rewritten asLOWER()
.Returns the string
str
, left-padded with the stringpadstr
to a length oflen
characters. Ifstr
is longer thanlen
, the return value is shortened tolen
characters.Returns the string
str
with leading space characters removed.This function is multibyte safe.
Returns a set value (a string containing substrings separated by
,
characters) consisting of the strings that have the corresponding bit inbits
set.str1
corresponds to bit 0,str2
to bit 1, and so on.NULL
values instr1
,str2
,...
are not appended to the result.MID(
is a synonym forstr
,pos
,len
)SUBSTRING(
.str
,pos
,len
)Returns a string representation of the octal value of
N
, whereN
is a longlong (BIGINT
) number. This is equivalent toCONV(
. ReturnsN
,10,8)NULL
ifN
isNULL
.OCTET_LENGTH()
is a synonym forLENGTH()
.If the leftmost character of the string
str
is a multibyte character, returns the code for that character, calculated from the numeric values of its constituent bytes using this formula:(1st byte code) + (2nd byte code * 256) + (3rd byte code * 256^2) ...
If the leftmost character is not a multibyte character,
ORD()
returns the same value as theASCII()
function.POSITION(
is a synonym forsubstr
INstr
)LOCATE(
.substr
,str
)Quotes a string to produce a result that can be used as a properly escaped data value in an SQL statement. The string is returned enclosed by single quotation marks and with each instance of backslash (
\
), single quote ('
), ASCIINUL
, and Control+Z preceded by a backslash. If the argument isNULL
, the return value is the word “NULL” without enclosing single quotation marks.For comparison, see the quoting rules for literal strings and within the C API in Section 9.1.1, “String Literals”, and Section 28.7.7.56, “mysql_real_escape_string_quote()”.
Returns a string consisting of the string
str
repeatedcount
times. Ifcount
is less than 1, returns an empty string. ReturnsNULL
ifstr
orcount
areNULL
.Returns the string
str
with all occurrences of the stringfrom_str
replaced by the stringto_str
.REPLACE()
performs a case-sensitive match when searching forfrom_str
.This function is multibyte safe.
Returns the string
str
with the order of the characters reversed.This function is multibyte safe.
Returns the rightmost
len
characters from the stringstr
, orNULL
if any argument isNULL
.This function is multibyte safe.
Returns the string
str
, right-padded with the stringpadstr
to a length oflen
characters. Ifstr
is longer thanlen
, the return value is shortened tolen
characters.This function is multibyte safe.
Returns the string
str
with trailing space characters removed.This function is multibyte safe.
Returns a soundex string from
str
. Two strings that sound almost the same should have identical soundex strings. A standard soundex string is four characters long, but theSOUNDEX()
function returns an arbitrarily long string. You can useSUBSTRING()
on the result to get a standard soundex string. All nonalphabetic characters instr
are ignored. All international alphabetic characters outside the A-Z range are treated as vowels.ImportantWhen using
SOUNDEX()
, you should be aware of the following limitations:This function, as currently implemented, is intended to work well with strings that are in the English language only. Strings in other languages may not produce reliable results.
This function is not guaranteed to provide consistent results with strings that use multibyte character sets, including
utf-8
. See Bug #22638 for more information.
NoteThis function implements the original Soundex algorithm, not the more popular enhanced version (also described by D. Knuth). The difference is that original version discards vowels first and duplicates second, whereas the enhanced version discards duplicates first and vowels second.
This is the same as
SOUNDEX(
.expr1
) = SOUNDEX(expr2
)Returns a string consisting of
N
space characters.SUBSTR(
,str
,pos
)SUBSTR(
,str
FROMpos
)SUBSTR(
,str
,pos
,len
)SUBSTR(
str
FROMpos
FORlen
)SUBSTR()
is a synonym forSUBSTRING()
.SUBSTRING(
,str
,pos
)SUBSTRING(
,str
FROMpos
)SUBSTRING(
,str
,pos
,len
)SUBSTRING(
str
FROMpos
FORlen
)The forms without a
len
argument return a substring from stringstr
starting at positionpos
. The forms with alen
argument return a substringlen
characters long from stringstr
, starting at positionpos
. The forms that useFROM
are standard SQL syntax. It is also possible to use a negative value forpos
. In this case, the beginning of the substring ispos
characters from the end of the string, rather than the beginning. A negative value may be used forpos
in any of the forms of this function.For all forms of
SUBSTRING()
, the position of the first character in the string from which the substring is to be extracted is reckoned as1
.- -> 'ratically'
- -> 'barbar'
- -> 'ratica'
- -> 'ila'
- -> 'aki'
- -> 'ki'
This function is multibyte safe.
If
len
is less than 1, the result is the empty string.SUBSTRING_INDEX(
str
,delim
,count
)Returns the substring from string
str
beforecount
occurrences of the delimiterdelim
. Ifcount
is positive, everything to the left of the final delimiter (counting from the left) is returned. Ifcount
is negative, everything to the right of the final delimiter (counting from the right) is returned.SUBSTRING_INDEX()
performs a case-sensitive match when searching fordelim
.- -> 'www.mysql'
- -> 'mysql.com'
This function is multibyte safe.
Converts the string argument to base-64 encoded form and returns the result as a character string with the connection character set and collation. If the argument is not a string, it is converted to a string before conversion takes place. The result is
NULL
if the argument isNULL
. Base-64 encoded strings can be decoded using theFROM_BASE64()
function.- -> 'JWJj', 'abc'
Different base-64 encoding schemes exist. These are the encoding and decoding rules used by
TO_BASE64()
andFROM_BASE64()
:The encoding for alphabet value 62 is
'+'
.The encoding for alphabet value 63 is
'/'
.Encoded output consists of groups of 4 printable characters. Each 3 bytes of the input data are encoded using 4 characters. If the last group is incomplete, it is padded with
'='
characters to a length of 4.A newline is added after each 76 characters of encoded output to divide long output into multiple lines.
Decoding recognizes and ignores newline, carriage return, tab, and space.
TRIM([{BOTH | LEADING | TRAILING} [
,remstr
] FROM]str
)TRIM([
remstr
FROM]str
)Returns the string
str
with allremstr
prefixes or suffixes removed. If none of the specifiersBOTH
,LEADING
, orTRAILING
is given,BOTH
is assumed.remstr
is optional and, if not specified, spaces are removed.- -> 'bar'
- -> 'barxxx'
- -> 'bar'
- -> 'barx'
This function is multibyte safe.
UCASE()
is a synonym forUPPER()
.UCASE()
used within views is rewritten asUPPER()
.For a string argument
str
,UNHEX(
interprets each pair of characters in the argument as a hexadecimal number and converts it to the byte represented by the number. The return value is a binary string.str
)- -> 'MySQL'
- -> 'MySQL'
- -> 'string'
- -> '1267'
The characters in the argument string must be legal hexadecimal digits:
'0'
..'9'
,'A'
..'F'
,'a'
..'f'
. If the argument contains any nonhexadecimal digits, the result isNULL
:A
NULL
result can occur if the argument toUNHEX()
is aBINARY
column, because values are padded with 0x00 bytes when stored but those bytes are not stripped on retrieval. For example,'41'
is stored into aCHAR(3)
column as'41 '
and retrieved as'41'
(with the trailing pad space stripped), soUNHEX()
for the column value returns'A'
. By contrast'41'
is stored into aBINARY(3)
column as'41\0'
and retrieved as'41\0'
(with the trailing pad0x00
byte not stripped).'\0'
is not a legal hexadecimal digit, soUNHEX()
for the column value returnsNULL
.For a numeric argument
N
, the inverse ofHEX(
is not performed byN
)UNHEX()
. UseCONV(HEX(
instead. See the description ofN
),16,10)HEX()
.Returns the string
str
with all characters changed to uppercase according to the current character set mapping. The default isutf8mb4
.See the description of
LOWER()
for information that also applies toUPPER()
. This included information about how to perform lettercase conversion of binary strings (BINARY
,VARBINARY
,BLOB
) for which these functions are ineffective, and information about case folding for Unicode character sets.This function is multibyte safe.
UCASE()
used within views is rewritten asUPPER()
.WEIGHT_STRING(
str
[AS {CHAR|BINARY}(N
)] [flags
])This function returns the weight string for the input string. The return value is a binary string that represents the comparison and sorting value of the string. It has these properties:
If
WEIGHT_STRING(
=str1
)WEIGHT_STRING(
, thenstr2
)
(str1
=str2
str1
andstr2
are considered equal)If
WEIGHT_STRING(
<str1
)WEIGHT_STRING(
, thenstr2
)
(str1
<str2
str1
sorts beforestr2
)
WEIGHT_STRING()
is a debugging function intended for internal use. Its behavior can change without notice between MySQL versions. It can be used for testing and debugging of collations, especially if you are adding a new collation. See Section 10.13, “Adding a Collation to a Character Set”.This list briefly summarizes the arguments. More details are given in the discussion following the list.
str
: The input string expression.AS
clause: Optional; cast the input string to a given type and length.flags
: Optional; unused.
The input string,
str
, is a string expression. If the input is a nonbinary (character) string such as aCHAR
,VARCHAR
, orTEXT
value, the return value contains the collation weights for the string. If the input is a binary (byte) string such as aBINARY
,VARBINARY
, orBLOB
value, the return value is the same as the input (the weight for each byte in a binary string is the byte value). If the input isNULL
,WEIGHT_STRING()
returnsNULL
.Examples:
- +------+---------+------------------------+
- +------+---------+------------------------+
- | AB | 4142 | 1C471C60 |
- +------+---------+------------------------+
- +------+---------+------------------------+
- +------+---------+------------------------+
- | ab | 6162 | 1C471C60 |
- +------+---------+------------------------+
- +------+---------+------------------------+
- +------+---------+------------------------+
- | AB | 4142 | 4142 |
- +------+---------+------------------------+
- +------+---------+------------------------+
- +------+---------+------------------------+
- | ab | 6162 | 6162 |
- +------+---------+------------------------+
The preceding examples use
HEX()
to display theWEIGHT_STRING()
result. Because the result is a binary value,HEX()
can be especially useful when the result contains nonprinting values, to display it in printable form:- +------------------------+
- +------------------------+
- | 0FEA0FEA |
- +------------------------+
For non-
NULL
return values, the data type of the value isVARBINARY
if its length is within the maximum length forVARBINARY
, otherwise the data type isBLOB
.The
AS
clause may be given to cast the input string to a nonbinary or binary string and to force it to a given length:AS CHAR(
casts the string to a nonbinary string and pads it on the right with spaces to a length ofN
)N
characters.N
must be at least 1. IfN
is less than the length of the input string, the string is truncated toN
characters. No warning occurs for truncation.AS BINARY(
is similar but casts the string to a binary string,N
)N
is measured in bytes (not characters), and padding uses0x00
bytes (not spaces).
- +-------------------------------------+
- +-------------------------------------+
- | 41422020 |
- +-------------------------------------+
- +-------------------------------------+
- +-------------------------------------+
- | 0041004200200020 |
- +-------------------------------------+
- +---------------------------------------+
- +---------------------------------------+
- | 61620000 |
- +---------------------------------------+
The
flags
clause currently is unused.
Document created the 26/06/2006, last modified the 26/10/2018
Source of the printed document:https://www.gaudry.be/en/mysql-rf-string-functions.html
The infobrol is a personal site whose content is my sole responsibility. The text is available under CreativeCommons license (BY-NC-SA). More info on the terms of use and the author.
References
These references and links indicate documents consulted during the writing of this page, or which may provide additional information, but the authors of these sources can not be held responsible for the content of this page.
The author This site is solely responsible for the way in which the various concepts, and the freedoms that are taken with the reference works, are presented here. Remember that you must cross multiple source information to reduce the risk of errors.