The Normalizer class
(PHP 5 >= 5.3.0, PHP 7, PECL intl >= 1.0.0)
Introduction
Normalization is a process that involves transforming characters and sequences of characters into a formally-defined underlying representation. This process is most important when text needs to be compared for sorting and searching, but it is also used when storing text to ensure that the text is stored in a consistent representation.
The Unicode Consortium has defined a number of normalization forms reflecting the various needs of applications:
- Normalization Form D (NFD) - Canonical Decomposition
- Normalization Form C (NFC) - Canonical Decomposition followed by Canonical Composition
- Normalization Form KD (NFKD) - Compatibility Decomposition
- Normalization Form KC (NFKC) - Compatibility Decomposition followed by Canonical Composition
Class synopsis
Predefined Constants
The following constants define the normalization form used by the normalizer:
-
Normalizer::FORM_C
(integer) - Normalization Form C (NFC) - Canonical Decomposition followed by Canonical Composition
-
Normalizer::FORM_D
(integer) - Normalization Form D (NFD) - Canonical Decomposition
-
Normalizer::FORM_KC
(integer) - Normalization Form KC (NFKC) - Compatibility Decomposition, followed by Canonical Composition
-
Normalizer::FORM_KD
(integer) - Normalization Form KD (NFKD) - Compatibility Decomposition
-
Normalizer::NONE
(integer) - No decomposition/composition
-
Normalizer::OPTION_DEFAULT
(integer) - Default normalization options
Table of Contents
- Normalizer::getRawDecomposition — Gets the Decomposition_Mapping property for the given UTF-8 encoded code point
- Normalizer::isNormalized — Checks if the provided string is already in the specified normalization form
- Normalizer::normalize — Normalizes the input provided and returns the normalized string
English translation
You have asked to visit this site in English. For now, only the interface is translated, but not all the content yet.If you want to help me in translations, your contribution is welcome. All you need to do is register on the site, and send me a message asking me to add you to the group of translators, which will give you the opportunity to translate the pages you want. A link at the bottom of each translated page indicates that you are the translator, and has a link to your profile.
Thank you in advance.
Document created the 30/01/2003, last modified the 26/10/2018
Source of the printed document:https://www.gaudry.be/en/php-rf-class.normalizer.html
The infobrol is a personal site whose content is my sole responsibility. The text is available under CreativeCommons license (BY-NC-SA). More info on the terms of use and the author.
References
These references and links indicate documents consulted during the writing of this page, or which may provide additional information, but the authors of these sources can not be held responsible for the content of this page.
The author This site is solely responsible for the way in which the various concepts, and the freedoms that are taken with the reference works, are presented here. Remember that you must cross multiple source information to reduce the risk of errors.