Information technology — International string ordering and comparison — Method for comparing character strings and description of the common template tailorable ordering

This document defines a reference comparison method. This method is applicable to two or more character strings to determine their collating order in a sorted list. The method can be applied to strings containing characters from the full repertoire of ISO/IEC 10646. This method is also applicable to subsets of that repertoire to produce ordering results valid (after tailoring) for a given set of languages for each script. This method uses collation tables derived either from the Common Template Tables (CTT) referenced by this document or from one of their tailoring. The format of the Common Template Table is described using the Backus-Naur Form (BNF). The format is used normatively within this document. This document also defines syntax elements to tailor these Common Template Tables used by the reference comparison method. Furthermore, it defines requirements for a declaration of the differences (delta) between a collation table and a given Common Template Table including the tailoring elements. These Common Template Tables describe an order for all characters encoded in the current and past ISO/IEC 10646 editions, including amendments. They allow for a specification of a fully deterministic ordering. These tables enable the specification of a string ordering adapted to local ordering rules, without requiring an implementer to have knowledge of all the different scripts already encoded in the Universal Coded Character Set (UCS). All these Common Template Tables have reference names which are related to a particular stage of development of the ISO/IEC 10646 Universal coded character set or a particular version of the Unicode Standard. These names and their relationship with ISO/IEC 10646 or the Unicode Standard repertoire are specified by an externally referenced document: Unicode Technical Standard, UTS #10, Unicode Collation Algorithm. This document does not: — mandate a specific comparison method; any equivalent method giving the same results is acceptable; — mandate a specific format for describing or tailoring tables in a given implementation; — mandate specific symbols to be used by implementations; — mandate any specific internal format for intermediate keys used when comparing, nor for the table used. The use of numeric keys is not mandated either; — mandate a context-dependent ordering; — mandate any particular preparation of character strings prior to comparison. NOTE 1 It is typical to do preparation of character strings prior to comparison even if it is not prescribed by this document (see Annex C). NOTE 2 Annex D describes problems that gave way to this document with their anticipated solutions.

Technologies de l'information — Classement international et comparaison de chaînes de caractères — Méthode de comparaison de chaînes de caractères et description du modèle commun et adaptable d'ordre de classement

Le présent document définit une méthode de comparaison de référence. Cette méthode est applicable à deux chaînes de caractères ou plus pour déterminer leur ordre de classement dans une liste triée. La méthode peut être appliquée aux chaînes contenant des caractères du répertoire complet de l'ISO/IEC 10646. Cette méthode est également applicable aux sous-ensembles de ce répertoire pour produire des résultats de tri valides (après adaptation) pour un ensemble donné de langues pour chaque script. Cette méthode de référence utilise des tables de tri dérivées soit des tables-modèles communes de classement définies dans le présent document, soit d’une de leurs adaptations. Le format de la table-modèle commune est décrit en notation BNF (Backus-Naur Form, Forme de Backus-Naur). Son emploi est normatif dans le présent document; Le présent document définit également les éléments de syntaxe pour adapter ces tables-modèles communes utilisées par la méthode de comparaison de référence. De plus, il définit les exigences relatives à une déclaration des différences (delta) entre une table de tri et une table-modèle commune donnée, y compris les éléments d'adaptation. Ces tables-modèles communes décrivent un ordre pour tous les caractères encodés dans les éditions actuelles et passées de l'ISO/IEC 10646, y compris les amendements. Elles permettent de spécifier un ordre complètement déterministe. Ces tables constituent le point de départ permettant de préciser un ordre de classement adapté aux règles de classement locales, sans qu’il soit nécessaire de connaître tous les systèmes d’écriture repris dans le jeu universel de caractères codés (JUC). Toutes ces tables-modèles communes comportent des noms de référence qui sont liés à un stade particulier de développement de l'ISO/IEC 10646 relative au jeu universel de caractères codés ou d'une version particulière du standard Unicode. Ces noms et leur relation avec l'ISO/IEC 10646 ou le répertoire du standard Unicode sont spécifiés par un document de référencement externe: Unicode Technical Standard, UTS #10, Unicode Collation Algorithm. Le présent document n'impose pas ce qui suit: — une méthode particulière de comparaison; toute méthode équivalente conduisant aux mêmes résultats est acceptable; — un format précis pour décrire ou pour adapter les tables dans une mise en œuvre donnée; — des symboles spécifiques à utiliser par les mises en œuvre; — un format interne particulier pour les clés intermédiaires utilisées dans les comparaisons ou pour la table de tri. L’utilisation de clés numériques n’est pas spécifiée non plus; — un ordre dépendant du contexte; — un prétraitement particulier des chaînes de caractères avant comparaison. NOTE 1 Bien que ceci ne soit pas spécifié par le présent document, il s’avère courant de préparer les chaînes de caractères avant leur comparaison (voir l’Annexe C). NOTE 2 L’Annexe D décrit les problèmes qui ont donné lieu au présent document avec leurs solutions anticipées.

General Information

Status: Published
Publication Date: 21-Jul-2025

ICS: 35.040.10 - Coding of character sets

Technical Committee: ISO/IEC JTC 1/SC 2 - Coded character sets
Drafting Committee: ISO/IEC JTC 1/SC 2 - Coded character sets

Current Stage: 6060 - International Standard published
Start Date: 22-Jul-2025
Due Date: 23-Jun-2026
Completion Date: 22-Jul-2025

Relations

Revises: ISO/IEC 14651:2020 - Information technology — International string ordering and comparison — Method for comparing character strings and description of the common template tailorable ordering
Effective Date: 01-Jul-2023

Overview

ISO/IEC 14651:2025 - Information technology - International string ordering and comparison - defines a reference method for comparing character strings to determine their collating order. The standard applies to strings containing characters from the full ISO/IEC 10646 (UCS) repertoire and to tailored subsets for specific languages and scripts. It relies on Common Template Tables (CTT) (referenced via Unicode Technical Standard UTS #10) and specifies a normative BNF format for those tables, plus syntax for tailoring and declaring deltas (differences) from a template.

Key topics and requirements

Reference comparison method: Builds ordering keys composed of subkeys and levels to determine string order (see Clause 6).
Common Template Tables (CTT): Describe deterministic orders for all UCS characters; templates are named relative to ISO/IEC 10646/Unicode versions. The CTT format is given in BNF and used normatively.
Tailoring and deltas: Mechanisms to adapt a template to local language rules; implementations must declare any deviations (deltas) from the referenced CTT.
Conformance declarations must include:
- name of the Common Template Table used;
- number of supported levels (minimum three);
- support for forward/backward processing parameters;
- the tailoring delta and its level coverage;
- any preparation method used for strings.
Non‑mandates: The standard does not mandate a specific algorithm, internal key format (numeric keys not required), symbol sets, context‑dependent ordering, or mandatory string preparation-provided equivalent results are produced.
Normative and informative annexes: Annex A (CTT), Annex B (example deltas), Annex C (preparation examples), Annex D (tutorial on lexical ordering issues), Annex E (searching/fuzzy matches).

Applications and who uses it

ISO/IEC 14651:2025 is essential for internationalization (i18n) and multilingual systems that require consistent, repeatable string ordering. Typical use cases and users:

Software and library developers implementing collation and sorting (databases, programming language runtimes, UI toolkits).
Database vendors and search/indexing engines that must sort and compare text across languages.
Operating system and file system developers implementing locale‑aware ordering.
Localization engineers and standards bodies defining language‑specific tailoring rules.
Applications needing reproducible, deterministic ordering for queries, lists, and user interfaces.

Related standards

ISO/IEC 10646 (Universal Coded Character Set - UCS)
Unicode Technical Standard UTS #10 (Unicode Collation Algorithm) - includes Common Template Tables and synchronization notes with ISO/IEC 14651
ISO/IEC 30112 (informative complement on ordering keywords)

Keywords: ISO/IEC 14651:2025, string ordering, collation, Common Template Tables, ISO/IEC 10646, Unicode Collation Algorithm, UTS #10, tailoring, delta, internationalization, collation table.

ISO/IEC 14651:2025 - Information technology — International string ordering and comparison — Method for comparing character strings and description of the common template tailorable ordering
Released:22. 07. 2025 - Page 1 preview

Standard

ISO/IEC 14651:2025 - Information technology — International string ordering and comparison — Method for comparing character strings and description of the common template tailorable ordering Released:22. 07. 2025

English language

51 pages

sale 15% off

Preview

sale 15% off

Preview

ISO/IEC 14651:2025 - Technologies de l'information — Classement international et comparaison de chaînes de caractères — Méthode de comparaison de chaînes de caractères et description du modèle commun et adaptable d'ordre de classement
Released:22. 07. 2025 - Page 1 preview

Standard

ISO/IEC 14651:2025 - Technologies de l'information — Classement international et comparaison de chaînes de caractères — Méthode de comparaison de chaînes de caractères et description du modèle commun et adaptable d'ordre de classement Released:22. 07. 2025

French language

49 pages

sale 15% off

Preview

sale 15% off

Preview

Get Certified

Connect with accredited certification bodies for this standard

BSI Group

BSI (British Standards Institution) is the business standards company that helps organizations make excellence a habit.

UKAS United Kingdom Verified

Visit Website

NYCE

Mexican standards and certification body.

EMA Mexico Verified

Visit Website

Frequently Asked Questions

What is ISO/IEC 14651:2025?

ISO/IEC 14651:2025 is a standard published by the International Organization for Standardization (ISO). Its full title is "Information technology — International string ordering and comparison — Method for comparing character strings and description of the common template tailorable ordering". This standard covers: This document defines a reference comparison method. This method is applicable to two or more character strings to determine their collating order in a sorted list. The method can be applied to strings containing characters from the full repertoire of ISO/IEC 10646. This method is also applicable to subsets of that repertoire to produce ordering results valid (after tailoring) for a given set of languages for each script. This method uses collation tables derived either from the Common Template Tables (CTT) referenced by this document or from one of their tailoring. The format of the Common Template Table is described using the Backus-Naur Form (BNF). The format is used normatively within this document. This document also defines syntax elements to tailor these Common Template Tables used by the reference comparison method. Furthermore, it defines requirements for a declaration of the differences (delta) between a collation table and a given Common Template Table including the tailoring elements. These Common Template Tables describe an order for all characters encoded in the current and past ISO/IEC 10646 editions, including amendments. They allow for a specification of a fully deterministic ordering. These tables enable the specification of a string ordering adapted to local ordering rules, without requiring an implementer to have knowledge of all the different scripts already encoded in the Universal Coded Character Set (UCS). All these Common Template Tables have reference names which are related to a particular stage of development of the ISO/IEC 10646 Universal coded character set or a particular version of the Unicode Standard. These names and their relationship with ISO/IEC 10646 or the Unicode Standard repertoire are specified by an externally referenced document: Unicode Technical Standard, UTS #10, Unicode Collation Algorithm. This document does not: — mandate a specific comparison method; any equivalent method giving the same results is acceptable; — mandate a specific format for describing or tailoring tables in a given implementation; — mandate specific symbols to be used by implementations; — mandate any specific internal format for intermediate keys used when comparing, nor for the table used. The use of numeric keys is not mandated either; — mandate a context-dependent ordering; — mandate any particular preparation of character strings prior to comparison. NOTE 1 It is typical to do preparation of character strings prior to comparison even if it is not prescribed by this document (see Annex C). NOTE 2 Annex D describes problems that gave way to this document with their anticipated solutions.

What is the scope of ISO/IEC 14651:2025?

What ICS categories does ISO/IEC 14651:2025 belong to?

ISO/IEC 14651:2025 is classified under the following ICS (International Classification for Standards) categories: 35.040.10 - Coding of character sets. The ICS classification helps identify the subject area and facilitates finding related standards.

What standards are related to ISO/IEC 14651:2025?

ISO/IEC 14651:2025 has the following relationships with other standards: It is inter standard links to ISO/IEC 14651:2020. Understanding these relationships helps ensure you are using the most current and applicable version of the standard.

How can I access ISO/IEC 14651:2025?

ISO/IEC 14651:2025 is available in PDF format for immediate download after purchase. The document can be added to your cart and obtained through the secure checkout process. Digital delivery ensures instant access to the complete standard document.

Standards Content (Sample)

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.

Loading comments...

Information technology — International string ordering and comparison — Method for comparing character strings and description of the common template tailorable ordering

Technologies de l'information — Classement international et comparaison de chaînes de caractères — Méthode de comparaison de chaînes de caractères et description du modèle commun et adaptable d'ordre de classement

General Information

Relations

Overview

Key topics and requirements

Applications and who uses it

Related standards

ISO/IEC 14651:2025 - Information technology — International string ordering and comparison — Method for comparing character strings and description of the common template tailorable ordering Released:22. 07. 2025

ISO/IEC 14651:2025 - Technologies de l'information — Classement international et comparaison de chaînes de caractères — Méthode de comparaison de chaînes de caractères et description du modèle commun et adaptable d'ordre de classement Released:22. 07. 2025

Get Certified

BSI Group

NYCE

Frequently Asked Questions

Standards Content (Sample)

Questions, Comments and Discussion

This May Also Interest You