EN 13710:2011
(Main)European Ordering Rules - Ordering of characters from Latin, Greek, Cyrillic, Georgian and Armenian scripts
European Ordering Rules - Ordering of characters from Latin, Greek, Cyrillic, Georgian and Armenian scripts
This European Standard specifies the order between two character strings composed of characters from the Modern European Scripts (MES) collection of ISO/IEC 10646:2003 or subsets of it.
NOTE Collection 283 Modern European Scripts (MES) of ISO/IEC 10646:2003 was originally specified in CEN Workshop Agreement 13873:2000 Multilingual European Subsets of ISO/IEC 10646 as Multilingual European Subset Number 3 and was subsequently incorporated as a collection in Annex A of ISO/IEC 10646:2003 alongside its sister collections MES-1 and MES-2.
The ordering rules specified in this European Standard are only applicable for lists of data in more than one European language and when this data is intended for a multicultural audience. They complement existing national standards or practices in the field.
Europäische Sortierregeln - Sortierung von lateinischen, griechischen, kyrillischen, georgischen und armenischen Schriftzeichen
Diese europäische Norm legt die Reihenfolge zwischen zwei Zeichenfolgen fest, die aus Zeichen des Modern European Scripts (MES)-Zeichenvorrates aus ISO/IEC 10646:2003 oder Untermengen davon entstammen.
ANMERKUNG Der Zeichenvorrat 283 Modern European Scripts (MES) aus ISO/IEC 10646:2003 wurde ursprünglich im CEN Workshop Agreement 13873:2000 als Multilingual European Subsets aus ISO/IEC 10646 als Multilingual European Subset Nummer 3 festgelegt und wurde danach als Zeichenvorrat in den Anhang A von ISO/IEC 10646:2003 eingeführt, neben den verwandten Zeichenvorräten MES-1 und MES-2.
Die in dieser Europäischen Norm angegebenen Regeln zur alphabetischen Anordnung sind nur für Daten-aufstellungen in mehr als einer europäischen Sprache vorgesehen und nur dann, wenn diese Daten für ein multikulturelles Publikum vorgesehen sind. Sie ergänzen bestehende nationale Normen oder Verfahren in der Praxis.
Règles de classement européen - Classement des caractères latins, grecs, cyrilliques, géorgiens et arméniens
La présente Norme européenne spécifie l’ordre entre deux chaînes de caractères composées de caractères
issus de la collection des Modern european scripts (MES) (caractères d’écriture européens modernes) de
l’ISO/CEI 10646:2003 ou de ses sous-ensembles.
NOTE À l’origine, la collection 283 des caractères d’écriture européens modernes (MES) de l’ISO/CEI 10646:2003
était spécifiée dans l’Accord d’atelier du groupe de travail CEN 13873:2000 Sous-ensembles européens multilingues de
l’ISO/CEI 10646 en tant que Sous-ensemble européen multilingue numéro 3 et, par la suite, a été incorporée comme une
collection dans l’Annexe A de l’ISO/CEI 10646:2003 avec ses collections soeurs MES-1 et MES-2.
Les règles de classement spécifiées dans la présente Norme européenne ne s’appliquent qu’aux listes de
données dans plusieurs langues européennes, et lorsque ces données sont destinées à un public
multiculturel. Elles complètent les normes nationales existantes ou les pratiques dans le domaine.
Evropska pravila za razpored - Razpored za latinsko, grško, cirilsko, gruzinsko in armensko pisavo
Ta evropski standard določa razpored med dvema nizoma znakov, sestavljenima iz znakov iz zbirke modernih evropskih pisav (MES) ISO/IEC 10646:2003 ali njihovih podmnožic.
OPOMBA: Zbirka 283 Modernih evropskih pisav ISO/IEC 10646:2003 je bila prvotno določena v CEN Workshop Agreement 13873:2000, večjezične evropske podmnožice ISO/IEC 10646, kot večjezična evropska podmnožica številka 3 in je bila posledično vključena kot zbirka v dodatku A ISO/IEC 10646:2003, vzporedno s svojimi sestrskimi zbirkami MES-1 in MES-2.
Pravila za razpored, določena v tem evropskem standardu, veljajo samo za sezname podatkov v več kot enem evropskem jeziku in kadar so ti podatki namenjeni večkulturnemu občinstvu. Dopolnjujejo obstoječe državne standarde ali postopke na področju.
General Information
Relations
Standards Content (Sample)
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.Evropska pravila za razpored - Razpored za latinsko, grško, cirilsko, gruzinsko in armensko pisavoEuropäische Sortierregeln - Sortierung von lateinischen, griechischen, kyrillischen, georgischen und armenischen SchriftzeichenRègles de classement européen - Classement des caractères latins, grecs, cyrilliques, géorgiens et arméniensEuropean Ordering Rules - Ordering of characters from Latin, Greek, Cyrillic, Georgian and Armenian scripts35.040Nabori znakov in kodiranje informacijCharacter sets and information coding01.140.20Informacijske vedeInformation sciencesICS:Ta slovenski standard je istoveten z:EN 13710:2011SIST EN 13710:2011en,de01-maj-2011SIST EN 13710:2011SLOVENSKI
STANDARDSIST-TP CR 14400:2003SIST ENV 13710:20031DGRPHãþD
EUROPEAN STANDARD NORME EUROPÉENNE EUROPÄISCHE NORM
EN 13710
March 2011 ICS 01.140.20; 35.040 Supersedes CR 14400:2001, ENV 13710:2000English Version
European Ordering Rules - Ordering of characters from Latin, Greek, Cyrillic, Georgian and Armenian scripts
Règles de classement européen - Classement des caractères latins, grecs, cyrilliques, géorgiens et arméniens Europäische Sortierregeln - Sortierung von lateinischen, griechischen, kyrillischen, georgischen und armenischen Schriftzeichen This European Standard was approved by CEN on 5 February 2011.
CEN members are bound to comply with the CEN/CENELEC Internal Regulations which stipulate the conditions for giving this European Standard the status of a national standard without any alteration. Up-to-date lists and bibliographical references concerning such national standards may be obtained on application to the CEN-CENELEC Management Centre or to any CEN member.
This European Standard exists in three official versions (English, French, German). A version in any other language made by translation under the responsibility of a CEN member into its own language and notified to the CEN-CENELEC Management Centre has the same status as the official versions.
CEN members are the national standards bodies of Austria, Belgium, Bulgaria, Croatia, Cyprus, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland, Ireland, Italy, Latvia, Lithuania, Luxembourg, Malta, Netherlands, Norway, Poland, Portugal, Romania, Slovakia, Slovenia, Spain, Sweden, Switzerland and United Kingdom.
EUROPEAN COMMITTEE FOR STANDARDIZATION
COMITÉ EUROPÉEN DE NORMALISATION EUROPÄISCHES KOMITEE FÜR NORMUNG
Management Centre:
Avenue Marnix 17,
B-1000 Brussels © 2011 CEN All rights of exploitation in any form and by any means reserved worldwide for CEN national Members. Ref. No. EN 13710:2011: ESIST EN 13710:2011
Principles behind the European Ordering Rules . 14Annex B (informative)
Word-by-word ordering . 34Annex C (informative)
Ordering by position and by style . 35Annex D (informative)
Mixed-script ordering with one predominant script . 36Annex E (informative)
Defining National Deltas based on the EOR . 37Annex F (informative)
Modern European Scripts / MES . 42Annex G (informative)
EOR Delta in LDML Syntax . 44Bibliography . 51 SIST EN 13710:2011
ENV 13710:2000 mainly as follows: a) ENV 13170:2000 and CR 14400:2001 have been consolidated; b) the document has been partly revised and has been brought up to date. According to the CEN/CENELEC Internal Regulations, the national standards organizations of the following countries are bound to implement this European Standard: Austria, Belgium, Bulgaria, Croatia, Cyprus, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland, Ireland, Italy, Latvia, Lithuania, Luxembourg, Malta, Netherlands, Norway, Poland, Portugal, Romania, Slovakia, Slovenia, Spain, Sweden, Switzerland and United Kingdom.
[ISO/IEC 10646:2003] 3.2 character string sequence of characters considered as a single object
[ISO/IEC 14651:2007] 3.3 collating symbol symbol used to specify weights assigned to a collating element
[ISO/IEC 14651:2007] 3.4 collating element sequence of one or more characters that are considered a single entity for ordering
[ISO/IEC 14651:2007] SIST EN 13710:2011
[ISO/IEC 14651:2007] NOTE A special collation table is the Common Template Table (CTT) used in Annex A of ISO/IEC 14651 to express the default mapping from collating elements to weighting elements.
3.6 delta list of the differences between a given collation table and another one
[ISO/IEC 14651] NOTE The given collation table, together with a given delta, forms a new collation table. Unless otherwise specified in the European Standard, the term “delta" always refers to differences from the Common Template Table as defined in ISO/IEC 14651. 3.7 ordering process by which, given two strings, it is determined whether the first one is less than, equal to, or greater than the second one
[ISO/IEC 14651:2007] 3.8 sorting presentation of information in a structured way NOTE Sorting may include the subdivision of information by subject matters, e.g. by having several registers in a book, by splitting a phone book into several sections, one for each town that falls into its purview or by having multiple indices in a library. Ordering is in most circumstances an integral part of this procedure. 4 Conformance In order to be conformant to this European Standard an application shall meet the requirements prescribed in ISO/IEC 14651:2007, Clause 6 and its Common Template Table ISO14651_2006_TABLE1 after the application of the EOR delta table specified in Clause 6 of this European Standard. An equivalent description of the resulting tailored table shall equally conform to this European Standard. 5 Tailorability The European Ordering Rules defined in this European Standard can be taken as a default template which can be tailored to the needs of any European country in the manner specified by ISO/IEC 14651 (cf. also informative Annex E). This European Standard is not meant to influence national standards or traditions in the field of ordering, its scope being the ordering of multilingual data. Nonetheless, national standards are encouraged to express their national ordering rules on this European Standard by declaring a formalized set of deviation rules (”delta”), as explained in Informative Annex E. This way, the respective ordering rules are automatically machine-processable and can be incorporated into international repositories of locale data, allowing for more widespread support of national ordering standards across software products. SIST EN 13710:2011
reorder-after % Introduce the LIG weight. collating-symbol reorder-end
reorder-after %Introduce more variants collating-symbol collating-symbol collating-symbol collating-symbol collating-symbol collating-symbol reorder-end
reorder-after %Introduce a weight for U0587 ARMENIAN SMALL LIGATURE ECH YIWN collating-symbol reorder-end
reorder-after
order_start forward;forward;forward;forward
% Non-alphanumeric characters (including some modifier letters):
% The DRACHMA SIGN is already in ISO14651_2006 ignorable on levels 1-3 IGNORE;IGNORE;IGNORE; % DOLLAR SIGN IGNORE;IGNORE;IGNORE; % CENT SIGN IGNORE;IGNORE;IGNORE; % POUND SIGN IGNORE;IGNORE;IGNORE; % CURRENCY SIGN IGNORE;IGNORE;IGNORE; % YEN SIGN IGNORE;IGNORE;IGNORE; % EURO-CURRENCY SIGN SIST EN 13710:2011
% Modifier letters that are not ignorable in ISO14651_2006_TABLE1_en.txt IGNORE;IGNORE;IGNORE; % MODIFIER LETTER SMALL H IGNORE;IGNORE;IGNORE; % MODIFIER LETTER SMALL H WITH HOOK IGNORE;IGNORE;IGNORE; % MODIFIER LETTER SMALL J IGNORE;IGNORE;IGNORE; % MODIFIER LETTER SMALL R IGNORE;IGNORE;IGNORE; % MODIFIER LETTER SMALL TURNED R IGNORE;IGNORE;IGNORE; % MODIFIER LETTER SMALL TURNED R WITH HOOK IGNORE;IGNORE;IGNORE; % MODIFIER LETTER SMALL CAPITAL INVERTED R IGNORE;IGNORE;IGNORE; % MODIFIER LETTER SMALL W IGNORE;IGNORE;IGNORE; % MODIFIER LETTER SMALL Y IGNORE;IGNORE;IGNORE; % MODIFIER LETTER TURNED COMMA IGNORE;IGNORE;IGNORE; % MODIFIER LETTER APOSTROPHE IGNORE;IGNORE;IGNORE; % MODIFIER LETTER REVERSED COMMA IGNORE;IGNORE;IGNORE; % MODIFIER LETTER RIGHT HALF RING IGNORE;IGNORE;IGNORE; % MODIFIER LETTER LEFT HALF RING IGNORE;IGNORE;IGNORE; % MODIFIER LETTER GLOTTAL STOP IGNORE;IGNORE;IGNORE; % MODIFIER LETTER REVERSED GLOTTAL STOP IGNORE;IGNORE;IGNORE; % MODIFIER LETTER TRIANGULAR COLON IGNORE;IGNORE;IGNORE; % MODIFIER LETTER HALF TRIANGULAR COLON IGNORE;IGNORE;IGNORE; % MODIFIER LETTER SMALL GAMMA IGNORE;IGNORE;IGNORE; % MODIFIER LETTER SMALL L IGNORE;IGNORE;IGNORE; % MODIFIER LETTER SMALL S IGNORE;IGNORE;IGNORE; % MODIFIER LETTER SMALL REVERSED GLOTTAL STOP IGNORE;IGNORE;IGNORE; % MODIFIER LETTER DOUBLE APOSTROPHE
IGNORE;IGNORE;IGNORE; % LATIN LETTER GLOTTAL STOP IGNORE;IGNORE;IGNORE; % LATIN LETTER PHARYNGEAL VOICED FRICATIVE IGNORE;IGNORE;IGNORE; % LATIN LETTER INVERTED GLOTTAL STOP IGNORE;IGNORE;IGNORE; % LATIN LETTER BILABIAL CLICK IGNORE;IGNORE;IGNORE; % LATIN LETTER GLOTTAL STOP WITH STROKE SIST EN 13710:2011
%%
% Latin
% Almost all changes here result from CEN/TC304's resolution % for the Latin script part of the Modern European Scripts / MES-3 to % treat only the letters a to z and thorn as distinct on the first % level and treat other combinations as variants or ligatures
;"";""; % LATIN SMALL LETTER TURNED A ;"";""; % LATIN SMALL LETTER ALPHA ;"";""; % LATIN SMALL LETTER TURNED ALPHA
;"";""; % LATIN LETTER SMALL CAPITAL B ;"";""; % LATIN SMALL LETTER B WITH STROKE ;"";""; % LATIN CAPITAL LETTER B WITH STROKE ;"";""; % LATIN SMALL LETTER B WITH HOOK ;"";""; % LATIN CAPITAL LETTER B WITH HOOK ;"";""; % LATIN SMALL LETTER B WITH TOPBAR ;"";""; % LATIN CAPITAL LETTER B WITH TOPBAR
;"";""; % LATIN SMALL LETTER C WITH HOOK ;"";""; % LATIN CAPITAL LETTER C WITH HOOK ;"";""; % LATIN SMALL LETTER C WITH CURL ;"";""; % LATIN LETTER STRETCHED C
% is used for U00F0 LATIN SMALL LETTER ETH (already in CTT) ;"";""; % LATIN SMALL LETTER D WITH TAIL ;"";""; % LATIN CAPITAL LETTER AFRICAN D ;"";""; % LATIN SMALL LETTER D WITH HOOK ;"";""; % LATIN CAPITAL LETTER D WITH HOOK ;"";""; % LATIN SMALL LETTER D WITH TOPBAR ;"";""; % LATIN CAPITAL LETTER D WITH TOPBAR ;"";""; % LATIN SMALL LETTER D WITH CURL ;"";""; % LATIN SMALL LETTER TURNED DELTA
"";"";""; % LATIN SMALL LETTER DZ DIGRAPH WITH CURL "";"";""; % LATIN SMALL LETTER DEZH DIGRAPH
;"";""; % LATIN SMALL LETTER TURNED E ;"";""; % LATIN CAPITAL LETTER REVERSED E ;"";""; % LATIN SMALL LETTER SCHWA ;"";""; % LATIN CAPITAL LETTER SCHWA ;"";""; % LATIN SMALL LETTER OPEN E ;"";""; % LATIN CAPITAL LETTER OPEN E ;"";""; % LATIN SMALL LETTER REVERSED E ;"";""; % LATIN SMALL LETTER SCHWA WITH HOOK ;"";""; % LATIN SMALL LETTER REVERSED OPEN E ;"";""; % LATIN SMALL LETTER REVERSED OPEN E WITH HOOK ;"";""; % LATIN SMALL LETTER CLOSED REVERSED OPEN E SIST EN 13710:2011
;"";""; % LATIN SMALL LETTER F WITH HOOK ;"";""; % LATIN CAPITAL LETTER F WITH HOOK
;"";""; % LATIN SMALL LETTER SCRIPT G ;"";""; % LATIN LETTER SMALL CAPITAL G ;"";""; % LATIN SMALL LETTER G WITH STROKE ;"";""; % LATIN CAPITAL LETTER G WITH STROKE ;"";""; % LATIN SMALL LETTER G WITH HOOK ;"";""; % LATIN CAPITAL LETTER G WITH HOOK ;"";""; % LATIN LETTER SMALL CAPITAL G WITH HOOK ;"";""; % LATIN SMALL LETTER GAMMA ;"";""; % LATIN CAPITAL LETTER GAMMA ;"";""; % LATIN SMALL LETTER RAMS HORN ;"";""; % LATIN SMALL LETTER OI ;"";""; % LATIN CAPITAL LETTER OI
;"";""; % LATIN LETTER SMALL CAPITAL H ;"";""; % LATIN SMALL LETTER H WITH HOOK ;"";""; % LATIN SMALL LETTER HENG WITH HOOK ;"";""; % LATIN SMALL LETTER TURNED H ;"";""; % LATIN SMALL LETTER TURNED H WITH FISHHOOK
;"";""; % LATIN SMALL LETTER TURNED H WITH FISHHOOK AND TAIL
"";"";""; % LATIN SMALL LETTER HV "";"";""; % LATIN CAPITAL LETTER HWAIR
;"";""; % LATIN SMALL LETTER DOTLESS I ;"";""; % LATIN LETTER SMALL CAPITAL I ;"";""; % LATIN SMALL LETTER I WITH STROKE ;"";""; % LATIN CAPITAL LETTER I WITH STROKE ;"";""; % LATIN SMALL LETTER IOTA ;"";""; % LATIN CAPITAL LETTER IOTA
;"";""; % LATIN SMALL LETTER J WITH CROSSED-TAIL ;"";""; % LATIN SMALL LETTER DOTLESS J WITH STROKE ;"";""; % LATIN SMALL LETTER DOTLESS J WITH STROKE AND HOOK
;"";""; % LATIN SMALL LETTER K WITH HOOK ;"";""; % LATIN CAPITAL LETTER K WITH HOOK ;"";""; % LATIN SMALL LETTER KRA ;"";""; % LATIN SMALL LETTER TURNED K
%
is used for U0140 LATIN SMALL LETTER L WITH MIDDLE DOT (already in CTT) ;"";""; % LATIN LETTER SMALL CAPITAL L ;"";""; % LATIN SMALL LETTER L WITH BAR ;"";""; % LATIN CAPITAL LETTER L WITH BAR ;"";""; % LATIN SMALL LETTER L WITH MIDDLE TILDE ;"";""; % LATIN SMALL LETTER L WITH BELT ;"";""; % LATIN SMALL LETTER L WITH RETROFLEX HOOK ;"";""; % LATIN SMALL LETTER L WITH CURL SIST EN 13710:2011
"";"";""; % LATIN SMALL LETTER LEZH
;"";""; % LATIN SMALL LETTER M WITH HOOK ;"";""; % LATIN SMALL LETTER TURNED M ;"";""; % LATIN CAPITAL LETTER TURNED M ;"";""; % LATIN SMALL LETTER TURNED M WITH LONG LEG
;"";""; % LATIN SMALL LETTER N PRECEDED BY APOSTROPHE ;"";""; % LATIN LETTER SMALL CAPITAL N ;"";""; % LATIN SMALL LETTER N WITH LEFT HOOK ;"";""; % LATIN CAPITAL LETTER N WITH LEFT HOOK ;"";""; % LATIN SMALL LETTER N WITH LONG RIGHT LEG ;"";""; % LATIN CAPITAL LETTER N WITH LONG RIGHT LEG ;"";""; % LATIN SMALL LETTER N WITH RETROFLEX HOOK ;"";""; % LATIN SMALL LETTER N WITH CURL ;"";""; % LATIN SMALL LETTER ENG ;"";""; % LATIN CAPITAL LETTER ENG
%
is used for U0153 LATIN SMALL LIGATURE OE (already in CTT) ;"";""; % LATIN SMALL LETTER OPEN O ;"";""; % LATIN CAPITAL LETTER OPEN O ;"";""; % LATIN SMALL LETTER BARRED O ;"";""; % LATIN CAPITAL LETTER O WITH MIDDLE TILDE ;"";""; % LATIN SMALL LETTER CLOSED OMEGA ;"";""; % LATIN SMALL LETTER OU ;"";""; % LATIN CAPITAL LETTER OU
"";"";""; % LATIN LETTER SMALL CAPITAL OE
;"";""; % LATIN SMALL LETTER P WITH HOOK ;"";""; % LATIN CAPITAL LETTER P WITH HOOK ;"";""; % LATIN SMALL LETTER PHI
;"";""; % LATIN SMALL LETTER Q WITH HOOK
;"";""; % LATIN LETTER SMALL CAPITAL R ;"";""; % LATIN LETTER YR ;"";""; % LATIN SMALL LETTER TURNED R ;"";""; % LATIN SMALL LETTER TURNED R WITH LONG LEG ;"";""; % LATIN SMALL LETTER TURNED R WITH HOOK ;"";""; % LATIN SMALL LETTER R WITH LONG LEG ;"";""; % LATIN SMALL LETTER R WITH TAIL ;"";""; % LATIN SMALL LETTER R WITH FISHHOOK ;"";""; % LATIN SMALL LETTER REVERSED R WITH FISHHOOK ;"";""; % LATIN LETTER SMALL CAPITAL INVERTED R
%
is used for U00DF LATIN SMALL LETTER SHARP S (already in CTT) % is used for U017F LATIN SMALL LETTER LONG S (already in CTT) ;"";""; % LATIN SMALL LETTER S WITH HOOK SIST EN 13710:2011
;"";""; % LATIN SMALL LETTER T WITH STROKE ;"";""; % LATIN CAPITAL LETTER T WITH STROKE ;"";""; % LATIN SMALL LETTER T WITH PALATAL HOOK ;"";""; % LATIN SMALL LETTER T WITH HOOK ;"";""; % LATIN CAPITAL LETTER T WITH HOOK ;"";""; % LATIN SMALL LETTER T WITH RETROFLEX HOOK ;"";""; % LATIN CAPITAL LETTER T WITH RETROFLEX HOOK ;"";""; % LATIN SMALL LETTER T WITH CURL ;"";""; % LATIN SMALL LETTER TURNED T
"";"";""; % LATIN SMALL LETTER TC DIGRAPH WITH CURL
;"";""; % LATIN SMALL LETTER U BAR ;"";""; % LATIN CAPITAL LETTER U BAR ;"";""; % LATIN SMALL LETTER UPSILON ;"";""; % LATIN CAPITAL LETTER UPSILON
;"";""; % LATIN SMALL LETTER V WITH HOOK ;"";""; % LATIN CAPITAL LETTER V WITH HOOK ;"";""; % LATIN SMALL LETTER TURNED V ;"";""; % LATIN CAPITAL LETTER TURNED V
;"";""; % LATIN SMALL LETTER TURNED W ;"";""; % LATIN LETTER WYNN ;"";""; % LATIN CAPITAL LETTER WYNN
;"";""; % LATIN LETTER SMALL CAPITAL Y
;"";""; % LATIN SMALL LETTER Y WITH HOOK ;"";""; % LATIN CAPITAL LETTER Y WITH HOOK ;"";""; % LATIN SMALL LETTER TURNED Y ;"";""; % LATIN SMALL LETTER YOGH ;"";""; % LATIN CAPITAL LETTER YOGH
;"";""; % LATIN SMALL LETTER Z WITH STROKE ;"";""; % LATIN CAPITAL LETTER Z WITH STROKE ;"";""; % LATIN SMALL LETTER Z WITH HOOK ;"";""; % LATIN CAPITAL LETTER Z WITH HOOK
;"";""; % LATIN SMALL LETTER Z WITH RETROFLEX HOOK ;"";""; % LATIN SMALL LETTER Z WITH CURL ;"";""; % LATIN SMALL LETTER EZH ;"";""; % LATIN CAPITAL LETTER EZH ;"";""; % LATIN SMALL LETTER EZH WITH CARON ;"";""; % LATIN CAPITAL LETTER EZH WITH CARON ;"";""; % LATIN SMALL LETTER EZH REVERSED SIST EN 13710:2011
;"";""; % LATIN SMALL LETTER EZH WITH CURL
% Greek % ISO14651_2006_TABLE1_en.txt now contains the tailorings of CR 14400 in its CTT
% Full conformance with GOST requirements for Cyrillic letters
;"";""; % CYRILLIC SMALL LETTER GJE ;"";""; % CYRILLIC CAPITAL LETTER GJE
;"";""; % CYRILLIC SMALL LETTER KJE ;"";""; % CYRILLIC CAPITAL LETTER KJE
% Georgian: Identical to ISO14651_2006_TABLE1_en.txt
% Armenian:
;;; % ARMENIAN SMALL LIGATURE ECH YIWN
reorder-end %% for EOR's EORDeltaTable
Principles behind the European Ordering Rules A.0 Introduction This annex aims to present the information inherent in Clause 6 in a more accessible form for those who are interested in the principles guiding the composition of the table. Those readers not concerned with implementation details may take this more traditional treatment of the matter as an authoritative interpretation of the body of this European Standard. A.1 Terms and definitions For the purpose of this annex, the following terms and definitions apply in addition to those in the body of this European Standard (see Clause 3). A.1.1 digit any of the characters
0 (U0030), 1 (U0031), 2 (U0032), 3 (U0033), 4 (U0034), 5 (U0035), 6 (U0036), 7 (U0037), 8 (U0038), 9 (U0039)
A.1.2 letter character used to represent (either alone or in combination) sounds or sequences of sounds of a natural language in writing NOTE Here equivalent to all characters of the Multilingual European Subset No 3 whose name contains one of the words LETTER or LIGATURE. A.1.3 first level letter character that is a member of the following list of letters: Latin script: a (U0061),
A (U0041),
b (U0062),
B (U0042),
c (U0063),
C (U0043),
d (U0064),
D (U0044), e (U0065),
E (U0045),
f (U0066),
F (U0046),
g (U0067),
G (U0047),
h (U0068),
H (U0048), i (U0069),
I (U0049),
j (U006A),
J (U004A),
k (U006B),
K (U004B),
l (U006C),
L (U004C), m (U006D),
M (U004D),
n (U006E),
N (U004E),
o (U006F),
O (U004F),
p (U0070),
P (U0050), q (U0071),
Q (U0051),
r (U0072),
R (U0052),
s (U0073),
S (U0053),
t (U0074),
T (U0054), u (U0075),
U (U0055),
v (U0076),
V (U0056),
w (U0077),
W (U0057),
x (U0078),
X (U0058), SIST EN 13710:2011
Y (U0059),
z (U007A),
Z (U005A),
þ (U00FE),
Þ (U00DE)
Greek script:
(U03B1),
(U0391),
(U03B2),
(U0392),
(U03B3),
(U0393),
(U03B4),
(U0394),
(U03B5),
(U0395),
(U03DC),
(U03DA),
(U03B6),
(U0396),
(U03B7),
(U0397),
(U03B8),
(U0398),
(U03B9),
(U0399),
(U03BA),
(U039A),
(U03BB),
(U039B), (U03BC),
(U039C),
(U03BD),
(U039D),
(U03BE),
(U039E),
(U03BF),
(U039F),
(U03C0),
(U03A0),
(U03DE),
(U03C1),
(U03A1),
(U03C3),
(U03A3),
(U03C4), (U03A4),
(U03C5),
(U03A5),
(U03C6),
(U03A6),
(U03C7),
(U03A7),
(U03C8), (U03A8),
(U03C9),
(U03A9),
(U03E0)
NOTE 1 Stigma
(U03DA / U03DB), Koppa / Qoppa
(U03DE / U03DF) and Sampi
(U03E0 / U03E1) are archaic letters that are currently used to designate numerals. Digamma
(U03DC) is not used in any modern language. NOTE 2 Through collection 9 "Greek Symbols and Coptic" of ISO/IEC 10646:2003 MES contains also a number of Coptic letters. Their order is specified in ISO/IEC 14651. Cyrillic script:
(U0430),
(U0410),
‰ (U04D1),
Æ (U04D0), ‹ (U04D3), Š (U04D2),
‘ (U04D9),
(U04D8),“ (U04DB),
Ð (U04DA),
(U04D5),
Œ (U04D4),
(U0431),
(U0411),
(U0432),
(U0412),
(U0433),
(U0413),
K (U0493),
J (U0492),
M (U0495),
L (U0494),
(U0434),
(U0414),
(U0452),
(U0402),
Q (U0499),
P (U0498),
(U0435),
(U0415),
(U04D7),
Ž (U04D6), (U0454),
(U0404),
(U0436),
(U0416),
• (U04DD), ” (U04DC),
O (U0497),
N (U0496),
(U0437),
(U0417),
— (U04DF),
– (U04DE),
(U0455),
(U0405),
™ (U04E1),
˜ (U04E0),
(U0438),
(U0418),
(U04E5),
œ (U04E4),
(U0456),
(U0406),
(U0457),
(U0407),
(U0439),
(U0419),
a (U048B),
` (U048A),
(U0458),
(U0408),
(U043A),
(U041A),S (U049B),
R (U049A),
| (U04C4),
{ (U04C3), Y (U04A1), X (U04A0),
W (U049F),
V (U049E), U (U049D),
T (U049C),
(U043B),
(U041B), ~ (U04C6), } (U04C5),
(U0459),
(U0409),
(U043C),
(U041C), † (U04CE),
… (U04CD),
(U043D),
(U041D),
‚ (U04CA), (U04C9),SIST EN 13710:2011
Z (U04A2),
€ (U04C8),
(U04C7), ] (U04A5), \ (U04A4),
(U045A),
(U040A), (U043E),
(U041E),
Ÿ (U04E7),
ž (U04E6),
ß (U04E9),
Þ (U04E8),
£ (U04EB),
¢ (U04EA), (U043F),
(U041F),
_ (U04A7),
^ (U04A6), 9 (U0481),
8 (U0480),
(U0440),
(U0420), G (U048F),
F (U048E),
(U0441),
(U0421),
c (U04AB), b (U04AA),
(U0442),
(U0422), £ (U04AD),
¢ (U04AC),
(U045B),
(U040B),
(U0443),
(U0423),
(U045E),
(U040E), © (U04F1),
æ (U04F0),
« (U04F3),
ª (U04F2),
¥ (U04AF), ¤ (U04AE), i (U04B1),
h (U04B0),1 (U0479),
0 (U0478),
(U0444),
(U0424),
(U0445),
(U0425),
k (U04B3),
æ (U04B2),s (U04BB),
r (U04BA),
(U0461),
(U0460),
7 (U047F),
6 (U047E),
5 (U047D),
4 (U047C),3 (U047B),
2 (U047A),
(U0446),
(U0426),
m (U04B5), l (U04B4),
(U0447),
(U0427), (U04F5),
¬ (U04F4),
o (U04B7),
n (U04B6), „ (U04CC), ƒ (U04CB), q (U04B9),
p (U04B8),u (U04BD),
t (U04BC),
w (U04BF),
´ (U04BE),
(U045F),
(U040F),
(U0448),
(U0428),
(U0449),
(U0429),
(U044A),
(U042A),
(U044B),
(U042B),
± (U04F9),
° (U04F8),
(U044C),
(U042C),
E (U048D),
D (U048C),
(U0463),
(U0462),
(U044D),
(U042D),¥ (U04ED),
¤ (U04EC),
(U044E),
(U042E),
(U044F),
(U042F),
(U0465),
(U0464),
(U0467),
(U0466),
# (U046B),
" (U046A), ! (U0469),
(U0468),
% (U046D),
$ (U046C),' (U046F),
& (U046E),
) (U0471),
( (U0470),
+ (U0473),
* (U0472),
- (U0475),
, (U0474), / (U0477),
. (U0476),
a (U04A9),
` (U04A8), x (U04C0)
Georgian script: T (U10D0),
U (U10D1),
V (U10D2),
W (U10D3), X (U10D4), Y (U10D5), Z (U10D6),
u (U10F1), [ (U10D7),
\ (U10D8),
] (U10D9),
^ (U10DA), _ (U10DB), ` (U10DC), ´ (U10F2),
a (U10DD),b (U10DE),
c (U10DF),
¢ (U10E0),
£ (U10E1),
¤ (U10E2),
w (U10F3),
¥ (U10E3),
h (U10E4), i (U10E5),
æ (U10E6),
k (U10E7),
l (U10E8),
m (U10E9),
n (U10EA), o (U10EB),
p (U10EC),q (U10ED),
r (U10EE),
x (U10F4),
s (U10EF), t (U10F0),
y (U10F5),
ö (U10F6),
{ (U10F7), | (U10F8)
w (U10F3), x (U10F4), y (U10F5) and ö (U10F6) are today considered archaic letters. Armenian script:
(U0561),
Ü (U0531),
(U0562),
Ý (U0532),
(U0563),
Þ (U0533),
(U0564),
ß (U0534),
(U0565),
à (U0535),
(U0566),
á (U0536),
(U0567),
â (U0537),
(U0568),
ã (U0538),
(U0569),
ä (U0539),
(U056A),
å (U053A),
(U056B), æ (U053B),
(U056C),
ç (U053C), (U056D),
Ħ (U053D),
(U056E),
ħ (U053E),
(U056F),
ê (U053F),
(U0570),
ë (U0540),
(U0571),
ì (U0541),
(U0572),
í (U0542),
(U0573),
î (U0543),
(U0574),
ï (U0544),
(U0575),
ð (U0545),
(U0576),
ñ (U0546),
(U0577),
ò (U0547),
(U0578),
ı (U0548), ! (U0579),
IJ (U0549),
" (U057A),
Ư (U054A), # (U057B), ö (U054B),
$ (U057C),
÷ (U054C),% (U057D),
ø (U054D), & (U057E),
ù (U054E),
' (U057F),
ĸ (U054F),
( (U0580),
û (U0550), ) (U0581),
ü (U0551),
* (U0582),
ý (U0552),
+ (U0583),
þ (U0553),
, (U0584),
ÿ (U0554), / (U0587),
- (U0585),
(U0555),
. (U0586),
(U0556)
A.1.4 diacritical mark any of a number of recurring graphical structures placed over, under or next to a first level letter which does not significantly modify the shape of the first level letter itself and which in combination with that first level letter is a valid letter NOTE These structures modify meaning or pronunciation or some other feature of the first level letter. The diacritical marks which are relevant to this European Standard are listed in A.8.1. A.1.5 letter with diacritical marks letter which can be seen as equivalent to the combination between a first level letter and one or more diacritical marks NOTE Some letters with diacritical marks are treated as first level letters in some languages, e.g. ä in Swedish and ñ in Spanish. However, these are subject to national standards or local practices which are outside the scope of this European Standard. NOTE Very few Latin letters such as ½ (U01FB) have more than one diacritical mark. A considerable number of Greek letters have more than one diacritical mark. A.1.6 equivalent letter form character created by joining two or more distinct first level letters or two or more letters with diacritical marks or any combination of these NOTE Examples for equivalent letter forms in MES are the LATIN SMALL LIGATURE FI (UFB01), LATIN SMALL LIGATURE FL (UFB02), and the Croatian dz (U01F2 / U01F3). SIST EN 13710:2011
NOTE 1 This definition works for the repertoire of MES, but not necessarily for the full repertoire of the UCS. NOTE 2 A capital letter is also known as an uppercase letter NOTE 3 For the first level letters these are: Latin script:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Þ
Greek script:
Cyrillic script:
Æ Š Ð Œ
J L
P
Ž
” N
–
˜
œ
`
R { X V T
…
‰
Z \
ž Þ ¢
^ 8
d
b
¢
æ ª ¤ h 0
æ r
6 4 2
l
¬ n ƒ p t ´
°
b
§
"
$ & ( * , . `
Georgian script:
(none in MES)
NOTE 4 The function of capital letters in Georgian differs significantly from the function of capital letters in the other four scripts. The Georgian letters which ISO/IEC 10646calls GEORGIAN CAPITAL LETTER make up the asomtavruli script that is primarily used in Old Georgian texts. The remaining letters (classified simply as GEORGIAN LETTER in ISO/IEC 10646) are usually identified with the mxedruli or military script that is used almost exclusively for writing modern Georgian. For this reason the MES collection only comprises the "Basic Georgian" collection (10D0-10FF) with the mxedruli script. Armenian script:
Ü Ý Þ ß à á â ã ä å æ ç Ħ ħ ê ë ì í î ï ð ñ ò ı IJ Ư ö ÷ ø ù ĸ û ü ý þ ÿ
A.1.9 small letter letter which is not a capital letter
NOTE A small letter is also known as a lowercase letter A.1.10 special character character that is neither a letter nor a digit SIST EN 13710:2011
...








Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.
Loading comments...