SIST ISO 24613-1:2019
(Main)Language resource management -- Lexical markup framework (LMF) -- Part 1: Core model
Language resource management -- Lexical markup framework (LMF) -- Part 1: Core model
This document describes the core model of the lexical markup framework (LMF)l, a metamodel for
representing data in monolingual and multilingual lexical databases used with computer applications.
LMF provides mechanisms that allow the development and integration of a variety of electronic lexical
resource types.
Gestion de ressources linguistiques -- Cadre de balisage lexical -- Partie 1: Modèle de base
Upravljanje jezikovnih virov - Ogrodje za označevanje leksikonov (LMF) - 1. del: Jedrni model
Ta dokument opisuje jedrni model ogrodja za označevanje leksikonov (LMF)l, metamodel za predstavljanje podatkov v enojezičnih in večjezičnih leksikalnih zbirkah podatkov, ki se uporabljajo z računalniškimi aplikacijami.
LMF zagotavlja mehanizme, ki omogočajo razvoj in integracijo številnih vrst elektronskih leksikalnih virov.
General Information
Relations
Standards Content (Sample)
SLOVENSKI STANDARD
01-oktober-2019
Upravljanje jezikovnih virov - Ogrodje za označevanje leksikonov (LMF) - 1. del:
Jedrni model
Language resource management -- Lexical markup framework (LMF) -- Part 1: Core
model
Gestion de ressources linguistiques -- Cadre de balisage lexical -- Partie 1: Modèle de
base
Ta slovenski standard je istoveten z: ISO 24613-1:2019
ICS:
01.140.20 Informacijske vede Information sciences
35.240.30 Uporabniške rešitve IT v IT applications in information,
informatiki, dokumentiranju in documentation and
založništvu publishing
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.
INTERNATIONAL ISO
STANDARD 24613-1
First edition
2019-06
Language resource management —
Lexical markup framework (LMF) —
Part 1:
Core model
Gestion des ressources linguistiques — Cadre de balisage lexical
(LMF) —
Partie 1: Modèle de base
Reference number
©
ISO 2019
© ISO 2019
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting
on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address
below or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Fax: +41 22 749 09 47
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii © ISO 2019 – All rights reserved
Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Key standards used by LMF . 3
4.1 Unicode . 3
4.2 Language coding . 3
4.3 Script coding . 3
4.4 Unified modeling language (UML) . 3
5 The LMF model . 3
5.1 Introduction . 3
5.2 Class inheritance and data category selection procedures . 4
5.2.1 Class inheritance . 4
5.2.2 LMF attributes . . 4
5.2.3 Data category selection (DCS) . 4
5.2.4 User-defined data categories . 4
5.3 LMF core package . 4
5.3.1 General. 4
5.3.2 LexicalResource class . 5
5.3.3 GlobalInformation class . 5
5.3.4 Lexicon class . 6
5.3.5 LexiconInformation class . 6
5.3.6 LexicalEntry class . 6
5.3.7 Form class . 6
5.3.8 OrthographicRepresentation class . 6
5.3.9 GrammaticalInformation Class . 6
5.3.10 Sense class . 7
5.3.11 Definition class . 7
5.4 Cross reference (CrossREF) model . 7
5.4.1 General. 7
5.4.2 CrossREF and CrossREFConstraint classes . 7
5.4.3 CrossREFConstraint class . 7
5.5 Methods for data category selection and subclass creation . 7
5.5.1 General. 7
5.5.2 Generalization (typing) . 8
5.5.3 Object instantiation . 8
5.5.4 Design choices . 8
5.5.5 Data categories for orthographic representation . 9
5.5.6 Principles for model simplification. 9
5.6 LMF extension use . 9
5.6.1 General. 9
5.6.2 Lexicon comparison .10
Annex A (informative) Data category examples .11
Bibliography .13
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards
bodies (ISO member bodies). The work of preparing International Standards is normally carried out
through ISO technical committees. Each member body interested in a subject for which a technical
committee has been established has the right to be represented on that committee. International
organizations, governmental and non-governmental, in liaison with ISO, also take part in the work.
ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of
electrotechnical standardization.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the
different types of ISO documents should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www .iso .org/directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. ISO shall not be held responsible for identifying any or all such patent rights. Details of
any patent rights identified during the development of the document will be in the Introduction and/or
on the ISO list of patent declarations received (see www .iso .org/patents).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to the
World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT), see www .iso
.org/iso/foreword .html.
The document was prepared by Technical Committee ISO/TC 37, Language and terminology,
Subcommittee 4, Language resource management.
This first edition of ISO 24613-1, together with ISO 24613-2 to ISO 24613-6, cancels and replaces
ISO 24613:2008, which has been technically revised.
The main changes compared to the previous edition are as follows:
The content has been entirely revised and subdivided into parts. Part 1, Core model, contains the
body of the previous edition. New classes include LexiconInformation and GrammaticalInformation.
The Representation class has been renamed the OrthographicRepresentation class. In addition, the
OrthographicRepresentation subclasses, FormRepresentation and TextRepresentation, no longer are
part of the core model, providing it with greater modeling flexibility. The LexicalEntry subclass now
allows subclasses, providing improved extensibility and flexibility for modeling future parts. The
addition of the CrossREF class and associated metadata provides a formal model for cross-reference
design and implementation, closing a functional gap in the previous edition. A thoroughly revised
description of data category allocation mechanisms and their relationship to generalization by typing
provides a more incisive description of how these interdependent mechanisms enable flexible and
extensible designs.
A list of all parts in the ISO 24613 series can be found on the ISO website.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www .iso .org/members .html.
iv © ISO 2019 – All rights reserved
Introduction
Optimizing the production, maintenance and extension of electronic lexical resources is one of the
crucial aspects impacting human language technologies (HLT) in general and natural language
processing (NLP) in particular, as well as human-oriented translation technologies. A second crucial
aspect involves optimizing the process leading to their integration in applications. Lexical markup
framework (LMF) is an abstract metamodel that provides a common, standardized framework for
the construction of computational lexicons. LMF ensures the encoding of linguistic information in a
way that enables reusability in different applicatio
...
SLOVENSKI STANDARD
01-oktober-2019
Upravljanje jezikovnih virov - Ogrodje za označevanje leksikonov (LMF) - 1. del:
Jedrni model
Language resource management -- Lexical markup framework (LMF) -- Part 1: Core
model
Gestion de ressources linguistiques -- Cadre de balisage lexical -- Partie 1: Modèle de
base
Ta slovenski standard je istoveten z: ISO 24613-1:2019
ICS:
01.020 Terminologija (načela in Terminology (principles and
koordinacija) coordination)
01.140.20 Informacijske vede Information sciences
35.240.30 Uporabniške rešitve IT v IT applications in information,
informatiki, dokumentiranju in documentation and
založništvu publishing
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.
INTERNATIONAL ISO
STANDARD 24613-1
First edition
2019-06
Language resource management —
Lexical markup framework (LMF) —
Part 1:
Core model
Gestion des ressources linguistiques — Cadre de balisage lexical
(LMF) —
Partie 1: Modèle de base
Reference number
©
ISO 2019
© ISO 2019
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting
on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address
below or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Fax: +41 22 749 09 47
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii © ISO 2019 – All rights reserved
Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Key standards used by LMF . 3
4.1 Unicode . 3
4.2 Language coding . 3
4.3 Script coding . 3
4.4 Unified modeling language (UML) . 3
5 The LMF model . 3
5.1 Introduction . 3
5.2 Class inheritance and data category selection procedures . 4
5.2.1 Class inheritance . 4
5.2.2 LMF attributes . . 4
5.2.3 Data category selection (DCS) . 4
5.2.4 User-defined data categories . 4
5.3 LMF core package . 4
5.3.1 General. 4
5.3.2 LexicalResource class . 5
5.3.3 GlobalInformation class . 5
5.3.4 Lexicon class . 6
5.3.5 LexiconInformation class . 6
5.3.6 LexicalEntry class . 6
5.3.7 Form class . 6
5.3.8 OrthographicRepresentation class . 6
5.3.9 GrammaticalInformation Class . 6
5.3.10 Sense class . 7
5.3.11 Definition class . 7
5.4 Cross reference (CrossREF) model . 7
5.4.1 General. 7
5.4.2 CrossREF and CrossREFConstraint classes . 7
5.4.3 CrossREFConstraint class . 7
5.5 Methods for data category selection and subclass creation . 7
5.5.1 General. 7
5.5.2 Generalization (typing) . 8
5.5.3 Object instantiation . 8
5.5.4 Design choices . 8
5.5.5 Data categories for orthographic representation . 9
5.5.6 Principles for model simplification. 9
5.6 LMF extension use . 9
5.6.1 General. 9
5.6.2 Lexicon comparison .10
Annex A (informative) Data category examples .11
Bibliography .13
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards
bodies (ISO member bodies). The work of preparing International Standards is normally carried out
through ISO technical committees. Each member body interested in a subject for which a technical
committee has been established has the right to be represented on that committee. International
organizations, governmental and non-governmental, in liaison with ISO, also take part in the work.
ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of
electrotechnical standardization.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the
different types of ISO documents should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www .iso .org/directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. ISO shall not be held responsible for identifying any or all such patent rights. Details of
any patent rights identified during the development of the document will be in the Introduction and/or
on the ISO list of patent declarations received (see www .iso .org/patents).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to the
World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT), see www .iso
.org/iso/foreword .html.
The document was prepared by Technical Committee ISO/TC 37, Language and terminology,
Subcommittee 4, Language resource management.
This first edition of ISO 24613-1, together with ISO 24613-2 to ISO 24613-6, cancels and replaces
ISO 24613:2008, which has been technically revised.
The main changes compared to the previous edition are as follows:
The content has been entirely revised and subdivided into parts. Part 1, Core model, contains the
body of the previous edition. New classes include LexiconInformation and GrammaticalInformation.
The Representation class has been renamed the OrthographicRepresentation class. In addition, the
OrthographicRepresentation subclasses, FormRepresentation and TextRepresentation, no longer are
part of the core model, providing it with greater modeling flexibility. The LexicalEntry subclass now
allows subclasses, providing improved extensibility and flexibility for modeling future parts. The
addition of the CrossREF class and associated metadata provides a formal model for cross-reference
design and implementation, closing a functional gap in the previous edition. A thoroughly revised
description of data category allocation mechanisms and their relationship to generalization by typing
provides a more incisive description of how these interdependent mechanisms enable flexible and
extensible designs.
A list of all parts in the ISO 24613 series can be found on the ISO website.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www .iso .org/members .html.
iv © ISO 2019 – All rights reserved
Introduction
Optimizing the production, maintenance and extension of electronic lexical resources is one of the
crucial aspects impacting human language technologies (HLT) in general and natural language
processing (NLP) in particular, as well as human-oriented translation technologies. A second crucial
aspect involves optimizing the process leading to their integration in applications. Lexical markup
framework (LMF) is an abstract metamodel that provides a common, standardized framework for
the construction of computational lexicons. LMF ensures the enco
...
INTERNATIONAL ISO
STANDARD 24613-1
First edition
2019-06
Language resource management —
Lexical markup framework (LMF) —
Part 1:
Core model
Gestion des ressources linguistiques — Cadre de balisage lexical
(LMF) —
Partie 1: Modèle de base
Reference number
©
ISO 2019
© ISO 2019
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting
on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address
below or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Fax: +41 22 749 09 47
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii © ISO 2019 – All rights reserved
Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Key standards used by LMF . 3
4.1 Unicode . 3
4.2 Language coding . 3
4.3 Script coding . 3
4.4 Unified modeling language (UML) . 3
5 The LMF model . 3
5.1 Introduction . 3
5.2 Class inheritance and data category selection procedures . 4
5.2.1 Class inheritance . 4
5.2.2 LMF attributes . . 4
5.2.3 Data category selection (DCS) . 4
5.2.4 User-defined data categories . 4
5.3 LMF core package . 4
5.3.1 General. 4
5.3.2 LexicalResource class . 5
5.3.3 GlobalInformation class . 5
5.3.4 Lexicon class . 6
5.3.5 LexiconInformation class . 6
5.3.6 LexicalEntry class . 6
5.3.7 Form class . 6
5.3.8 OrthographicRepresentation class . 6
5.3.9 GrammaticalInformation Class . 6
5.3.10 Sense class . 7
5.3.11 Definition class . 7
5.4 Cross reference (CrossREF) model . 7
5.4.1 General. 7
5.4.2 CrossREF and CrossREFConstraint classes . 7
5.4.3 CrossREFConstraint class . 7
5.5 Methods for data category selection and subclass creation . 7
5.5.1 General. 7
5.5.2 Generalization (typing) . 8
5.5.3 Object instantiation . 8
5.5.4 Design choices . 8
5.5.5 Data categories for orthographic representation . 9
5.5.6 Principles for model simplification. 9
5.6 LMF extension use . 9
5.6.1 General. 9
5.6.2 Lexicon comparison .10
Annex A (informative) Data category examples .11
Bibliography .13
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards
bodies (ISO member bodies). The work of preparing International Standards is normally carried out
through ISO technical committees. Each member body interested in a subject for which a technical
committee has been established has the right to be represented on that committee. International
organizations, governmental and non-governmental, in liaison with ISO, also take part in the work.
ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of
electrotechnical standardization.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the
different types of ISO documents should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www .iso .org/directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. ISO shall not be held responsible for identifying any or all such patent rights. Details of
any patent rights identified during the development of the document will be in the Introduction and/or
on the ISO list of patent declarations received (see www .iso .org/patents).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to the
World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT), see www .iso
.org/iso/foreword .html.
The document was prepared by Technical Committee ISO/TC 37, Language and terminology,
Subcommittee 4, Language resource management.
This first edition of ISO 24613-1, together with ISO 24613-2 to ISO 24613-6, cancels and replaces
ISO 24613:2008, which has been technically revised.
The main changes compared to the previous edition are as follows:
The content has been entirely revised and subdivided into parts. Part 1, Core model, contains the
body of the previous edition. New classes include LexiconInformation and GrammaticalInformation.
The Representation class has been renamed the OrthographicRepresentation class. In addition, the
OrthographicRepresentation subclasses, FormRepresentation and TextRepresentation, no longer are
part of the core model, providing it with greater modeling flexibility. The LexicalEntry subclass now
allows subclasses, providing improved extensibility and flexibility for modeling future parts. The
addition of the CrossREF class and associated metadata provides a formal model for cross-reference
design and implementation, closing a functional gap in the previous edition. A thoroughly revised
description of data category allocation mechanisms and their relationship to generalization by typing
provides a more incisive description of how these interdependent mechanisms enable flexible and
extensible designs.
A list of all parts in the ISO 24613 series can be found on the ISO website.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www .iso .org/members .html.
iv © ISO 2019 – All rights reserved
Introduction
Optimizing the production, maintenance and extension of electronic lexical resources is one of the
crucial aspects impacting human language technologies (HLT) in general and natural language
processing (NLP) in particular, as well as human-oriented translation technologies. A second crucial
aspect involves optimizing the process leading to their integration in applications. Lexical markup
framework (LMF) is an abstract metamodel that provides a common, standardized framework for
the construction of computational lexicons. LMF ensures the encoding of linguistic information in a
way that enables reusability in different applications and for different tasks. LMF provides a common,
shared representation of lexical objects, including morphological, syntactic and semantic aspects.
The goals of LMF are to provide a common model for the creation and use of electronic lexical resources
ranging from small to large in scale, to manage the exchange of data between and among these
resources, and to facilitate the merging of large numbers of different individual electronic resources to
form extensive global electronic resources. The ultimate goal of LMF is to create a modular structure
that will facilitate true content interoperability across all aspects of electronic lexical resources.
[3]
LMF supports existing lexical resource models such as Genelex , the EAGLES International Standard
[4] [10]
for Language Engineering (ISLE) , Multilingual ISLE Lexical Entry (MILE) models , Text Encoding
[8] [7]
Initiative (TEI) guidelines , Ontolex , and the Language Base Exchange (LBX) serialization
...
INTERNATIONAL ISO
STANDARD 24613-1
First edition
2019-06
Language resource management —
Lexical markup framework (LMF) —
Part 1:
Core model
Gestion des ressources linguistiques — Cadre de balisage lexical
(LMF) —
Partie 1: Modèle de base
Reference number
©
ISO 2019
© ISO 2019
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting
on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address
below or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Fax: +41 22 749 09 47
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii © ISO 2019 – All rights reserved
Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Key standards used by LMF . 3
4.1 Unicode . 3
4.2 Language coding . 3
4.3 Script coding . 3
4.4 Unified modeling language (UML) . 3
5 The LMF model . 3
5.1 Introduction . 3
5.2 Class inheritance and data category selection procedures . 4
5.2.1 Class inheritance . 4
5.2.2 LMF attributes . . 4
5.2.3 Data category selection (DCS) . 4
5.2.4 User-defined data categories . 4
5.3 LMF core package . 4
5.3.1 General. 4
5.3.2 LexicalResource class . 5
5.3.3 GlobalInformation class . 5
5.3.4 Lexicon class . 6
5.3.5 LexiconInformation class . 6
5.3.6 LexicalEntry class . 6
5.3.7 Form class . 6
5.3.8 OrthographicRepresentation class . 6
5.3.9 GrammaticalInformation Class . 6
5.3.10 Sense class . 7
5.3.11 Definition class . 7
5.4 Cross reference (CrossREF) model . 7
5.4.1 General. 7
5.4.2 CrossREF and CrossREFConstraint classes . 7
5.4.3 CrossREFConstraint class . 7
5.5 Methods for data category selection and subclass creation . 7
5.5.1 General. 7
5.5.2 Generalization (typing) . 8
5.5.3 Object instantiation . 8
5.5.4 Design choices . 8
5.5.5 Data categories for orthographic representation . 9
5.5.6 Principles for model simplification. 9
5.6 LMF extension use . 9
5.6.1 General. 9
5.6.2 Lexicon comparison .10
Annex A (informative) Data category examples .11
Bibliography .13
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards
bodies (ISO member bodies). The work of preparing International Standards is normally carried out
through ISO technical committees. Each member body interested in a subject for which a technical
committee has been established has the right to be represented on that committee. International
organizations, governmental and non-governmental, in liaison with ISO, also take part in the work.
ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of
electrotechnical standardization.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the
different types of ISO documents should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www .iso .org/directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. ISO shall not be held responsible for identifying any or all such patent rights. Details of
any patent rights identified during the development of the document will be in the Introduction and/or
on the ISO list of patent declarations received (see www .iso .org/patents).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to the
World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT), see www .iso
.org/iso/foreword .html.
The document was prepared by Technical Committee ISO/TC 37, Language and terminology,
Subcommittee 4, Language resource management.
This first edition of ISO 24613-1, together with ISO 24613-2 to ISO 24613-6, cancels and replaces
ISO 24613:2008, which has been technically revised.
The main changes compared to the previous edition are as follows:
The content has been entirely revised and subdivided into parts. Part 1, Core model, contains the
body of the previous edition. New classes include LexiconInformation and GrammaticalInformation.
The Representation class has been renamed the OrthographicRepresentation class. In addition, the
OrthographicRepresentation subclasses, FormRepresentation and TextRepresentation, no longer are
part of the core model, providing it with greater modeling flexibility. The LexicalEntry subclass now
allows subclasses, providing improved extensibility and flexibility for modeling future parts. The
addition of the CrossREF class and associated metadata provides a formal model for cross-reference
design and implementation, closing a functional gap in the previous edition. A thoroughly revised
description of data category allocation mechanisms and their relationship to generalization by typing
provides a more incisive description of how these interdependent mechanisms enable flexible and
extensible designs.
A list of all parts in the ISO 24613 series can be found on the ISO website.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www .iso .org/members .html.
iv © ISO 2019 – All rights reserved
Introduction
Optimizing the production, maintenance and extension of electronic lexical resources is one of the
crucial aspects impacting human language technologies (HLT) in general and natural language
processing (NLP) in particular, as well as human-oriented translation technologies. A second crucial
aspect involves optimizing the process leading to their integration in applications. Lexical markup
framework (LMF) is an abstract metamodel that provides a common, standardized framework for
the construction of computational lexicons. LMF ensures the encoding of linguistic information in a
way that enables reusability in different applications and for different tasks. LMF provides a common,
shared representation of lexical objects, including morphological, syntactic and semantic aspects.
The goals of LMF are to provide a common model for the creation and use of electronic lexical resources
ranging from small to large in scale, to manage the exchange of data between and among these
resources, and to facilitate the merging of large numbers of different individual electronic resources to
form extensive global electronic resources. The ultimate goal of LMF is to create a modular structure
that will facilitate true content interoperability across all aspects of electronic lexical resources.
[3]
LMF supports existing lexical resource models such as Genelex , the EAGLES International Standard
[4] [10]
for Language Engineering (ISLE) , Multilingual ISLE Lexical Entry (MILE) models , Text Encoding
[8] [7]
Initiative (TEI) guidelines , Ontolex , and the Language Base Exchange (LBX) serialization
...
Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.