ISO/TC 37/SC 4/WG 1 - Basic descriptors and mechanisms for language

	Jan 2026
28	29	30	31	1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31
1	2	3	4	5	6	7

	Feb 2026
25	26	27	28	29	30	31
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
1	2	3	4	5	6	7

	Jan 2026
28	29	30	31	1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31
1	2	3	4	5	6	7

	Feb 2026
25	26	27	28	29	30	31
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
1	2	3	4	5	6	7

	Jan 2026
28	29	30	31	1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31
1	2	3	4	5	6	7

	Feb 2026
25	26	27	28	29	30	31
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
1	2	3	4	5	6	7

The component metadata lifecycle needs a comprehensive infrastructure with systems that cooperate well together. To enable this level of cooperation this document provides in depth descriptions and definitions of what CMDI records, components and their representations in XML look like. This document describes these XML representations, which enable the flexible construction of interoperable metadata schemas suitable for, but not limited to, describing language resources. The metadata schemas based on these representations can be used to describe resources at different levels of granularity (e.g. descriptions on the collection level or on the level of individual resources).

Standard
38 pages
English language
e-Library read for
AI-Chat
1 day
Standard
38 pages
English language
e-Library read for
AI-Chat
1 day
Standard
32 pages
English language
sale 15% off

ISO

ISO 24622-1:2015(Main)

Language resource management - Component Metadata Infrastructure (CMDI) - Part 1: The Component Metadata Model

SIST ISO 24622-1:2018

ISO 24622-1:2015 describes a model that enables the flexible construction of interoperable metadata schemas for Language Resources (LRs). The metadata schemas based on this model can be used to describe resources at different levels of granularity (e.g. descriptions both on the collection level and on the level of individual resources).

Standard
17 pages
English language
e-Library read for
AI-Chat
1 day
Standard
17 pages
English language
e-Library read for
AI-Chat
1 day
Standard
11 pages
English language
sale 15% off

ISO

ISO 24612:2012(Main)

Language resource management - Linguistic annotation framework (LAF)

SIST ISO 24612:2013

ISO 24612:2012 specifies a linguistic annotation framework (LAF) for representing linguistic annotations of language data such as corpora, speech signal and video. The framework includes an abstract data model and an XML serialization of that model for representing annotations of primary data. The serialization serves as a pivot format to allow annotations expressed in one representation format to be mapped onto another.

Standard
24 pages
English language
e-Library read for
AI-Chat
1 day
Standard
24 pages
English language
e-Library read for
AI-Chat
1 day
Standard
19 pages
English language
sale 15% off
Standard
26 pages
Russian language
sale 15% off

ISO

ISO 24610-2:2011(Main)

Language resource management - Feature structures - Part 2: Feature system declaration

SIST ISO 24610-2:2013

ISO 24610-2:2011 provides a format to represent, store or exchange feature structures in natural language applications, for both annotation and production of linguistic data. It is ultimately designed to provide a computer format to define a type hierarchy and to declare the constraints that bear on a set of feature specifications and operations on feature structures, thus offering means to check the conformance of each feature structure with regards to a reference specification. Feature structures are an essential part of many linguistic formalisms as well as an underlying mechanism for representing the information consumed or produced by and for language engineering applications. A feature system declaration (FSD) is an auxiliary file used in conjunction with a certain type of text that makes use of fs (that is, feature structure) elements. The FSD serves four purposes. 1) It provides an encoding by which types and their subtyping and inheritance relationships can be introduced and defined, thus laying the basis for constructing a feature system. 2) It provides a mechanism by which the encoder can list all of the feature names and feature values and give a prose description as to what each represents. 3) It provides a mechanism by which type constraints can be declared, against which typed feature structures are validated relative to a given theory stated in typed feature logic. These constraints may involve constraints on the range of a feature's value, constraints on which features are permitted within certain types of feature structures, or constraints that prevent the co-occurrence of certain feature-value pairs. The source of these constraints is normally the empirical domain being modelled. 4) It provides a mechanism by which the encoder can define the intended interpretation of underspecified feature structures. This involves defining default values (whether literal or computed) for missing features. The scheme described in ISO 24610-2:2011 may be used to document any feature system, but is primarily intended for use with the typed feature structure representation defined in ISO 24610-1. The feature structure representations of ISO 24610-1 specify data structures that are subject to the typing conventions and constraints specified using ISO 24610-2:2011. The feature structure representations of ISO 24610-1 are also used within some of the elements defined in ISO 24610-2:2011.

Standard
55 pages
English language
e-Library read for
AI-Chat
1 day
Standard
55 pages
English language
e-Library read for
AI-Chat
1 day
Standard
50 pages
English language
sale 15% off
Standard
50 pages
Russian language
sale 15% off

ISO

ISO 24619:2011(Main)

Language resource management - Persistent identification and sustainable access (PISA)

SIST ISO 24619:2014

ISO 24619:2011 specifies requirements for the persistent identifier (PID) framework and for using PIDs as references and citations of language resources in documents as well as in language resources themselves. In this context, examples of language resources include such works as digital dictionaries, language-purposed terminological resources, machine-translation lexica, annotated multimedia/multimodal corpora, text corpora that have been annotated with, for example, morpho-syntactic information, and the like. Computational and applied linguists and information specialists create such resources. ISO 24619:2011 also addresses issues of persistence and granularity of references to resources, first by requiring that persistent references be implemented by using a PID framework and further by imposing requirements on any PID frameworks used for this purpose. PID frameworks also allow the association of general metadata with the identifier, which can also contain citation information. ISO 24619:2011 specifies minimum requirements for effective use of PIDs in language resources and cites the use of several possible existing standards and de-facto standards.

Standard
34 pages
English language
e-Library read for
AI-Chat
1 day
Standard
34 pages
English language
e-Library read for
AI-Chat
1 day
Standard
29 pages
English language
sale 15% off

ISO

ISO 24610-1:2006(Main)

Language resource management - Feature structures - Part 1: Feature structure representation

SIST ISO 24610-1:2013

ISO 24610-1:2006 provides a format for the representation, storage and exchange of feature structures in natural language applications concerned with the annotation, production or analysis of linguistic data. It also defines a computer format for the description of constraints that bear on a set of features, feature values, feature specifications and operations on feature structures, thus offering a means of checking the conformance of each feature structure with regards to a reference specification.

Standard
84 pages
English language
e-Library read for
AI-Chat
1 day
Standard
84 pages
English language
e-Library read for
AI-Chat
1 day
Standard
78 pages
English language
sale 15% off

ISO/TC 37/SC 4/WG 1 - Basic descriptors and mechanisms for language resources

Mécanismes et descripteurs de base pour les ressources linguistiques

General Information

Frequently Asked Questions