SIST ISO 5725-3:2024
(Main)Accuracy (trueness and precision) of measurement methods and results - Part 3: Intermediate precision and alternative designs for collaborative studies
Accuracy (trueness and precision) of measurement methods and results - Part 3: Intermediate precision and alternative designs for collaborative studies
This document provides
a) a discussion of alternative experimental designs for the determination of trueness and precision measures including reproducibility, repeatability and selected measures of intermediate precision of a standard measurement method, including a review of the circumstances in which their use is necessary or beneficial, and guidance as to the interpretation and application of the resulting estimates, and
b) worked examples including specific designs and computations.
Each of the alternative designs discussed in this document is intended to address one (or several) of the following issues:
a) a discussion of the implications of the definitions of intermediate precision measures;
b) a guidance on the interpretation and application of the estimates of intermediate precision measures in practical situations;
c) determining reproducibility, repeatability and selected measures of intermediate precision;
d) improved determination of reproducibility and other measures of precision;
e) improving the estimate of the sample mean;
f) determining the range of in-house repeatability standard deviations;
g) determining other precision components such as operator variability;
h) determining the level of reliability of precision estimates;
i) reducing the minimum number of participating laboratories by optimizing the reliability of precision estimates;
j) avoiding distorted estimations of repeatability (split-level designs);
k) avoiding distorted estimations of reproducibility (taking the heterogeneity of the material into consideration).
Often, the performance of the method whose precision is being evaluated in a collaborative study will have previously been assessed in a single-laboratory validation study conducted by the laboratory which developed it. Relevant factors for the determination of intermediary precision will have been identified in this prior single-laboratory study.
Exactitude (justesse et fidélité) des résultats et méthodes de mesure — Partie 3: Fidélité intermédiaire et plans alternatifs pour les études collaboratives
Le présent document fournit:
a) une discussion de plans d’expérience alternatifs pour la détermination de mesures de justesse et de fidélité, y compris la reproductibilité, la répétabilité et les mesures sélectionnées de la fidélité intermédiaire d’une méthode de mesure normalisée, incluant un examen des circonstances dans lesquelles leur utilisation est nécessaire ou bénéfique, ainsi que des recommandations relatives à l’interprétation et à l’application des estimations en résultant; et
b) des exemples détaillés, incluant des plans et des calculs spécifiques.
Chacun des plans alternatifs abordés dans le présent document est destiné à traiter l’un (ou plusieurs) des problèmes suivants:
a) une discussion des implications des définitions des mesures de fidélité intermédiaire;
b) des recommandations relatives à l’interprétation et à l’application des estimations des mesures de fidélité intermédiaire dans des situations pratiques;
c) la détermination de la reproductibilité, de la répétabilité et de mesures sélectionnées de la fidélité intermédiaire;
d) la détermination améliorée[1] de la reproductibilité et d’autres mesures de la fidélité;
e) l’amélioration de l’estimation de la moyenne de l’échantillon;
f) la détermination de la plage des écarts-types de répétabilité interne;
g) la détermination d’autres composantes de la fidélité, telles que la variabilité des opérateurs;
h) la détermination du niveau de fiabilité des estimations de la fidélité;
i) la réduction du nombre minimal de laboratoires participants en optimisant la fiabilité des estimations de la fidélité;
j) l’évitement d’estimations biaisées de la répétabilité (plans à niveau fractionné);
k) l’évitement d’estimations biaisées de la reproductibilité (en tenant compte de l’hétérogénéité du matériau).
Il arrive souvent que la performance de la méthode dont la fidélité est soumise à évaluation dans une étude collaborative ait déjà été évaluée dans le cadre d’une étude de validation intralaboratoire menée par le laboratoire qui l’a élaborée. Des facteurs pertinents pour la détermination de la fidélité intermédiaire ont donc déjà été identifiés lors de cette étude intralaboratoire antérieure.
[1] Autorisant une réduction du nombre de laboratoires.
Točnost (pravilnost in natančnost) merilnih metod in rezultatov – 3. del : Vmesne mere natančnosti in alternativni pristopi za primerjalne študije
Ta dokument zagotavlja
a) razpravo o alternativnih poskusnih pristopih k določanju mer pravilnosti in natančnosti, vključno z obnovljivostjo, ponovljivostjo in izbranimi merami vmesne natančnosti standardne merilne metode, kar vključuje pregled okoliščin, v katerih je njihova uporaba potrebna ali koristna, ter smernice za interpretacijo in uporabo dobljenih ocen ter
b) praktične primere, vključno s posebnimi pristopi in izračuni.
Vsak od alternativnih pristopov, obravnavanih v tem dokumentu, je namenjen obravnavanju enega (ali več) od naslednjih vprašanj:
a) razprava o posledicah opredelitve mer vmesne natančnosti;
b) navodila za interpretacijo in uporabo ocenjenih mer vmesne natančnosti v praktičnih situacijah;
c) določanje obnovljivosti, ponovljivosti in izbranih mer vmesne natančnosti;
d) izboljšano določanje obnovljivosti in drugih mer natančnosti;
e) izboljšanje ocene vzorčnega povprečja;
f) določanje obsega standardnih odklonov interne ponovljivosti;
g) določanje drugih komponent natančnosti, kot je spremenljivost izvajalca;
h) določanje stopnje zanesljivosti ocen natančnosti;
i) zmanjšanje najmanjšega števila sodelujočih laboratorijev z optimizacijo zanesljivosti ocen natančnosti;
j) preprečevanje popačenja ocen ponovljivosti (pristopi na dveh ravneh);
k) izogibanje popačenju ocen obnovljivosti (upoštevanje heterogenosti materiala).
Pogosto je učinkovitost metode, katere natančnost se ocenjuje v primerjalni študiji, predhodno ocenjena v študiji potrjevanja, ki jo izvede laboratorij, v katerem je bila metoda razvita. V tej predhodni študiji, ki jo izvede en laboratorij, se opredelijo dejavniki, ki se upoštevajo pri določitvi vmesne natančnosti.
General Information
- Status
- Published
- Publication Date
- 17-Jun-2024
- Technical Committee
- ISTM - Statistical methods
- Current Stage
- 6060 - National Implementation/Publication (Adopted Project)
- Start Date
- 17-Jun-2024
- Due Date
- 22-Aug-2024
- Completion Date
- 18-Jun-2024
- Directive
- TP037 - Pravilnik o pitni vodi
Relations
- Effective Date
- 04-Nov-2015
- Effective Date
- 04-Nov-2015
Overview
ISO 5725-3:2023 - "Accuracy (trueness and precision) of measurement methods and results - Part 3" provides guidance on intermediate precision and alternative experimental designs for collaborative studies. The standard explains how to determine and interpret measures of precision (repeatability, reproducibility and selected intermediate precision measures), reviews when alternative designs are necessary or beneficial, and includes worked examples with specific layouts and computations. It helps laboratories and method developers characterise method variability under practical, between‑run and between‑operator conditions.
Key topics and requirements
- Scope of precision: Definitions and interpretation of trueness, precision, repeatability, reproducibility and intermediate precision in the context of standard measurement methods.
- Factor selection and modelling: Guidance on identifying relevant factors (laboratory, operator, equipment, reagent batch, time, environment, etc.), selection of factor levels, and treatment of random vs fixed effects.
- Alternative experimental designs: Practical layouts and analysis for:
- Balanced fully‑nested, staggered‑nested and partially‑nested designs
- Orthogonal array designs
- Split‑level designs to avoid distorted repeatability estimates
- Designs for heterogeneous material
- Designs across levels
- Statistical analysis methods: Analysis of variance (ANOVA) approaches and Restricted Maximum Likelihood (REML) options are referenced (annexes include worked ANOVA/REML examples).
- Reliability and optimisation: Methods to assess the reliability of precision and overall mean estimates, reduce required participant numbers, estimate operator variability, and improve sample mean estimation.
- Worked examples: Annexes provide step‑by‑step analyses and example computations to apply the designs and interpret results.
Practical applications and users
ISO 5725-3:2023 is intended for:
- Analytical and testing laboratories validating or verifying measurement methods
- Method developers preparing single‑laboratory validation and planning collaborative studies
- Accreditation bodies and conformity assessment organizations assessing method performance
- Standards committees and statisticians designing interlaboratory studies
Typical uses:
- Estimating how much variability is introduced by operators, equipment or days (intermediate precision)
- Selecting efficient collaborative study designs that reduce workload while maintaining reliable precision estimates
- Designing experiments for heterogeneous test materials or multi‑level measurement systems
Related standards
- ISO 5725-1 - Concepts and general principles for accuracy (trueness and precision)
- ISO 5725-2 - Basic method for the determination of repeatability and reproducibility
- ISO 5725-5 - Previously addressed designs for heterogeneous materials (related content now included)
Keywords: ISO 5725-3:2023, intermediate precision, collaborative studies, reproducibility, repeatability, measurement methods, split-level design, orthogonal array, heterogeneous material, ANOVA, REML.
Frequently Asked Questions
SIST ISO 5725-3:2024 is a standard published by the Slovenian Institute for Standardization (SIST). Its full title is "Accuracy (trueness and precision) of measurement methods and results - Part 3: Intermediate precision and alternative designs for collaborative studies". This standard covers: This document provides a) a discussion of alternative experimental designs for the determination of trueness and precision measures including reproducibility, repeatability and selected measures of intermediate precision of a standard measurement method, including a review of the circumstances in which their use is necessary or beneficial, and guidance as to the interpretation and application of the resulting estimates, and b) worked examples including specific designs and computations. Each of the alternative designs discussed in this document is intended to address one (or several) of the following issues: a) a discussion of the implications of the definitions of intermediate precision measures; b) a guidance on the interpretation and application of the estimates of intermediate precision measures in practical situations; c) determining reproducibility, repeatability and selected measures of intermediate precision; d) improved determination of reproducibility and other measures of precision; e) improving the estimate of the sample mean; f) determining the range of in-house repeatability standard deviations; g) determining other precision components such as operator variability; h) determining the level of reliability of precision estimates; i) reducing the minimum number of participating laboratories by optimizing the reliability of precision estimates; j) avoiding distorted estimations of repeatability (split-level designs); k) avoiding distorted estimations of reproducibility (taking the heterogeneity of the material into consideration). Often, the performance of the method whose precision is being evaluated in a collaborative study will have previously been assessed in a single-laboratory validation study conducted by the laboratory which developed it. Relevant factors for the determination of intermediary precision will have been identified in this prior single-laboratory study.
This document provides a) a discussion of alternative experimental designs for the determination of trueness and precision measures including reproducibility, repeatability and selected measures of intermediate precision of a standard measurement method, including a review of the circumstances in which their use is necessary or beneficial, and guidance as to the interpretation and application of the resulting estimates, and b) worked examples including specific designs and computations. Each of the alternative designs discussed in this document is intended to address one (or several) of the following issues: a) a discussion of the implications of the definitions of intermediate precision measures; b) a guidance on the interpretation and application of the estimates of intermediate precision measures in practical situations; c) determining reproducibility, repeatability and selected measures of intermediate precision; d) improved determination of reproducibility and other measures of precision; e) improving the estimate of the sample mean; f) determining the range of in-house repeatability standard deviations; g) determining other precision components such as operator variability; h) determining the level of reliability of precision estimates; i) reducing the minimum number of participating laboratories by optimizing the reliability of precision estimates; j) avoiding distorted estimations of repeatability (split-level designs); k) avoiding distorted estimations of reproducibility (taking the heterogeneity of the material into consideration). Often, the performance of the method whose precision is being evaluated in a collaborative study will have previously been assessed in a single-laboratory validation study conducted by the laboratory which developed it. Relevant factors for the determination of intermediary precision will have been identified in this prior single-laboratory study.
SIST ISO 5725-3:2024 is classified under the following ICS (International Classification for Standards) categories: 03.120.30 - Application of statistical methods; 17.020 - Metrology and measurement in general. The ICS classification helps identify the subject area and facilitates finding related standards.
SIST ISO 5725-3:2024 has the following relationships with other standards: It is inter standard links to SIST ISO 5725-3:2003, SIST ISO 5725-3:2003/C1:2003. Understanding these relationships helps ensure you are using the most current and applicable version of the standard.
SIST ISO 5725-3:2024 is associated with the following European legislation: EU Directives/Regulations: 2009-01-4018, 2011-01-2525, TP037. When a standard is cited in the Official Journal of the European Union, products manufactured in conformity with it benefit from a presumption of conformity with the essential requirements of the corresponding EU directive or regulation.
You can purchase SIST ISO 5725-3:2024 directly from iTeh Standards. The document is available in PDF format and is delivered instantly after payment. Add the standard to your cart and complete the secure checkout process. iTeh Standards is an authorized distributor of SIST standards.
Standards Content (Sample)
SLOVENSKI STANDARD
01-september-2024
Točnost (pravilnost in natančnost) merilnih metod in rezultatov – 3. del : Vmesne
mere natančnosti in alternativni pristopi za primerjalne študije
Accuracy (trueness and precision) of measurement methods and results — Part 3:
Intermediate precision and alternative designs for collaborative studies
Exactitude (justesse et fidélité) des résultats et méthodes de mesure — Partie 3: Fidélité
intermédiaire et plans alternatifs pour les études collaboratives
Ta slovenski standard je istoveten z: ISO 5725-3:2023
ICS:
03.120.30 Uporaba statističnih metod Application of statistical
methods
17.020 Meroslovje in merjenje na Metrology and measurement
splošno in general
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.
INTERNATIONAL ISO
STANDARD 5725-3
Second edition
2023-06
Accuracy (trueness and precision) of
measurement methods and results —
Part 3:
Intermediate precision and alternative
designs for collaborative studies
Exactitude (justesse et fidélité) des résultats et méthodes de mesure —
Partie 3: Fidélité intermédiaire et plans alternatifs pour les études
collaboratives
Reference number
© ISO 2023
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii
Contents Page
Foreword .v
Introduction . vi
1 Scope . 1
2 Normative references . 2
3 Terms and definitions . 2
4 Symbols . 3
5 General requirements . 4
6 Intermediate measures of the precision of a standard measurement method .5
6.1 Factors and factor levels . 5
6.1.1 Definitions and examples . 5
6.1.2 Selection of factors of interest . 6
6.1.3 Random and fixed effects . 6
6.1.4 Statistical model . 7
6.2 Within-laboratory study and analysis of intermediate precision measures . 9
6.2.1 Simplest approach . 9
6.2.2 Alternative method . 10
6.2.3 Effect of the measurement conditions on the final quoted result . 10
7 Nested design .11
7.1 Balanced fully-nested design . 11
7.2 Staggered-nested design . 12
7.3 Balanced partially-nested design . 13
7.4 Orthogonal array design . 14
8 Design for heterogeneous material .16
8.1 Applications of the design for a heterogeneous material . 16
8.2 Layout of the design for a heterogeneous material . 17
8.3 Statistical analysis . 17
9 Split-level design .17
9.1 Applications of the split-level design . 17
9.2 Layout of the split-level design . 19
9.3 Statistical analysis . 19
10 Design across levels .19
10.1 Applications of the design across levels . 19
10.2 Layout of the design across levels . 20
10.3 Statistical analysis . 20
11 Reliability of interlaboratory parameters .20
11.1 Reliability of precision estimates . 20
11.2 Reliability of estimates of the overall mean . 21
11.2.1 General . 21
11.2.2 Balanced fully-nested design (2 factors) . 21
11.2.3 Staggered nested design (2 factors) . 21
11.2.4 Balanced partially-nested design . 21
11.2.5 Orthogonal array design . 21
11.2.6 Split-level design . 22
Annex A (informative) Fully- and partially-nested designs .23
Annex B (informative) Analysis of variance for balanced fully-nested design .25
Annex C (informative) Analysis of variance for staggered design .30
Annex D (informative) Analysis of variance for the balanced partially-nested design (three
factors) .38
iii
Annex E (informative) Statistical model for an experiment with heterogeneous material .41
Annex F (informative) Analysis of variance for split-level design .42
Annex G (informative) Example for split-level design . 44
Annex H (informative) Design across levels .47
Annex I (informative) Restricted maximum likelihood (REML) .48
Annex J (informative) Examples of the statistical analysis of intermediate precision
experiment .49
Annex K (informative) Example for an analysis across levels .55
Bibliography .57
iv
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards
bodies (ISO member bodies). The work of preparing International Standards is normally carried out
through ISO technical committees. Each member body interested in a subject for which a technical
committee has been established has the right to be represented on that committee. International
organizations, governmental and non-governmental, in liaison with ISO, also take part in the work.
ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of
electrotechnical standardization.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the
different types of ISO document should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www.iso.org/directives).
ISO draws attention to the possibility that the implementation of this document may involve the use
of (a) patent(s). ISO takes no position concerning the evidence, validity or applicability of any claimed
patent rights in respect thereof. As of the date of publication of this document, ISO had not received
notice of (a) patent(s) which may be required to implement this document. However, implementers are
cautioned that this may not represent the latest information, which may be obtained from the patent
database available at www.iso.org/patents. ISO shall not be held responsible for identifying any or all
such patent rights.
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to
the World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT), see
www.iso.org/iso/foreword.html.
This document was prepared by Technical Committee ISO/TC 69, Applications of statistical methods,
Subcommittee SC 6, Measurement methods and results.
This second edition cancels and replaces the first edition (ISO 5725-3:1994), which has been technically
revised. It also incorporates the Technical Corrigendum ISO 5725-3:1994/Cor.1:2001.
The main changes are as follows:
— Several additional experimental designs have been added to this version compared to the previous
version, some of them from ISO 5725-5. These are orthogonal array designs, split level designs,
designs for heterogeneous sample material as well as designs across levels.
— Furthermore, the standard was supplemented by considerations on the selection of factors and
modelling of the factorial effects, as well as by a section in which the reliability of the various
interlaboratory test parameters (mean and precision parameters) are considered.
A list of all parts in the ISO 5725 series can be found on the ISO website.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www.iso.org/members.html.
v
Introduction
0.1 ISO 5725 uses two terms “trueness” and "precision” to describe the accuracy of a measurement
method. “Trueness” refers to the degree of agreement between the average value of a large number
of test results and the true or accepted reference value. “Precision” refers to the degree of agreement
between test results.
0.2 General consideration of these quantities is given in ISO 5725-1 and is not repeated here. It is
stressed that ISO 5725-1 provides underlying definitions and general principles should be read in
conjunction with all other parts of ISO 5725.
0.3 Many different factors (apart from test material heterogeneity) may contribute to the variability of
results from a measurement method, including:
a) the laboratory;
b) the operator;
c) the equipment used;
d) the calibration of the equipment;
e) the batch of a reagent;
f) the time elapsed between measurements;
g) environment (temperature, humidity, air pollution, etc.);
h) other factors.
0.4 Two conditions of precision, termed repeatability and reproducibility conditions, have been found
necessary and, for many practical cases, useful for describing the variability of a measurement method.
Under repeatability conditions, none of the factors a) to h) in 0.3 are considered to vary, while under
reproducibility conditions, all of the factors are considered to vary and contribute to the variability of
the test results. Thus, repeatability and reproducibility conditions are the two extremes of precision,
the first describing the minimum and the second the maximum variability in results. Intermediate
conditions between these two extreme conditions of precision are also conceivable, when one or more
of the factors listed in b) to g) are allowed to vary.
To illustrate the need for including a consideration of intermediate conditions in method validation,
consider the operation of a present-day laboratory connected with a production plant involving, for
example, a three-shift working system where measurements are made by different operators on
different equipment. Operators and equipment are then some of the factors that contribute to the
variability in the test results.
The standard deviation of test results obtained under repeatability conditions is generally less than
that obtained under intermediate precision conditions. Generally, in chemical analysis, the standard
deviation under intermediate precision conditions may be two or three times larger than that under
repeatability conditions. It should not, of course, exceed the reproducibility standard deviation.
As an example, in the determination of copper in copper ore, a collaborative study among 35 laboratories
revealed that the standard deviation under intermediate precision conditions (different times) was
1,5 times larger than that under repeatability conditions, both for the electrolytic gravimetry and
Na S 0 titration methods.
2 2 3
0.5 This document focuses on intermediate precision and alternative designs for collaborative studies
of a measurement method. Apart from the determination of intermediate precision measures, the
aims of these alternative designs include reducing the number of required measurements, increasing
the reliability of the estimates for precision and overall mean and taking into account test material
heterogeneity.
vi
Indeed, a t -factor fully-nested experiment with two levels per factor (inside each laboratory, there are
t−1
t−1 factors) and two replicates per setting requires 22 · test results from each laboratory, which
can be an excessive requirement on the laboratories. For this reason, in the previous version of
ISO 5725-3, the staggered nested design is also discussed. While the estimation of the precision
parameters is more complex and subject to greater uncertainty in a staggered nested design, the
workload is reduced. This document offers alternative strategies to reduce the workload without
compromising the reliability of the precision estimates.
As far as the special designs for sample heterogeneity are concerned, they were discussed in the
previous version of ISO 5725-5. However, it is convenient to have one part of this standard dedicated to
the question of the design of experiments.
0.6 The repeatability precision as determined in accordance with ISO 5725-2 is computed as a mean
across participating laboratories. Whether it can be used for quality control purposes depends on
whether the repeatability standard deviation can be considered to remain constant across laboratories.
For this reason, it is important to obtain information on how the repeatability standard deviation varies
within and between the laboratories under different conditions.
0.7 In many collaborative studies, the between-laboratory variability is large in comparison to the
repeatability, and it would be useful to a) decompose it into several different precision components, b)
reduce, if possible, some sources of variability which are due to the intermediate precision conditions.
This can be done by identifying factors (e.g. time, calibration, operator or equipment) which contribute
to the variability under intermediate precision conditions of measurement, by quantifying the
corresponding variability components and, wherever achievable, decreasing their contribution. In this
manner, the intermediate precision component of the overall variance is enlarged while the between-
laboratory component of the overall variance is reduced. Only random effects are considered: it is only
reasonable to model a factor as a fixed effect after a method or calibration optimization study has been
conducted. In this standard, different relationships between factors are taken into account, e.g. whether
a particular factor is subsumed under another factor or not.
0.8 Estimates for precision and overall mean are subject to random variability. Accordingly, it
is important to determine the uncertainty associated with each estimate, and to understand the
relationships between this uncertainty, the number of participants and the design. Once these
relationships are understood, it becomes possible to make much more informed decisions concerning
the number of participants and the experimental design.
0.9 Provided different factorial effects do contribute to the variability, determining the respective
precision components may make it possible to reduce the required number of participating laboratories,
since the between-laboratory variability can be expected to be less dominant. However, it is highly
recommended to have a reasonable number of participating laboratories in order to ensure a realistic
assessment of the overall method variability obtained under routine conditions of operation.
0.10 In the uniform-level design according to part 2 of this standard, there is a risk that an operator will
allow the result of a measurement on one sample to influence the result of a subsequent measurement
on another sample of the same material, causing the estimates of the repeatability and reproducibility
standard deviations to be biased. When this risk is considered to be serious, the split-level design
described in this document may be preferred as it reduces this risk. Care should be taken that the
two materials used at a particular level of the experiment are sufficiently similar to ensure that the
same precision measures can be expected (in other words: the question arises whether the precision
component associated with a particular factor remains unchanged across a range of similar matrices).
0.11 The experimental design presented in ISO 5725-2 requires the preparation of a number of
identical samples of the material for use in the experiment. With heterogeneous materials this may not
be possible, so that the use of the basic method then gives estimates of the reproducibility standard
deviation that are inflated by the variation between the samples. The design for a heterogeneous
material given in this document yields information about the variability between samples which is not
obtainable from the basic method; it may be used to calculate an estimate of reproducibility from which
the between-sample variation has been removed.
vii
INTERNATIONAL STANDARD ISO 5725-3:2023(E)
Accuracy (trueness and precision) of measurement
methods and results —
Part 3:
Intermediate precision and alternative designs for
collaborative studies
1 Scope
This document provides
a) a discussion of alternative experimental designs for the determination of trueness and precision
measures including reproducibility, repeatability and selected measures of intermediate precision
of a standard measurement method, including a review of the circumstances in which their use
is necessary or beneficial, and guidance as to the interpretation and application of the resulting
estimates, and
b) worked examples including specific designs and computations.
Each of the alternative designs discussed in this document is intended to address one (or several) of the
following issues:
a) a discussion of the implications of the definitions of intermediate precision measures;
b) a guidance on the interpretation and application of the estimates of intermediate precision
measures in practical situations;
c) determining reproducibility, repeatability and selected measures of intermediate precision;
1)
d) improved determination of reproducibility and other measures of precision;
e) improving the estimate of the sample mean;
f) determining the range of in-house repeatability standard deviations;
g) determining other precision components such as operator variability;
h) determining the level of reliability of precision estimates;
i) reducing the minimum number of participating laboratories by optimizing the reliability of
precision estimates;
j) avoiding distorted estimations of repeatability (split-level designs);
k) avoiding distorted estimations of reproducibility (taking the heterogeneity of the material into
consideration).
Often, the performance of the method whose precision is being evaluated in a collaborative study will
have previously been assessed in a single-laboratory validation study conducted by the laboratory
which developed it. Relevant factors for the determination of intermediary precision will have been
identified in this prior single-laboratory study.
1) Allowing a reduction in the number of laboratories.
2 Normative references
The following documents are referred to in the text in such a way that some or all of their content
constitutes requirements of this document. For dated references, only the edition cited applies. For
undated references, the latest edition of the referenced document (including any amendments) applies.
ISO 3534-1, Statistics — Vocabulary and symbols — Part 1: General statistical terms and terms used in
probability
ISO 3534-2, Statistics — Vocabulary and symbols — Part 2: Applied statistics
ISO 5725-1, Accuracy (trueness and precision) of measurement methods and results — Part 1: General
principles and definitions
ISO Guide 33, Reference materials — Good practice in using reference materials
ISO Guide 35, Reference materials — Guidance for characterization and assessment of homogeneity and
stability
3 Terms and definitions
For the purposes of this document, the terms and definitions given in ISO 3534-1, ISO 3534-2 and
ISO 5725-1 and the following apply.
ISO and IEC maintain terminological databases for use in standardization at the following addresses:
— ISO Online browsing platform: available at https:// www .iso .org/ obp
— IEC Electropedia: available at https:// www .electropedia .org/
3.1
block
group of settings (3.7) conducted in parallel or within a short time interval, and with the same samples
EXAMPLE Two settings:
Operator 1 + Calibration 1 + Equipment 1 + Batch 1
and
Operator 1 + Calibration 2 + Equipment 2 + Batch 1
Note 1 to entry: This definition is more specific than the general definition given in ISO 3534-3:2013, 3.1.25,
where block is defined as a collection of experimental units.
3.2
factor
feature under examination as a potential source of variation
EXAMPLE Operator, calibration, equipment, day, reagent batch, storage temperature, shaker orbit, shaker
frequency.
Note 1 to entry: Strictly speaking, the factor laboratory is a factor just like any other. However, since the ISO 5725
standard focuses on method validation by means of interlaboratory studies, the factor laboratory can be
considered to have a somewhat privileged role. The following characteristics distinguish it from other factors:
— The factor laboratory is indispensable: For each measurement, the name of the particular laboratory where
it was performed will always be provided in a collaborative study.
— The factor laboratory will almost always have more levels than other factors.
It should also be noted that categories such as measurand, sample/matrix and level may also be
considered to be factors. However, in collaborative studies, they are often not taken into account
as such in the factorial design. The reason is that, for these factors, one is interested in a separate
statistical analysis for each separate factor level. In other words, one is interested in obtaining separate
precision measures for each particular measurand or concentration level, not across measurands or
concentration levels. However, in cases where it is required to quantify precision across, say, matrices,
then the factor sample/matrix should also be included in the design. Accordingly, in this document,
designs are discussed to be applied for a particular measurand or concentration level by different
laboratories all applying the same measurement procedure.
[SOURCE: ISO 3534-3:2013, 3.1.5, modified — Note 1 to entry was modified and Note 2 to entry was
deleted.]
3.3
factor level
setting (3.7), value or assignment of a factor (3.2)
EXAMPLE Operator 1, Operator 2
Note 1 to entry: In many designs, the majority of factors will be varied across two levels.
3.4
fully-nested design
nested design, where there is a nesting hierarchy for every pair of factors (3.2)
EXAMPLE There are 2 operators in each laboratory, and each operator performs 2 calibrations, i.e., the
study includes 2 operators and 4 calibrations for each laboratory.
3.5
partially-nested design
nested design where one factor (3.2) (the factor laboratory) is ranked higher than all other factors (i.e.,
all other factors are nested within the factor laboratory), and there is at least one factor pair without a
nesting hierarchy
EXAMPLE There are 2 operators and 2 instruments in each laboratory, and each operator performs
measurements on 2 instruments, i.e., the study includes 2 operators and 2 instruments for each laboratory.
3.6
run
actual measurement carried out for a particular setting (3.7) and for a particular laboratory
EXAMPLE Operator 1 + Equipment 1 + Batch 1 + Day 1 carried out in laboratory 1
Note 1 to entry: This definition is more specific than the general definition given in ISO 3534-3 (3.1.13), where
run is defined as specific settings of every factor used on a particular experimental unit.
Note 2 to entry: “Identical” runs are called replicates, whereby “identical” means that the different time points are
close enough to each other to allow for the results to be considered as obtained under repeatability conditions.
3.7
setting
combination of factor levels (3.3), for all factors (3.2) except the factor laboratory
EXAMPLE Operator 1 + Equipment 1 + Batch 1 + Day 1.
4 Symbols
B
Component in a test result representing the deviation of a laboratory from the general
average (laboratory component of bias)
B
Component of B representing all factors that do not vary under intermediate precision
conditions – laboratory bias per se
BB,, etc.
Components of B representing factors that vary under intermediate precision conditions
() ()
e
Component representing the random error occurring in every test result, corresponding
to the analytical, repeatability, model or residual error
m
Overall mean of the measurand or test property for a particular matrix; level
ˆ
m Estimate of the overall mean
n
Number of replicate test results obtained in one laboratory at one level for one setting
p
Number of laboratories participating in the collaborative study
q
Number of levels of the test property in the collaborative study
Within-laboratory standard deviation of the residual term e
σ
w
σ
Repeatability standard deviation
r
σ
Reproducibility standard deviation
R
σ Standard deviation corresponding to factor B
0 0
σ Standard deviation corresponding to factor B
()1 ()1
σ Standard deviation corresponding to factor B
()2 ()2
Standard deviation corresponding to factor A
σ
A
σ
Standard deviation corresponding to the interaction of two factors
Interaction
Standard deviation corresponding to the interaction of the two factors A and B
σ
AB
s
Estimate of a standard deviation
se
Standard error
Variance of X
VarX()
w
Range of a set of test results
y
Test result
Mean of X
X
Absolute value of X
X
5 General requirements
In order to ensure that measurements are carried out in the same way, the measurement method shall
have been standardized. All measurements obtained in the framework of an experiment within a
specific laboratory or of a collaborative study shall be carried out according to that standard.
NOTE The terms collaborative experiment, collaborative trial and interlaboratory experiment are
used interchangeably to denote a collaborative study conducted in order to characterize and/or assess the
performance of a measurement method.
6 Intermediate measures of the precision of a standard measurement method
6.1 Factors and factor levels
6.1.1 Definitions and examples
In this document, the term factor denotes an identifiable and quantifiable source of variability such
as time, calibration, operator or equipment (see 3.2). In order to investigate a factor’s contribution to
variability, it is necessary to conduct measurements under different conditions or states. For instance,
measurements shall be carried out with different pieces of equipment, or with different operators. The
different states associated with a particular factor are called factor levels (see 3.3). Table 1 provides
typical examples of factors and their factor levels.
Table 1 — Examples of factors
Description/example of the
Factor Comments
different factor levels
Laboratory The different participating labo- Some of the special designs presented in this document
ratories, typically between 4 and allow reliable precision estimates with as few as 4 partic-
15 different laboratories. ipating laboratories.
Point in time Two different time points (e.g. Differences between “measurements made at different
different days, different weeks, times”, i.e. separated by a relatively long time interval (as
etc.) compared with the repeatability interval) will reflect effects
which correspond to uncontrolled changes in environmental
conditions as well as other “controlled” sources of variability
such as the use of different reagent batches, etc.
Calibration Before and after instrument is Calibration does not refer here to any calibration required
sent to the manufacturer for a as an integral part of obtaining a test result by the measure-
recalibration ment method. It refers to the calibration process that takes
place at regular intervals between groups of measurements
within a laboratory.
Operator The different technicians working In some circumstances, the operator may be, in fact, a team
in the laboratory of operators, each of whom performs some specific part of
the procedure. In such a case, the team should be regarded
as the operator, and any change in membership or in the
allotment of duties within the team should be regarded as
constituting a different operator.
Equipment Two different pieces of equipment Equipment is often a set of equipment, and any change in any
significant component should be regarded as constituting
different equipment. As to what constitutes a significant
component, common sense must prevail (e.g. different
burettes/pipettes, thermometers, pH meters, centrifuges,
shaker orbits or frequencies).
Consumables (buffer Different batches or producers A change of a batch of a reagent should be considered a sig-
solutions, reagents, nificant component. It can lead to different equipment or to
calibrators, cartridg- a recalibration if such a change is followed by calibration.
es)
NOTE 1 In practice, it may not be possible to consider factors in isolation from one another; this is due to a characteristic
of experimental designs called confounding. In theory, it should always be possible to disentangle the effects of different
factors by additional testing. For instance, if Operator 1 always carried out tests with Equipment 1 (e.g. HPLC system 1) and
Operator 2 with Equipment 2, then it would be possible to tell the effects of the two factors Operator and Equipment apart
by adding further runs for Operator 1 with Equipment 2 and for Operator 2 with Equipment 1.
NOTE 2 Further effects called interaction effects are not explicitly considered here. However, some interaction effects are
implicitly taken into consideration. For instance, the effect of skill or fatigue of an operator may be considered to be the
interaction of operator and time. Similarly, the performance of a piece of equipment may be different at the time it is first
turned on and after many hours of use: this is an example of interaction between equipment and time.
NOTE 3 In ISO 5725-2, the factor laboratory is implicitly included in the analysis.
6.1.2 Selection of factors of interest
In the standard for a measurement method, the repeatability and reproducibility standard deviations
should always be specified, but it is not necessary (or even feasible) to state all possible intermediate
precision measures. The selection of relevant factors is informed by experience and an understanding
of the relevant physical, chemical or microbiological processes.
Practical considerations in most laboratories, such as the desired precision of the final quoted result
and the cost of performing the measurements, will govern the number and choice of factors taken into
consideration in the standardization of the measurement method.
Finally, the choice of factors to include in the design should reflect concerns with uncontrollable
variations between the laboratories.
It will often be sufficient to specify only one suitable intermediate precision measure, together with
a detailed stipulation of the specific measurement conditions associated with it. The factors should
be carefully defined; in particular, for the intermediate precision associated with the factor Time, a
practical mean time interval between successive measurements should be specified.
It is assumed that, in the case of a standardized measurement method, the bias inherent in the method
itself will have been corrected by technical means. For this reason, this document only addresses the
bias arising in connection with different measurement conditions.
6.1.3 Random and fixed effects
This subclause provides a discussion of the question why, in this document, factors are modelled as
random rather than as fixed effects.
The term fixed effect is used to describe a contribution to the deviation from the overall mean or true
value whose direction and magnitude is predictable and can thus be determined. Say, for example, that
measurements always lie below the true value with equipment 1 or reagent supplier 1 and above the
true value with equipment 2 or reagent supplier 2. Then it would be appropriate to model the factor
Equipment or Reagent supplier as a fixed effect.
On the other hand, the term random effect is used to describe a contribution to the deviation from the
overall mean or true value whose direction varies – and thus cannot be determined. In such cases, the
only quantity of interest is the magnitude of the contribution (independently of its direction) often
described in terms of a standard deviation.
NOTE A factor is modelled as a fixed effect if the specific factor levels included in the experiment are of
interest in and of themselves. On the other hand, if the aim is to characterize the variability associated with
the underlying population from which the factor levels were selected, the factor is modelled as a random effect.
In this document, it is usually the variability of the underlying population which is of interest, rather than the
individual factor levels included in the experiment – this is the rationale for modelling factors as random.
The rationale for modelling factors as random rather than as fixed effects is now illustrated on the
basis of several examples.
Table 2 — Rationale for modelling factors as random rather than as fixed effects
Factor Discussion
Operator Effects due to differences between operators include personal habits in operating measurement
methods, e.g. in reading graduations on scales, etc. Thus, even though there is a bias in the test
results obtained by an individual operator, this bias is not always constant. The magnitude of
such a bias should be reduced by use of a clear operation manual and training. Under such cir-
cumstances, the effect of changing operators can be considered to be of a random nature.
Equipment Effects due to different equipment include the effects due to different places of installation,
particularly in fluctuations of the indicator, etc. Systematic differences should be corrected by
calibration and such a procedure should be included in the standard method (e.g. a change in the
batch of a reagent). An accepted reference value is needed for this, for which ISO Guide 33 and ISO
Guide 35 shall be consulted. Remaining equipment effects are considered random.
Time Effects due to time may be caused by environmental differences, such as changes in room
temperature, humidity, etc. Standardization of environmental conditions should be attempted
to minimize these effects. Clearly, achieving an ideal degree of standardization would make it
appropriate to model the factor Time as a fixed effect. However, it is more realistic to model this
factor in terms of random effects.
6.1.4 Statistical model
6.1.4.1 Basic model
For the reader’s convenience and ease of reference, the basic model described in ISO 5725-1 is
reproduced here. For estimating the accuracy (trueness and precision) of a measurement method, it is
useful to assume that every test result y is the sum of three components given by Formula (1):
ym=+Be+ (1)
where, for the particular material tested
m is the overall mean (expectation);
B is the laboratory component of bias under repeatability conditions;
e is the random error occurring in every measurement under repeatability conditions.
For a general discussion of these components, the reader is referred to ISO 5725-1, 5.1.
NOTE 1 Depending on the context, m denotes either the theoretical (unknown) overall mean or its estimate.
ˆ
It is possible to use different symbols (e.g. m versus m ) in order to distinguish between a theoretical quantity
and its estimate. However, this type of notational nuance seems unnecessary in this document. The same holds
for the other symbols used to denote quantities which are to be estimated – though the symbol σ will be
reserved for theoretical standard deviations and s for their estimates. The reader is referred to ISO 5725-1 for a
discussion of this issue.
NOTE 2 In ISO 5725-4, the bias is further decomposed into two parts: method bias and l
...
INTERNATIONAL ISO
STANDARD 5725-3
Second edition
2023-06
Accuracy (trueness and precision) of
measurement methods and results —
Part 3:
Intermediate precision and alternative
designs for collaborative studies
Exactitude (justesse et fidélité) des résultats et méthodes de mesure —
Partie 3: Fidélité intermédiaire et plans alternatifs pour les études
collaboratives
Reference number
© ISO 2023
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii
Contents Page
Foreword .v
Introduction . vi
1 Scope . 1
2 Normative references . 2
3 Terms and definitions . 2
4 Symbols . 3
5 General requirements . 4
6 Intermediate measures of the precision of a standard measurement method .5
6.1 Factors and factor levels . 5
6.1.1 Definitions and examples . 5
6.1.2 Selection of factors of interest . 6
6.1.3 Random and fixed effects . 6
6.1.4 Statistical model . 7
6.2 Within-laboratory study and analysis of intermediate precision measures . 9
6.2.1 Simplest approach . 9
6.2.2 Alternative method . 10
6.2.3 Effect of the measurement conditions on the final quoted result . 10
7 Nested design .11
7.1 Balanced fully-nested design . 11
7.2 Staggered-nested design . 12
7.3 Balanced partially-nested design . 13
7.4 Orthogonal array design . 14
8 Design for heterogeneous material .16
8.1 Applications of the design for a heterogeneous material . 16
8.2 Layout of the design for a heterogeneous material . 17
8.3 Statistical analysis . 17
9 Split-level design .17
9.1 Applications of the split-level design . 17
9.2 Layout of the split-level design . 19
9.3 Statistical analysis . 19
10 Design across levels .19
10.1 Applications of the design across levels . 19
10.2 Layout of the design across levels . 20
10.3 Statistical analysis . 20
11 Reliability of interlaboratory parameters .20
11.1 Reliability of precision estimates . 20
11.2 Reliability of estimates of the overall mean . 21
11.2.1 General . 21
11.2.2 Balanced fully-nested design (2 factors) . 21
11.2.3 Staggered nested design (2 factors) . 21
11.2.4 Balanced partially-nested design . 21
11.2.5 Orthogonal array design . 21
11.2.6 Split-level design . 22
Annex A (informative) Fully- and partially-nested designs .23
Annex B (informative) Analysis of variance for balanced fully-nested design .25
Annex C (informative) Analysis of variance for staggered design .30
Annex D (informative) Analysis of variance for the balanced partially-nested design (three
factors) .38
iii
Annex E (informative) Statistical model for an experiment with heterogeneous material .41
Annex F (informative) Analysis of variance for split-level design .42
Annex G (informative) Example for split-level design . 44
Annex H (informative) Design across levels .47
Annex I (informative) Restricted maximum likelihood (REML) .48
Annex J (informative) Examples of the statistical analysis of intermediate precision
experiment .49
Annex K (informative) Example for an analysis across levels .55
Bibliography .57
iv
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards
bodies (ISO member bodies). The work of preparing International Standards is normally carried out
through ISO technical committees. Each member body interested in a subject for which a technical
committee has been established has the right to be represented on that committee. International
organizations, governmental and non-governmental, in liaison with ISO, also take part in the work.
ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of
electrotechnical standardization.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the
different types of ISO document should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www.iso.org/directives).
ISO draws attention to the possibility that the implementation of this document may involve the use
of (a) patent(s). ISO takes no position concerning the evidence, validity or applicability of any claimed
patent rights in respect thereof. As of the date of publication of this document, ISO had not received
notice of (a) patent(s) which may be required to implement this document. However, implementers are
cautioned that this may not represent the latest information, which may be obtained from the patent
database available at www.iso.org/patents. ISO shall not be held responsible for identifying any or all
such patent rights.
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to
the World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT), see
www.iso.org/iso/foreword.html.
This document was prepared by Technical Committee ISO/TC 69, Applications of statistical methods,
Subcommittee SC 6, Measurement methods and results.
This second edition cancels and replaces the first edition (ISO 5725-3:1994), which has been technically
revised. It also incorporates the Technical Corrigendum ISO 5725-3:1994/Cor.1:2001.
The main changes are as follows:
— Several additional experimental designs have been added to this version compared to the previous
version, some of them from ISO 5725-5. These are orthogonal array designs, split level designs,
designs for heterogeneous sample material as well as designs across levels.
— Furthermore, the standard was supplemented by considerations on the selection of factors and
modelling of the factorial effects, as well as by a section in which the reliability of the various
interlaboratory test parameters (mean and precision parameters) are considered.
A list of all parts in the ISO 5725 series can be found on the ISO website.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www.iso.org/members.html.
v
Introduction
0.1 ISO 5725 uses two terms “trueness” and "precision” to describe the accuracy of a measurement
method. “Trueness” refers to the degree of agreement between the average value of a large number
of test results and the true or accepted reference value. “Precision” refers to the degree of agreement
between test results.
0.2 General consideration of these quantities is given in ISO 5725-1 and is not repeated here. It is
stressed that ISO 5725-1 provides underlying definitions and general principles should be read in
conjunction with all other parts of ISO 5725.
0.3 Many different factors (apart from test material heterogeneity) may contribute to the variability of
results from a measurement method, including:
a) the laboratory;
b) the operator;
c) the equipment used;
d) the calibration of the equipment;
e) the batch of a reagent;
f) the time elapsed between measurements;
g) environment (temperature, humidity, air pollution, etc.);
h) other factors.
0.4 Two conditions of precision, termed repeatability and reproducibility conditions, have been found
necessary and, for many practical cases, useful for describing the variability of a measurement method.
Under repeatability conditions, none of the factors a) to h) in 0.3 are considered to vary, while under
reproducibility conditions, all of the factors are considered to vary and contribute to the variability of
the test results. Thus, repeatability and reproducibility conditions are the two extremes of precision,
the first describing the minimum and the second the maximum variability in results. Intermediate
conditions between these two extreme conditions of precision are also conceivable, when one or more
of the factors listed in b) to g) are allowed to vary.
To illustrate the need for including a consideration of intermediate conditions in method validation,
consider the operation of a present-day laboratory connected with a production plant involving, for
example, a three-shift working system where measurements are made by different operators on
different equipment. Operators and equipment are then some of the factors that contribute to the
variability in the test results.
The standard deviation of test results obtained under repeatability conditions is generally less than
that obtained under intermediate precision conditions. Generally, in chemical analysis, the standard
deviation under intermediate precision conditions may be two or three times larger than that under
repeatability conditions. It should not, of course, exceed the reproducibility standard deviation.
As an example, in the determination of copper in copper ore, a collaborative study among 35 laboratories
revealed that the standard deviation under intermediate precision conditions (different times) was
1,5 times larger than that under repeatability conditions, both for the electrolytic gravimetry and
Na S 0 titration methods.
2 2 3
0.5 This document focuses on intermediate precision and alternative designs for collaborative studies
of a measurement method. Apart from the determination of intermediate precision measures, the
aims of these alternative designs include reducing the number of required measurements, increasing
the reliability of the estimates for precision and overall mean and taking into account test material
heterogeneity.
vi
Indeed, a t -factor fully-nested experiment with two levels per factor (inside each laboratory, there are
t−1
t−1 factors) and two replicates per setting requires 22 · test results from each laboratory, which
can be an excessive requirement on the laboratories. For this reason, in the previous version of
ISO 5725-3, the staggered nested design is also discussed. While the estimation of the precision
parameters is more complex and subject to greater uncertainty in a staggered nested design, the
workload is reduced. This document offers alternative strategies to reduce the workload without
compromising the reliability of the precision estimates.
As far as the special designs for sample heterogeneity are concerned, they were discussed in the
previous version of ISO 5725-5. However, it is convenient to have one part of this standard dedicated to
the question of the design of experiments.
0.6 The repeatability precision as determined in accordance with ISO 5725-2 is computed as a mean
across participating laboratories. Whether it can be used for quality control purposes depends on
whether the repeatability standard deviation can be considered to remain constant across laboratories.
For this reason, it is important to obtain information on how the repeatability standard deviation varies
within and between the laboratories under different conditions.
0.7 In many collaborative studies, the between-laboratory variability is large in comparison to the
repeatability, and it would be useful to a) decompose it into several different precision components, b)
reduce, if possible, some sources of variability which are due to the intermediate precision conditions.
This can be done by identifying factors (e.g. time, calibration, operator or equipment) which contribute
to the variability under intermediate precision conditions of measurement, by quantifying the
corresponding variability components and, wherever achievable, decreasing their contribution. In this
manner, the intermediate precision component of the overall variance is enlarged while the between-
laboratory component of the overall variance is reduced. Only random effects are considered: it is only
reasonable to model a factor as a fixed effect after a method or calibration optimization study has been
conducted. In this standard, different relationships between factors are taken into account, e.g. whether
a particular factor is subsumed under another factor or not.
0.8 Estimates for precision and overall mean are subject to random variability. Accordingly, it
is important to determine the uncertainty associated with each estimate, and to understand the
relationships between this uncertainty, the number of participants and the design. Once these
relationships are understood, it becomes possible to make much more informed decisions concerning
the number of participants and the experimental design.
0.9 Provided different factorial effects do contribute to the variability, determining the respective
precision components may make it possible to reduce the required number of participating laboratories,
since the between-laboratory variability can be expected to be less dominant. However, it is highly
recommended to have a reasonable number of participating laboratories in order to ensure a realistic
assessment of the overall method variability obtained under routine conditions of operation.
0.10 In the uniform-level design according to part 2 of this standard, there is a risk that an operator will
allow the result of a measurement on one sample to influence the result of a subsequent measurement
on another sample of the same material, causing the estimates of the repeatability and reproducibility
standard deviations to be biased. When this risk is considered to be serious, the split-level design
described in this document may be preferred as it reduces this risk. Care should be taken that the
two materials used at a particular level of the experiment are sufficiently similar to ensure that the
same precision measures can be expected (in other words: the question arises whether the precision
component associated with a particular factor remains unchanged across a range of similar matrices).
0.11 The experimental design presented in ISO 5725-2 requires the preparation of a number of
identical samples of the material for use in the experiment. With heterogeneous materials this may not
be possible, so that the use of the basic method then gives estimates of the reproducibility standard
deviation that are inflated by the variation between the samples. The design for a heterogeneous
material given in this document yields information about the variability between samples which is not
obtainable from the basic method; it may be used to calculate an estimate of reproducibility from which
the between-sample variation has been removed.
vii
INTERNATIONAL STANDARD ISO 5725-3:2023(E)
Accuracy (trueness and precision) of measurement
methods and results —
Part 3:
Intermediate precision and alternative designs for
collaborative studies
1 Scope
This document provides
a) a discussion of alternative experimental designs for the determination of trueness and precision
measures including reproducibility, repeatability and selected measures of intermediate precision
of a standard measurement method, including a review of the circumstances in which their use
is necessary or beneficial, and guidance as to the interpretation and application of the resulting
estimates, and
b) worked examples including specific designs and computations.
Each of the alternative designs discussed in this document is intended to address one (or several) of the
following issues:
a) a discussion of the implications of the definitions of intermediate precision measures;
b) a guidance on the interpretation and application of the estimates of intermediate precision
measures in practical situations;
c) determining reproducibility, repeatability and selected measures of intermediate precision;
1)
d) improved determination of reproducibility and other measures of precision;
e) improving the estimate of the sample mean;
f) determining the range of in-house repeatability standard deviations;
g) determining other precision components such as operator variability;
h) determining the level of reliability of precision estimates;
i) reducing the minimum number of participating laboratories by optimizing the reliability of
precision estimates;
j) avoiding distorted estimations of repeatability (split-level designs);
k) avoiding distorted estimations of reproducibility (taking the heterogeneity of the material into
consideration).
Often, the performance of the method whose precision is being evaluated in a collaborative study will
have previously been assessed in a single-laboratory validation study conducted by the laboratory
which developed it. Relevant factors for the determination of intermediary precision will have been
identified in this prior single-laboratory study.
1) Allowing a reduction in the number of laboratories.
2 Normative references
The following documents are referred to in the text in such a way that some or all of their content
constitutes requirements of this document. For dated references, only the edition cited applies. For
undated references, the latest edition of the referenced document (including any amendments) applies.
ISO 3534-1, Statistics — Vocabulary and symbols — Part 1: General statistical terms and terms used in
probability
ISO 3534-2, Statistics — Vocabulary and symbols — Part 2: Applied statistics
ISO 5725-1, Accuracy (trueness and precision) of measurement methods and results — Part 1: General
principles and definitions
ISO Guide 33, Reference materials — Good practice in using reference materials
ISO Guide 35, Reference materials — Guidance for characterization and assessment of homogeneity and
stability
3 Terms and definitions
For the purposes of this document, the terms and definitions given in ISO 3534-1, ISO 3534-2 and
ISO 5725-1 and the following apply.
ISO and IEC maintain terminological databases for use in standardization at the following addresses:
— ISO Online browsing platform: available at https:// www .iso .org/ obp
— IEC Electropedia: available at https:// www .electropedia .org/
3.1
block
group of settings (3.7) conducted in parallel or within a short time interval, and with the same samples
EXAMPLE Two settings:
Operator 1 + Calibration 1 + Equipment 1 + Batch 1
and
Operator 1 + Calibration 2 + Equipment 2 + Batch 1
Note 1 to entry: This definition is more specific than the general definition given in ISO 3534-3:2013, 3.1.25,
where block is defined as a collection of experimental units.
3.2
factor
feature under examination as a potential source of variation
EXAMPLE Operator, calibration, equipment, day, reagent batch, storage temperature, shaker orbit, shaker
frequency.
Note 1 to entry: Strictly speaking, the factor laboratory is a factor just like any other. However, since the ISO 5725
standard focuses on method validation by means of interlaboratory studies, the factor laboratory can be
considered to have a somewhat privileged role. The following characteristics distinguish it from other factors:
— The factor laboratory is indispensable: For each measurement, the name of the particular laboratory where
it was performed will always be provided in a collaborative study.
— The factor laboratory will almost always have more levels than other factors.
It should also be noted that categories such as measurand, sample/matrix and level may also be
considered to be factors. However, in collaborative studies, they are often not taken into account
as such in the factorial design. The reason is that, for these factors, one is interested in a separate
statistical analysis for each separate factor level. In other words, one is interested in obtaining separate
precision measures for each particular measurand or concentration level, not across measurands or
concentration levels. However, in cases where it is required to quantify precision across, say, matrices,
then the factor sample/matrix should also be included in the design. Accordingly, in this document,
designs are discussed to be applied for a particular measurand or concentration level by different
laboratories all applying the same measurement procedure.
[SOURCE: ISO 3534-3:2013, 3.1.5, modified — Note 1 to entry was modified and Note 2 to entry was
deleted.]
3.3
factor level
setting (3.7), value or assignment of a factor (3.2)
EXAMPLE Operator 1, Operator 2
Note 1 to entry: In many designs, the majority of factors will be varied across two levels.
3.4
fully-nested design
nested design, where there is a nesting hierarchy for every pair of factors (3.2)
EXAMPLE There are 2 operators in each laboratory, and each operator performs 2 calibrations, i.e., the
study includes 2 operators and 4 calibrations for each laboratory.
3.5
partially-nested design
nested design where one factor (3.2) (the factor laboratory) is ranked higher than all other factors (i.e.,
all other factors are nested within the factor laboratory), and there is at least one factor pair without a
nesting hierarchy
EXAMPLE There are 2 operators and 2 instruments in each laboratory, and each operator performs
measurements on 2 instruments, i.e., the study includes 2 operators and 2 instruments for each laboratory.
3.6
run
actual measurement carried out for a particular setting (3.7) and for a particular laboratory
EXAMPLE Operator 1 + Equipment 1 + Batch 1 + Day 1 carried out in laboratory 1
Note 1 to entry: This definition is more specific than the general definition given in ISO 3534-3 (3.1.13), where
run is defined as specific settings of every factor used on a particular experimental unit.
Note 2 to entry: “Identical” runs are called replicates, whereby “identical” means that the different time points are
close enough to each other to allow for the results to be considered as obtained under repeatability conditions.
3.7
setting
combination of factor levels (3.3), for all factors (3.2) except the factor laboratory
EXAMPLE Operator 1 + Equipment 1 + Batch 1 + Day 1.
4 Symbols
B
Component in a test result representing the deviation of a laboratory from the general
average (laboratory component of bias)
B
Component of B representing all factors that do not vary under intermediate precision
conditions – laboratory bias per se
BB,, etc.
Components of B representing factors that vary under intermediate precision conditions
() ()
e
Component representing the random error occurring in every test result, corresponding
to the analytical, repeatability, model or residual error
m
Overall mean of the measurand or test property for a particular matrix; level
ˆ
m Estimate of the overall mean
n
Number of replicate test results obtained in one laboratory at one level for one setting
p
Number of laboratories participating in the collaborative study
q
Number of levels of the test property in the collaborative study
Within-laboratory standard deviation of the residual term e
σ
w
σ
Repeatability standard deviation
r
σ
Reproducibility standard deviation
R
σ Standard deviation corresponding to factor B
0 0
σ Standard deviation corresponding to factor B
()1 ()1
σ Standard deviation corresponding to factor B
()2 ()2
Standard deviation corresponding to factor A
σ
A
σ
Standard deviation corresponding to the interaction of two factors
Interaction
Standard deviation corresponding to the interaction of the two factors A and B
σ
AB
s
Estimate of a standard deviation
se
Standard error
Variance of X
VarX()
w
Range of a set of test results
y
Test result
Mean of X
X
Absolute value of X
X
5 General requirements
In order to ensure that measurements are carried out in the same way, the measurement method shall
have been standardized. All measurements obtained in the framework of an experiment within a
specific laboratory or of a collaborative study shall be carried out according to that standard.
NOTE The terms collaborative experiment, collaborative trial and interlaboratory experiment are
used interchangeably to denote a collaborative study conducted in order to characterize and/or assess the
performance of a measurement method.
6 Intermediate measures of the precision of a standard measurement method
6.1 Factors and factor levels
6.1.1 Definitions and examples
In this document, the term factor denotes an identifiable and quantifiable source of variability such
as time, calibration, operator or equipment (see 3.2). In order to investigate a factor’s contribution to
variability, it is necessary to conduct measurements under different conditions or states. For instance,
measurements shall be carried out with different pieces of equipment, or with different operators. The
different states associated with a particular factor are called factor levels (see 3.3). Table 1 provides
typical examples of factors and their factor levels.
Table 1 — Examples of factors
Description/example of the
Factor Comments
different factor levels
Laboratory The different participating labo- Some of the special designs presented in this document
ratories, typically between 4 and allow reliable precision estimates with as few as 4 partic-
15 different laboratories. ipating laboratories.
Point in time Two different time points (e.g. Differences between “measurements made at different
different days, different weeks, times”, i.e. separated by a relatively long time interval (as
etc.) compared with the repeatability interval) will reflect effects
which correspond to uncontrolled changes in environmental
conditions as well as other “controlled” sources of variability
such as the use of different reagent batches, etc.
Calibration Before and after instrument is Calibration does not refer here to any calibration required
sent to the manufacturer for a as an integral part of obtaining a test result by the measure-
recalibration ment method. It refers to the calibration process that takes
place at regular intervals between groups of measurements
within a laboratory.
Operator The different technicians working In some circumstances, the operator may be, in fact, a team
in the laboratory of operators, each of whom performs some specific part of
the procedure. In such a case, the team should be regarded
as the operator, and any change in membership or in the
allotment of duties within the team should be regarded as
constituting a different operator.
Equipment Two different pieces of equipment Equipment is often a set of equipment, and any change in any
significant component should be regarded as constituting
different equipment. As to what constitutes a significant
component, common sense must prevail (e.g. different
burettes/pipettes, thermometers, pH meters, centrifuges,
shaker orbits or frequencies).
Consumables (buffer Different batches or producers A change of a batch of a reagent should be considered a sig-
solutions, reagents, nificant component. It can lead to different equipment or to
calibrators, cartridg- a recalibration if such a change is followed by calibration.
es)
NOTE 1 In practice, it may not be possible to consider factors in isolation from one another; this is due to a characteristic
of experimental designs called confounding. In theory, it should always be possible to disentangle the effects of different
factors by additional testing. For instance, if Operator 1 always carried out tests with Equipment 1 (e.g. HPLC system 1) and
Operator 2 with Equipment 2, then it would be possible to tell the effects of the two factors Operator and Equipment apart
by adding further runs for Operator 1 with Equipment 2 and for Operator 2 with Equipment 1.
NOTE 2 Further effects called interaction effects are not explicitly considered here. However, some interaction effects are
implicitly taken into consideration. For instance, the effect of skill or fatigue of an operator may be considered to be the
interaction of operator and time. Similarly, the performance of a piece of equipment may be different at the time it is first
turned on and after many hours of use: this is an example of interaction between equipment and time.
NOTE 3 In ISO 5725-2, the factor laboratory is implicitly included in the analysis.
6.1.2 Selection of factors of interest
In the standard for a measurement method, the repeatability and reproducibility standard deviations
should always be specified, but it is not necessary (or even feasible) to state all possible intermediate
precision measures. The selection of relevant factors is informed by experience and an understanding
of the relevant physical, chemical or microbiological processes.
Practical considerations in most laboratories, such as the desired precision of the final quoted result
and the cost of performing the measurements, will govern the number and choice of factors taken into
consideration in the standardization of the measurement method.
Finally, the choice of factors to include in the design should reflect concerns with uncontrollable
variations between the laboratories.
It will often be sufficient to specify only one suitable intermediate precision measure, together with
a detailed stipulation of the specific measurement conditions associated with it. The factors should
be carefully defined; in particular, for the intermediate precision associated with the factor Time, a
practical mean time interval between successive measurements should be specified.
It is assumed that, in the case of a standardized measurement method, the bias inherent in the method
itself will have been corrected by technical means. For this reason, this document only addresses the
bias arising in connection with different measurement conditions.
6.1.3 Random and fixed effects
This subclause provides a discussion of the question why, in this document, factors are modelled as
random rather than as fixed effects.
The term fixed effect is used to describe a contribution to the deviation from the overall mean or true
value whose direction and magnitude is predictable and can thus be determined. Say, for example, that
measurements always lie below the true value with equipment 1 or reagent supplier 1 and above the
true value with equipment 2 or reagent supplier 2. Then it would be appropriate to model the factor
Equipment or Reagent supplier as a fixed effect.
On the other hand, the term random effect is used to describe a contribution to the deviation from the
overall mean or true value whose direction varies – and thus cannot be determined. In such cases, the
only quantity of interest is the magnitude of the contribution (independently of its direction) often
described in terms of a standard deviation.
NOTE A factor is modelled as a fixed effect if the specific factor levels included in the experiment are of
interest in and of themselves. On the other hand, if the aim is to characterize the variability associated with
the underlying population from which the factor levels were selected, the factor is modelled as a random effect.
In this document, it is usually the variability of the underlying population which is of interest, rather than the
individual factor levels included in the experiment – this is the rationale for modelling factors as random.
The rationale for modelling factors as random rather than as fixed effects is now illustrated on the
basis of several examples.
Table 2 — Rationale for modelling factors as random rather than as fixed effects
Factor Discussion
Operator Effects due to differences between operators include personal habits in operating measurement
methods, e.g. in reading graduations on scales, etc. Thus, even though there is a bias in the test
results obtained by an individual operator, this bias is not always constant. The magnitude of
such a bias should be reduced by use of a clear operation manual and training. Under such cir-
cumstances, the effect of changing operators can be considered to be of a random nature.
Equipment Effects due to different equipment include the effects due to different places of installation,
particularly in fluctuations of the indicator, etc. Systematic differences should be corrected by
calibration and such a procedure should be included in the standard method (e.g. a change in the
batch of a reagent). An accepted reference value is needed for this, for which ISO Guide 33 and ISO
Guide 35 shall be consulted. Remaining equipment effects are considered random.
Time Effects due to time may be caused by environmental differences, such as changes in room
temperature, humidity, etc. Standardization of environmental conditions should be attempted
to minimize these effects. Clearly, achieving an ideal degree of standardization would make it
appropriate to model the factor Time as a fixed effect. However, it is more realistic to model this
factor in terms of random effects.
6.1.4 Statistical model
6.1.4.1 Basic model
For the reader’s convenience and ease of reference, the basic model described in ISO 5725-1 is
reproduced here. For estimating the accuracy (trueness and precision) of a measurement method, it is
useful to assume that every test result y is the sum of three components given by Formula (1):
ym=+Be+ (1)
where, for the particular material tested
m is the overall mean (expectation);
B is the laboratory component of bias under repeatability conditions;
e is the random error occurring in every measurement under repeatability conditions.
For a general discussion of these components, the reader is referred to ISO 5725-1, 5.1.
NOTE 1 Depending on the context, m denotes either the theoretical (unknown) overall mean or its estimate.
ˆ
It is possible to use different symbols (e.g. m versus m ) in order to distinguish between a theoretical quantity
and its estimate. However, this type of notational nuance seems unnecessary in this document. The same holds
for the other symbols used to denote quantities which are to be estimated – though the symbol σ will be
reserved for theoretical standard deviations and s for their estimates. The reader is referred to ISO 5725-1 for a
discussion of this issue.
NOTE 2 In ISO 5725-4, the bias is further decomposed into two parts: method bias and laboratory bias. While
laboratory bias is modelled as a random effect, method bias is modelled as a fixed effect.
6.1.4.2 Partitioning the laboratory bias term
The model described in Formula (1) is appropriate for the situation described in ISO 5725-2, where,
within each laboratory, results are obtained under repeatability conditions (i.e. within a short period
of time, by the same operator, etc.). Under these conditions, B can be considered constant and is called
the “laboratory component of bias”. In practice, however, B arises from a combination of a number of
effects. The statistical model as given in Formula (1) can be rewritten in the form given by Formula (2):
ym=+BB++Be+…+ (2)
0 ()12()
where B is partitioned into contributions from variates
B
the residual component of the laboratory bias;
B B
effects corresponding to intermediate precision factors (such as those in Table 1).
()1, ()2 , …
6.1.4.3 Terms B , B , B , etc.
0 (1) (2)
Under repeatability conditions, these terms all remain constant and add to the bias of the test results.
Under intermediate precision conditions, B is the effect corresponding to the residual laboratory bias,
i.e. it characterizes the background component of laboratory bias which remains invariant as th
...
The SIST ISO 5725-3:2024 standard provides a comprehensive framework for assessing the accuracy, specifically trueness and precision, of measurement methods and results. The document's scope is notably expansive, covering various alternative experimental designs that enrich our understanding of the determination of trueness and precision measures. Notably, it discusses key concepts such as reproducibility, repeatability, and selected measures of intermediate precision, which are crucial in evaluating the reliability of measurement methods. One of the significant strengths of this standard is its thorough examination of scenarios where the indicated measures and alternative designs are most beneficial. By providing worked examples that detail specific designs and computations, the standard enhances its practical applicability for professionals in the field. It sets forth clear guidance on the interpretation and application of intermediate precision measures, ensuring users grasp the essential implications of these definitions as they pertain to practical situations. The documentation effectively addresses various critical issues in measurement methodologies. For instance, it offers insights on improving reproducibility, refining determinations of precision, and facilitating a better estimate of the sample mean. Additionally, it emphasizes the importance of accounting for operator variability and the reliability of precision estimates, which are vital for establishing and validating measurement methods in collaborative studies. The standard also incorporates strategies for optimizing collaboration by reducing the minimum number of participating laboratories while still enhancing the reliability of precision estimates. This aspect is particularly relevant in today's context, where interdisciplinary collaboration is becoming increasingly important. Moreover, by providing solutions to avoid distorted estimations of repeatability and reproducibility, the standard helps maintain the integrity of precision measures-a key concern in empirical research. The relevance of the SIST ISO 5725-3:2024 document extends beyond mere compliance; it serves as an essential guide for practitioners seeking to elevate their measurement accuracy through informed practices. By elaborating on the interconnectedness of single-laboratory validation studies and collaborative assessments, the document reinforces the foundational concepts that underpin the evaluation of intermediate precision in measurement methodologies. Overall, the SIST ISO 5725-3:2024 standard stands as a robust resource, integral to the standards of accuracy in measurements. Its detailed exploration of alternative designs and practical applications reinforces its significance, making it an invaluable tool for professionals striving for precision in their measurement techniques.
SIST ISO 5725-3:2024は、測定方法および結果の精度(真実性と精密性)に関する重要な標準であり、共同研究における中間精密性と代替設計に焦点を当てています。この文書は、標準測定方法の精度の評価を行うためのさまざまな実験設計の選択肢を提供しており、特定の状況下でこれらのアプローチが有益である理由を詳しく説明しています。また、得られた推定値の解釈と適用に関するガイダンスも含まれています。 この標準の強みは、実務上の状況で中間精密性指標の解釈および応用に関する具体的な指導を提供する点です。特に、再現性、繰り返し性、そして中間精密性の選択的指標を確実に評価するための手法が含まれており、研究者にとって非常に有益です。さらに、実際の共同研究における方法のパフォーマンスを以前の単一ラボバリデーション研究と関連づけることで、実用的な見識が加わります。 また、サンプル平均の推定の改善や、オペレーター変動性など他の精密性成分の特定もこの文書において検討されています。特に、精密性推定値の信頼性を高めることや、参加するラボの最小数を削減するための最適化手法についても論じられており、研究者が効果的にリソースを使用できるようになっています。 この標準は、共同研究における反復性(スプリットレベルデザインによる歪んだ推定の回避)や、材料の不均一性を考慮した再現性の歪んだ推定の回避といった課題に対処するための代替設計の提案を含んでおり、より信頼性のある測定結果を得るためのアプローチを示しています。このように、SIST ISO 5725-3:2024は、測定方法の精密性を正確に評価するための包括的なガイドラインを提供し、様々な実験設計の選択肢を示しているため、広範囲にわたる研究分野においてその重要性と関連性が持続的に高まっています。
SIST ISO 5725-3:2024 offers a comprehensive framework for evaluating the accuracy-specifically trueness and precision-of measurement methods through intermediate precision and innovative designs for collaborative studies. The standard's scope is meticulously outlined, addressing the necessity and advantages of alternative experimental designs that focus on trueness and the various measures of precision. One of the key strengths of this standard is its inclusive discussion regarding the definitions of intermediate precision measures, guiding users to accurately interpret and apply these estimates in practical scenarios. This guidance is essential for practitioners who rely on precision measurements in diverse fields, ensuring that users can draw meaningful conclusions from their findings. The standard also emphasizes the importance of determining reproducibility and repeatability, significantly impacting the reliability of measurement outcomes. By providing worked examples and specific designs, practitioners are equipped with practical tools to conduct their studies. Moreover, the discussion on improving the estimate of the sample mean and determining in-house repeatability standard deviations is particularly valuable, highlighting the standard's commitment to enhancing measurement accuracy. Another noteworthy aspect is its focus on operator variability and the reliability of precision estimates. By optimizing the number of participating laboratories, the standard seeks to reduce logistical barriers while maintaining high reliability in precision estimates-a significant consideration in collaborative studies. Furthermore, the standard's approach to avoiding distorted estimations of repeatability and reproducibility by accounting for material heterogeneity reflects a sophisticated understanding of the factors influencing measurement precision. Through this comprehensive perspective, SIST ISO 5725-3:2024 not only serves as a guideline for practitioners but also ensures the integrity and accuracy of collaborative efforts in measurement standardization. Overall, SIST ISO 5725-3:2024 stands out for its thorough approach to the complexities of measurement precision, making it a vital resource for anyone involved in the development and evaluation of measurement methods. Its relevance in today’s scientific and industrial environments cannot be overstated, as it promotes enhanced collaboration and increasingly reliable results in accuracy assessments.
SIST ISO 5725-3:2024の標準文書は、計測方法と結果の正確性(真実性および精度)に関する重要なガイドラインを提供しています。この標準は、特に共同研究における中間精度とそのための代替デザインについて焦点を当てており、その内容は新しい実験デザインの導入や既存の手法の適切な運用に対する重要な指針を与えます。 この標準の範囲には、再現性や繰り返し性、中間精度を測定するための選定された基準の考察が含まれており、具体的な実験デザインや計算例も提示されています。これにより、研究者や技術者は、実験設計の選択や結果の解釈に際して、どのような状況で特定の方法が必要であるかを理解することが可能となります。 SIST ISO 5725-3:2024の強みは、その詳細なフレームワークにあります。中間精度測定の定義に関する考察は、計測プロセス全体を通じての信頼性を高め、特定の環境下での再現性や精度を向上させるための実用的なガイダンスを提供します。また、操作者のばらつきやサンプル平均の推定に関する新たな測定法を示すことで、より信頼性の高い精度評価が実現します。 この標準が現在の科学研究および工業プロセスにおいて特に重要なのは、計測結果の信頼性がその後の意思決定や品質管理に直接的に影響を与えるためです。特に、参加するラボの数を最小限に抑えつつ、精度の見積もりの信頼性を高めることに焦点を当てた設計を活用することで、効率的なデータ収集が可能となります。 SIST ISO 5725-3:2024は、現代の測定精度の評価において不可欠な標準であり、計測の分野における新しいステージを示すバイブルと言えるでしょう。その内容は、具体的な応用においても広範囲にわたり利用でき、研究者やエンジニアが直面する複雑な課題を解決するための土台を提供しています。
SIST ISO 5725-3:2024 표준 문서는 측정 방법 및 결과의 정확성(정확성 및 정밀성)과 관련된 중요한 내용을 다루고 있습니다. 이 문서의 주요 범위는 대안적인 실험 설계를 통해 중간 정밀성 및 정밀성 측정의 필요성을 논의하고, 이러한 설계들이 실질적으로 필요하거나 유익한 경우에 대한 지침을 제공합니다. 이 표준의 강점 중 하나는 다양한 대안 설계의 사용이 구체적인 문제 해결에 어떻게 기여하는지에 대한 상세한 논의입니다. 특히 중간 정밀성의 정의와 그 측정 추정값의 해석 및 적용에 대한 지침이 포함되어 있어, 연구자들이 더욱 높은 신뢰성을 가진 정밀성 추정값을 얻는 데 도움을 줍니다. 또한, 샘플 평균의 개선된 결정, 실험실 내 반복성 표준 편차의 범위 결정, 그리고 운영자 가변성과 같은 정밀성 구성 요소의 평가 등 다양한 주제를 다루고 있어 실험적 연구에서 매우 유용합니다. 또한, 참여 실험실의 최소 수를 최적화하여 정밀성 추정의 신뢰성을 높이는 방법을 제시하여 실험 설계의 효율성을 극대화하고, 결과적인 왜곡된 반복성과 재현성 측정의 추정치를 피할 수 있는 방안도 제공합니다. 특히, 물질의 이질성을 고려한 재현성 측정의 개선도 포함되어 있어, 다양한 상황에서의 활용 가능성을 높이고 있습니다. SIST ISO 5725-3:2024 표준은 협력 연구에서 평가되는 방법의 성능이 단일 실험실 검증 연구에서 어떻게 사전에 평가되었는지를 반영하고 있어, 중간 정밀성 결정에 있어 핵심적인 요소를 잘 준비하고 있습니다. 이와 같은 조치는 연구자들이 실험을 설계하는 데 있어 더욱 유용하고 실질적인 지침을 제공합니다. 이 표준은 현재測점 정밀성과 결과의 신뢰성을 극대화하고자 하는 모든 연구자, 품질 관리 전문가, 계측 기기 개발자들에게 필수적이며, 실험법의 표준화 및 보편화를 위한 중요한 기준을 제공합니다. 이러한 이유로 SIST ISO 5725-3:2024는 측정 정확성과 정밀성 개선을 위한 매우 중요한 기준으로 평가됩니다.
SIST ISO 5725-3:2024 표준 문서는 측정 방법과 결과의 정확성(진실성과 정밀성)에 대한 중간 정밀도와 협력 연구를 위한 대안 설계에 대한 포괄적인 논의를 제공합니다. 이 문서는 다음과 같은 주요 항목들을 포함하여, 정밀성 측정을 위한 대안 실험 설계의 필요성과 이점을 명확하게 설명하고 있습니다. 첫째, 중간 정밀도 측정의 정의가 미치는 의미에 대한 상세한 논의가 포함되어 있습니다. 이는 다양한 실험적 맥락에서 정의된 중간 정밀도의 정확한 해석을 가능하게 하여 실제 적용에 유용하게 합니다. 둘째, 이 문서는 재현성, 반복성 및 중간 정밀도의 특정 측정들이 어떻게 결정되는지에 대한 길잡이를 제시합니다. 이러한 가이드는 연구자들이 실험 결과의 신뢰성을 판단하는 데 중요한 역할을 합니다. 셋째, 본 표준은 반복성 및 다른 정밀도 측정치를 개선하기 위한 방법론을 제공합니다. 이는 샘플 평균의 정확한 결정과 같은 측면을 포함하여, 연구의 질을 높이는 데 기여합니다. 넷째, 내실에서의 반복 표준 편차 범위를 결정하고, 운영자 변동성과 같은 다른 정밀도 구성 요소를 평가하는 과정에서 유용한 정보를 제시합니다. 다섯째, 측정 신뢰성을 확보하기 위해 최소한의 참여 연구소 수를 줄이는 방법도 설명되어 있으며, 이는 자원 효율성을 높이는 데 기여합니다. 마지막으로, 분할 수준 설계를 통해 반복성의 왜곡된 추정을 피하고, 물질의 이질성을 고려하여 재현성의 왜곡된 추정을 피하는 방법도 제시하고 있습니다. 이처럼 SIST ISO 5725-3:2024 표준은 측정 방법의 정밀도와 신뢰성을 높이기 위한 종합적인 접근 방식을 제공하여 연구자들과 실험실들이 보다 정확하고 신뢰할 수 있는 결과를 도출할 수 있도록 돕습니다.














Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.
Loading comments...