A.1. Annex 1: Methodology

The primary goal of the YDI is to provide an evidence base on the condition of youth around the world, focusing on opportunities for their development. The theoretical framework for the development of the YDI is derived from the work of Sen (1985) and Nussbaum (2000, 2003) on capabilities.

The primary goal of the YDI is to provide an evidence base on the condition of youth around the world, focusing on opportunities for their development. The theoretical framework for the development of the YDI is derived from the work of Sen (1985) and Nussbaum (2000, 2003) on capabilities.

The 2023 iteration of the YDI has three main aims.

  1. Assess the data context for updating the global YDI in 2023, including acknowledging the admission of Gabon and Togo into the Commonwealth.
  2. Update the global YDI database of 27 indicators across six domains with more recent data, where available and taking into account additional countries for which data is now available.
  3. Provide an analysis of the situation.

The calculation and indicators remain the same as in the YDI 2020 report (Commonwealth Secretariat, 2021). The YDI is designed to measure youth development based on six domains:

  • Education
  • Employment and Opportunity
  • Equality and Inclusion
  • Health and Wellbeing
  • Peace and Security
  • Political and Civic Participation

These domains, and the indicators within each domain, were decided upon through consultation with the YDI Expert Panel. To capture youth development within each country across all domains, 27 indicators were sourced. Table A1.1 presents the indicators selected to capture these domains and includes information on their sources and the year of their most recent update.

 

Table A1.1 YDI 2023 indicators by domain

Health and Wellbeing

Indicator

Definition

Source

No. countries covered

Latest year of data

Alcohol abuse

YLL from alcohol use disorders, ages 15–29

IHME GBD

204

2019

Drug abuse

YLL from drug use disorders, ages 15–29

IHME GBD

204

2019

HIV rate

HIV rate, ages 15–29

UNAIDS estimates

131

2021

Mental health

YLL from mental disorders, ages 15–29

IHME GBD

204

2019

Mortality rate

Deaths from all causes, ages 15–29

IHME GBD

204

2019

Tobacco consumption

Tobacco smokers, % of ages 15–29

IHME GBD

201

2015

Self-harm

YLL from self-harm, ages 15–29

IHME GBD

204

2019

 

Education

Indicator

Definition

Source

No. countries covered

Latest year of data

Digital natives

Five or more years’ experience using the internet, % of ages 15–29

ITU

181

2013

Literacy rate

Literacy rate, youth total, % of ages 15–24

UNESCO Institute for Statistics

216

2021

School completion

Lower secondary completion rate, total, % of country-specific age group

UNESCO Institute for Statistics

216

2021

 

Employment and Opportunity

Indicator

Definition

Source

No. countries covered

Latest year of data

Account

Respondents who report having an account (by themselves or together with someone else) at a bank or other financial institution or report using mobile money in the past 12 months, % ages 15–24

World Bank Global Findex Database

158

2021

Adolescent fertility rate

Adolescent fertility rate, births per 1,000 women ages 15–19

United Nations Population Division, World Population Prospects

205

2020

NEET

NEET youth, % of ages 15–24

ILO

163

2022

Underemployment*

Time-related underemployment, ages 15-24

ILO modelled estimates

187

2019

 

Equality and Inclusion

Indicator

Definition

Source

No. countries covered

Latest year of data

Economic marginalisation

Population percentage classified as extremely poor (<US$1.90 PPP) or moderately poor (>=US$1.90 and <US$ 3.20 PPP), ages 15–24

ILO modelled estimates

191

2019

Gender parity in literacy

Literacy rate, youth, ages 15–24 GPI

UNESCO Institute for Statistics

162

2021

Gender parity in NEET

Distance from parity between percentages of NEET young women and NEET young men, ages 15–24

UNDESA Global SDG Indicators Database, IEP calculations

78

2018

Gender parity in safety and security

Distance from parity between percentages of young women and young men who report feeling safe walking alone in their neighbourhood at night

GWP, IEP calculations

167

2021

Early marriage

Women first married by age 18, % of women ages 20–24

Country surveys collected by World Bank and OECD

137

2021

 

Political and Civic Participation

Indicator

Definition

Source

No. countries covered

Latest year of data

Recognition for community improvement

Responding ‘agree’ or ‘strongly agree’ with the statement ‘In the past 12 months, you have received recognition for helping to improve the city or area where you live’, % ages 15–29

GWP

153

2016

Voiced opinion to an official

Responding that they have voiced their opinion to an official in the past 30 days, % ages 15–29

GWP

167

2021

Volunteered time

Responding that they have volunteered time in the past 30 days, % ages 15–29

GWP

168

2021

Youth policy score

Scores on youth policy and legislation, public institutions, youth representation, and public budget and spending

Youth Policy Labs, IEP calculation

196

2016

 

Peace and Security

Indicator

Definition

Source

No. countries covered

Latest year of data

Conflict and terrorism

YLL from armed conflict and terrorism

IHME GBD

204

2019

INFORM score

Risk of humanitarian crisis and disaster, including climate change related risks

EU INFORM

192

2022

Internal peace score

Composite score for domestic peace and safety and security

IEP Global Peace Index

163

2022

Interpersonal violence

YLL from interpersonal violence, ages 15–29

IHME GBD

204

2019

* This indicator is based on the absolute stocks of underemployed youth within a country’s borders, rather than a rate-based or per capita measure. The rationale behind this approach is that the indicator penalises countries with the largest proportions of youth who are struggling with underemployment. Underemployment exists when the hours of work of an employed person are below a threshold and are insufficient in relation to an alternative employment situation in which the person is willing and available to engage.

† The Gender Parity Index (GPI) calculates a value based on the number of females divided by the number of males. A value of 1 reflects equality or parity between females and males. A value below 1 usually favours males while a value over 1 usually favours females. The original values for the gender parity in literacy indicator are converted into an YDI indicator score where a lower score reflects greater inequality between men and women in favour of men and a higher score suggests greater equality in favour of women.

 

Imputations

The 2023 YDI’s methodology has been designed to be in line with other prominent global development indices, and substantial effort has been made to populate the index with the best available country data. However, consistent and comprehensive datasets covering youth development continue to remain scarce.

Effort has been made in the design of the methodology to include as many Commonwealth countries as possible, the required threshold for inclusion being 50 per cent availability of the needed data. Even with the high imputation threshold, for certain smaller-island countries there is simply not enough data available to justify their inclusion. Table A1.2 details data availability for these countries, indicating for which countries data coverage is above the threshold and for which additional data is needed.

 

Table A1.2 Data availability for small-island Commonwealth countries

Country

Percentage of data available

Antigua and Barbuda

40.74%

The Bahamas

59.26%

Barbados

77.78%

Brunei Darussalam

66.67%

Dominica

40.74%

Fiji

66.67%

Gabon

92.59%

Grenada

55.56%

Kiribati

51.85%

Maldives

70.37%

Nauru

44.44%

St Kitts and Nevis

40.74%

Saint Lucia

66.67%

St Vincent and the Grenadines

44.44%

Samoa

70.37%

Seychelles

59.26%

Solomon Islands

62.96%

Togo

92.59%

Tonga

74.07%

Tuvalu

48.15%

Vanuatu

66.67%

 

Data availability has been a particular challenge for the Equality and Inclusion and the Political and Civic Participation domains. Constructing the YDI has therefore highlighted gaps in youth development data and the need for further improvement in data collection.

A major challenge to developing a harmonised composite index lies in attempting to overcome the significant variation in data across very diverse countries around the world, not just in terms of demographic and geographic characteristics but also in terms of socio-economic characteristics, which often times can affect data collection and quality.

The issue of data gaps is a common challenge to creating an index. The Organisation for Economic Co-operation and Development (OECD) et al. (2008) recommend several statistical techniques for dealing with data imputation to fill in data gaps. Table A1.3 lists the approaches used in the YDI. Using a combination of these techniques, the YDI represents the use of the best possible data without an overly complex methodology.

 

Table A1.3 Data imputation methods applied in the YDI

Imputation method

Description

Application in the YDI

Time series imputation

Replace missing values using linear interpolation

The YDI uses this method when at least two data points exist in a time series for an indicator-country pair, to estimate data for unreported years.

Similarly, when only one year of data is available for all countries, the values for that year are used for all years in the index.

Cold deck imputation

Replace the missing value with a value from another source

The YDI uses this method when alternative country statistics from a different source are available to fill in gaps.

Hot deck imputation

Replace the missing value with a KNN (k-nearest neighbours) imputation

KNN is an algorithm that is useful for matching a point with its closest k neighbours in a multi-dimensional space. It can be used for data that are continuous, discrete, ordinal and categorical, which makes it particularly useful for dealing with missing data. The algorithm fills in data gaps using similar countries to impute a value.

The YDI uses this method for data that is not available for all countries and when time series and cold deck imputations fail. A value is assigned based on the average of the five most similar countries in the same year. These may be five countries selected, in order of preference, from among countries:

  1. in the same region;
  2. in the same income bracket as the country as defined by the World Bank; or
  3. with the same government type as defined by the Economist Intelligence Unit.

Only the most preferable of the three hot deck imputation techniques listed is used for any single missing data instance.

 

The banding process

In order to aggregate the incommensurable indicators, all indicators have been banded (normalised). This means each indicator is scaled to a score ranging between 0 and 1, relative to the initial global range. Appropriate minimum and maximum values are, therefore, chosen for each indicator so that any values below the minimum are assigned 0 and values above the maximum are assigned 1. All other values are scaled between 0 and 1, equivalent to their position in the original minimum-maximum range. Depending on the nature of the data, the banding process can take slightly different forms.

For example, for the literacy rate indicator, a higher score reflects a more desirable situation. Therefore, in this case, the banding process has assigned the largest data point a value of 1. Conversely, the lowest data point in the indicator has been assigned a value of 0, while all other data is scaled relative to these two points. This process is referred to as forward banding. On the other hand, a lower score on the mortality indicator reflects a more desirable situation. In this case, the data is reverse banded, so the lowest value is assigned 1 while the highest is assigned 0.

Therefore, for year y, Equation 1 calculates a forward banded score for indicator i. A reverse banded score is calculated using Equation 2.

 

Equation 1: Banding equation

Image
image of the banding equation

Equation 2: Reverse banding equation

Image
image of the reverse banding equation

An integral part of the banding process is to set appropriate minimum and maximum cutoff values for the banded scores. Some data has a normal distribution and therefore outliers can be easily defined as those greater than three standard deviations from the mean. However, other datasets do not follow the bell-curved distribution trend. A number of considerations are therefore essential in choosing the appropriate technique: the nature of the data, the distribution, the purpose of the index, the information to be conveyed and so on. When investigating global datasets for the YDI, very few can be classified as having a normal distribution. The presence of outliers defines the variance, skewing both the minimum and the maximum values. To account for this, IEP has set artificial minimum and maximum values to ensure that results are not too heavily influenced by outliers. Table A1.4 outlines the data distribution and bands for each indicator. The upper and lower bands are indicated when they are not based on the distribution of the data but rather have been input manually by IEP.

 

Table A1.4 Banding limits for the YDI by domain

Education

Indicator

Minimum

Maximum

Mean

Standard deviation

Lower band

Upper band

Digital natives

0.00

.996

.4011

.3082

0

1

Literacy rate

13.1

100.00

84.09

15.56

0

100

School completion

9.81

176.92

78.57

24.48

10.5

113.82

 

Employment and Opportunity

Indicator

Minimum

Maximum

Mean

Standard deviation

Lower band

Upper band

Account

0.00

100.00

50.19

27.51

0

1

Adolescent fertility rate

1.45

185.124

48.028

40.19

0

250

NEET

1

5

2.58

0.82

0

100

Underemployment

0.08

7,526.1

180.60

492.16

0.08

433.21

 

Equality and Inclusion

Indicator

Minimum

Maximum

Mean

Standard deviation

Lower band

Upper band

Economic marginalisation

00

0.95

0.24

NA

0

0.91

Gender parity in literacy

0.00

1.230157

0.09

0.11

0

1

Gender parity in NEET

0.00

30.66

6.93

6.19

32.56

Gender parity in safety and security

0.00

0.42

0.13

0.06

0

1

Early marriage

0.00

83.50

26.94

17.91

0

100

 

Health and Wellbeing

Indicator

Minimum

Maximum

Mean

Standard deviation

Lower band

Upper band

Alcohol abuse

5.95

965.10

94.71

142.46

2.49 

66.29

Mortality rate

70.4

2,177.31

374.62

259.36

0

2,500

Drug abuse

12.5

3,575.95

205.55

320.69

0

500

HIV rate

0.10

22.03

0.70

1.99

 0.1

0.725

Mental health

0.02

19.65

1.29

2.64

0

4

Tobacco consumption

0.04

1.55

0.44

0.27

0

1

Self-harm

395.

15,393.2

2,058

1,635.1

0

5,000

 

Peace and Security

Indicator

Minimum

Maximum

Mean

Standard deviation

Lower band

Upper band

Conflict and terrorism

0.00

30,411.1

355

1,948.94

32,615

INFORM score

0.50

8.7

3.91

1.71

0

10

Internal peace score

1.17

4.271

3.70

0.57

1

5

Interpersonal violence

55.3

36,790.69

2,218.34

3,191.15

20.91

1,601.5

 

Political and Civic Participation

Indicator

Minimum

Maximum

Mean

Standard deviation

Lower band

Upper band

Recognition for community improvement

0.02

0.78

0.26

0.10

0

1

Voiced opinion to an official

0.01

0.51

0.17

0.07

0

1

Volunteered time

0.03

0.66

0.22

0.09

0

1

Youth policy score

0.00

4.00

2.72

0.80

0

4

 

Weighting indicators and domains

Table A1.5 shows the indicators and respective weights applied in the YDI. The YDI assigns a higher weighting to three domains: Health and Wellbeing, Education, and Employment and Opportunity receive 22 per cent each, as these domains are considered key to youth development and data quality and availability are higher here. Equality and Inclusion is weighted at 14 per cent, while the Peace and Security domain and the Political and Civic Participation domain are weighted at 10 per cent each.

Within each domain, indicators are weighted by their importance relative to the other indicators in the respective domain. In some instances, they are weighted equally, indicating that they together comprise the core features of the respective domain and are equally essential. Across the three core domains, which comprise 66 per cent of the overall index, three indicators are considered primary: mortality rate, literacy rate and NEET. These primary indicators are weighted slightly higher than others in the index and therefore have a big impact on domain scores. In some cases, they grant countries a more pronounced domain score regardless of their overall rank in the YDI.

 

Table A1.5 Weights used in the YDI, by domain

Education

Domain weight

Indicator

Indicator weight

22%

Literacy rate

10%

School completion

8%

Digital natives

4%

 

Employment and Opportunity

Domain weight

Indicator

Indicator weight

22%

NEET

10%

Underemployment

4%

Adolescent fertility rate

4%

Account

4%

 

Equality and Inclusion

Domain weight

Indicator

Indicator weight

14%

Gender parity in NEET

2.80%

Gender parity in safety and security

2.80%

Gender parity in literacy

2.80%

Early marriage

2.80%

Economic marginalisation

2.80%

 

Health and Wellbeing

Domain weight

Indicator

Indicator weight

22%

Mortality rate

10%

HIV rate

2%

Self-harm

2%

Mental health

2%

Drug abuse

2%

Alcohol abuse

2%

Tobacco consumption

2%

 

Peace and Security

Domain weight

Indicator

Indicator weight

10%

Internal peace score

2.50%

Interpersonal violence

2.50%

Conflict and terrorism

2.50%

INFORM score

2.50%

 

Political and Civic Participation

Domain weight

Indicator

Indicator weight

10%

Youth policy score

2.50%

Voiced opinion to an official

2.50%

Volunteered time

2.50%

Recognition for community improvement

2.50%

 

Aggregation and calculation

Once the data has been banded and weights have been assigned, the final stage is to multiply each banded indicator with its corresponding weight and add each country’s performance to arrive at an overall YDI score. Final scores are calculated by combining scores for the six individual domains into the overall YDI score, as demonstrated in Figure A1.1.

 

Figure A1.1 Composition of indicators into domains and the final YDI scores

Image
figureA1.1

Downloads