Boken om Astrid - Den svenska prinsessan som blev belgiernas drottning by : Difficulty Assessment for Swedish Learners

How difficult is Boken om Astrid - Den svenska prinsessan som blev belgiernas drottning for Swedish learners? We have performed multiple tests on its full text (freely available here) of approximately 20,631, crunched all the numbers for you and present the results below.

Read the Full Text Now for Free!

Difficulty Assessment Summary

We have estimated Boken om Astrid - Den svenska prinsessan som blev belgiernas drottning to have a difficulty score of 58. Here're its scores:

Measure Score
easy difficult (1 - 100)
Overall Difficulty 58% 58
Vocabulary Difficulty 63% 63
Grammatical Difficulty 54% 54

Vocabulary Difficulty: Breakdown

63%

Vocabulary difficulty: 63%

This score has been calculated based on frequency vocabulary (the top most frequently used words in Swedish). It combines various measures of Boken om Astrid - Den svenska prinsessan som blev belgiernas drottning's text analyzed in terms of frequency vocabulary: a plain vocabulary score, frequency-weighted vocabulary score, banded frequency vocabulary scores based on vocabulary of the text falling in the top 1,000 or 2,000 most frequent words, etc. Here's a further breakdown of how often the top most frequently used words in Swedish appear in the full text of Boken om Astrid - Den svenska prinsessan som blev belgiernas drottning:

Vocabulary difficulty breakdown for Boken om Astrid  - Den svenska prinsessan som blev belgiernas drottning: a test for Swedish top frequency vocabulary

We have also calculated the following approximate data on the vocabulary in Boken om Astrid - Den svenska prinsessan som blev belgiernas drottning:

Measure Score
Measure Score
Number of words 20,631
Number of unique words 5,098
Number of recognized words for names/places/other entities 1,445
Number of very rare non-entity words 536
Number of sentences 3,215
Average number of words/sentence 6

There is some research suggesting that that you need to know about 98% of a text's vocabulary in order to be able to infer the meaning of unknown words when reading. If true, this means that you would need to know around 4,996 words (where all the forms of the word are still counted as unique words) in Swedish to be able to read Boken om Astrid - Den svenska prinsessan som blev belgiernas drottning without a dictionary and fully understand it.

Grammatical Difficulty: Breakdown

54%

Grammatical difficulty: 54%

Here is the further grammatical comparison on this text. You can find an explanation of all these scores below.

Measure Score
Measure Score
Automated Readability Index 6
Coleman-Liau Index 9
Type/Token Ratio (TTR) 0.247104
Root type/Token Ratio (RTTR) 0.0000119773
Corrected type/Token Ratio (CTTR) 0.00000598865
MTLD Index 65
HDD Index 64
Yule's I Index 68
Lexical Diversity Index (MTLD + HD-D + Yule's I) 66

The type-token ratio (TTR) of Boken om Astrid - Den svenska prinsessan som blev belgiernas drottning is 0.247104. The TTR is the most basic measure of lexical diversity. To calculate it, we divide the number of unique words by the number of words in the text. For example, for this text, the number of unique words is 5,098, while the number of words is 20,631, so the TTR is 5,098 / 20,631 = 0.247104. However, the TTR is a very crude measure, as it is extremely dependent on text length. The longer the text, the lower the TTR is usually going to be, since common words tend to often repeat. Especially since the number of words in this text is more than 1,000, the TTR is not likely to give an accurate measure.

The root type-token ratio (RTTR) and corrected type-token ratio (CTTR) are measures which were suggested by researchers to partially address the problem of TTR's variance on text length. In the RTTR, the number of unique words is divided by a square of the number of words (therefore, 5,098 / (20,631 * 20,631) = 0.0000119773), while in CTTR, it is divided by a square of the number of words, multiplied twice 5,098 / 2 * (20,631 * 20,631) = 0.00000598865). However, these measures are not as easily readable, and also there is a growing body of research asserting that CTTR and RTTR do not effectively address the problems of text length. Therefore, while we do provide the full text's TTR, RTTR and CTTR on this page, these fiqures do not form part of our final calculations.

The Automated Readability Index (ARI) is one readability measure that has been developed by researchers over the years. The formula for calculating the ARI is as follows:
Formula for calculating the Automated Readability Index

The ARI should compute a reading level approximately corresponding to the reader's grade level (assuming the reader undertakes formal education). Thus, for example, a value of 1 is kindergarten level, while a value of 12 or 13 is the last year of school, and 14 is a sophomore at college. The current ARI of this text is 6, making it understandable for 6-grade students at their expected level of education.

The Coleman Liau Index (CLI) is a similar index designed by Meri Coleman and T. L. Liau, and it is supposed to compute the grade level of the reader (thus, for example, sophomore level material would be around grade 14, or year 14 of formal education, while kindergarten / primary school level material would be close to grade 1 in the CLI). The CLI is usually slightly higher than the ARI. The CLI is computed with this formula:
Formula for calculating the Coleman-Liau Readability Index

It is notable that other indexes exist, such as the Flesch-Kincaid Reading Ease, Gunning-Fog Score, and others, but we have chosen not to include them, since, contrary to the ARI and CLI, such other indexes are based on a syllable count and therefore arguably only work for English and not Swedish.

We compute a further compound lexical diversity index, which should range from 1 to a 100 (with the standard deviation being around 10, and its average value being around 50) - it is 66 in the present case. The compound lexical diversity index consists of the following indexes, averaged out (and also provided in the table above):

  • the Measure of Textual Lexical Diversity (MTLD) index - a measure which is based on computing the TTR for increasingly larger parts of the text until the TTR drops below a certain threshold point (around 0.7 in our case) - in which case, the TTR is reset, and the overall counter is increased; the counter is at the end divided by the number of words in text; as a result, the MTLD does not significantly vary by text length;
  • the Yule's I index (based on Yule's K characteristic inverted) - an index based on the work of the statistician G.U. Yule, who published his index of Frequency Vocabulary in his paper "The statistical study of literary vocabulary"; Yule's I takes into account the number of words in the text, and a compound summed measure of word frequency;
  • the Hypergeometric Distribution D (HD-D) index (based on vocd) - an index which assesses the contribution of each word to the diversity of the text; to calculate such contributions, a hypergeometric distribution is used to compute probabilities of each word appearing in word samples extracted from the text; then such distributions are divided by sample sizes and added up;

Our overall measure of grammatical diversity is based on a combination of the compound lexical diversity index (which includes the MTLD, Yule's I and HD-D indexes), the ARI and CLI, all normalized and given certain weight. The score should normally range from 1 to 100. In this case, the score is 54.

Other Information about Boken om Astrid - Den svenska prinsessan som blev belgiernas drottning by

We provide you a sample of the text below, however, the full text of the Boken om Astrid - Den svenska prinsessan som blev belgiernas drottning is also available free of charge on our website.

Sample of text:

Nästan varje dag skickade kronprinsessan Astrid ett litet brev eller kort hem till de sina i Sverige från bröllopsresan; det är mest några korta men vältaliga rader, som alltid sluta med en försäkran om hur obeskrivligt lyckliga de båda äro. Astrid njuter av att ha det helt naturligt ganska tröttande bröllopet bakom sig och av att äntligen få ha Leopold för sig själv, men »ändå längtar jag hem — nu är det Bryssel som är "hem" — och till att riktigt få lära känna mitt nya land och mina nya landsmän», säger hon. Och säkert var kronprinsessan också ivrig att få börja iordningställandet av Stuyvenberg, det slott i Laeken strax utanför Bryssel som kung Albert skänkt sin son och sonhustru, och där de båda unga beslutat att bo, fast de formellt skulle residera i västra flygeln av kungliga ...

Top most frequently used words in Boken om Astrid - Den svenska prinsessan som blev belgiernas drottning by *

Position Word Repetitions Part of all words
Position Word Repetitions Part of all words
1 och 883 4.28%
2 som 342 1.66%
3 att 323 1.57%
4 en 304 1.47%
5 292 1.42%
6 av 287 1.39%
7 med 287 1.39%
8 hon 272 1.32%
9 Astrid 253 1.23%
10 till 234 1.13%
11 den 194 0.94%
12 var 191 0.93%
13 för 190 0.92%
14 det 175 0.85%
15 de 174 0.84%
16 sig 170 0.82%
17 sin 168 0.81%
18 145 0.7%
19 ett 133 0.64%
20 prinsessan 130 0.63%
21 om 111 0.54%
22 vid 101 0.49%
23 Leopold 96 0.47%
24 hade 93 0.45%
25 från 83 0.4%
26 är 81 0.39%
27 sina 81 0.39%
28 henne 78 0.38%
29 han 78 0.38%
30 72 0.35%
31 inte 72 0.35%
32 drottning 71 0.34%
33 barn 70 0.34%
34 alla 69 0.33%
35 hennes 65 0.32%
36 sitt 57 0.28%
37 prins 57 0.28%
38 Belgien 48 0.23%
39 när 47 0.23%
40 efter 47 0.23%
41 men 46 0.22%
42 dag 44 0.21%
43 har 44 0.21%
44 där 43 0.21%
45 Carl 43 0.21%
46 Bryssel 43 0.21%
47 jag 42 0.2%
48 mycket 42 0.2%
49 under 42 0.2%
50 man 41 0.2%
51 blev 39 0.19%
52 drottningen 39 0.19%
53 Astrids 39 0.19%
54 skulle 38 0.18%
55 ha 38 0.18%
56 hem 38 0.18%
57 åt 37 0.18%
58 dem 37 0.18%
59 än 37 0.18%
60 kom 37 0.18%
61 belgiska 37 0.18%
62 kronprins 36 0.17%
63 ut 35 0.17%
64 själv 35 0.17%
65 över 35 0.17%
66 kung 34 0.16%
67 lilla 34 0.16%
68 mot 34 0.16%
69 också 33 0.16%
70 fick 33 0.16%
71 kunde 32 0.16%
72 upp 32 0.16%
73 Sverige 31 0.15%
74 varje 31 0.15%
75 eller 30 0.15%
76 år 29 0.14%
77 Ingeborg 29 0.14%
78 genom 29 0.14%
79 gång 29 0.14%
80 Charlotte 28 0.14%
81 Joséphine 28 0.14%
82 Fridhem 28 0.14%
83 28 0.14%
84 sade 27 0.13%
85 unga 27 0.13%
86 vara 27 0.13%
87 allt 26 0.13%
88 något 26 0.13%
89 in 26 0.13%
90 nu 26 0.13%
91 små 25 0.12%
92 hos 25 0.12%
93 barnen 24 0.12%
94 litet 24 0.12%
95 någon 24 0.12%
96 stora 24 0.12%
97 liten 24 0.12%
98 kungliga 24 0.12%
99 första 24 0.12%
100 hur 24 0.12%
101 älskade 24 0.12%
102 alltid 23 0.11%
103 två 23 0.11%
104 voro 23 0.11%
105 Belgiens 23 0.11%
106 äro 23 0.11%
107 besök 23 0.11%
108 fått 23 0.11%
109 honom 23 0.11%
110 hand 23 0.11%
111 några 22 0.11%
112 tid 22 0.11%
113 hans 21 0.1%
114 aldrig 21 0.1%
115 nya 21 0.1%
116 slottet 20 0.1%
117 stor 19 0.09%
118 kronprinsessan 19 0.09%
119 hela 19 0.09%
120 denna 19 0.09%
121 fram 19 0.09%
122 sedan 19 0.09%
123 vi 19 0.09%
124 båda 18 0.09%
125 gick 18 0.09%
126 se 18 0.09%
127 nästan 17 0.08%
128 samma 17 0.08%
129 helt 17 0.08%
130 vad 17 0.08%
131 Märtha 17 0.08%
132 Albert 17 0.08%
133 par 17 0.08%
134 deras 17 0.08%
135 Stockholm 17 0.08%
136 kärlek 17 0.08%
137 började 16 0.08%
138 mellan 16 0.08%
139 dagen 16 0.08%
140 varit 16 0.08%
141 Stuyvenberg 16 0.08%
142 skall 16 0.08%
143 sista 16 0.08%
144 sida 16 0.08%
145 Baudouin 16 0.08%
146 svenska 15 0.07%
147 leende 15 0.07%
148 ord 15 0.07%
149 omkring 15 0.07%
150 vår 15 0.07%
151 gjorde 15 0.07%
152 väl 15 0.07%
153 mamma 15 0.07%
154 lika 15 0.07%
155 bara 15 0.07%
156 alldeles 15 0.07%
157 sorg 14 0.07%
158 andra 14 0.07%
159 föräldrar 14 0.07%
160 14 0.07%
161 ständigt 14 0.07%
162 vackra 14 0.07%
163 folket 14 0.07%
164 Nenne 14 0.07%
165 komma 14 0.07%
166 mer 14 0.07%
167 kommer 14 0.07%
168 land 14 0.07%
169 redan 14 0.07%
170 snart 14 0.07%
171 många 14 0.07%
172 här 14 0.07%
173 fingo 13 0.06%
174 mest 13 0.06%
175 Margaretha 13 0.06%
176 göra 13 0.06%
177 framför 13 0.06%
178 ung 13 0.06%
179 utan 13 0.06%

This list excludes punctuation or single-letter words, also some different-case repeats of the same words.

If you think the text would be accessible to you, you can read it on our site (click on the cover to access):

Cover of Boken om Astrid  - Den svenska prinsessan som blev belgiernas drottning by

Other resources and languages

If you like this analysis, you should have a look at out our lists of Swedish short stories and Swedish books.

If you like literature as a means to learn languages - please take a look at our project Interlinear Books. We even have a Swedish Interlinear book available for purchase.