Lotten Brenners ferier by Hjalmar Bergman : Difficulty Assessment for Swedish Learners

How difficult is Lotten Brenners ferier for Swedish learners? We have performed multiple tests on its full text (freely available here) of approximately 45,403, crunched all the numbers for you and present the results below.

Read the Full Text Now for Free!

Difficulty Assessment Summary

We have estimated Lotten Brenners ferier to have a difficulty score of 57. Here're its scores:

Measure Score
easy difficult (1 - 100)
Overall Difficulty 57% 57
Vocabulary Difficulty 64% 64
Grammatical Difficulty 51% 51

Vocabulary Difficulty: Breakdown

64%

Vocabulary difficulty: 64%

This score has been calculated based on frequency vocabulary (the top most frequently used words in Swedish). It combines various measures of Lotten Brenners ferier's text analyzed in terms of frequency vocabulary: a plain vocabulary score, frequency-weighted vocabulary score, banded frequency vocabulary scores based on vocabulary of the text falling in the top 1,000 or 2,000 most frequent words, etc. Here's a further breakdown of how often the top most frequently used words in Swedish appear in the full text of Lotten Brenners ferier:

Vocabulary difficulty breakdown for Lotten Brenners ferier: a test for Swedish top frequency vocabulary

We have also calculated the following approximate data on the vocabulary in Lotten Brenners ferier:

Measure Score
Measure Score
Number of words 45,403
Number of unique words 9,791
Number of recognized words for names/places/other entities 1,781
Number of very rare non-entity words 1,324
Number of sentences 10,839
Average number of words/sentence 4

There is some research suggesting that that you need to know about 98% of a text's vocabulary in order to be able to infer the meaning of unknown words when reading. If true, this means that you would need to know around 9,595 words (where all the forms of the word are still counted as unique words) in Swedish to be able to read Lotten Brenners ferier without a dictionary and fully understand it.

Grammatical Difficulty: Breakdown

51%

Grammatical difficulty: 51%

Here is the further grammatical comparison on this text. You can find an explanation of all these scores below.

Measure Score
Measure Score
Automated Readability Index 4
Coleman-Liau Index 6
Type/Token Ratio (TTR) 0.215647
Root type/Token Ratio (RTTR) 0.00000474961
Corrected type/Token Ratio (CTTR) 0.00000237481
MTLD Index 62
HDD Index 68
Yule's I Index 78
Lexical Diversity Index (MTLD + HD-D + Yule's I) 69

The type-token ratio (TTR) of Lotten Brenners ferier is 0.215647. The TTR is the most basic measure of lexical diversity. To calculate it, we divide the number of unique words by the number of words in the text. For example, for this text, the number of unique words is 9,791, while the number of words is 45,403, so the TTR is 9,791 / 45,403 = 0.215647. However, the TTR is a very crude measure, as it is extremely dependent on text length. The longer the text, the lower the TTR is usually going to be, since common words tend to often repeat. Especially since the number of words in this text is more than 1,000, the TTR is not likely to give an accurate measure.

The root type-token ratio (RTTR) and corrected type-token ratio (CTTR) are measures which were suggested by researchers to partially address the problem of TTR's variance on text length. In the RTTR, the number of unique words is divided by a square of the number of words (therefore, 9,791 / (45,403 * 45,403) = 0.00000474961), while in CTTR, it is divided by a square of the number of words, multiplied twice 9,791 / 2 * (45,403 * 45,403) = 0.00000237481). However, these measures are not as easily readable, and also there is a growing body of research asserting that CTTR and RTTR do not effectively address the problems of text length. Therefore, while we do provide the full text's TTR, RTTR and CTTR on this page, these fiqures do not form part of our final calculations.

The Automated Readability Index (ARI) is one readability measure that has been developed by researchers over the years. The formula for calculating the ARI is as follows:
Formula for calculating the Automated Readability Index

The ARI should compute a reading level approximately corresponding to the reader's grade level (assuming the reader undertakes formal education). Thus, for example, a value of 1 is kindergarten level, while a value of 12 or 13 is the last year of school, and 14 is a sophomore at college. The current ARI of this text is 4, making it understandable for 4-grade students at their expected level of education.

The Coleman Liau Index (CLI) is a similar index designed by Meri Coleman and T. L. Liau, and it is supposed to compute the grade level of the reader (thus, for example, sophomore level material would be around grade 14, or year 14 of formal education, while kindergarten / primary school level material would be close to grade 1 in the CLI). The CLI is usually slightly higher than the ARI. The CLI is computed with this formula:
Formula for calculating the Coleman-Liau Readability Index

It is notable that other indexes exist, such as the Flesch-Kincaid Reading Ease, Gunning-Fog Score, and others, but we have chosen not to include them, since, contrary to the ARI and CLI, such other indexes are based on a syllable count and therefore arguably only work for English and not Swedish.

We compute a further compound lexical diversity index, which should range from 1 to a 100 (with the standard deviation being around 10, and its average value being around 50) - it is 69 in the present case. The compound lexical diversity index consists of the following indexes, averaged out (and also provided in the table above):

  • the Measure of Textual Lexical Diversity (MTLD) index - a measure which is based on computing the TTR for increasingly larger parts of the text until the TTR drops below a certain threshold point (around 0.7 in our case) - in which case, the TTR is reset, and the overall counter is increased; the counter is at the end divided by the number of words in text; as a result, the MTLD does not significantly vary by text length;
  • the Yule's I index (based on Yule's K characteristic inverted) - an index based on the work of the statistician G.U. Yule, who published his index of Frequency Vocabulary in his paper "The statistical study of literary vocabulary"; Yule's I takes into account the number of words in the text, and a compound summed measure of word frequency;
  • the Hypergeometric Distribution D (HD-D) index (based on vocd) - an index which assesses the contribution of each word to the diversity of the text; to calculate such contributions, a hypergeometric distribution is used to compute probabilities of each word appearing in word samples extracted from the text; then such distributions are divided by sample sizes and added up;

Our overall measure of grammatical diversity is based on a combination of the compound lexical diversity index (which includes the MTLD, Yule's I and HD-D indexes), the ARI and CLI, all normalized and given certain weight. The score should normally range from 1 to 100. In this case, the score is 51.

Other Information about Lotten Brenners ferier by Hjalmar Bergman

We provide you a sample of the text below, however, the full text of the Lotten Brenners ferier is also available free of charge on our website.

Sample of text:

Ett lappri. Ferierna är slut, vardagen begynner. Känns alltid lite trist, känns som om något försummats. Stranden ligger öde. Farväl strand, farväl badianer. Farväl Blinkmåns. Tack för mig. Hon gick direkt in i matsalen. Hon satte sig vid sitt bord och hade missmodet till kavaljer. Ho! Hi! ...

Top most frequently used words in Lotten Brenners ferier by Hjalmar Bergman*

Position Word Repetitions Part of all words
Position Word Repetitions Part of all words
1 och 1,512 3.33%
2 att 774 1.7%
3 en 744 1.64%
4 hon 589 1.3%
5 531 1.17%
6 det 526 1.16%
7 är 511 1.13%
8 som 498 1.1%
9 den 476 1.05%
10 med 452 1%
11 jag 429 0.94%
12 inte 421 0.93%
13 sig 373 0.82%
14 till 362 0.8%
15 av 322 0.71%
16 han 317 0.7%
17 för 307 0.68%
18 ett 298 0.66%
19 var 276 0.61%
20 de 259 0.57%
21 henne 254 0.56%
22 Brenner 214 0.47%
23 om 213 0.47%
24 sin 211 0.46%
25 mig 210 0.46%
26 sa 203 0.45%
27 har 189 0.42%
28 hade 184 0.41%
29 183 0.4%
30 eller 168 0.37%
31 Lotten 157 0.35%
32 skulle 156 0.34%
33 men 152 0.33%
34 hennes 150 0.33%
35 ni 147 0.32%
36 honom 142 0.31%
37 över 132 0.29%
38 man 131 0.29%
39 ha 126 0.28%
40 dem 119 0.26%
41 upp 115 0.25%
42 hans 110 0.24%
43 icke 100 0.22%
44 er 97 0.21%
45 Markurell 95 0.21%
46 kunde 94 0.21%
47 än 93 0.2%
48 min 91 0.2%
49 kan 91 0.2%
50 ut 87 0.19%
51 mot 86 0.19%
52 nu 84 0.19%
53 ska 82 0.18%
54 vara 82 0.18%
55 du 79 0.17%
56 vad 77 0.17%
57 vid 76 0.17%
58 sina 75 0.17%
59 här 74 0.16%
60 73 0.16%
61 utan 70 0.15%
62 vill 69 0.15%
63 åt 66 0.15%
64 från 64 0.14%
65 vet 61 0.13%
66 ned 61 0.13%
67 efter 59 0.13%
68 sitt 59 0.13%
69 vi 58 0.13%
70 alla 58 0.13%
71 där 58 0.13%
72 blev 57 0.13%
73 någon 55 0.12%
74 tog 54 0.12%
75 gick 54 0.12%
76 säger 54 0.12%
77 själv 53 0.12%
78 Ludwig 53 0.12%
79 dig 53 0.12%
80 blir 52 0.11%
81 Varför 52 0.11%
82 in 49 0.11%
83 går 49 0.11%
84 bland 49 0.11%
85 varit 48 0.11%
86 mycket 48 0.11%
87 säga 47 0.1%
88 denna 46 0.1%
89 detta 45 0.1%
90 deras 44 0.1%
91 båda 44 0.1%
92 gjorde 44 0.1%
93 fram 44 0.1%
94 Husch 43 0.09%
95 får 43 0.09%
96 åter 43 0.09%
97 bort 42 0.09%
98 andra 42 0.09%
99 tänkte 42 0.09%
100 kommer 42 0.09%
101 ringen 42 0.09%
102 ur 41 0.09%
103 såg 40 0.09%
104 måste 40 0.09%
105 människa 39 0.09%
106 några 39 0.09%
107 små 39 0.09%
108 någonting 39 0.09%
109 satt 39 0.09%
110 Rödhättan 38 0.08%
111 svarade 38 0.08%
112 lilla 38 0.08%
113 Främmande 37 0.08%
114 37 0.08%
115 kanske 37 0.08%
116 sätt 37 0.08%
117 ingen 37 0.08%
118 just 36 0.08%
119 började 36 0.08%
120 se 35 0.08%
121 sagt 35 0.08%
122 Johan 34 0.07%
123 väl 34 0.07%
124 göra 34 0.07%
125 fick 33 0.07%
126 aldrig 33 0.07%
127 Brenners 33 0.07%
128 ej 33 0.07%
129 mellan 33 0.07%
130 omkring 33 0.07%
131 allt 32 0.07%
132 kusten 32 0.07%
133 dess 32 0.07%
134 bli 32 0.07%
135 stranden 32 0.07%
136 mer 32 0.07%
137 oss 32 0.07%
138 Kanaljen 31 0.07%
139 bra 31 0.07%
140 annan 31 0.07%
141 något 31 0.07%
142 fall 31 0.07%
143 två 31 0.07%
144 betraktade 31 0.07%
145 hand 30 0.07%
146 gav 30 0.07%
147 förstår 30 0.07%
148 kände 30 0.07%
149 under 30 0.07%
150 kom 30 0.07%
151 gör 30 0.07%
152 annat 29 0.06%
153 steg 29 0.06%
154 känner 29 0.06%
155 ögon 29 0.06%
156 hur 29 0.06%
157 många 29 0.06%
158 estraden 29 0.06%
159 rätt 29 0.06%
160 huvudet 28 0.06%
161 gång 28 0.06%
162 barn 28 0.06%
163 Främlingen 27 0.06%
164 vore 27 0.06%
165 framför 27 0.06%
166 kring 27 0.06%
167 Lissilussi 27 0.06%
168 dansa 27 0.06%
169 mitt 27 0.06%
170 badianer 27 0.06%
171 när 27 0.06%
172 hela 27 0.06%
173 Lizzie 27 0.06%

This list excludes punctuation or single-letter words, also some different-case repeats of the same words.

If you think the text would be accessible to you, you can read it on our site (click on the cover to access):

Cover of Lotten Brenners ferier by Hjalmar Bergman

Other resources and languages

If you like this analysis, you should have a look at out our lists of Swedish short stories and Swedish books.

If you like literature as a means to learn languages - please take a look at our project Interlinear Books. We even have a Swedish Interlinear book available for purchase.