Brukspatron Adamsson eller Hvar bor du? by P. P. Waldenström : Difficulty Assessment for Swedish Learners

How difficult is Brukspatron Adamsson eller Hvar bor du? for Swedish learners? We have performed multiple tests on its full text (freely available here) of approximately 50,551, crunched all the numbers for you and present the results below.

Read the Full Text Now for Free!

Difficulty Assessment Summary

We have estimated Brukspatron Adamsson eller Hvar bor du? to have a difficulty score of 53. Here're its scores:

Measure Score
easy difficult (1 - 100)
Overall Difficulty 53% 53
Vocabulary Difficulty 56% 56
Grammatical Difficulty 51% 51

Vocabulary Difficulty: Breakdown

56%

Vocabulary difficulty: 56%

This score has been calculated based on frequency vocabulary (the top most frequently used words in Swedish). It combines various measures of Brukspatron Adamsson eller Hvar bor du?'s text analyzed in terms of frequency vocabulary: a plain vocabulary score, frequency-weighted vocabulary score, banded frequency vocabulary scores based on vocabulary of the text falling in the top 1,000 or 2,000 most frequent words, etc. Here's a further breakdown of how often the top most frequently used words in Swedish appear in the full text of Brukspatron Adamsson eller Hvar bor du?:

Vocabulary difficulty breakdown for Brukspatron Adamsson eller Hvar bor du?: a test for Swedish top frequency vocabulary

We have also calculated the following approximate data on the vocabulary in Brukspatron Adamsson eller Hvar bor du?:

Measure Score
Measure Score
Number of words 50,551
Number of unique words 6,862
Number of recognized words for names/places/other entities 2,104
Number of very rare non-entity words 1,067
Number of sentences 7,663
Average number of words/sentence 7

There is some research suggesting that that you need to know about 98% of a text's vocabulary in order to be able to infer the meaning of unknown words when reading. If true, this means that you would need to know around 6,724 words (where all the forms of the word are still counted as unique words) in Swedish to be able to read Brukspatron Adamsson eller Hvar bor du? without a dictionary and fully understand it.

Grammatical Difficulty: Breakdown

51%

Grammatical difficulty: 51%

Here is the further grammatical comparison on this text. You can find an explanation of all these scores below.

Measure Score
Measure Score
Automated Readability Index 3
Coleman-Liau Index 6
Type/Token Ratio (TTR) 0.135744
Root type/Token Ratio (RTTR) 0.00000268529
Corrected type/Token Ratio (CTTR) 0.00000134265
MTLD Index 63
HDD Index 67
Yule's I Index 79
Lexical Diversity Index (MTLD + HD-D + Yule's I) 70

The type-token ratio (TTR) of Brukspatron Adamsson eller Hvar bor du? is 0.135744. The TTR is the most basic measure of lexical diversity. To calculate it, we divide the number of unique words by the number of words in the text. For example, for this text, the number of unique words is 6,862, while the number of words is 50,551, so the TTR is 6,862 / 50,551 = 0.135744. However, the TTR is a very crude measure, as it is extremely dependent on text length. The longer the text, the lower the TTR is usually going to be, since common words tend to often repeat. Especially since the number of words in this text is more than 1,000, the TTR is not likely to give an accurate measure.

The root type-token ratio (RTTR) and corrected type-token ratio (CTTR) are measures which were suggested by researchers to partially address the problem of TTR's variance on text length. In the RTTR, the number of unique words is divided by a square of the number of words (therefore, 6,862 / (50,551 * 50,551) = 0.00000268529), while in CTTR, it is divided by a square of the number of words, multiplied twice 6,862 / 2 * (50,551 * 50,551) = 0.00000134265). However, these measures are not as easily readable, and also there is a growing body of research asserting that CTTR and RTTR do not effectively address the problems of text length. Therefore, while we do provide the full text's TTR, RTTR and CTTR on this page, these fiqures do not form part of our final calculations.

The Automated Readability Index (ARI) is one readability measure that has been developed by researchers over the years. The formula for calculating the ARI is as follows:
Formula for calculating the Automated Readability Index

The ARI should compute a reading level approximately corresponding to the reader's grade level (assuming the reader undertakes formal education). Thus, for example, a value of 1 is kindergarten level, while a value of 12 or 13 is the last year of school, and 14 is a sophomore at college. The current ARI of this text is 3, making it understandable for 3-grade students at their expected level of education.

The Coleman Liau Index (CLI) is a similar index designed by Meri Coleman and T. L. Liau, and it is supposed to compute the grade level of the reader (thus, for example, sophomore level material would be around grade 14, or year 14 of formal education, while kindergarten / primary school level material would be close to grade 1 in the CLI). The CLI is usually slightly higher than the ARI. The CLI is computed with this formula:
Formula for calculating the Coleman-Liau Readability Index

It is notable that other indexes exist, such as the Flesch-Kincaid Reading Ease, Gunning-Fog Score, and others, but we have chosen not to include them, since, contrary to the ARI and CLI, such other indexes are based on a syllable count and therefore arguably only work for English and not Swedish.

We compute a further compound lexical diversity index, which should range from 1 to a 100 (with the standard deviation being around 10, and its average value being around 50) - it is 70 in the present case. The compound lexical diversity index consists of the following indexes, averaged out (and also provided in the table above):

  • the Measure of Textual Lexical Diversity (MTLD) index - a measure which is based on computing the TTR for increasingly larger parts of the text until the TTR drops below a certain threshold point (around 0.7 in our case) - in which case, the TTR is reset, and the overall counter is increased; the counter is at the end divided by the number of words in text; as a result, the MTLD does not significantly vary by text length;
  • the Yule's I index (based on Yule's K characteristic inverted) - an index based on the work of the statistician G.U. Yule, who published his index of Frequency Vocabulary in his paper "The statistical study of literary vocabulary"; Yule's I takes into account the number of words in the text, and a compound summed measure of word frequency;
  • the Hypergeometric Distribution D (HD-D) index (based on vocd) - an index which assesses the contribution of each word to the diversity of the text; to calculate such contributions, a hypergeometric distribution is used to compute probabilities of each word appearing in word samples extracted from the text; then such distributions are divided by sample sizes and added up;

Our overall measure of grammatical diversity is based on a combination of the compound lexical diversity index (which includes the MTLD, Yule's I and HD-D indexes), the ARI and CLI, all normalized and given certain weight. The score should normally range from 1 to 100. In this case, the score is 51.

Other Information about Brukspatron Adamsson eller Hvar bor du? by P. P. Waldenström

We provide you a sample of the text below, however, the full text of the Brukspatron Adamsson eller Hvar bor du? is also available free of charge on our website.

Sample of text:

Synnerligen fröjdefulla voro de små utflykter eller lustresor, som dess innevånare understundom anställde till den några mil därifrån belägna staden Världen, för att bese dess innevånares tillställningar eller, som de vanligen kallades, »oskyldiga nöjen». De deltogo icke själva i dem utan voro blott åskådare. Till staden Lösaktighet ledde en port, som hette Köttslighet, men genom den gick sällan någon, ty över densamma lästes de förfärliga orden: Faren icke vilse! Gud låter icke gäcka sig, ty vad människan sår, det skall hon ock uppskära! Därför begagnade man sig merendels av en bakväg och gick genom en port, som kallades Frihet, och som icke var så avskräckande. Inne-vånarne i staden kallade sig också »frisinnade», emedan de icke voro så nogräknade, vare sig med läran eller med levernet. Vad de egentligen lade vikt uppå, var icke ...

Top most frequently used words in Brukspatron Adamsson eller Hvar bor du? by P. P. Waldenström*

Position Word Repetitions Part of all words
Position Word Repetitions Part of all words
1 och 1,328 2.63%
2 att 1,234 2.44%
3 han 889 1.76%
4 det 800 1.58%
5 som 742 1.47%
6 en 626 1.24%
7 till 582 1.15%
8 540 1.07%
9 jag 536 1.06%
10 för 496 0.98%
11 de 475 0.94%
12 var 473 0.94%
13 icke 459 0.91%
14 med 445 0.88%
15 den 431 0.85%
16 sig 416 0.82%
17 413 0.82%
18 av 386 0.76%
19 är 364 0.72%
20 om 343 0.68%
21 hon 343 0.68%
22 honom 338 0.67%
23 hade 307 0.61%
24 Men 295 0.58%
25 man 292 0.58%
26 245 0.48%
27 ett 234 0.46%
28 nu 219 0.43%
29 mig 216 0.43%
30 du 213 0.42%
31 sin 213 0.42%
32 Abrahamsson 208 0.41%
33 mor 205 0.41%
34 Immanuel 203 0.4%
35 skulle 186 0.37%
36 har 185 0.37%
37 hans 183 0.36%
38 patron 179 0.35%
39 Enfaldig 174 0.34%
40 där 170 0.34%
41 väl 167 0.33%
42 vad 152 0.3%
43 dem 151 0.3%
44 sade 148 0.29%
45 alla 141 0.28%
46 skall 140 0.28%
47 vara 138 0.27%
48 Evangelium 136 0.27%
49 mycket 127 0.25%
50 allt 126 0.25%
51 henne 124 0.25%
52 kan 120 0.24%
53 voro 119 0.24%
54 vi 117 0.23%
55 nåd 112 0.22%
56 vid 109 0.22%
57 utan 109 0.22%
58 staden 108 0.21%
59 nåden 108 0.21%
60 ej 108 0.21%
61 ord 106 0.21%
62 från 106 0.21%
63 sitt 105 0.21%
64 detta 103 0.2%
65 mer 102 0.2%
66 något 102 0.2%
67 denna 99 0.2%
68 Ja 98 0.19%
69 kunde 96 0.19%
70 andra 96 0.19%
71 när 96 0.19%
72 ty 95 0.19%
73 eller 95 0.19%
74 svarade 94 0.19%
75 här 93 0.18%
76 än 91 0.18%
77 samt 91 0.18%
78 såsom 90 0.18%
79 sina 89 0.18%
80 huru 87 0.17%
81 över 87 0.17%
82 ut 86 0.17%
83 85 0.17%
84 själv 85 0.17%
85 se 84 0.17%
86 genom 83 0.16%
87 någon 83 0.16%
88 kom 83 0.16%
89 inte 82 0.16%
90 åter 81 0.16%
91 blev 81 0.16%
92 många 81 0.16%
93 gång 80 0.16%
94 gick 80 0.16%
95 hennes 79 0.16%
96 ock 79 0.16%
97 under 78 0.15%
98 måste 77 0.15%
99 ju 75 0.15%
100 herr 75 0.15%
101 dig 72 0.14%
102 vill 71 0.14%
103 aldrig 71 0.14%
104 Justus 70 0.14%
105 göra 66 0.13%
106 dessa 66 0.13%
107 efter 65 0.13%
108 frågade 65 0.13%
109 ännu 64 0.13%
110 Adamsson 63 0.12%
111 bliva 63 0.12%
112 varit 62 0.12%
113 min 62 0.12%
114 gumman 61 0.12%
115 stor 59 0.12%
116 även 59 0.12%
117 in 58 0.11%
118 vilken 58 0.11%
119 ingen 58 0.11%
120 57 0.11%
121 Immanuels 57 0.11%
122 tid 56 0.11%
123 alltid 56 0.11%
124 såg 56 0.11%
125 komma 55 0.11%
126 hava 55 0.11%
127 säga 54 0.11%
128 ville 54 0.11%
129 tro 54 0.11%
130 ofta 53 0.1%
131 dock 53 0.1%
132 åt 52 0.1%
133 några 50 0.1%
134 Världen 50 0.1%
135 vet 50 0.1%
136 alldeles 50 0.1%
137 hjärta 49 0.1%
138 Jo 49 0.1%
139 namn 49 0.1%
140 Därför 49 0.1%
141 nog 49 0.1%
142 fick 47 0.09%
143 Nej 47 0.09%
144 oss 46 0.09%
145 samma 45 0.09%
146 upp 45 0.09%
147 får 45 0.09%
148 kände 45 0.09%
149 började 45 0.09%
150 visste 44 0.09%
151 snart 43 0.09%
152 sådant 43 0.09%
153 annat 43 0.09%
154 ooh 42 0.08%
155 leva 42 0.08%
156 emot 41 0.08%
157 just 41 0.08%
158 rätt 41 0.08%
159 hörde 41 0.08%
160 hela 40 0.08%
161 sådana 40 0.08%
162 tala 40 0.08%
163 deras 40 0.08%
164 hos 40 0.08%
165 lian 40 0.08%
166 dag 39 0.08%
167 tänkte 39 0.08%
168 Väktare 39 0.08%
169 nämligen 38 0.08%
170 blott 38 0.08%
171 blivit 38 0.08%
172 sagt 37 0.07%
173 äro 37 0.07%
174 igen 37 0.07%
175 ganska 36 0.07%
176 länge 36 0.07%
177 Ack 36 0.07%
178 tiden 36 0.07%
179 vore 36 0.07%

This list excludes punctuation or single-letter words, also some different-case repeats of the same words.

If you think the text would be accessible to you, you can read it on our site (click on the cover to access):

Cover of Brukspatron Adamsson eller Hvar bor du? by P. P. Waldenström

Other resources and languages

If you like this analysis, you should have a look at out our lists of Swedish short stories and Swedish books.

If you like literature as a means to learn languages - please take a look at our project Interlinear Books. We even have a Swedish Interlinear book available for purchase.