Finska kriget och Finlands krigare 1808-1809 by Johan Richard Danielson-Kalmari : Difficulty Assessment for Swedish Learners

How difficult is Finska kriget och Finlands krigare 1808-1809 for Swedish learners? We have performed multiple tests on its full text (freely available here) of approximately 201,123, crunched all the numbers for you and present the results below.

Read the Full Text Now for Free!

Difficulty Assessment Summary

We have estimated Finska kriget och Finlands krigare 1808-1809 to have a difficulty score of 75. Here're its scores:

Measure Score
easy difficult (1 - 100)
Overall Difficulty 75% 75
Vocabulary Difficulty 81% 81
Grammatical Difficulty 69% 69

Vocabulary Difficulty: Breakdown

81%

Vocabulary difficulty: 81%

This score has been calculated based on frequency vocabulary (the top most frequently used words in Swedish). It combines various measures of Finska kriget och Finlands krigare 1808-1809's text analyzed in terms of frequency vocabulary: a plain vocabulary score, frequency-weighted vocabulary score, banded frequency vocabulary scores based on vocabulary of the text falling in the top 1,000 or 2,000 most frequent words, etc. Here's a further breakdown of how often the top most frequently used words in Swedish appear in the full text of Finska kriget och Finlands krigare 1808-1809:

Vocabulary difficulty breakdown for Finska kriget och Finlands krigare 1808-1809: a test for Swedish top frequency vocabulary

We have also calculated the following approximate data on the vocabulary in Finska kriget och Finlands krigare 1808-1809:

Measure Score
Measure Score
Number of words 201,123
Number of unique words 24,790
Number of recognized words for names/places/other entities 12,833
Number of very rare non-entity words 15,754
Number of sentences 27,728
Average number of words/sentence 7

There is some research suggesting that that you need to know about 98% of a text's vocabulary in order to be able to infer the meaning of unknown words when reading. If true, this means that you would need to know around 24,294 words (where all the forms of the word are still counted as unique words) in Swedish to be able to read Finska kriget och Finlands krigare 1808-1809 without a dictionary and fully understand it.

Grammatical Difficulty: Breakdown

69%

Grammatical difficulty: 69%

Here is the further grammatical comparison on this text. You can find an explanation of all these scores below.

Measure Score
Measure Score
Automated Readability Index 8
Coleman-Liau Index 12
Type/Token Ratio (TTR) 0.123258
Root type/Token Ratio (RTTR) 0.000000612848
Corrected type/Token Ratio (CTTR) 0.000000306424
MTLD Index 88
HDD Index 71
Yule's I Index 89
Lexical Diversity Index (MTLD + HD-D + Yule's I) 83

The type-token ratio (TTR) of Finska kriget och Finlands krigare 1808-1809 is 0.123258. The TTR is the most basic measure of lexical diversity. To calculate it, we divide the number of unique words by the number of words in the text. For example, for this text, the number of unique words is 24,790, while the number of words is 201,123, so the TTR is 24,790 / 201,123 = 0.123258. However, the TTR is a very crude measure, as it is extremely dependent on text length. The longer the text, the lower the TTR is usually going to be, since common words tend to often repeat. Especially since the number of words in this text is more than 1,000, the TTR is not likely to give an accurate measure.

The root type-token ratio (RTTR) and corrected type-token ratio (CTTR) are measures which were suggested by researchers to partially address the problem of TTR's variance on text length. In the RTTR, the number of unique words is divided by a square of the number of words (therefore, 24,790 / (201,123 * 201,123) = 0.000000612848), while in CTTR, it is divided by a square of the number of words, multiplied twice 24,790 / 2 * (201,123 * 201,123) = 0.000000306424). However, these measures are not as easily readable, and also there is a growing body of research asserting that CTTR and RTTR do not effectively address the problems of text length. Therefore, while we do provide the full text's TTR, RTTR and CTTR on this page, these fiqures do not form part of our final calculations.

The Automated Readability Index (ARI) is one readability measure that has been developed by researchers over the years. The formula for calculating the ARI is as follows:
Formula for calculating the Automated Readability Index

The ARI should compute a reading level approximately corresponding to the reader's grade level (assuming the reader undertakes formal education). Thus, for example, a value of 1 is kindergarten level, while a value of 12 or 13 is the last year of school, and 14 is a sophomore at college. The current ARI of this text is 8, making it understandable for 8-grade students at their expected level of education.

The Coleman Liau Index (CLI) is a similar index designed by Meri Coleman and T. L. Liau, and it is supposed to compute the grade level of the reader (thus, for example, sophomore level material would be around grade 14, or year 14 of formal education, while kindergarten / primary school level material would be close to grade 1 in the CLI). The CLI is usually slightly higher than the ARI. The CLI is computed with this formula:
Formula for calculating the Coleman-Liau Readability Index

It is notable that other indexes exist, such as the Flesch-Kincaid Reading Ease, Gunning-Fog Score, and others, but we have chosen not to include them, since, contrary to the ARI and CLI, such other indexes are based on a syllable count and therefore arguably only work for English and not Swedish.

We compute a further compound lexical diversity index, which should range from 1 to a 100 (with the standard deviation being around 10, and its average value being around 50) - it is 83 in the present case. The compound lexical diversity index consists of the following indexes, averaged out (and also provided in the table above):

  • the Measure of Textual Lexical Diversity (MTLD) index - a measure which is based on computing the TTR for increasingly larger parts of the text until the TTR drops below a certain threshold point (around 0.7 in our case) - in which case, the TTR is reset, and the overall counter is increased; the counter is at the end divided by the number of words in text; as a result, the MTLD does not significantly vary by text length;
  • the Yule's I index (based on Yule's K characteristic inverted) - an index based on the work of the statistician G.U. Yule, who published his index of Frequency Vocabulary in his paper "The statistical study of literary vocabulary"; Yule's I takes into account the number of words in the text, and a compound summed measure of word frequency;
  • the Hypergeometric Distribution D (HD-D) index (based on vocd) - an index which assesses the contribution of each word to the diversity of the text; to calculate such contributions, a hypergeometric distribution is used to compute probabilities of each word appearing in word samples extracted from the text; then such distributions are divided by sample sizes and added up;

Our overall measure of grammatical diversity is based on a combination of the compound lexical diversity index (which includes the MTLD, Yule's I and HD-D indexes), the ARI and CLI, all normalized and given certain weight. The score should normally range from 1 to 100. In this case, the score is 69.

Other Information about Finska kriget och Finlands krigare 1808-1809 by Johan Richard Danielson-Kalmari

We provide you a sample of the text below, however, the full text of the Finska kriget och Finlands krigare 1808-1809 is also available free of charge on our website.

Sample of text:

Vid första skott som sker på något Batteri samlas hela servisen." 3) Infanteriet exercerades hvarje dag, i fall icke en alldeles ovanlig köld förhindrade det. Därjämte fortsattes befästningsvärkens förstärkande. Alt blef härefter, skrifver Wärnhjelm, ,tyst och stilla, och vi började åter *) Sprengtportens bref till kejsaren den 2 mars. — Sederholm berättar hurusom ryssame förskräkta gjorde sig beredda till en snabb reträtt dä en dag några kanoner släpades ut p& isen. Jfr ock kapten Qvists berättelse. Svenska krigsarkivet’ *) Löjtnant Hagelstams berättelse. Svenska krigsarkivet. *) Ofvannämda Asikkala kompanis journal.våra arbeten på de ännu till stor del oiärdige och bristfällige fastningsvärken.M Emellertid mångdubblades fiendens styrka. Under förra ...

Top most frequently used words in Finska kriget och Finlands krigare 1808-1809 by Johan Richard Danielson-Kalmari*

Position Word Repetitions Part of all words
Position Word Repetitions Part of all words
1 och 5,551 2.76%
2 att 5,117 2.54%
3 till 3,445 1.71%
4 den 2,955 1.47%
5 af 2,902 1.44%
6 en 2,167 1.08%
7 som 2,010 1%
8 1,916 0.95%
9 de 1,881 0.94%
10 för 1,825 0.91%
11 med 1,720 0.86%
12 det 1,669 0.83%
13 han 1,581 0.79%
14 sig 1,468 0.73%
15 om 1,316 0.65%
16 icke 1,213 0.6%
17 hade 1,149 0.57%
18 var 1,108 0.55%
19 från 1,049 0.52%
20 ett 996 0.5%
21 vid 977 0.49%
22 sin 947 0.47%
23 man 920 0.46%
24 jag 855 0.43%
25 skulle 854 0.42%
26 801 0.4%
27 är 770 0.38%
28 men 700 0.35%
29 denna 657 0.33%
30 652 0.32%
31 äfven 639 0.32%
32 detta 610 0.3%
33 under 582 0.29%
34 hans 534 0.27%
35 vi 500 0.25%
36 ej 498 0.25%
37 än 466 0.23%
38 dem 465 0.23%
39 har 451 0.22%
40 sitt 447 0.22%
41 sina 445 0.22%
42 alla 444 0.22%
43 eller 436 0.22%
44 kunde 436 0.22%
45 hvilka 430 0.21%
46 finska 408 0.2%
47 öfver 406 0.2%
48 honom 404 0.2%
49 efter 404 0.2%
50 redan 399 0.2%
51 Aminoff 394 0.2%
52 hvilken 390 0.19%
53 utan 384 0.19%
54 ännu 381 0.19%
55 mig 378 0.19%
56 fienden 377 0.19%
57 bref 361 0.18%
58 nu 349 0.17%
59 Finland 343 0.17%
60 armén 339 0.17%
61 mot 333 0.17%
62 oss 323 0.16%
63 Adlercreutz 320 0.16%
64 hafva 318 0.16%
65 samma 309 0.15%
66 hvad 300 0.15%
67 där 289 0.14%
68 dock 289 0.14%
69 alt 284 0.14%
70 dessa 284 0.14%
71 kan 277 0.14%
72 vara 274 0.14%
73 varit 267 0.13%
74 såsom 267 0.13%
75 voro 266 0.13%
76 deras 263 0.13%
77 genom 258 0.13%
78 andra 257 0.13%
79 samt 253 0.13%
80 åt 252 0.13%
81 sedan 249 0.12%
82 kunna 235 0.12%
83 min 233 0.12%
84 tid 229 0.11%
85 mycket 226 0.11%
86 dess 225 0.11%
87 hos 223 0.11%
88 endast 222 0.11%
89 dag 221 0.11%
90 någon 221 0.11%
91 borde 218 0.11%
92 kejsaren 208 0.1%
93 ty 206 0.1%
94 von 201 0.1%
95 Buxhövden 199 0.1%
96 ryska 199 0.1%
97 några 198 0.1%
98 vår 196 0.1%
99 blifvit 191 0.09%
100 Klingspor 190 0.09%
101 blef 189 0.09%
102 skall 186 0.09%
103 hvilket 184 0.09%
104 mera 183 0.09%
105 hela 181 0.09%
106 göra 179 0.09%
107 Sverge 178 0.09%
108 Sandels 174 0.09%
109 här 173 0.09%
110 våra 173 0.09%
111 konungen 172 0.09%
112 general 169 0.08%
113 måste 166 0.08%
114 sätt 165 0.08%
115 något 163 0.08%
116 svenska 163 0.08%
117 163 0.08%
118 senare 162 0.08%
119 del 162 0.08%
120 dagar 162 0.08%
121 sålunda 162 0.08%
122 år 161 0.08%
123 Gripenberg 161 0.08%
124 Cronstedt 157 0.08%
125 första 156 0.08%
126 säger 155 0.08%
127 följande 155 0.08%
128 således 155 0.08%
129 inom 154 0.08%
130 äro 154 0.08%
131 officerare 154 0.08%
132 åter 152 0.08%
133 Finlands 151 0.08%
134 arméns 149 0.07%
135 fiendens 148 0.07%
136 trupper 148 0.07%
137 vore 143 0.07%
138 grefve 142 0.07%
139 annat 141 0.07%
140 komma 140 0.07%
141 gång 140 0.07%
142 fästningen 134 0.07%
143 väl 133 0.07%
144 ryssarne 132 0.07%
145 kriget 132 0.07%
146 Sveaborg 131 0.07%
147 Petersburg 131 0.07%
148 gjorde 131 0.07%
149 ock 130 0.06%
150 huru 129 0.06%
151 blott 129 0.06%
152 ur 128 0.06%
153 snart 127 0.06%
154 skrifver 126 0.06%
155 order 126 0.06%
156 mars 123 0.06%
157 öfverste 123 0.06%
158 bland 122 0.06%
159 ställning 119 0.06%
160 därför 119 0.06%
161 innan 119 0.06%
162 blifva 119 0.06%
163 mer 119 0.06%
164 Uleåborg 118 0.06%
165 först 118 0.06%
166 mellan 117 0.06%
167 enligt 116 0.06%
168 Stockholm 116 0.06%
169 vägen 116 0.06%
170 tillbaka 116 0.06%
171 många 115 0.06%
172 maj 115 0.06%
173 folk 115 0.06%
174 själf 113 0.06%
175 Vasa 113 0.06%
176 fick 113 0.06%
177 Lappo 112 0.06%

This list excludes punctuation or single-letter words, also some different-case repeats of the same words.

If you think the text would be accessible to you, you can read it on our site (click on the cover to access):

Cover of Finska kriget och Finlands krigare 1808-1809 by Johan Richard Danielson-Kalmari

Other resources and languages

If you like this analysis, you should have a look at out our lists of Swedish short stories and Swedish books.

If you like literature as a means to learn languages - please take a look at our project Interlinear Books. We even have a Swedish Interlinear book available for purchase.