På Romanovs och Habsburgs ruiner - kulturpolitiska nutidsstudier by Alfred Jensen : Difficulty Assessment for Swedish Learners

How difficult is På Romanovs och Habsburgs ruiner - kulturpolitiska nutidsstudier for Swedish learners? We have performed multiple tests on its full text (freely available here) of approximately 55,312, crunched all the numbers for you and present the results below.

Read the Full Text Now for Free!

Difficulty Assessment Summary

We have estimated På Romanovs och Habsburgs ruiner - kulturpolitiska nutidsstudier to have a difficulty score of 78. Here're its scores:

Measure Score
easy difficult (1 - 100)
Overall Difficulty 78% 78
Vocabulary Difficulty 91% 91
Grammatical Difficulty 65% 65

Vocabulary Difficulty: Breakdown

91%

Vocabulary difficulty: 91%

This score has been calculated based on frequency vocabulary (the top most frequently used words in Swedish). It combines various measures of På Romanovs och Habsburgs ruiner - kulturpolitiska nutidsstudier's text analyzed in terms of frequency vocabulary: a plain vocabulary score, frequency-weighted vocabulary score, banded frequency vocabulary scores based on vocabulary of the text falling in the top 1,000 or 2,000 most frequent words, etc. Here's a further breakdown of how often the top most frequently used words in Swedish appear in the full text of På Romanovs och Habsburgs ruiner - kulturpolitiska nutidsstudier:

Vocabulary difficulty breakdown for På Romanovs och Habsburgs ruiner  - kulturpolitiska nutidsstudier: a test for Swedish top frequency vocabulary

We have also calculated the following approximate data on the vocabulary in På Romanovs och Habsburgs ruiner - kulturpolitiska nutidsstudier:

Measure Score
Measure Score
Number of words 55,312
Number of unique words 13,407
Number of recognized words for names/places/other entities 3,283
Number of very rare non-entity words 3,756
Number of sentences 8,219
Average number of words/sentence 7

There is some research suggesting that that you need to know about 98% of a text's vocabulary in order to be able to infer the meaning of unknown words when reading. If true, this means that you would need to know around 13,138 words (where all the forms of the word are still counted as unique words) in Swedish to be able to read På Romanovs och Habsburgs ruiner - kulturpolitiska nutidsstudier without a dictionary and fully understand it.

Grammatical Difficulty: Breakdown

65%

Grammatical difficulty: 65%

Here is the further grammatical comparison on this text. You can find an explanation of all these scores below.

Measure Score
Measure Score
Automated Readability Index 8
Coleman-Liau Index 12
Type/Token Ratio (TTR) 0.242389
Root type/Token Ratio (RTTR) 0.00000438221
Corrected type/Token Ratio (CTTR) 0.0000021911
MTLD Index 83
HDD Index 67
Yule's I Index 75
Lexical Diversity Index (MTLD + HD-D + Yule's I) 75

The type-token ratio (TTR) of På Romanovs och Habsburgs ruiner - kulturpolitiska nutidsstudier is 0.242389. The TTR is the most basic measure of lexical diversity. To calculate it, we divide the number of unique words by the number of words in the text. For example, for this text, the number of unique words is 13,407, while the number of words is 55,312, so the TTR is 13,407 / 55,312 = 0.242389. However, the TTR is a very crude measure, as it is extremely dependent on text length. The longer the text, the lower the TTR is usually going to be, since common words tend to often repeat. Especially since the number of words in this text is more than 1,000, the TTR is not likely to give an accurate measure.

The root type-token ratio (RTTR) and corrected type-token ratio (CTTR) are measures which were suggested by researchers to partially address the problem of TTR's variance on text length. In the RTTR, the number of unique words is divided by a square of the number of words (therefore, 13,407 / (55,312 * 55,312) = 0.00000438221), while in CTTR, it is divided by a square of the number of words, multiplied twice 13,407 / 2 * (55,312 * 55,312) = 0.0000021911). However, these measures are not as easily readable, and also there is a growing body of research asserting that CTTR and RTTR do not effectively address the problems of text length. Therefore, while we do provide the full text's TTR, RTTR and CTTR on this page, these fiqures do not form part of our final calculations.

The Automated Readability Index (ARI) is one readability measure that has been developed by researchers over the years. The formula for calculating the ARI is as follows:
Formula for calculating the Automated Readability Index

The ARI should compute a reading level approximately corresponding to the reader's grade level (assuming the reader undertakes formal education). Thus, for example, a value of 1 is kindergarten level, while a value of 12 or 13 is the last year of school, and 14 is a sophomore at college. The current ARI of this text is 8, making it understandable for 8-grade students at their expected level of education.

The Coleman Liau Index (CLI) is a similar index designed by Meri Coleman and T. L. Liau, and it is supposed to compute the grade level of the reader (thus, for example, sophomore level material would be around grade 14, or year 14 of formal education, while kindergarten / primary school level material would be close to grade 1 in the CLI). The CLI is usually slightly higher than the ARI. The CLI is computed with this formula:
Formula for calculating the Coleman-Liau Readability Index

It is notable that other indexes exist, such as the Flesch-Kincaid Reading Ease, Gunning-Fog Score, and others, but we have chosen not to include them, since, contrary to the ARI and CLI, such other indexes are based on a syllable count and therefore arguably only work for English and not Swedish.

We compute a further compound lexical diversity index, which should range from 1 to a 100 (with the standard deviation being around 10, and its average value being around 50) - it is 75 in the present case. The compound lexical diversity index consists of the following indexes, averaged out (and also provided in the table above):

  • the Measure of Textual Lexical Diversity (MTLD) index - a measure which is based on computing the TTR for increasingly larger parts of the text until the TTR drops below a certain threshold point (around 0.7 in our case) - in which case, the TTR is reset, and the overall counter is increased; the counter is at the end divided by the number of words in text; as a result, the MTLD does not significantly vary by text length;
  • the Yule's I index (based on Yule's K characteristic inverted) - an index based on the work of the statistician G.U. Yule, who published his index of Frequency Vocabulary in his paper "The statistical study of literary vocabulary"; Yule's I takes into account the number of words in the text, and a compound summed measure of word frequency;
  • the Hypergeometric Distribution D (HD-D) index (based on vocd) - an index which assesses the contribution of each word to the diversity of the text; to calculate such contributions, a hypergeometric distribution is used to compute probabilities of each word appearing in word samples extracted from the text; then such distributions are divided by sample sizes and added up;

Our overall measure of grammatical diversity is based on a combination of the compound lexical diversity index (which includes the MTLD, Yule's I and HD-D indexes), the ARI and CLI, all normalized and given certain weight. The score should normally range from 1 to 100. In this case, the score is 65.

Other Information about På Romanovs och Habsburgs ruiner - kulturpolitiska nutidsstudier by Alfred Jensen

We provide you a sample of the text below, however, the full text of the På Romanovs och Habsburgs ruiner - kulturpolitiska nutidsstudier is also available free of charge on our website.

Sample of text:

Men på den korta våren följde en lång höst och en förfärlig vinter. Måtte det icke upprepas nu! Det är med svidande hjärta, som jag målat det nya politiska Polen i delvis mörka färger. Jag har hittills gällt som ärlig vän av Polen, och jag ämnar förbliva det, ty jag har hög tanke om dess litteratur och vetenskapliga arbete och är fullt medveten om de urskuldande svårigheter, som den polska statens konsoliderande innebär. Men vilja polackerna behålla sina gamla sympatier i Europa, åtminstone hos de "neutrala", är det av stor vikt för dem, att de följa en både klok och högsinnad politik — icke blott mot Litauen och Ukraina, utan ock mot Tyskland, om de verkligen vilja vara civilisationens utpost i öster. Och om Polens oberschlesiska politik skulle spränga ententen, vore det kanske sorgligast ...

Top most frequently used words in På Romanovs och Habsburgs ruiner - kulturpolitiska nutidsstudier by Alfred Jensen*

Position Word Repetitions Part of all words
Position Word Repetitions Part of all words
1 och 1,845 3.34%
2 den 1,279 2.31%
3 det 980 1.77%
4 som 937 1.69%
5 en 900 1.63%
6 av 820 1.48%
7 att 702 1.27%
8 för 572 1.03%
9 538 0.97%
10 till 528 0.95%
11 är 502 0.91%
12 med 496 0.9%
13 de 489 0.88%
14 har 442 0.8%
15 icke 402 0.73%
16 ett 400 0.72%
17 om 316 0.57%
18 sig 308 0.56%
19 han 264 0.48%
20 men 233 0.42%
21 sin 231 0.42%
22 var 223 0.4%
23 från 204 0.37%
24 genom 199 0.36%
25 ha 190 0.34%
26 eller 172 0.31%
27 under 171 0.31%
28 vid 145 0.26%
29 145 0.26%
30 man 143 0.26%
31 Ryssland 142 0.26%
32 jag 141 0.25%
33 140 0.25%
34 sina 137 0.25%
35 kan 134 0.24%
36 skulle 133 0.24%
37 än 130 0.24%
38 ryska 130 0.24%
39 ännu 129 0.23%
40 sitt 127 0.23%
41 måste 116 0.21%
42 hade 116 0.21%
43 hans 115 0.21%
44 denna 115 0.21%
45 detta 114 0.21%
46 mot 113 0.2%
47 nya 112 0.2%
48 äro 111 0.2%
49 Polen 111 0.2%
50 gamla 110 0.2%
51 mycket 109 0.2%
52 polska 104 0.19%
53 även 104 0.19%
54 utan 103 0.19%
55 skall 100 0.18%
56 efter 99 0.18%
57 mer 97 0.18%
58 nu 97 0.18%
59 år 88 0.16%
60 ock 88 0.16%
61 dess 87 0.16%
62 varit 84 0.15%
63 över 83 0.15%
64 där 83 0.15%
65 ej 81 0.15%
66 slaviska 80 0.14%
67 ty 79 0.14%
68 blev 78 0.14%
69 dock 77 0.14%
70 blott 77 0.14%
71 redan 76 0.14%
72 också 75 0.14%
73 något 72 0.13%
74 vara 72 0.13%
75 dessa 72 0.13%
76 tyska 72 0.13%
77 kunna 71 0.13%
78 sedan 71 0.13%
79 sydslaviska 67 0.12%
80 världskriget 66 0.12%
81 alla 65 0.12%
82 stora 63 0.11%
83 republiken 62 0.11%
84 ur 62 0.11%
85 både 61 0.11%
86 Europa 60 0.11%
87 honom 60 0.11%
88 kanske 59 0.11%
89 tjechiska 58 0.1%
90 polackerna 57 0.1%
91 åt 57 0.1%
92 många 56 0.1%
93 allt 56 0.1%
94 därför 56 0.1%
95 56 0.1%
96 politiska 54 0.1%
97 samma 54 0.1%
98 deras 51 0.09%
99 fick 51 0.09%
100 själva 51 0.09%
101 Warszawa 51 0.09%
102 kunde 51 0.09%
103 bliva 51 0.09%
104 andra 51 0.09%
105 vi 50 0.09%
106 namn 49 0.09%
107 Österrike 49 0.09%
108 naturligtvis 48 0.09%
109 Polens 48 0.09%
110 grund 48 0.09%
111 tid 47 0.08%
112 Ukraina 47 0.08%
113 lika 44 0.08%
114 första 44 0.08%
115 nästan 44 0.08%
116 mig 44 0.08%
117 inte 44 0.08%
118 land 44 0.08%
119 hela 43 0.08%
120 hos 43 0.08%
121 mellan 42 0.08%
122 vilken 42 0.08%
123 världen 41 0.07%
124 skola 40 0.07%
125 mindre 40 0.07%
126 Masaryk 40 0.07%
127 haft 40 0.07%
128 själv 40 0.07%
129 slovakiska 39 0.07%
130 just 39 0.07%
131 länge 39 0.07%
132 såsom 38 0.07%
133 särskilt 38 0.07%
134 blivit 38 0.07%
135 väl 38 0.07%
136 österrikiska 37 0.07%
137 gång 37 0.07%
138 nog 37 0.07%
139 ingen 36 0.07%
140 ex 36 0.07%
141 Lenin 36 0.07%
142 någon 36 0.07%
143 Tyskland 35 0.06%
144 polsk 35 0.06%
145 båda 35 0.06%
146 serbiska 34 0.06%
147 Ungern 34 0.06%
148 ganska 34 0.06%
149 kunnat 34 0.06%
150 stor 34 0.06%
151 dem 34 0.06%
152 Moskva 33 0.06%
153 längre 32 0.06%
154 rent 32 0.06%
155 sista 32 0.06%
156 får 32 0.06%
157 När 32 0.06%
158 talet 31 0.06%
159 serberna 31 0.06%
160 tjechoslovakiska 31 0.06%
161 II 31 0.06%
162 statsbildningen 31 0.06%
163 helt 31 0.06%
164 visserligen 31 0.06%
165 Prag 31 0.06%
166 gjort 31 0.06%
167 ukrainska 31 0.06%
168 Serbien 30 0.05%
169 endast 30 0.05%
170 Rysslands 30 0.05%
171 voro 30 0.05%
172 folk 30 0.05%
173 nyligen 29 0.05%
174 göra 29 0.05%
175 närvarande 29 0.05%
176 folket 29 0.05%
177 här 29 0.05%
178 tre 29 0.05%
179 emot 29 0.05%
180 fast 28 0.05%
181 samt 28 0.05%

This list excludes punctuation or single-letter words, also some different-case repeats of the same words.

If you think the text would be accessible to you, you can read it on our site (click on the cover to access):

Cover of På Romanovs och Habsburgs ruiner  - kulturpolitiska nutidsstudier by Alfred Jensen

Other resources and languages

If you like this analysis, you should have a look at out our lists of Swedish short stories and Swedish books.

If you like literature as a means to learn languages - please take a look at our project Interlinear Books. We even have a Swedish Interlinear book available for purchase.