Svenskt flyg och dess män by : Difficulty Assessment for Swedish Learners

How difficult is Svenskt flyg och dess män for Swedish learners? We have performed multiple tests on its full text (freely available here) of approximately 91,224, crunched all the numbers for you and present the results below.

Read the Full Text Now for Free!

Difficulty Assessment Summary

We have estimated Svenskt flyg och dess män to have a difficulty score of 76. Here're its scores:

Measure Score
easy difficult (1 - 100)
Overall Difficulty 76% 76
Vocabulary Difficulty 86% 86
Grammatical Difficulty 67% 67

Vocabulary Difficulty: Breakdown

86%

Vocabulary difficulty: 86%

This score has been calculated based on frequency vocabulary (the top most frequently used words in Swedish). It combines various measures of Svenskt flyg och dess män's text analyzed in terms of frequency vocabulary: a plain vocabulary score, frequency-weighted vocabulary score, banded frequency vocabulary scores based on vocabulary of the text falling in the top 1,000 or 2,000 most frequent words, etc. Here's a further breakdown of how often the top most frequently used words in Swedish appear in the full text of Svenskt flyg och dess män:

Vocabulary difficulty breakdown for Svenskt flyg och dess män: a test for Swedish top frequency vocabulary

We have also calculated the following approximate data on the vocabulary in Svenskt flyg och dess män:

Measure Score
Measure Score
Number of words 91,224
Number of unique words 17,409
Number of recognized words for names/places/other entities 5,017
Number of very rare non-entity words 7,286
Number of sentences 15,283
Average number of words/sentence 6

There is some research suggesting that that you need to know about 98% of a text's vocabulary in order to be able to infer the meaning of unknown words when reading. If true, this means that you would need to know around 17,060 words (where all the forms of the word are still counted as unique words) in Swedish to be able to read Svenskt flyg och dess män without a dictionary and fully understand it.

Grammatical Difficulty: Breakdown

67%

Grammatical difficulty: 67%

Here is the further grammatical comparison on this text. You can find an explanation of all these scores below.

Measure Score
Measure Score
Automated Readability Index 8
Coleman-Liau Index 13
Type/Token Ratio (TTR) 0.190838
Root type/Token Ratio (RTTR) 0.00000209197
Corrected type/Token Ratio (CTTR) 0.00000104599
MTLD Index 78
HDD Index 70
Yule's I Index 83
Lexical Diversity Index (MTLD + HD-D + Yule's I) 77

The type-token ratio (TTR) of Svenskt flyg och dess män is 0.190838. The TTR is the most basic measure of lexical diversity. To calculate it, we divide the number of unique words by the number of words in the text. For example, for this text, the number of unique words is 17,409, while the number of words is 91,224, so the TTR is 17,409 / 91,224 = 0.190838. However, the TTR is a very crude measure, as it is extremely dependent on text length. The longer the text, the lower the TTR is usually going to be, since common words tend to often repeat. Especially since the number of words in this text is more than 1,000, the TTR is not likely to give an accurate measure.

The root type-token ratio (RTTR) and corrected type-token ratio (CTTR) are measures which were suggested by researchers to partially address the problem of TTR's variance on text length. In the RTTR, the number of unique words is divided by a square of the number of words (therefore, 17,409 / (91,224 * 91,224) = 0.00000209197), while in CTTR, it is divided by a square of the number of words, multiplied twice 17,409 / 2 * (91,224 * 91,224) = 0.00000104599). However, these measures are not as easily readable, and also there is a growing body of research asserting that CTTR and RTTR do not effectively address the problems of text length. Therefore, while we do provide the full text's TTR, RTTR and CTTR on this page, these fiqures do not form part of our final calculations.

The Automated Readability Index (ARI) is one readability measure that has been developed by researchers over the years. The formula for calculating the ARI is as follows:
Formula for calculating the Automated Readability Index

The ARI should compute a reading level approximately corresponding to the reader's grade level (assuming the reader undertakes formal education). Thus, for example, a value of 1 is kindergarten level, while a value of 12 or 13 is the last year of school, and 14 is a sophomore at college. The current ARI of this text is 8, making it understandable for 8-grade students at their expected level of education.

The Coleman Liau Index (CLI) is a similar index designed by Meri Coleman and T. L. Liau, and it is supposed to compute the grade level of the reader (thus, for example, sophomore level material would be around grade 14, or year 14 of formal education, while kindergarten / primary school level material would be close to grade 1 in the CLI). The CLI is usually slightly higher than the ARI. The CLI is computed with this formula:
Formula for calculating the Coleman-Liau Readability Index

It is notable that other indexes exist, such as the Flesch-Kincaid Reading Ease, Gunning-Fog Score, and others, but we have chosen not to include them, since, contrary to the ARI and CLI, such other indexes are based on a syllable count and therefore arguably only work for English and not Swedish.

We compute a further compound lexical diversity index, which should range from 1 to a 100 (with the standard deviation being around 10, and its average value being around 50) - it is 77 in the present case. The compound lexical diversity index consists of the following indexes, averaged out (and also provided in the table above):

  • the Measure of Textual Lexical Diversity (MTLD) index - a measure which is based on computing the TTR for increasingly larger parts of the text until the TTR drops below a certain threshold point (around 0.7 in our case) - in which case, the TTR is reset, and the overall counter is increased; the counter is at the end divided by the number of words in text; as a result, the MTLD does not significantly vary by text length;
  • the Yule's I index (based on Yule's K characteristic inverted) - an index based on the work of the statistician G.U. Yule, who published his index of Frequency Vocabulary in his paper "The statistical study of literary vocabulary"; Yule's I takes into account the number of words in the text, and a compound summed measure of word frequency;
  • the Hypergeometric Distribution D (HD-D) index (based on vocd) - an index which assesses the contribution of each word to the diversity of the text; to calculate such contributions, a hypergeometric distribution is used to compute probabilities of each word appearing in word samples extracted from the text; then such distributions are divided by sample sizes and added up;

Our overall measure of grammatical diversity is based on a combination of the compound lexical diversity index (which includes the MTLD, Yule's I and HD-D indexes), the ARI and CLI, all normalized and given certain weight. The score should normally range from 1 to 100. In this case, the score is 67.

Other Information about Svenskt flyg och dess män by

We provide you a sample of the text below, however, the full text of the Svenskt flyg och dess män is also available free of charge on our website.

Sample of text:

I mitten av mars 1937 godkände regeringen flygförvaltningens förslag till avtal med de samgående företagen Svenska Järnvägsverkstäderna och Nydqvist & Holm, och ett särskilt bolag för verksamheten vid Trollhättan bildades med namnet Svenska Aeroplanaktiebolaget •— SAAB. Dessutom bildades ett bolag, AB Förenade Flygverkstäderna, som skulle bedriva forskning och konstruktionsverksamhet på det flygtekniska området. Verkstäderna vid Trollhättan tillverkade först endast motorer och flygplanpropellrar — Hamilton constant speed och hydromatic propeller — men under sommaren 1938 kunde även tillverkning av medeltunga helmetallbombplan av Junkers typ Ju 86 påbörjas. Både vid Trollhättan och vid Linköping ha särskilda flygfält anlagts intill fabrikerna. Utöver dessa två intimt samverkande flygverkstäder, som i början av år 1939 också formellt sammanslogos, har Götaverken i Göteborg anlagt en särskild flygplanavdelning, som tillverkat lätta bombplan och sportflygplanen GV 38. ...

Top most frequently used words in Svenskt flyg och dess män by *

Position Word Repetitions Part of all words
Position Word Repetitions Part of all words
1 och 3,077 3.37%
2 att 1,719 1.88%
3 av 1,611 1.77%
4 en 1,547 1.7%
5 som 1,411 1.55%
6 1,174 1.29%
7 för 1,144 1.25%
8 med 1,110 1.22%
9 den 1,062 1.16%
10 till 1,061 1.16%
11 det 818 0.9%
12 ett 741 0.81%
13 de 695 0.76%
14 var 557 0.61%
15 sig 472 0.52%
16 om 404 0.44%
17 men 400 0.44%
18 vid 396 0.43%
19 under 381 0.42%
20 man 376 0.41%
21 hade 371 0.41%
22 första 362 0.4%
23 från 357 0.39%
24 är 354 0.39%
25 har 327 0.36%
26 han 293 0.32%
27 290 0.32%
28 sin 272 0.3%
29 inte 255 0.28%
30 kunde 252 0.28%
31 ha 241 0.26%
32 genom 236 0.26%
33 även 226 0.25%
34 klubben 219 0.24%
35 denna 213 0.23%
36 över 207 0.23%
37 år 204 0.22%
38 skulle 196 0.21%
39 186 0.2%
40 blev 182 0.2%
41 icke 175 0.19%
42 också 173 0.19%
43 efter 162 0.18%
44 andra 162 0.18%
45 kan 159 0.17%
46 stora 152 0.17%
47 än 148 0.16%
48 eller 147 0.16%
49 ordförande 143 0.16%
50 mycket 142 0.16%
51 sina 140 0.15%
52 svenska 139 0.15%
53 kunna 131 0.14%
54 klubbens 129 0.14%
55 voro 127 0.14%
56 stor 126 0.14%
57 såsom 126 0.14%
58 detta 125 0.14%
59 1939 123 0.13%
60 dessa 113 0.12%
61 någon 113 0.12%
62 redan 112 0.12%
63 sitt 112 0.12%
64 utan 112 0.12%
65 när 111 0.12%
66 alla 110 0.12%
67 större 108 0.12%
68 vara 105 0.12%
69 senare 105 0.12%
70 Flygklubb 104 0.11%
71 samt 101 0.11%
72 sedan 100 0.11%
73 flygplan 100 0.11%
74 två 100 0.11%
75 kom 99 0.11%
76 löjtnant 99 0.11%
77 fick 99 0.11%
78 emellertid 98 0.11%
79 något 98 0.11%
80 där 98 0.11%
81 varit 97 0.11%
82 1938 97 0.11%
83 hela 95 0.1%
84 början 94 0.1%
85 Stockholm 93 0.1%
86 93 0.1%
87 vilken 93 0.1%
88 ingenjör 93 0.1%
89 dess 92 0.1%
90 inom 91 0.1%
91 del 91 0.1%
92 tre 89 0.1%
93 nu 88 0.1%
94 luften 88 0.1%
95 gjorde 87 0.1%
96 måste 87 0.1%
97 upp 87 0.1%
98 olika 85 0.09%
99 1937 84 0.09%
100 ganska 84 0.09%
101 endast 84 0.09%
102 medlemmar 84 0.09%
103 många 83 0.09%
104 både 81 0.09%
105 dock 80 0.09%
106 ännu 80 0.09%
107 tid 79 0.09%
108 samma 79 0.09%
109 hans 78 0.09%
110 några 77 0.08%
111 nya 77 0.08%
112 mot 77 0.08%
113 000 77 0.08%
114 intresse 73 0.08%
115 flygare 73 0.08%
116 sätt 73 0.08%
117 kapten 73 0.08%
118 sekreterare 72 0.08%
119 mer 71 0.08%
120 flygning 71 0.08%
121 allt 70 0.08%
122 nästan 69 0.08%
123 utveckling 68 0.07%
124 flygfält 68 0.07%
125 dels 67 0.07%
126 helt 67 0.07%
127 började 66 0.07%
128 mindre 65 0.07%
129 väl 65 0.07%
130 därför 65 0.07%
131 resultat 64 0.07%
132 först 64 0.07%
133 Sverige 64 0.07%
134 bland 63 0.07%
135 mera 62 0.07%
136 hösten 62 0.07%
137 flyga 61 0.07%
138 stort 60 0.07%
139 gång 60 0.07%
140 plan 59 0.06%
141 annat 58 0.06%
142 vilket 58 0.06%
143 1936 57 0.06%
144 visade 57 0.06%
145 åt 57 0.06%
146 sällskapet 57 0.06%
147 styrelseledamot 56 0.06%
148 fram 56 0.06%
149 göra 56 0.06%
150 äro 55 0.06%
151 hur 55 0.06%
152 mellan 54 0.06%
153 komma 54 0.06%
154 kunnat 54 0.06%
155 långt 53 0.06%
156 flera 53 0.06%
157 dem 53 0.06%
158 ej 52 0.06%
159 torde 52 0.06%
160 KSAK 51 0.06%
161 ut 51 0.06%
162 åter 50 0.05%
163 hos 50 0.05%
164 vad 50 0.05%
165 bildades 50 0.05%
166 bli 49 0.05%
167 vilka 49 0.05%
168 rätt 48 0.05%
169 året 48 0.05%

This list excludes punctuation or single-letter words, also some different-case repeats of the same words.

If you think the text would be accessible to you, you can read it on our site (click on the cover to access):

Cover of Svenskt flyg och dess män by

Other resources and languages

If you like this analysis, you should have a look at out our lists of Swedish short stories and Swedish books.

If you like literature as a means to learn languages - please take a look at our project Interlinear Books. We even have a Swedish Interlinear book available for purchase.