Svensk-Amerikanerna - Deras materiella och andliga strävanden by Carl Sundbeck : Difficulty Assessment for Swedish Learners

How difficult is Svensk-Amerikanerna - Deras materiella och andliga strävanden for Swedish learners? We have performed multiple tests on its full text (freely available here) of approximately 75,580, crunched all the numbers for you and present the results below.

Read the Full Text Now for Free!

Difficulty Assessment Summary

We have estimated Svensk-Amerikanerna - Deras materiella och andliga strävanden to have a difficulty score of 79. Here're its scores:

Measure Score
easy difficult (1 - 100)
Overall Difficulty 79% 79
Vocabulary Difficulty 96% 96
Grammatical Difficulty 62% 62

Vocabulary Difficulty: Breakdown

96%

Vocabulary difficulty: 96%

This score has been calculated based on frequency vocabulary (the top most frequently used words in Swedish). It combines various measures of Svensk-Amerikanerna - Deras materiella och andliga strävanden's text analyzed in terms of frequency vocabulary: a plain vocabulary score, frequency-weighted vocabulary score, banded frequency vocabulary scores based on vocabulary of the text falling in the top 1,000 or 2,000 most frequent words, etc. Here's a further breakdown of how often the top most frequently used words in Swedish appear in the full text of Svensk-Amerikanerna - Deras materiella och andliga strävanden:

Vocabulary difficulty breakdown for Svensk-Amerikanerna  - Deras materiella och andliga strävanden: a test for Swedish top frequency vocabulary

We have also calculated the following approximate data on the vocabulary in Svensk-Amerikanerna - Deras materiella och andliga strävanden:

Measure Score
Measure Score
Number of words 75,580
Number of unique words 16,425
Number of recognized words for names/places/other entities 5,068
Number of very rare non-entity words 6,298
Number of sentences 12,878
Average number of words/sentence 6

There is some research suggesting that that you need to know about 98% of a text's vocabulary in order to be able to infer the meaning of unknown words when reading. If true, this means that you would need to know around 16,096 words (where all the forms of the word are still counted as unique words) in Swedish to be able to read Svensk-Amerikanerna - Deras materiella och andliga strävanden without a dictionary and fully understand it.

Grammatical Difficulty: Breakdown

62%

Grammatical difficulty: 62%

Here is the further grammatical comparison on this text. You can find an explanation of all these scores below.

Measure Score
Measure Score
Automated Readability Index 6
Coleman-Liau Index 10
Type/Token Ratio (TTR) 0.217319
Root type/Token Ratio (RTTR) 0.00000287536
Corrected type/Token Ratio (CTTR) 0.00000143768
MTLD Index 82
HDD Index 69
Yule's I Index 81
Lexical Diversity Index (MTLD + HD-D + Yule's I) 78

The type-token ratio (TTR) of Svensk-Amerikanerna - Deras materiella och andliga strävanden is 0.217319. The TTR is the most basic measure of lexical diversity. To calculate it, we divide the number of unique words by the number of words in the text. For example, for this text, the number of unique words is 16,425, while the number of words is 75,580, so the TTR is 16,425 / 75,580 = 0.217319. However, the TTR is a very crude measure, as it is extremely dependent on text length. The longer the text, the lower the TTR is usually going to be, since common words tend to often repeat. Especially since the number of words in this text is more than 1,000, the TTR is not likely to give an accurate measure.

The root type-token ratio (RTTR) and corrected type-token ratio (CTTR) are measures which were suggested by researchers to partially address the problem of TTR's variance on text length. In the RTTR, the number of unique words is divided by a square of the number of words (therefore, 16,425 / (75,580 * 75,580) = 0.00000287536), while in CTTR, it is divided by a square of the number of words, multiplied twice 16,425 / 2 * (75,580 * 75,580) = 0.00000143768). However, these measures are not as easily readable, and also there is a growing body of research asserting that CTTR and RTTR do not effectively address the problems of text length. Therefore, while we do provide the full text's TTR, RTTR and CTTR on this page, these fiqures do not form part of our final calculations.

The Automated Readability Index (ARI) is one readability measure that has been developed by researchers over the years. The formula for calculating the ARI is as follows:
Formula for calculating the Automated Readability Index

The ARI should compute a reading level approximately corresponding to the reader's grade level (assuming the reader undertakes formal education). Thus, for example, a value of 1 is kindergarten level, while a value of 12 or 13 is the last year of school, and 14 is a sophomore at college. The current ARI of this text is 6, making it understandable for 6-grade students at their expected level of education.

The Coleman Liau Index (CLI) is a similar index designed by Meri Coleman and T. L. Liau, and it is supposed to compute the grade level of the reader (thus, for example, sophomore level material would be around grade 14, or year 14 of formal education, while kindergarten / primary school level material would be close to grade 1 in the CLI). The CLI is usually slightly higher than the ARI. The CLI is computed with this formula:
Formula for calculating the Coleman-Liau Readability Index

It is notable that other indexes exist, such as the Flesch-Kincaid Reading Ease, Gunning-Fog Score, and others, but we have chosen not to include them, since, contrary to the ARI and CLI, such other indexes are based on a syllable count and therefore arguably only work for English and not Swedish.

We compute a further compound lexical diversity index, which should range from 1 to a 100 (with the standard deviation being around 10, and its average value being around 50) - it is 78 in the present case. The compound lexical diversity index consists of the following indexes, averaged out (and also provided in the table above):

  • the Measure of Textual Lexical Diversity (MTLD) index - a measure which is based on computing the TTR for increasingly larger parts of the text until the TTR drops below a certain threshold point (around 0.7 in our case) - in which case, the TTR is reset, and the overall counter is increased; the counter is at the end divided by the number of words in text; as a result, the MTLD does not significantly vary by text length;
  • the Yule's I index (based on Yule's K characteristic inverted) - an index based on the work of the statistician G.U. Yule, who published his index of Frequency Vocabulary in his paper "The statistical study of literary vocabulary"; Yule's I takes into account the number of words in the text, and a compound summed measure of word frequency;
  • the Hypergeometric Distribution D (HD-D) index (based on vocd) - an index which assesses the contribution of each word to the diversity of the text; to calculate such contributions, a hypergeometric distribution is used to compute probabilities of each word appearing in word samples extracted from the text; then such distributions are divided by sample sizes and added up;

Our overall measure of grammatical diversity is based on a combination of the compound lexical diversity index (which includes the MTLD, Yule's I and HD-D indexes), the ARI and CLI, all normalized and given certain weight. The score should normally range from 1 to 100. In this case, the score is 62.

Other Information about Svensk-Amerikanerna - Deras materiella och andliga strävanden by Carl Sundbeck

We provide you a sample of the text below, however, the full text of the Svensk-Amerikanerna - Deras materiella och andliga strävanden is also available free of charge on our website.

Sample of text:

Men människorna, som syssla härmed, kunna på detta afstånd endast urskiljas, när de i större flockar mala på sin grottekvarn. New Yorks väldiga betydelse som handelsplats framgår däraf, att mer än 50 procent af Förenta staternas utrikeshandel går genom dess hamnar. Det årliga bytesvärdet öfverstiger fyra milliarder kronor. Skeppens, de in och utgåendes, tontal stiger till femton millioner. New Yorks industri är proportionsvis mot dess varuutbyte och handel icke så storartad. Men riktar man kikaren åt väster, tyckes hela horisonten öfver Montclair, Bloomfield, Arlington, Lyndhurst, Kingsland o. s. v. vara insvept i rök. Likaså skönjes vid East rivers stränder helt nära oss fabrik vid fabrik Deras antal, som växer för hvarje dag, har nått höga fyrsiffriga tal, deras årliga tillverkningsvärde når upp till tre milliarder kronor, och de sotiga, oljiga och ...

Top most frequently used words in Svensk-Amerikanerna - Deras materiella och andliga strävanden by Carl Sundbeck*

Position Word Repetitions Part of all words
Position Word Repetitions Part of all words
1 och 2,480 3.28%
2 som 1,315 1.74%
3 en 1,290 1.71%
4 af 1,104 1.46%
5 att 911 1.21%
6 de 867 1.15%
7 det 854 1.13%
8 är 852 1.13%
9 den 819 1.08%
10 till 766 1.01%
11 732 0.97%
12 med 693 0.92%
13 för 635 0.84%
14 ett 453 0.6%
15 från 368 0.49%
16 var 367 0.49%
17 sig 350 0.46%
18 svenska 329 0.44%
19 man 321 0.42%
20 där 320 0.42%
21 vi 320 0.42%
22 287 0.38%
23 vid 282 0.37%
24 icke 279 0.37%
25 om 269 0.36%
26 jag 264 0.35%
27 sin 257 0.34%
28 men 246 0.33%
29 äro 237 0.31%
30 denna 223 0.3%
31 har 222 0.29%
32 Amerika 211 0.28%
33 svensk 203 0.27%
34 äfven 202 0.27%
35 öfver 198 0.26%
36 här 194 0.26%
37 dessa 188 0.25%
38 alla 177 0.23%
39 än 170 0.22%
40 ha 161 0.21%
41 detta 155 0.21%
42 under 154 0.2%
43 andra 154 0.2%
44 hade 149 0.2%
45 eller 144 0.19%
46 genom 142 0.19%
47 han 142 0.19%
48 nu 140 0.19%
49 ej 137 0.18%
50 mycket 127 0.17%
51 sina 126 0.17%
52 Chicago 124 0.16%
53 dem 120 0.16%
54 Sverige 114 0.15%
55 samt 112 0.15%
56 vara 110 0.15%
57 amerikanska 109 0.14%
58 hela 108 0.14%
59 108 0.14%
60 stora 107 0.14%
61 också 105 0.14%
62 endast 103 0.14%
63 voro 102 0.13%
64 dess 99 0.13%
65 99 0.13%
66 år 99 0.13%
67 oss 95 0.13%
68 mot 92 0.12%
69 utan 92 0.12%
70 allt 92 0.12%
71 sitt 92 0.12%
72 mig 92 0.12%
73 ut 91 0.12%
74 kan 90 0.12%
75 sedan 90 0.12%
76 ännu 89 0.12%
77 åt 88 0.12%
78 skulle 88 0.12%
79 när 87 0.12%
80 deras 86 0.11%
81 hvad 84 0.11%
82 såsom 83 0.11%
83 hans 81 0.11%
84 Augustana 81 0.11%
85 många 81 0.11%
86 samma 81 0.11%
87 dock 80 0.11%
88 mer 80 0.11%
89 efter 80 0.11%
90 in 80 0.11%
91 första 79 0.1%
92 engelska 77 0.1%
93 bland 77 0.1%
94 Amerikas 76 0.1%
95 farm 73 0.1%
96 stor 73 0.1%
97 New 73 0.1%
98 land 71 0.09%
99 dollars 71 0.09%
100 nya 70 0.09%
101 dag 69 0.09%
102 upp 69 0.09%
103 gång 69 0.09%
104 omkring 68 0.09%
105 staden 67 0.09%
106 stad 66 0.09%
107 finnes 66 0.09%
108 våra 66 0.09%
109 Kansas 65 0.09%
110 del 62 0.08%
111 se 61 0.08%
112 hvars 61 0.08%
113 väldiga 61 0.08%
114 måste 60 0.08%
115 finnas 59 0.08%
116 några 59 0.08%
117 svenskar 59 0.08%
118 göra 59 0.08%
119 kunna 59 0.08%
120 män 58 0.08%
121 hos 58 0.08%
122 historia 58 0.08%
123 dr 58 0.08%
124 flera 57 0.08%
125 något 57 0.08%
126 vår 57 0.08%
127 två 57 0.08%
128 gamla 57 0.08%
129 min 57 0.08%
130 mellan 56 0.07%
131 York 54 0.07%
132 annat 54 0.07%
133 mest 54 0.07%
134 väl 53 0.07%
135 någon 53 0.07%
136 namn 53 0.07%
137 Minneapolis 53 0.07%
138 hvilka 52 0.07%
139 tid 51 0.07%
140 tre 51 0.07%
141 Minnesota 50 0.07%
142 komma 50 0.07%
143 hvarje 49 0.06%
144 kyrkan 48 0.06%
145 folk 48 0.06%
146 inom 48 0.06%
147 blott 47 0.06%
148 pastor 47 0.06%
149 helt 47 0.06%
150 hr 46 0.06%
151 får 46 0.06%
152 fram 46 0.06%
153 alltid 46 0.06%
154 barn 46 0.06%
155 kom 46 0.06%
156 redan 45 0.06%
157 unga 45 0.06%
158 hvilken 45 0.06%
159 mera 45 0.06%
160 ligger 45 0.06%
161 varit 44 0.06%
162 of 44 0.06%
163 ord 43 0.06%
164 lika 43 0.06%
165 tal 43 0.06%
166 43 0.06%
167 Illinois 43 0.06%
168 största 43 0.06%
169 vårt 43 0.06%
170 kring 43 0.06%

This list excludes punctuation or single-letter words, also some different-case repeats of the same words.

If you think the text would be accessible to you, you can read it on our site (click on the cover to access):

Cover of Svensk-Amerikanerna  - Deras materiella och andliga strävanden by Carl Sundbeck

Other resources and languages

If you like this analysis, you should have a look at out our lists of Swedish short stories and Swedish books.

If you like literature as a means to learn languages - please take a look at our project Interlinear Books. We even have a Swedish Interlinear book available for purchase.