Från det nya och gamla landet. Noveller by Hilma Angered-Strandberg : Difficulty Assessment for Swedish Learners

How difficult is Från det nya och gamla landet. Noveller for Swedish learners? We have performed multiple tests on its full text (freely available here) of approximately 18,924, crunched all the numbers for you and present the results below.

Read the Full Text Now for Free!

Difficulty Assessment Summary

We have estimated Från det nya och gamla landet. Noveller to have a difficulty score of 72. Here're its scores:

Measure Score
easy difficult (1 - 100)
Overall Difficulty 72% 72
Vocabulary Difficulty 92% 92
Grammatical Difficulty 52% 52

Vocabulary Difficulty: Breakdown

92%

Vocabulary difficulty: 92%

This score has been calculated based on frequency vocabulary (the top most frequently used words in Swedish). It combines various measures of Från det nya och gamla landet. Noveller's text analyzed in terms of frequency vocabulary: a plain vocabulary score, frequency-weighted vocabulary score, banded frequency vocabulary scores based on vocabulary of the text falling in the top 1,000 or 2,000 most frequent words, etc. Here's a further breakdown of how often the top most frequently used words in Swedish appear in the full text of Från det nya och gamla landet. Noveller:

Vocabulary difficulty breakdown for Från det nya och gamla landet. Noveller: a test for Swedish top frequency vocabulary

We have also calculated the following approximate data on the vocabulary in Från det nya och gamla landet. Noveller:

Measure Score
Measure Score
Number of words 18,924
Number of unique words 5,529
Number of recognized words for names/places/other entities 754
Number of very rare non-entity words 1,262
Number of sentences 3,130
Average number of words/sentence 6

There is some research suggesting that that you need to know about 98% of a text's vocabulary in order to be able to infer the meaning of unknown words when reading. If true, this means that you would need to know around 5,418 words (where all the forms of the word are still counted as unique words) in Swedish to be able to read Från det nya och gamla landet. Noveller without a dictionary and fully understand it.

Grammatical Difficulty: Breakdown

52%

Grammatical difficulty: 52%

Here is the further grammatical comparison on this text. You can find an explanation of all these scores below.

Measure Score
Measure Score
Automated Readability Index 4
Coleman-Liau Index 7
Type/Token Ratio (TTR) 0.292169
Root type/Token Ratio (RTTR) 0.0000154391
Corrected type/Token Ratio (CTTR) 0.00000771953
MTLD Index 68
HDD Index 65
Yule's I Index 71
Lexical Diversity Index (MTLD + HD-D + Yule's I) 68

The type-token ratio (TTR) of Från det nya och gamla landet. Noveller is 0.292169. The TTR is the most basic measure of lexical diversity. To calculate it, we divide the number of unique words by the number of words in the text. For example, for this text, the number of unique words is 5,529, while the number of words is 18,924, so the TTR is 5,529 / 18,924 = 0.292169. However, the TTR is a very crude measure, as it is extremely dependent on text length. The longer the text, the lower the TTR is usually going to be, since common words tend to often repeat. Especially since the number of words in this text is more than 1,000, the TTR is not likely to give an accurate measure.

The root type-token ratio (RTTR) and corrected type-token ratio (CTTR) are measures which were suggested by researchers to partially address the problem of TTR's variance on text length. In the RTTR, the number of unique words is divided by a square of the number of words (therefore, 5,529 / (18,924 * 18,924) = 0.0000154391), while in CTTR, it is divided by a square of the number of words, multiplied twice 5,529 / 2 * (18,924 * 18,924) = 0.00000771953). However, these measures are not as easily readable, and also there is a growing body of research asserting that CTTR and RTTR do not effectively address the problems of text length. Therefore, while we do provide the full text's TTR, RTTR and CTTR on this page, these fiqures do not form part of our final calculations.

The Automated Readability Index (ARI) is one readability measure that has been developed by researchers over the years. The formula for calculating the ARI is as follows:
Formula for calculating the Automated Readability Index

The ARI should compute a reading level approximately corresponding to the reader's grade level (assuming the reader undertakes formal education). Thus, for example, a value of 1 is kindergarten level, while a value of 12 or 13 is the last year of school, and 14 is a sophomore at college. The current ARI of this text is 4, making it understandable for 4-grade students at their expected level of education.

The Coleman Liau Index (CLI) is a similar index designed by Meri Coleman and T. L. Liau, and it is supposed to compute the grade level of the reader (thus, for example, sophomore level material would be around grade 14, or year 14 of formal education, while kindergarten / primary school level material would be close to grade 1 in the CLI). The CLI is usually slightly higher than the ARI. The CLI is computed with this formula:
Formula for calculating the Coleman-Liau Readability Index

It is notable that other indexes exist, such as the Flesch-Kincaid Reading Ease, Gunning-Fog Score, and others, but we have chosen not to include them, since, contrary to the ARI and CLI, such other indexes are based on a syllable count and therefore arguably only work for English and not Swedish.

We compute a further compound lexical diversity index, which should range from 1 to a 100 (with the standard deviation being around 10, and its average value being around 50) - it is 68 in the present case. The compound lexical diversity index consists of the following indexes, averaged out (and also provided in the table above):

  • the Measure of Textual Lexical Diversity (MTLD) index - a measure which is based on computing the TTR for increasingly larger parts of the text until the TTR drops below a certain threshold point (around 0.7 in our case) - in which case, the TTR is reset, and the overall counter is increased; the counter is at the end divided by the number of words in text; as a result, the MTLD does not significantly vary by text length;
  • the Yule's I index (based on Yule's K characteristic inverted) - an index based on the work of the statistician G.U. Yule, who published his index of Frequency Vocabulary in his paper "The statistical study of literary vocabulary"; Yule's I takes into account the number of words in the text, and a compound summed measure of word frequency;
  • the Hypergeometric Distribution D (HD-D) index (based on vocd) - an index which assesses the contribution of each word to the diversity of the text; to calculate such contributions, a hypergeometric distribution is used to compute probabilities of each word appearing in word samples extracted from the text; then such distributions are divided by sample sizes and added up;

Our overall measure of grammatical diversity is based on a combination of the compound lexical diversity index (which includes the MTLD, Yule's I and HD-D indexes), the ARI and CLI, all normalized and given certain weight. The score should normally range from 1 to 100. In this case, the score is 52.

Other Information about Från det nya och gamla landet. Noveller by Hilma Angered-Strandberg

We provide you a sample of the text below, however, the full text of the Från det nya och gamla landet. Noveller is also available free of charge on our website.

Sample of text:

Icke på många år hade hon känt lifvet så spännande, som då hon skyndade hem genom de larmande gatorna. Det stormade i hennes själ. Syntes det icke tydligt, att hon var ämnad till något? Nu finge hon pröfva, hvad hon dugde till. Och hvem vet — om ett år eller par skulle hon kanske vara med i den stora kampen — som talare eller skriftställare — förtjäna penningar -—- höja deras lif — blifva omtalad! Ärelystnaden slog sina klor i henne. — Lilla flicka — barnet -— behöfde du gå bort för att diu mamma skulle få lefva, tänkte ...

Top most frequently used words in Från det nya och gamla landet. Noveller by Hilma Angered-Strandberg*

Position Word Repetitions Part of all words
Position Word Repetitions Part of all words
1 och 757 4%
2 det 380 2.01%
3 som 283 1.5%
4 243 1.28%
5 att 239 1.26%
6 en 227 1.2%
7 de 205 1.08%
8 af 201 1.06%
9 hon 200 1.06%
10 var 196 1.04%
11 med 178 0.94%
12 sig 157 0.83%
13 han 157 0.83%
14 den 143 0.76%
15 jag 138 0.73%
16 för 131 0.69%
17 är 122 0.64%
18 till 119 0.63%
19 icke 117 0.62%
20 ett 107 0.57%
21 104 0.55%
22 hade 92 0.49%
23 du 84 0.44%
24 där 81 0.43%
25 öfver 77 0.41%
26 men 71 0.38%
27 ut 70 0.37%
28 inte 61 0.32%
29 skulle 54 0.29%
30 henne 54 0.29%
31 man 53 0.28%
32 allt 53 0.28%
33 om 51 0.27%
34 sitt 51 0.27%
35 sin 51 0.27%
36 från 50 0.26%
37 efter 49 0.26%
38 hennes 48 0.25%
39 upp 46 0.24%
40 45 0.24%
41 något 45 0.24%
42 här 45 0.24%
43 nu 45 0.24%
44 kunde 44 0.23%
45 Nils 43 0.23%
46 detta 42 0.22%
47 sade 41 0.22%
48 in 41 0.22%
49 dem 40 0.21%
50 än 38 0.2%
51 deras 37 0.2%
52 kom 37 0.2%
53 mig 36 0.19%
54 utan 36 0.19%
55 hvad 35 0.18%
56 vid 34 0.18%
57 hans 34 0.18%
58 ned 34 0.18%
59 fram 33 0.17%
60 vara 32 0.17%
61 ju 32 0.17%
62 vi 32 0.17%
63 sina 32 0.17%
64 barn 31 0.16%
65 eller 31 0.16%
66 blef 31 0.16%
67 lif 30 0.16%
68 har 29 0.15%
69 ögon 29 0.15%
70 honom 29 0.15%
71 gamla 28 0.15%
72 ingen 28 0.15%
73 nya 28 0.15%
74 alla 28 0.15%
75 när 28 0.15%
76 under 28 0.15%
77 ville 26 0.14%
78 kan 26 0.14%
79 denna 25 0.13%
80 väl 25 0.13%
81 satt 25 0.13%
82 lilla 25 0.13%
83 mer 25 0.13%
84 dessa 24 0.13%
85 mot 24 0.13%
86 blott 24 0.13%
87 24 0.13%
88 kände 23 0.12%
89 små 23 0.12%
90 dag 23 0.12%
91 Jan 23 0.12%
92 se 23 0.12%
93 ni 23 0.12%
94 Anna 23 0.12%
95 stod 23 0.12%
96 hur 23 0.12%
97 voro 22 0.12%
98 stora 22 0.12%
99 0111 22 0.12%
100 aldrig 22 0.12%
101 bort 22 0.12%
102 tog 21 0.11%
103 hela 21 0.11%
104 Ja 21 0.11%
105 20 0.11%
106 såg 20 0.11%
107 gick 19 0.1%
108 Almroth 19 0.1%
109 tankar 19 0.1%
110 mycket 18 0.1%
111 ej 18 0.1%
112 skall 18 0.1%
113 Gud 18 0.1%
114 lifvet 17 0.09%
115 år 17 0.09%
116 fick 17 0.09%
117 gjorde 17 0.09%
118 visste 17 0.09%
119 ord 17 0.09%
120 vet 17 0.09%
121 genom 17 0.09%
122 själf 16 0.08%
123 andra 16 0.08%
124 världen 16 0.08%
125 ha 16 0.08%
126 någon 15 0.08%
127 dock 15 0.08%
128 lian 15 0.08%
129 hvita 15 0.08%
130 Tora 15 0.08%
131 Arne 15 0.08%
132 låg 15 0.08%
133 lefva 15 0.08%
134 ingenting 15 0.08%
135 plötsligt 15 0.08%
136 mitt 15 0.08%
137 göra 14 0.07%
138 tyst 14 0.07%
139 dig 14 0.07%
140 själ 14 0.07%
141 vill 14 0.07%
142 åter 14 0.07%
143 komma 14 0.07%
144 min 14 0.07%
145 tala 14 0.07%
146 enda 14 0.07%
147 rätt 14 0.07%
148 värld 13 0.07%
149 Olof 13 0.07%
150 stilla 13 0.07%
151 Nej 13 0.07%
152 ännu 13 0.07%
153 folk 13 0.07%
154 oss 13 0.07%
155 ens 13 0.07%
156 vore 13 0.07%
157 bästa 13 0.07%
158 ton 13 0.07%
159 talade 13 0.07%
160 ska 13 0.07%
161 några 12 0.06%
162 taga 12 0.06%
163 steg 12 0.06%
164 landet 12 0.06%
165 nytt 12 0.06%
166 hufvud 12 0.06%
167 Fredman 12 0.06%
168 samma 12 0.06%
169 sätt 12 0.06%
170 ögonen 12 0.06%
171 måste 12 0.06%
172 fast 12 0.06%
173 ser 12 0.06%
174 åt 11 0.06%
175 mellan 11 0.06%
176 fru 11 0.06%
177 barnet 11 0.06%
178 lade 11 0.06%

This list excludes punctuation or single-letter words, also some different-case repeats of the same words.

If you think the text would be accessible to you, you can read it on our site (click on the cover to access):

Cover of Från det nya och gamla landet. Noveller by Hilma Angered-Strandberg

Other resources and languages

If you like this analysis, you should have a look at out our lists of Swedish short stories and Swedish books.

If you like literature as a means to learn languages - please take a look at our project Interlinear Books. We even have a Swedish Interlinear book available for purchase.